Journal Article10.1016/J.KNOSYS.2021.106878
Semi-supervised anomaly detection algorithms: A comparative summary and future research directions
Miryam Elizabeth Villa-Pérez,Miguel A. Álvarez-Carmona,Octavio Loyola-González,Miguel Angel Medina-Pérez,Juan Carlos Velazco-Rossell,Kim-Kwang Raymond Choo +5 more
86
TL;DR: In this paper, the performance of 29 semi-supervised anomaly detection algorithms on 95 benchmark imbalanced databases from the KEEL repository is studied. And the authors show that BRM is a robust classifier, in terms of achieving better classification results than the other 28 state-of-the-art techniques on diverse anomaly detection problems.
read more
Abstract: While anomaly detection is relatively well-studied, it remains a topic of ongoing interest and challenge, as our society becomes increasingly interconnected and digitalized. In this paper, we focus on existing anomaly detection approaches, by empirically studying the performance of 29 semi-supervised anomaly detection algorithms on 95 benchmark imbalanced databases from the KEEL repository. These include well-established and commonly used classifiers (e.g., One-Class Support Vector Machine (ocSVM) and Isolation Forest) and recent proposals (e.g., BRM and XGBOD). Findings from our in-depth empirical study show that BRM is a robust classifier, in terms of achieving better classification results than the other 28 state-of-the-art techniques on diverse anomaly detection problems. We also observe that OCKRA, Isolation Forest, and ocSVM achieve good performance overall AUC, but poor classification results on databases where the number of objects is equal or greater than 1,460, all features are nominal, or the imbalance ratio is equal or greater than 39.14.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
A survey on machine learning methods for churn prediction
L. Geiler,Séverine Affeldt,Mohamed Nadif +2 more
TL;DR: In this article , the authors draw general guidelines from a benchmark of supervised machine learning techniques in association with widely used data sampling approaches on publicly available datasets in the context of churn prediction.
Outlier detection method based on high-density iteration
Yu Zhou,Xiaotao Hao,Dawei Yu,Jingwen Cheng,Jichun Li +4 more
TL;DR: HDIOD effectively detects both global and local outliers by iteratively comparing local kernel density with extended k-nearest neighbors.
31
Anomaly Detection and Classification in Multispectral Time Series Based on Hidden Markov Models
TL;DR: A framework for anomaly detection, localization, and classification that exploits the temporal information contained in a given season at a parcel level to detect and localize outliers using hidden Markov models (HMMs).
27
Self-supervised anomaly detection in computer vision and beyond: A survey and outlook.
Hadi Hojjati,Thi Kieu Khanh Ho,Narges Armanfard +2 more
TL;DR: This paper aims to provide a comprehensive review of the current methodologies in self-supervised anomaly detection, including the development of more effective and efficient algorithms and the integration of these techniques with other related fields, such as multi-modal learning.
25
A survey on semi-supervised graph clustering
Fatemeh Daneshfar,Sayvan Soleymanbaigi,Pedram Yamini,Mohammad Sadra Amini +3 more
TL;DR: This paper presents a comprehensive survey of Semi-Supervised Graph Clustering (SSGC) techniques, categorizing and analyzing methodologies, identifying unexplored avenues, and discussing applications in various fields, while highlighting limitations and future directions.
17
References
•Journal Article
Scikit-learn: Machine Learning in Python
Fabian Pedregosa,Gaël Varoquaux,Alexandre Gramfort,Vincent Michel,Bertrand Thirion,Olivier Grisel,Mathieu Blondel,Peter Prettenhofer,Ron Weiss,Vincent Dubourg,Jake Vanderplas,Alexandre Passos,David Cournapeau,Matthieu Brucher,Matthieu Perrot,Edouard Duchesnay +15 more
TL;DR: Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems, focusing on bringing machine learning to non-specialists using a general-purpose high-level language.
•Proceedings Article
Auto-Encoding Variational Bayes
Diederik P. Kingma,Max Welling +1 more
- 01 Jan 2014
TL;DR: A stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case is introduced.
An introduction to ROC analysis
TL;DR: The purpose of this article is to serve as an introduction to ROC graphs and as a guide for using them in research.
21.3K