On the Evaluation of Outlier Detection and One-Class Classification Methods

Outlier detection, as a type of one-class classification problem, is one of important research topics in data mining and machine learning. Its task is to identify sample points markedly deviating from the normal data. A reliable outlier detector needs to build a model which encloses the normal data tightly. In this paper, an improved one-class SVM (OC-SVM) classifier is proposed for outlier detection problems. We name this method OC-SVM with minimum within-class scatter (OC-WCSSVM), which exploits the inner-class structure of the training set via minimizing the within-class scatter of the training data. This can construct a more accurate hyperplane for outlier detection, such that the margin between the training data and the origin in a higher dimensional space is as large as possible, while at the same time the decision boundary around the normal data is as tight as possible. Experimental results on a synthetic dataset and 10 real-world datasets demonstrate that our proposed OC-WCSSVM algorithm is effective and superior to the compared algorithms.

One-Class Classifiers: A Review and Analysis of Suitability in the Context of Mobile-Masquerader Detection

Revue Africaine de la Recherche en Informatique et Mathématiques Appliquées ◽

10.46298/arima.1877 ◽

2007 ◽

Vol Volume 6, april 2007, joint... ◽

Author(s):

Oleksiy Mazhelis

Keyword(s):

Classification Method ◽

Classification Methods ◽

User Characteristics ◽

Training Set ◽

Legitimate User ◽

International Audience ◽

One Class Classification

International audience One-class classifiers employing for training only the data from one class are justified when the data from other classes is difficult to obtain. In particular, their use is justified in mobile-masquerader detection, where user characteristics are classified as belonging to the legitimate user class or to the impostor class, and where collecting the data originated from impostors is problematic. This paper systematically reviews various one-class classification methods, and analyses their suitability in the context of mobile-masquerader detection. For each classification method, its sensitivity to the errors in the training set, computational requirements, and other characteristics are considered. After that, for each category of features used in masquerader detection, suitable classifiers are identified.

A literature review on one-class classification and its potential applications in big data

Journal Of Big Data ◽

10.1186/s40537-021-00514-x ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Naeem Seliya ◽

Azadeh Abdollah Zadeh ◽

Taghi M. Khoshgoftaar

Keyword(s):

Big Data ◽

Outlier Detection ◽

Novelty Detection ◽

Class Imbalance ◽

Imbalanced Datasets ◽

Minority Class ◽

Related Literature ◽

Potential Applications ◽

Multi Class Classification ◽

One Class Classification

AbstractIn severely imbalanced datasets, using traditional binary or multi-class classification typically leads to bias towards the class(es) with the much larger number of instances. Under such conditions, modeling and detecting instances of the minority class is very difficult. One-class classification (OCC) is an approach to detect abnormal data points compared to the instances of the known class and can serve to address issues related to severely imbalanced datasets, which are especially very common in big data. We present a detailed survey of OCC-related literature works published over the last decade, approximately. We group the different works into three categories: outlier detection, novelty detection, and deep learning and OCC. We closely examine and evaluate selected works on OCC such that a good cross section of approaches, methods, and application domains is represented in the survey. Commonly used techniques in OCC for outlier detection and for novelty detection, respectively, are discussed. We observed one area that has been largely omitted in OCC-related literature is its application context for big data and its inherently associated problems, such as severe class imbalance, class rarity, noisy data, feature selection, and data reduction. We feel the survey will be appreciated by researchers working in these areas of big data.

Implementing Multi-class Classifiers by One-class Classification Methods

The 2006 IEEE International Joint Conference on Neural Network Proceedings ◽

10.1109/ijcnn.2006.246699 ◽

2006 ◽

Cited By ~ 4

Author(s):

Tao Ban ◽

S. Abe

Keyword(s):

Classification Methods ◽

One Class Classification

Case-Based Reasoning: The Search for Similar Solutions and Identification of Outliers

Complexity ◽

10.1155/2018/9280787 ◽

2018 ◽

Vol 2018 ◽

pp. 1-12 ◽

Cited By ~ 1

Author(s):

P. S. Szczepaniak ◽

A. Duraj

Keyword(s):

Linear Regression ◽

Outlier Detection ◽

Task Type ◽

Case Based Reasoning ◽

Classification Methods ◽

Bayes Classifier ◽

Detection Process ◽

Research Gap ◽

One Class Classification Methods Based Non-Relevance Feedback Document Retrieval

2006 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology Workshops ◽

10.1109/wi-iatw.2006.98 ◽

2006 ◽

Cited By ~ 4

Author(s):

Takashi Onoda ◽

Hiroshi Murata ◽

Seiji Yamada

Keyword(s):

Relevance Feedback ◽

Document Retrieval ◽

Classification Methods ◽

One Class Classification

On simple one-class classification methods

2012 IEEE International Symposium on Information Theory Proceedings ◽

10.1109/isit.2012.6283685 ◽

2012 ◽

Cited By ~ 8

Author(s):

Zineb Noumir ◽

Paul Honeine ◽

Cedue Richard

Keyword(s):

Classification Methods ◽

One Class Classification

Outlier Detection Using One-Class Classification

Algorithms for Intelligent Systems - Applications of Advanced Computing in Systems ◽

10.1007/978-981-33-4862-2_24 ◽

2021 ◽

pp. 227-233

Author(s):

Sonali Gupta ◽

Srikanth Boddu ◽

Muthya Ambati

Keyword(s):

Outlier Detection ◽

One Class Classification