Towards Application of One-Class Classification Methods to Medical Data

The Scientific World JOURNAL ◽

10.1155/2014/730712 ◽

2014 ◽

Vol 2014 ◽

pp. 1-7 ◽

Cited By ~ 10

Author(s):

Itziar Irigoien ◽

Basilio Sierra ◽

Concepción Arenas

Keyword(s):

State Of The Art ◽

Gaussian Mixture ◽

Support Vector ◽

Support Vector Data Description ◽

Data Sets ◽

Biomedical Data ◽

Vector Data ◽

Target Class ◽

Tumor Recognition ◽

One Class Classification

In the problem of one-class classification (OCC) one of the classes, the target class, has to be distinguished from all other possible objects, considered as nontargets. In many biomedical problems this situation arises, for example, in diagnosis, image based tumor recognition or analysis of electrocardiogram data. In this paper an approach to OCC based on a typicality test is experimentally compared with reference state-of-the-art OCC techniques—Gaussian, mixture of Gaussians, naive Parzen, Parzen, and support vector data description—using biomedical data sets. We evaluate the ability of the procedures using twelve experimental data sets with not necessarily continuous data. As there are few benchmark data sets for one-class classification, all data sets considered in the evaluation have multiple classes. Each class in turn is considered as the target class and the units in the other classes are considered as new units to be classified. The results of the comparison show the good performance of the typicality approach, which is available for high dimensional data; it is worth mentioning that it can be used for any kind of data (continuous, discrete, or nominal), whereas state-of-the-art approaches application is not straightforward when nominal variables are present.

Download Full-text

Meta Learning for Few-Shot One-Class Classification

AI ◽

10.3390/ai2020012 ◽

2021 ◽

Vol 2 (2) ◽

pp. 195-208

Author(s):

Gabriel Dahia ◽

Maurício Pamplona Segundo

Keyword(s):

Feature Representation ◽

Support Vector ◽

Support Vector Data Description ◽

Target Class ◽

Feature Representations ◽

Shot Classification ◽

Training Stage ◽

Comparable Performance ◽

Meta Learning ◽

One Class Classification

We propose a method that can perform one-class classification given only a small number of examples from the target class and none from the others. We formulate the learning of meaningful features for one-class classification as a meta-learning problem in which the meta-training stage repeatedly simulates one-class classification, using the classification loss of the chosen algorithm to learn a feature representation. To learn these representations, we require only multiclass data from similar tasks. We show how the Support Vector Data Description method can be used with our method, and also propose a simpler variant based on Prototypical Networks that obtains comparable performance, indicating that learning feature representations directly from data may be more important than which one-class algorithm we choose. We validate our approach by adapting few-shot classification datasets to the few-shot one-class classification scenario, obtaining similar results to the state-of-the-art of traditional one-class classification, and that improves upon that of one-class classification baselines employed in the few-shot setting.

Download Full-text

Authenticity Detection of Black Rice by Near-Infrared Spectroscopy and Support Vector Data Description

International Journal of Analytical Chemistry ◽

10.1155/2018/8032831 ◽

2018 ◽

Vol 2018 ◽

pp. 1-8 ◽

Cited By ~ 5

Author(s):

Hui Chen ◽

Chao Tan ◽

Zan Lin

Keyword(s):

Near Infrared ◽

Nearest Neighbor ◽

Principal Component ◽

Support Vector ◽

Support Vector Data Description ◽

Vector Data ◽

K Nearest Neighbor ◽

Black Rice ◽

Target Class ◽

Data Description

Black rice is an important rice species in Southeast Asia. It is a common phenomenon to pass low-priced black rice off as high-priced ones for economic benefit, especially in some remote towns. There is increasing need for the development of fast, easy-to-use, and low-cost analytical methods for authenticity detection. The feasibility to utilize near-infrared (NIR) spectroscopy and support vector data description (SVDD) for such a goal is explored. Principal component analysis (PCA) is used for exploratory analysis and feature extraction. Another two data description methods, i.e., k-nearest neighbor data description (KNNDD) and GAUSS method, are used as the reference. A total of 142 samples from three brands were collected for spectral analysis. Each time, the samples of a brand serve as the target class whereas other samples serve as the outlier class. Based on both the first two principal components (PCs) and original variables, three types of data descriptions were constructed. On average, the optimized SVDD model achieves acceptable performance, i.e., a specificity of 100% and a sensitivity of 94.2% on the independent test set with tight boundary. It indicates that SVDD combined with NIR is feasible and effective for authenticity detection of black rice.

Download Full-text

A Support Vector Data Description Approach to NLOS Identification in UWB Positioning

Mathematical Problems in Engineering ◽

10.1155/2014/963418 ◽

2014 ◽

Vol 2014 ◽

pp. 1-6 ◽

Cited By ~ 3

Author(s):

Shiwei Tian ◽

Luwen Zhao ◽

Guangxia Li

Keyword(s):

Identification Problem ◽

Classification Problem ◽

Line Of Sight ◽

Support Vector ◽

Delay Spread ◽

Support Vector Data Description ◽

Vector Data ◽

Data Description ◽

One Class Classification ◽

Mean Excess Delay

Non-line-of-sight (NLOS) propagation is one of the most important challenges in radio positioning, and, in recent years, significant attention has been drawn to the identification and mitigation of NLOS signals. This paper focuses on the identification of NLOS signals. The authors consider the NLOS identification problem as a one-class classification problem and apply the support vector data description (SVDD), providing accurate data descriptions utilizing kernel techniques, to perform NLOS identification in ultrawide bandwidth (UWB) positioning. Our work is based on the fact that some features extracted from the received signal waveforms, such as the kurtosis, the mean excess delay spread, and the root mean square delay spread, are different between line-of-sight (LOS) and NLOS signals. Numerical simulations are performed to demonstrate the performance, using a dataset derived from a measurement campaign.

Download Full-text

A pruned support vector data description-based outlier detection method: Applied to robust process monitoring

Transactions of the Institute of Measurement and Control ◽

10.1177/0142331220905951 ◽

2020 ◽

Vol 42 (11) ◽

pp. 2113-2126 ◽

Cited By ~ 2

Author(s):

Ping Yuan ◽

Zhizhong Mao ◽

Biao Wang

Keyword(s):

Process Monitoring ◽

Support Vector ◽

Support Vector Data Description ◽

Data Sets ◽

Vector Data ◽

Training Set ◽

Data Set ◽

Data Description ◽

One Class Classifier ◽

Comparative Results

Support vector data description (SVDD) is a boundary-based one-class classifier that has been widely used for process monitoring during recent years. However, in some applications where databases are often contaminated by outliers, the performance of SVDD would become deteriorated, leading to low detection rate. To this end, this paper proposes a pruned SVDD model in order to improve its robustness. In contrast to other robust SVDD models that are developed from the algorithmic level, we prune the basic SVDD from a data level. The rationale is to exclude outlier examples from the final training set as many as possible. Specifically, three different SVDD models are constructed successively with different training sets. The first model is used to extract target points by means of rejecting more suspect outlier examples. The second model is constructed using those extracted target points, and is used to recover some false outlier examples labeled by the first model. We build the third (final) model with the final training set consisting of target examples by the first model and false outlier examples by the second model. We validate our proposed method on 20 benchmark data sets and TE data set. Comparative results show that our pruned model could improve the robustness of SVDD more efficiently.

Download Full-text

SVDD-Based Pattern Denoising

Neural Computation ◽

10.1162/neco.2007.19.7.1919 ◽

2007 ◽

Vol 19 (7) ◽

pp. 1919-1938 ◽

Cited By ~ 36

Author(s):

Jooyoung Park ◽

Daesung Kang ◽

Jongho Kim ◽

James T. Kwok ◽

Ivor W. Tsang

Keyword(s):

Test Pattern ◽

Main Idea ◽

Feature Space ◽

Training Data ◽

Support Vector ◽

Support Vector Data Description ◽

Data Sets ◽

Decision Boundary ◽

Vector Data ◽

Real World Data

The support vector data description (SVDD) is one of the best-known one-class support vector learning methods, in which one tries the strategy of using balls defined on the feature space in order to distinguish a set of normal data from all other possible abnormal objects. The major concern of this letter is to extend the main idea of SVDD to pattern denoising. Combining the geodesic projection to the spherical decision boundary resulting from the SVDD, together with solving the preimage problem, we propose a new method for pattern denoising. We first solve SVDD for the training data and then for each noisy test pattern, obtain its denoised feature by moving its feature vector along the geodesic on the manifold to the nearest decision boundary of the SVDD ball. Finally we find the location of the denoised pattern by obtaining the pre-image of the denoised feature. The applicability of the proposed method is illustrated by a number of toy and real-world data sets.

Download Full-text

KERNEL WHITENING FOR ONE-CLASS CLASSIFICATION

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s021800140300240x ◽

2003 ◽

Vol 17 (03) ◽

pp. 333-347 ◽

Cited By ~ 37

Author(s):

DAVID M. J. TAX ◽

PIOTR JUSZCZAK

Keyword(s):

Feature Space ◽

Support Vector ◽

Support Vector Data Description ◽

Good Representation ◽

Vector Data ◽

Data Description ◽

Unit Variance ◽

Good Distinction ◽

One Class Classification ◽

Target Data

In one-class classification one tries to describe a class of target data and to distinguish it from all other possible outlier objects. Obvious applications are areas where outliers are very diverse or very difficult or expensive to measure, such as in machine diagnostics or in medical applications. In order to have a good distinction between the target objects and the outliers, good representation of the data is essential. The performance of many one-class classifiers critically depends on the scaling of the data and is often harmed by data distributions in (nonlinear) subspaces. This paper presents a simple preprocessing method which actively tries to map the data to a spherical symmetric cluster and is almost insensitive to data distributed in subspaces. It uses techniques from Kernel PCA to rescale the data in a kernel feature space to unit variance. This transformed data can now be described very well by the Support Vector Data Description, which basically fits a hypersphere around the data. The paper presents the methods and some preliminary experimental results.

Download Full-text

One Class Classification for Anomaly Detection: Support Vector Data Description Revisited

Advances in Data Mining. Applications and Theoretical Aspects - Lecture Notes in Computer Science ◽

10.1007/978-3-642-23184-1_3 ◽

2011 ◽

pp. 25-39 ◽

Cited By ~ 14

Author(s):

Eric J. Pauwels ◽

Onkar Ambekar

Keyword(s):

Anomaly Detection ◽

Support Vector ◽

Support Vector Data Description ◽

Vector Data ◽

Data Description ◽

One Class Classification

Download Full-text

Compressor Surge Detection Based on Support Vector Data Description

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.152-154.1545 ◽

2012 ◽

Vol 152-154 ◽

pp. 1545-1549

Author(s):

Chang Zheng Li ◽

Yong Lei

Keyword(s):

Support Vector Machine ◽

Active Control ◽

Axial Flow ◽

Support Vector ◽

Support Vector Data Description ◽

Vector Data ◽

Data Description ◽

Compressor Surge ◽

Aerodynamic Instability ◽

One Class Classification

Axial flow compressors work as an indispensable device in industry fields. Surge is a phenomenon of aerodynamic instability, which characterized by disruption of flow. When a compressor works in surge state, the vibration is so intense that it may causes accidents. Detecting surge timely and accurately not only insure safety of compressors but also is a key of active control of aerodynamic instability. Support vector data description (SVDD) is a one-class classification method developed based on the theory of support vector machine (SVM). In this paper, we introduce SVDD into the field of compressor surge detection. It demonstrates that SVDD method can give a warning far ahead of surge.

Download Full-text

Helicopter main reduction planetary gear fault diagnosis method based on SVDD

International Journal of Applied Electromagnetics and Mechanics ◽

10.3233/jae-209316 ◽

2020 ◽

Vol 64 (1-4) ◽

pp. 137-145

Author(s):

Yubin Xia ◽

Dakai Liang ◽

Guo Zheng ◽

Jingling Wang ◽

Jie Zeng

Keyword(s):

Fault Diagnosis ◽

Planetary Gear ◽

Gaussian Kernel ◽

Support Vector ◽

Support Vector Data Description ◽

Vector Data ◽

Energy Characteristics ◽

Gear Fault ◽

Channel Information ◽

Diagnosis Method

Aiming at the irregularity of the fault characteristics of the helicopter main reducer planetary gear, a fault diagnosis method based on support vector data description (SVDD) is proposed. The working condition of the helicopter is complex and changeable, and the fault characteristics of the planetary gear also show irregularity with the change of working conditions. It is impossible to diagnose the fault by the regularity of a single fault feature; so a method of SVDD based on Gaussian kernel function is used. By connecting the energy characteristics and fault characteristics of the helicopter main reducer running state signal and performing vector quantization, the planetary gear of the helicopter main reducer is characterized, and simultaneously couple the multi-channel information, which can accurately characterize the operational state of the planetary gear’s state.

Download Full-text

MK-FSVM-SVDD: A Multiple Kernel-based Fuzzy SVM Model for Predicting DNA-binding Proteins via Support Vector Data Description

Current Bioinformatics ◽

10.2174/1574893615999200607173829 ◽

2020 ◽

Vol 15 ◽

Author(s):

Yi Zou ◽

Hongjie Wu ◽

Xiaoyi Guo ◽

Li Peng ◽

Yijie Ding ◽

...

Keyword(s):

Dna Binding ◽

Binding Proteins ◽

Detection Efficiency ◽

Dna Binding Proteins ◽

Support Vector ◽

Support Vector Data Description ◽

Vector Data ◽

Data Description ◽

Multiple Kernel ◽

Svm Model

Background: Detecting DNA-binding proetins (DBPs) based on biological and chemical methods is time consuming and expensive. Objective: In recent years, the rise of computational biology methods based on Machine Learning (ML) has greatly improved the detection efficiency of DBPs. Method: In this study, Multiple Kernel-based Fuzzy SVM Model with Support Vector Data Description (MK-FSVM-SVDD) is proposed to predict DBPs. Firstly, sex features are extracted from protein sequence. Secondly, multiple kernels are constructed via these sequence feature. Than, multiple kernels are integrated by Centered Kernel Alignment-based Multiple Kernel Learning (CKA-MKL). Next, fuzzy membership scores of training samples are calculated with Support Vector Data Description (SVDD). FSVM is trained and employed to detect new DBPs. Results: Our model is test on several benchmark datasets. Compared with other methods, MK-FSVM-SVDD achieves best Matthew's Correlation Coefficient (MCC) on PDB186 (0.7250) and PDB2272 (0.5476). Conclusion: We can conclude that MK-FSVM-SVDD is more suitable than common SVM, as the classifier for DNA-binding proteins identification.

Download Full-text