Active Learning Based Support Vector Data Description for Large Data Set Novelty Detection

Author(s):  
Lili Yin ◽  
Huangang Wang ◽  
Wenhui Fan ◽  
Qingkai Wang
2014 ◽  
Vol 2014 ◽  
pp. 1-9 ◽  
Author(s):  
Yi-Hung Liu ◽  
Yung Ting ◽  
Shian-Shing Shyu ◽  
Chang-Kuo Chen ◽  
Chung-Lin Lee ◽  
...  

Face detection is a crucial prestage for face recognition and is often treated as a binary (face and nonface) classification problem. While this strategy is simple to implement, face detection accuracy would drop when nonface training patterns are undersampled. To avoid these problems, we propose in this paper a one-class learning-based face detector called support vector data description (SVDD) committee, which consists of several SVDD members, each of which is trained on a subset of face patterns. Nonfaces are not required in the training of the SVDD committee. Therefore, the face detection accuracy of SVDD committee is independent of the nonface training patterns. Moreover, the proposed SVDD committee is also able to improve generalization ability of the original SVDD when the face data set has a multicluster distribution. Experiments carried out on the extended MIT face data set show that the proposed SVDD committee can achieve better face detection accuracy than the widely used SVM face detector and performs better than other one-class classifiers, including the original SVDD and the kernel principal component analysis (Kernel PCA).


2020 ◽  
Vol 53 (7-8) ◽  
pp. 1049-1058
Author(s):  
Jiong Zhang ◽  
Yue Wang ◽  
Qian Li ◽  
Biao Wang

As increasing data-driven control strategies are applied in electric arc furnace systems, the problem of novelty detection has drawn more attentions than before. The presence of outliers should be the main obstacle in practical applications for these advanced control techniques. To this end, this paper proposes a dynamically selective support vector data description model to discover novelties in electric arc furnace. In this model, support vector data description plays the role of base detector. Artificial outliers are generated with two objectives, one is to assist the dynamic selection, and the other is to optimize two parameters of support vector data description. Then clustering technique is used to determine the validation set for each test point. Finally, a probabilistic method is used to compute the competence of base detectors. In contrast to other novelty ensembles that have parallel structures, our ensemble model has a dynamic selection mechanism that could facilitate the mining of the potential of base detectors. Three synthetic and three real-world datasets are used to validate the effectiveness of the proposed detection model. Experimental results have approved our method by comparing it with several competitors.


2020 ◽  
Vol 42 (11) ◽  
pp. 2113-2126 ◽  
Author(s):  
Ping Yuan ◽  
Zhizhong Mao ◽  
Biao Wang

Support vector data description (SVDD) is a boundary-based one-class classifier that has been widely used for process monitoring during recent years. However, in some applications where databases are often contaminated by outliers, the performance of SVDD would become deteriorated, leading to low detection rate. To this end, this paper proposes a pruned SVDD model in order to improve its robustness. In contrast to other robust SVDD models that are developed from the algorithmic level, we prune the basic SVDD from a data level. The rationale is to exclude outlier examples from the final training set as many as possible. Specifically, three different SVDD models are constructed successively with different training sets. The first model is used to extract target points by means of rejecting more suspect outlier examples. The second model is constructed using those extracted target points, and is used to recover some false outlier examples labeled by the first model. We build the third (final) model with the final training set consisting of target examples by the first model and false outlier examples by the second model. We validate our proposed method on 20 benchmark data sets and TE data set. Comparative results show that our pruned model could improve the robustness of SVDD more efficiently.


2014 ◽  
Vol 19 (5) ◽  
pp. 1171-1186 ◽  
Author(s):  
Wenjun Hu ◽  
Shitong Wang ◽  
Fu-lai Chung ◽  
Yong Liu ◽  
Wenhao Ying

Sign in / Sign up

Export Citation Format

Share Document