On the parameter optimization of Support Vector Machines for binary classification

Summary Classifying biological data is a common task in the biomedical context. Predicting the class of new, unknown information allows researchers to gain insight and make decisions based on the available data. Also, using classification methods often implies choosing the best parameters to obtain optimal class separation, and the number of parameters might be large in biological datasets.Support Vector Machines provide a well-established and powerful classification method to analyse data and find the minimal-risk separation between different classes. Finding that separation strongly depends on the available feature set and the tuning of hyper-parameters. Techniques for feature selection and SVM parameters optimization are known to improve classification accuracy, and its literature is extensive.In this paper we review the strategies that are used to improve the classification performance of SVMs and perform our own experimentation to study the influence of features and hyper-parameters in the optimization process, using several known kernels.

Download Full-text

A comparison study: Support vector machines for binary classification in machine learning

2011 4th International Conference on Biomedical Engineering and Informatics (BMEI) ◽

10.1109/bmei.2011.6098517 ◽

2011 ◽

Cited By ~ 4

Author(s):

Wencai Zeng ◽

Jiong Jia ◽

Zhonglong Zheng ◽

Chenmao Xie ◽

Li Guo

Keyword(s):

Machine Learning ◽

Support Vector Machines ◽

Binary Classification ◽

Support Vector ◽

Comparison Study ◽

Vector Machines ◽

Study Support

Download Full-text

A note on supervised classification and Nash-equilibrium problems

RAIRO - Operations Research ◽

10.1051/ro/2016024 ◽

2017 ◽

Vol 51 (2) ◽

pp. 329-341

Author(s):

Nicolas Couellan

Keyword(s):

Nash Equilibrium ◽

Support Vector Machines ◽

Supervised Classification ◽

Dual Space ◽

Equilibrium Problems ◽

Support Vector ◽

Generalized Nash Equilibrium ◽

Class Separation ◽

Vector Machines ◽

Generalized Nash Equilibrium Problems

In this note, we investigate connections between supervised classification and (Generalized) Nash equilibrium problems (NEP & GNEP). For the specific case of support vector machines (SVM), we exploit the geometric properties of class separation in the dual space to formulate a non-cooperative game. NEP and Generalized NEP formulations are proposed for both binary and multi-class SVM problems.

Download Full-text

Gaussian Processes for Classification: Mean-Field Algorithms

Neural Computation ◽

10.1162/089976600300014881 ◽

2000 ◽

Vol 12 (11) ◽

pp. 2655-2684 ◽

Cited By ~ 91

Author(s):

Manfred Opper ◽

Ole Winther

Keyword(s):

Support Vector Machines ◽

Gaussian Processes ◽

Disordered Systems ◽

Binary Classification ◽

Computational Cost ◽

Mean Field ◽

Strong Support ◽

Support Vector ◽

Vector Machines ◽

Leave One Out

We derive a mean-field algorithm for binary classification with gaussian processes that is based on the TAP approach originally proposed in statistical physics of disordered systems. The theory also yields an approximate leave-one-out estimator for the generalization error, which is computed with no extra computational cost. We show that from the TAP approach, it is possible to derive both a simpler “naive” mean-field theory and support vector machines (SVMs) as limiting cases. For both mean-field algorithms and support vector machines, simulation results for three small benchmark data sets are presented. They show that one may get state-of-the-art performance by using the leave-one-out estimator for model selection and the built-in leave-one-out estimators are extremely precise when compared to the exact leave-one-out estimate. The second result is taken as strong support for the internal consistency of the mean-field approach.

Download Full-text

Handling binary classification problems with a priority class by using Support Vector Machines

Applied Soft Computing ◽

10.1016/j.asoc.2017.08.023 ◽

2017 ◽

Vol 61 ◽

pp. 661-669 ◽

Cited By ~ 10

Author(s):

L. Gonzalez-Abril ◽

C. Angulo ◽

H. Nuñez ◽

Y. Leal

Keyword(s):

Support Vector Machines ◽

Binary Classification ◽

Support Vector ◽

Classification Problems ◽

Priority Class ◽

Vector Machines ◽

A Priority

Download Full-text

Bankruptcy Prediction of Engineering Companies in the EU Using Classification Methods

Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis ◽

10.11118/actaun201866051347 ◽

2018 ◽

Vol 66 (5) ◽

pp. 1347-1356 ◽

Cited By ~ 1

Author(s):

Michaela Staňková ◽

David Hampel

Keyword(s):

Logistic Regression ◽

Support Vector Machines ◽

Binary Classification ◽

Classification Tree ◽

Classification Trees ◽

Bankruptcy Prediction ◽

Support Vector ◽

Type I ◽

Vector Machines ◽

The Eu

This article focuses on the problem of binary classification of 902 small- and medium‑sized engineering companies active in the EU, together with additional 51 companies which went bankrupt in 2014. For classification purposes, the basic statistical method of logistic regression has been selected, together with a representative of machine learning (support vector machines and classification trees method) to construct models for bankruptcy prediction. Different settings have been tested for each method. Furthermore, the models were estimated based on complete data and also using identified artificial factors. To evaluate the quality of prediction we observe not only the total accuracy with the type I and II errors but also the area under ROC curve criterion. The results clearly show that increasing distance to bankruptcy decreases the predictive ability of all models. The classification tree method leads us to rather simple models. The best classification results were achieved through logistic regression based on artificial factors. Moreover, this procedure provides good and stable results regardless of other settings. Artificial factors also seem to be a suitable variable for support vector machines models, but classification trees achieved better results using original data.

Download Full-text

Influence of Dataset Character on Classification Performance of Support Vector Machines for Grain Analysis

Artificial Intelligence and Applications ◽

10.2316/p.2010.674-071 ◽

2010 ◽

Author(s):

K. Anding ◽

G. Linβ ◽

P. Brückner

Keyword(s):

Support Vector Machines ◽

Classification Performance ◽

Support Vector ◽

Vector Machines

Download Full-text

PSO Parameters Optimization Based Support Vector Machines for Hyperspectral Classification

2009 First International Conference on Information Science and Engineering ◽

10.1109/icise.2009.859 ◽

2009 ◽

Cited By ~ 1

Author(s):

Sheng Ding ◽

Shunxin Li

Keyword(s):

Support Vector Machines ◽

Parameters Optimization ◽

Support Vector ◽

Vector Machines ◽

Hyperspectral Classification

Download Full-text

Comparing performance of interval neutrosophic sets and neural networks with support vector machines for binary classification problems

2008 2nd IEEE International Conference on Digital Ecosystems and Technologies ◽

10.1109/dest.2008.4635138 ◽

2008 ◽

Cited By ~ 1

Author(s):

Pawalai Kraipeerapun ◽

Chun Che Fung

Keyword(s):

Neural Networks ◽

Support Vector Machines ◽

Binary Classification ◽

Support Vector ◽

Classification Problems ◽

Neutrosophic Sets ◽

Vector Machines ◽

Interval Neutrosophic Sets

Download Full-text

Principal weighted support vector machines for sufficient dimension reduction in binary classification

Biometrika ◽

10.1093/biomet/asw057 ◽

2017 ◽

pp. asw057

Author(s):

Seung Jun Shin ◽

Yichao Wu ◽

Hao Helen Zhang ◽

Yufeng Liu

Keyword(s):

Support Vector Machines ◽

Dimension Reduction ◽

Binary Classification ◽

Support Vector ◽

Sufficient Dimension Reduction ◽

Vector Machines

Download Full-text

Rescale-Invariant SVM for Binary Classification

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/348 ◽

2017 ◽

Cited By ~ 1

Author(s):

Mojtaba Montazery ◽

Nic Wilson

Keyword(s):

Machine Learning ◽

Decision Making ◽

Support Vector Machines ◽

Binary Classification ◽

Experimental Results ◽

Support Vector ◽

Computation Method ◽

Learning Methods ◽

Machine Learning Methods ◽

Vector Machines

Support Vector Machines (SVM) are among the most well-known machine learning methods, with broad use in different scientific areas. However, one necessary pre-processing phase for SVM is normalization (scaling) of features, since SVM is not invariant to the scales of the features’ spaces, i.e., different ways of scaling may lead to different results. We define a more robust decision-making approach for binary classification, in which one sample strongly belongs to a class if it belongs to that class for all possible rescalings of features. We derive a way of characterising the approach for binary SVM that allows determining when an instance strongly belongs to a class and when the classification is invariant to rescaling. The characterisation leads to a computation method to determine whether one sample is strongly positive, strongly negative or neither. Our experimental results back up the intuition that being strongly positive suggests stronger confidence that an instance really is positive.

Download Full-text