Enhancing the Performance of SVM on Skewed Data Sets by Exciting Support Vectors

Author(s):  
José Hernández Santiago ◽  
Jair Cervantes ◽  
Asdrúbal López-Chau ◽  
Farid García Lamont
2020 ◽  
Vol 57 (4) ◽  
pp. 444-464
Author(s):  
Gauss M. Cordeiro ◽  
Thiago G. Ramires ◽  
Edwin M. M. Ortega ◽  
Rodrigo R. Pescim

We define the extended beta family of distributions to generalize the beta generator pioneered by Eugene et al. [10]. This paper is cited in at least 970 scientific articles and extends more than fifty well-known distributions. Any continuous distribution can be generalized by means of this family. The proposed family can present greater flexibility to model skewed data. Some of its mathematical properties are investigated and maximum likelihood is adopted to estimate its parameters. Further, for different parameter settings and sample sizes, some simulations are conducted. The superiority of the proposed family is illustrated by means of two real data sets.


2017 ◽  
Vol 228 ◽  
pp. 187-197 ◽  
Author(s):  
Jair Cervantes ◽  
Farid Garcia-Lamont ◽  
Lisbeth Rodriguez ◽  
Asdrúbal López ◽  
José Ruiz Castilla ◽  
...  

Author(s):  
JIE JI ◽  
QIANGFU ZHAO

This paper proposes a hybrid learning method to speed up the classification procedure of Support Vector Machines (SVM). Comparing most algorithms trying to decrease the support vectors in an SVM classifier, we focus on reducing the data points that need SVM for classification, and reduce the number of support vectors for each SVM classification. The system uses a Nearest Neighbor Classifier (NNC) to treat data points attentively. In the training phase, the NNC selects data near partial decision boundary, and then trains sub SVM for each Voronoi pair. For classification, most non-boundary data points are classified by NNC directly, while remaining boundary data points are passed to a corresponding local expert SVM. We also propose a data selection method for training reliable expert SVM. Experimental results on several generated and public machine learning data sets show that the proposed method significantly accelerates the testing speed.


2021 ◽  
Vol 8 (4) ◽  
pp. 747-760
Author(s):  
A. El Ouissari ◽  
◽  
K. El Moutaouakil ◽  

In this work, we propose a deep prediction diabetes system based on a new version of the support vector machine optimization model. First, we determine three types of patients (noisy, cord, and interior) basing on specific parameters. Second, we equilibrate the clinical data sets by suppressing noisy and cord patients. Third, we determine the support vectors by solving an optimization program with a reasonable size. Our system is performed on the well-known diabetes dataset PIMA. The experimental results show that the proposed method improves the prediction accuracy and the proposed system significantly outperforms all other versions of SVM as well as literature methods of classification.


Sign in / Sign up

Export Citation Format

Share Document