BVDT: A Boosted Vector Decision Tree Algorithm for Multi-Class Classification Problems

In this paper, we propose a powerful weak learner (Vector Decision Tree (VDT)) and a new Boosted Vector Decision Tree (BVDT) algorithm framework for the task of multi-class classification. Unlike the traditional scalar valued boosting algorithms, the BVDT algorithm directly maps the feature space to the decision space in the multi-class setting, which facilitates convenient implementations of the multi-class classification algorithms using diverse loss functions. By viewing the explicit hard threshold on the leaf node value applied in the LogitBoost as a constraint optimization problem, we further develop two new variants of the BVDT algorithm: the [Formula: see text]-BVDT and the [Formula: see text]-BVDT. The performance of the proposed algorithm is evaluated on different datasets and compared with three state-of-the-art boosting algorithms, [Formula: see text]-Nearest Neighbor (KNN) and Support Vector Machine (SVM). The results show that the performance of the proposed algorithm ranks first in all but one dataset and reduces the test error rate by 4% up to 58% with respect to the state-of-the-art boosting algorithms based on the scalar-valued weak learner. Furthermore, we present a case study on the Abalone dataset by designing a new loss function that combines the negative log-likelihood loss function of classification problem and square loss function of regression problem.

Download Full-text

Multiclass Boosting with Adaptive Group-BasedkNN and Its Application in Text Categorization

Mathematical Problems in Engineering ◽

10.1155/2012/793490 ◽

2012 ◽

Vol 2012 ◽

pp. 1-24 ◽

Cited By ~ 6

Author(s):

Lei La ◽

Qiao Guo ◽

Dequan Yang ◽

Qimin Cao

Keyword(s):

Chinese Text ◽

Text Categorization ◽

Nearest Neighbor ◽

Classification Problem ◽

Support Vector ◽

Classification Problems ◽

Adaboost Algorithm ◽

Novel Method ◽

Categorization System ◽

Multi Class Classification

AdaBoost is an excellent committee-based tool for classification. However, its effectiveness and efficiency in multiclass categorization face the challenges from methods based on support vector machine (SVM), neural networks (NN), naïve Bayes, andk-nearest neighbor (kNN). This paper uses a novel multi-class AdaBoost algorithm to avoid reducing the multi-class classification problem to multiple two-class classification problems. This novel method is more effective. In addition, it keeps the accuracy advantage of existing AdaBoost. An adaptive group-basedkNN method is proposed in this paper to build more accurate weak classifiers and in this way control the number of basis classifiers in an acceptable range. To further enhance the performance, weak classifiers are combined into a strong classifier through a double iterative weighted way and construct an adaptive group-basedkNN boosting algorithm (AGkNN-AdaBoost). We implement AGkNN-AdaBoost in a Chinese text categorization system. Experimental results showed that the classification algorithm proposed in this paper has better performance both in precision and recall than many other text categorization methods including traditional AdaBoost. In addition, the processing speed is significantly enhanced than original AdaBoost and many other classic categorization algorithms.

Download Full-text

IMPROVEMENT OF THE PERFORMANCE OF FINGERPRINT VERIFICATION USING A COMBINATORIAL APPROACH

Biomedical Engineering Applications Basis and Communications ◽

10.4015/s1016237218500199 ◽

2018 ◽

Vol 30 (03) ◽

pp. 1850019

Author(s):

Fatemeh Alimardani ◽

Reza Boostani

Keyword(s):

Nearest Neighbor ◽

State Of The Art ◽

Recognition Rate ◽

Acceptance Rate ◽

Feature Reduction ◽

Support Vector ◽

Fingerprint Verification ◽

Linear Discriminant ◽

Comparative Results ◽

Verification Systems

Fingerprint verification systems have attracted much attention in secure organizations; however, conventional methods still suffer from unconvincing recognition rate for noisy fingerprint images. To design a robust verification system, in this paper, wavelet and contourlet transforms (CTS) were suggested as efficient feature extraction techniques to elicit a coverall set of descriptive features to characterize fingerprint images. Contourlet coefficients capture the smooth contours of fingerprints while wavelet coefficients reveal its rough details. Due to the high dimensionality of the elicited features, across group variance (AGV), greedy overall relevancy (GOR) and Davis–Bouldin fast feature reduction (DB-FFR) methods were adopted to remove the redundant features. These features were applied to three different classifiers including Boosting Direct Linear Discriminant Analysis (BDLDA), Support Vector Machine (SVM) and Modified Nearest Neighbor (MNN). The proposed method along with state-of-the-art methods were evaluated, over the FVC2004 dataset, in terms of genuine acceptance rate (GAR), false acceptance rate (FAR) and equal error rate (EER). The features selected by AGV were the most significant ones and provided 95.12% GAR. Applying the selected features, by the GOR method, to the modified nearest neighbor, resulted in average EER of [Formula: see text]%, which outperformed the compared methods. The comparative results imply the statistical superiority ([Formula: see text]) of the proposed approach compared to the counterparts.

Download Full-text

Age Classification Based on Feature Fusion

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.519-520.644 ◽

2014 ◽

Vol 519-520 ◽

pp. 644-650

Author(s):

Mian Shui Yu ◽

Yu Xie ◽

Xiao Meng Xie

Keyword(s):

Feature Fusion ◽

Principal Component ◽

Directional Pattern ◽

Gabor Wavelet ◽

Support Vector ◽

Classification Problems ◽

Pca Method ◽

Multi Class Classification ◽

Global And Local ◽

Fusion Theory

Age classification based on facial images is attracting wide attention with its broad application to human-computer interaction (HCI). Since human senescence is a tremendously complex process, age classification is still a highly challenging issue. In our study, Local Directional Pattern (LDP) and Gabor wavelet transform were used to extract global and local facial features, respectively, that were fused based on information fusion theory. The Principal Component Analysis (PCA) method was used for dimensionality reduction of the fused features, to obtain a lower-dimensional age characteristic vector. A Support Vector Machine (SVM) multi-class classifier with Error Correcting Output Codes (ECOC) was proposed in the paper. This was aimed at multi-class classification problems, such as age classification. Experiments on a public FG-NET age database proved the efficiency of our method.

Download Full-text

A Comparison of the Analysis of Methods for Feature Extraction and Classification by Wavelet Transform in SSVEP BCIs

10.21203/rs.3.rs-82008/v1 ◽

2020 ◽

Author(s):

Hoda Heidari ◽

Zahra Einalou ◽

Mehrdad Dadgostar ◽

Hamidreza Hosseinzadeh

Keyword(s):

Feature Extraction ◽

Feature Selection ◽

Wavelet Transform ◽

Decision Tree ◽

Nearest Neighbor ◽

Support Vector ◽

K Nearest Neighbor ◽

Iir Filters ◽

Wide Range ◽

New Feature

Abstract Most of the studies in the field of Brain-Computer Interface (BCI) based on electroencephalography have a wide range of applications. Extracting Steady State Visual Evoked Potential (SSVEP) is regarded as one of the most useful tools in BCI systems. In this study, different methods such as feature extraction with different spectral methods (Shannon entropy, skewness, kurtosis, mean, variance) (bank of filters, narrow-bank IIR filters, and wavelet transform magnitude), feature selection performed by various methods (decision tree, principle component analysis (PCA), t-test, Wilcoxon, Receiver operating characteristic (ROC)), and classification step applying k nearest neighbor (k-NN), perceptron, support vector machines (SVM), Bayesian, multiple layer perceptron (MLP) were compared from the whole stream of signal processing. Through combining such methods, the effective overview of the study indicated the accuracy of classical methods. In addition, the present study relied on a rather new feature selection described by decision tree and PCA, which is used for the BCI-SSVEP systems. Finally, the obtained accuracies were calculated based on the four recorded frequencies representing four directions including right, left, up, and down.

Download Full-text

An Improved Skewness Decision Tree SVM Algorithm for the Classification of Steel Cord Conveyor Belt Defects

Applied Sciences ◽

10.3390/app8122574 ◽

2018 ◽

Vol 8 (12) ◽

pp. 2574 ◽

Cited By ~ 1

Author(s):

Qinghua Mao ◽

Hongwei Ma ◽

Xuhui Zhang ◽

Guangming Zhang

Keyword(s):

Decision Tree ◽

Classification Accuracy ◽

Pso Algorithm ◽

Conveyor Belt ◽

Kernel Functions ◽

Polynomial Kernel ◽

Support Vector ◽

Data Sets ◽

Classification Problems ◽

Steel Cord

Skewness Decision Tree Support Vector Machine (SDTSVM) algorithm is widely known as a supervised learning model for multi-class classification problems. However, the classification accuracy of the SDTSVM algorithm depends on the perfect selection of its parameters and the classification order. Therefore, an improved SDTSVM (ISDTSVM) algorithm is proposed in order to improve the classification accuracy of steel cord conveyor belt defects. In the proposed model, the classification order is determined by the sum of the Euclidean distances between multi-class sample centers and the parameters are optimized by the inertia weight Particle Swarm Optimization (PSO) algorithm. In order to verify the effectiveness of the ISDTSVM algorithm with different feature space, experiments were conducted on multiple UCI (University of California Irvine) data sets and steel cord conveyor belt defects using the proposed ISDTSVM algorithm and the conventional SDTSVM algorithm respectively. The average classification accuracies of five-fold cross-validation were obtained, based on two kinds of kernel functions respectively. For the Vowel, Zoo, and Wine data sets of the UCI data sets, as well as the steel cord conveyor belt defects, the ISDTSVM algorithm improved the classification accuracy by 3%, 3%, 1% and 4% respectively, compared to the SDTSVM algorithm. The classification accuracy of the radial basis function kernel were higher than the polynomial kernel. The results indicated that the proposed ISDTSVM algorithm improved the classification accuracy significantly, compared to the conventional SDTSVM algorithm.

Download Full-text

A novel hybrid intelligent method based on C4.5 decision tree classifier and one-against-all approach for multi-class classification problems

Expert Systems with Applications ◽

10.1016/j.eswa.2007.11.051 ◽

2009 ◽

Vol 36 (2) ◽

pp. 1587-1592 ◽

Cited By ~ 137

Author(s):

Kemal Polat ◽

Salih Güneş

Keyword(s):

Decision Tree ◽

Classification Problems ◽

Decision Tree Classifier ◽

Tree Classifier ◽

C4.5 Decision Tree ◽

Multi Class Classification ◽

Hybrid Intelligent Method

Download Full-text

Classification of exhaled air IR spectra using combination support vector machine, decision tree, and k-nearest neighbor

Fourth International Conference on Terahertz and Microwave Radiation: Generation, Detection, and Applications ◽

10.1117/12.2581563 ◽

2020 ◽

Author(s):

Viktor V. Nikolaev ◽

Dmitry D. Kuzmin ◽

Viacheslav V. Zasedatel

Keyword(s):

Support Vector Machine ◽

Decision Tree ◽

Ir Spectra ◽

Nearest Neighbor ◽

Support Vector ◽

K Nearest Neighbor ◽

Exhaled Air

Download Full-text

A novel Bagged Naïve Bayes-Decision Tree approach for multi-class classification problems

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-169937 ◽

2019 ◽

Vol 36 (3) ◽

pp. 2261-2271 ◽

Cited By ~ 6

Author(s):

Namrata Singh ◽

Pradeep Singh

Keyword(s):

Decision Tree ◽

Naive Bayes ◽

Naïve Bayes ◽

Classification Problems ◽

Multi Class Classification ◽

Tree Approach

Download Full-text

Piecewise Combination of Hyper-Sphere Support Vector Machine for Multi-Class Classification Problems

10.23940/ijpe.19.06.p12.16111619 ◽

2019 ◽

Author(s):

Shuang Liu

Keyword(s):

Support Vector Machine ◽

Support Vector ◽

Classification Problems ◽

Multi Class Classification

Download Full-text

IDENTIFIKASI JENIS IKAN MENGGUNAKAN MODEL HYBRID DEEP LEARNING DAN ALGORITMA KLASIFIKASI

Sebatik ◽

10.46984/sebatik.v24i2.1057 ◽

2020 ◽

Vol 24 (2) ◽

Author(s):

Anifuddin Azis

Keyword(s):

Neural Networks ◽

Support Vector Machine ◽

Logistic Regression ◽

Deep Learning ◽

Random Forest ◽

Decision Tree ◽

Nearest Neighbor ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Output

Indonesia merupakan negara dengan keanekaragaman hayati terbesar kedua di dunia setelah Brazil. Indonesia memiliki sekitar 25.000 spesies tumbuhan dan 400.000 jenis hewan dan ikan. Diperkirakan 8.500 spesies ikan hidup di perairan Indonesia atau merupakan 45% dari jumlah spesies yang ada di dunia, dengan sekitar 7.000an adalah spesies ikan laut. Untuk menentukan berapa jumlah spesies tersebut dibutuhkan suatu keahlian di bidang taksonomi. Dalam pelaksanaannya mengidentifikasi suatu jenis ikan bukanlah hal yang mudah karena memerlukan suatu metode dan peralatan tertentu, juga pustaka mengenai taksonomi. Pemrosesan video atau citra pada data ekosistem perairan yang dilakukan secara otomatis mulai dikembangkan. Dalam pengembangannya, proses deteksi dan identifikasi spesies ikan menjadi suatu tantangan dibandingkan dengan deteksi dan identifikasi pada objek yang lain. Metode deep learning yang berhasil dalam melakukan klasifikasi objek pada citra mampu untuk menganalisa data secara langsung tanpa adanya ekstraksi fitur pada data secara khusus. Sistem tersebut memiliki parameter atau bobot yang berfungsi sebagai ektraksi fitur maupun sebagai pengklasifikasi. Data yang diproses menghasilkan output yang diharapkan semirip mungkin dengan data output yang sesungguhnya. CNN merupakan arsitektur deep learning yang mampu mereduksi dimensi pada data tanpa menghilangkan ciri atau fitur pada data tersebut. Pada penelitian ini akan dikembangkan model hybrid CNN (Convolutional Neural Networks) untuk mengekstraksi fitur dan beberapa algoritma klasifikasi untuk mengidentifikasi spesies ikan. Algoritma klasifikasi yang digunakan pada penelitian ini adalah : Logistic Regression (LR), Support Vector Machine (SVM), Decision Tree, K-Nearest Neighbor (KNN), Random Forest, Backpropagation.

Download Full-text