Klasifikasi Anjing dan Kucing menggunakan Algoritma Linear Discriminant Analysis dan Support Vector Machine

2019 ◽  
Vol 11 (1) ◽  
pp. 46-51
Author(s):  
Alethea Suryadibrata ◽  
Suryadi Darmawan Salim

One of the factors driving technological development is the increase in computers ability to complete various jobs. One of them is doing image processing, which is widely used in our daily life, such as the use of fingerprints, face/iris recognition barcodes, medical needs, and various other uses. Classification is one of the applications of image processing that is used the most. One algorithm that can be used for the development of image classification systems is Linear Discriminant Analysis (LDA) and Support Vector Machine (SVM). LDA is a feature extraction algorithm to find a subspace that separates classes well. SVM is a classification algorithm, based on the idea of finding a hyperplane that best divides a dataset into classes. In this study, LDA and SVM algorithms were tested on the dog and cat classification system, with the highest F- score calculation results being 0.69 with 200 training data and 50 testing data for cats and 0.64 with 200 training data and 30 testing data for dogs.

Biometrika ◽  
2020 ◽  
Vol 107 (2) ◽  
pp. 311-330 ◽  
Author(s):  
Timothy I Cannings ◽  
Yingying Fan ◽  
Richard J Samworth

Summary We study the effect of imperfect training data labels on the performance of classification methods. In a general setting, where the probability that an observation in the training dataset is mislabelled may depend on both the feature vector and the true label, we bound the excess risk of an arbitrary classifier trained with imperfect labels in terms of its excess risk for predicting a noisy label. This reveals conditions under which a classifier trained with imperfect labels remains consistent for classifying uncorrupted test data points. Furthermore, under stronger conditions, we derive detailed asymptotic properties for the popular $k$-nearest neighbour, support vector machine and linear discriminant analysis classifiers. One consequence of these results is that the $k$-nearest neighbour and support vector machine classifiers are robust to imperfect training labels, in the sense that the rate of convergence of the excess risk of these classifiers remains unchanged; in fact, our theoretical and empirical results even show that in some cases, imperfect labels may improve the performance of these methods. The linear discriminant analysis classifier is shown to be typically inconsistent in the presence of label noise unless the prior probabilities of the classes are equal. Our theoretical results are supported by a simulation study.


2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
Heping Li ◽  
Yu Ren ◽  
Fan Yu ◽  
Dongliang Song ◽  
Lizhe Zhu ◽  
...  

To facilitate the enhanced reliability of Raman-based tumor detection and analytical methodologies, an ex vivo Raman spectral investigation was conducted to identify distinct compositional information of healthy (H), ductal carcinoma in situ (DCIS), and invasive ductal carcinoma (IDC). Then, principal component analysis-linear discriminant analysis (PCA-LDA) and principal component analysis-support vector machine (PCA-SVM) models were constructed for distinguishing spectral features among different tissue groups. Spectral analysis highlighted differences in levels of unsaturated and saturated lipids, carotenoids, protein, and nucleic acid between healthy and cancerous tissue and variations in the levels of nucleic acid, protein, and phenylalanine between DCIS and IDC. Both classification models were principal component analysis-linear discriminant analysis to be extremely efficient on discriminating tissue pathological types with 99% accuracy for PCA-LDA and 100%, 100%, and 96.7% for PCA-SVM analysis based on linear kernel, polynomial kernel, and radial basis function (RBF), respectively, while PCA-SVM algorithm greatly simplified the complexity of calculation without sacrificing performance. The present study demonstrates that Raman spectroscopy combined with multivariate analysis technology has considerable potential for improving the efficiency and performance of breast cancer diagnosis.


2014 ◽  
pp. 61-67
Author(s):  
A. Amari ◽  
N. El Bari ◽  
B. Bouchikhi

An electronic nose based system, which employs an array of six inexpensive commercial gas sensors based on tin dioxide (Figaro Engineering Inc., Japan), has been used to analyse the freshness states of anchovies. Fresh anchovies were stored in a refrigerator at 4 ± 1°C over a period of 15 days. Electronic nose measurements need no sample preparation and the results indicated that the spoilage process of anchovies could be followed by using this technique. Conductance responses of volatile compounds produced during storage of anchovy were monitored and the result were analysed by multivariate analysis methods. In this paper principal component analysis (PCA) and linear discriminant analysis (LDA) were used to investigate whether the electronic nose was able to distinguishing among different freshness states (fresh, moderated and non-fresh samples). The loadings analysis was used to identify the sensors responsible for discrimination in the current pattern file. Therefore, the support vector machines (SVM) method was applied to the new subset, with only the selected sensors, to confirm that a subset of a few sensors can be chosen to explain all the variance. The results obtained prove that the electronic nose can discriminate successfully different freshness state using LDA analysis. Some sensors have the highest influence in the current pattern file for electronic nose. Support vector machine (SVM) model, applied to the new subset of sensors show the good performance.


Sign in / Sign up

Export Citation Format

Share Document