Two-parameter KNN algorithm and its application in recognition of brand rice

Author(s):  
Zhu Siyu ◽  
He Chongnan ◽  
Song Mingjuan ◽  
Li Linna

In response to the frequent counterfeiting of Wuchang rice in the market, an effective method to identify brand rice is proposed. Taking the near-infrared spectroscopy data of a total of 373 grains of rice from the four origins (Wuchang, Shangzhi, Yanshou, and Fangzheng) as the observations, kernel principal component analysis(KPCA) was employed to reduce the dimensionality, and Fisher discriminant analysis(FDA) and k-nearest neighbor algorithm (KNN) were used to identify brand rice respectively. The effects of the two recognition methods are very good, and that of KNN is relatively better. Howerver the shortcomings of KNN are obvious. For instance, it has only one test dimension and its test of samples is not delicate enough. In order to further improve the recognition accuracy, fuzzy k-nearest neighbor set is defined and fuzzy probability theory is employed to get a new recognition method –Two-Parameter KNN discrimination method. Compared with KNN algorithm, this method increases the examination dimension. It not only examines the proportion of the number of samples in each pattern class in the k-nearest neighbor set, but also examines the degree of similarity between the center of each pattern class and the sample to be identified. Therefore, the recognition process is more delicate and the recognition accuracy is higher. In the identification of brand rice, the discriminant accuracy of Two-Parameter KNN algorithm is significantly higher than that of FDA and that of KNN algorithm.

2020 ◽  
Vol 2 (2) ◽  
pp. 29-38
Author(s):  
Abdur Rohman Harits Martawireja ◽  
Hilman Mujahid Purnama ◽  
Atika Nur Rahmawati

Pengenalan wajah manusia (face recognition) merupakan salah satu bidang penelitian yang penting dan belakangan ini banyak aplikasi yang menerapkannya, baik di bidang komersil ataupun di bidang penegakan hukum. Pengenalan wajah merupakan sebuah sistem yang berfungsikan untuk mengidentifikasi berdasarkan ciri-ciri dari wajah seseorang berbasis biometrik yang memiliki keakuratan tinggi. Pengenalan wajah dapat diterapkan pada sistem keamanan. Banyak metode yang dapat digunakan dalam aplikasi pengenalan wajah untuk keamanan sistem, namun pada artikel ini akan membahas tentang dua metode yaitu Two Dimensial Principal Component Analysis dan Kernel Fisher Discriminant Analysis dengan metode klasifikasi menggunakan K-Nearest Neigbor. Kedua metode ini diuji menggunakan metode cross validation. Hasil dari penelitian terdahulu terbukti bahwa sistem pengenalan wajah metode Two Dimensial Principal Component Analysis dengan 5-folds cross validation menghasilkan akurasi sebesar 88,73%, sedangkan dengan 2-folds validation akurasi yang dihasilkan sebesar 89,25%. Dan pengujian metode Kernel Fisher Discriminant dengan 2-folds cross validation menghasilkan akurasi rata rata sebesar 83,10%.


Sensors ◽  
2018 ◽  
Vol 18 (11) ◽  
pp. 3691 ◽  
Author(s):  
Fadilla Zennifa ◽  
Sho Ageno ◽  
Shota Hatano ◽  
Keiji Iramina

Engagement is described as a state in which an individual involved in an activity can ignore other influences. The engagement level is important to obtaining good performance especially under study conditions. Numerous methods using electroencephalograph (EEG), electrocardiograph (ECG), and near-infrared spectroscopy (NIRS) for the recognition of engagement have been proposed. However, the results were either unsatisfactory or required many channels. In this study, we introduce the implementation of a low-density hybrid system for engagement recognition. We used a two-electrode wireless EEG, a wireless ECG, and two wireless channels NIRS to measure engagement recognition during cognitive tasks. We used electrooculograms (EOG) and eye tracking to record eye movements for data labeling. We calculated the recognition accuracy using the combination of correlation-based feature selection and k-nearest neighbor algorithm. Following that, we did a comparative study against a stand-alone system. The results show that the hybrid system had an acceptable accuracy for practical use (71.65 ± 0.16%). In comparison, the accuracy of a pure EEG system was (65.73 ± 0.17%), pure ECG (67.44 ± 0.19%), and pure NIRS (66.83 ± 0.17%). Overall, our results demonstrate that the proposed method can be used to improve performance in engagement recognition.


2011 ◽  
Vol 317-319 ◽  
pp. 150-153
Author(s):  
Wan Li Feng ◽  
Shang Bing Gao

In this paper, a reformative scatter difference discriminant criterion (SDDC) with fuzzy set theory is studied. The scatter difference between between-class and within-class as discriminant criterion is effective to overcome the singularity problem of the within-class scatter matrix due to small sample size problem occurred in classical Fisher discriminant analysis. However, the conventional SDDC assumes the same level of relevance of each sample to the corresponding class. So, a fuzzy maximum scatter difference analysis (FMSDA) algorithm is proposed, in which the fuzzy k-nearest neighbor (FKNN) is implemented to achieve the distribution information of original samples, and this information is utilized to redefine corresponding scatter matrices which are different to the conventional SDDC and effective to extract discriminative features from overlapping (outlier) samples. Experiments conducted on FERET face databases demonstrate the effectiveness of the proposed method.


2013 ◽  
Vol 303-306 ◽  
pp. 815-818
Author(s):  
Ning Suo ◽  
Hui Lin Wang

This paper presents a novel approach for railway tunnel deformation data analysis in Safety Monitoring Information System. The proposed work introduces a nonlinear machine learning method, Kernel Principal Component Analysis (KPCA), and K nearest neighbor classification (KNN) classifier for railway tunnel deformation data analysis. Kernel Principal Component Analysis (KPCA) is first applied to 1-dimension signals derived from a sequence of silhouette images to reduce its dimensionality. Then, we performed K nearest neighbor classification (KNN) for railway tunnel deformation data analysis. The experimental results show the KNN based railway tunnel deformation data analysis algorithm is better than that based on KPCA.


2018 ◽  
Vol 2018 ◽  
pp. 1-8 ◽  
Author(s):  
Hui Chen ◽  
Chao Tan ◽  
Zan Lin

Black rice is an important rice species in Southeast Asia. It is a common phenomenon to pass low-priced black rice off as high-priced ones for economic benefit, especially in some remote towns. There is increasing need for the development of fast, easy-to-use, and low-cost analytical methods for authenticity detection. The feasibility to utilize near-infrared (NIR) spectroscopy and support vector data description (SVDD) for such a goal is explored. Principal component analysis (PCA) is used for exploratory analysis and feature extraction. Another two data description methods, i.e., k-nearest neighbor data description (KNNDD) and GAUSS method, are used as the reference. A total of 142 samples from three brands were collected for spectral analysis. Each time, the samples of a brand serve as the target class whereas other samples serve as the outlier class. Based on both the first two principal components (PCs) and original variables, three types of data descriptions were constructed. On average, the optimized SVDD model achieves acceptable performance, i.e., a specificity of 100% and a sensitivity of 94.2% on the independent test set with tight boundary. It indicates that SVDD combined with NIR is feasible and effective for authenticity detection of black rice.


Author(s):  
Amanah Saeroni ◽  
Memi Nor Hayati ◽  
Rito Goejantoro

Classification is a technique to form a model of data that is already known to its classification group. The model that was formed will be used to classify new objects. The K-Nearest Neighbor (K-NN) algorithm is a method for classifying new objects based on their K nearest neighbor. Fisher discriminant analysis is a multivariate technique for separating objects in different groups to form a discriminant function for allocate new objects in groups. This research has a goal to determine the results of classifying customer premium payment status using the K-NN method and Fisher discriminant analysis and comparing the accuracy of the K-NN method classification and Fisher discriminant analysis on the insurance customer premium payment status. The data used is the insurance customer data of PT. Prudential Life Samarinda in 2019 with current premium payment status or non-current premium payment status and four independent variables are age, duration of premium payment, income and premium payment amount. The results of the comparative measurement of accuracy from the two analyzes show that the K-NN method has a higher level of accuracy than Fisher discriminant analysis for the classification of insurance customers premium payment status. The results of misclassification using the APER (Apparent Error Rate) in K-NN method is 15% while in Fisher discriminant analysis is 30%.


2018 ◽  
Vol 16 (2) ◽  
pp. e0203 ◽  
Author(s):  
Xuping Feng ◽  
Haijun Yin ◽  
Chu Zhang ◽  
Cheng Peng ◽  
Yong He

The applicability of near infrared (NIR) spectroscopy combined with chemometrics was examined to develop fast, low-cost and non-destructive spectroscopic methods for classification of transgenic maize plants. The transgenic maize plants containing both cry1Ab/cry2Aj-G10evo proteins and their non-transgenic parent were measured in the NIR diffuse reflectance mode with the spectral range of 700–1900 nm. Three variable selection algorithms, including weighted regression coefficients, principal component analysis -loadings and second derivatives were used to extract sensitive wavelengths that contributed the most discrimination information for these genotypes. Five classification methods, including K-nearest neighbor, Soft Independent Modeling of Class Analogy, Naive Bayes Classifier, Extreme Learning Machine (ELM) and Radial Basis Function Neural Network were used to build discrimination models based on the preprocessed full spectra and sensitive wavelengths. The results demonstrated that ELM had the best performance of all methods, even though the model’s recognition ability decreased as the variables in the training of neural networks were reduced by using only the sensitive wavelengths. The ELM model calculated on the calibration set showed classification rates of 100% based on the full spectrum and 90.83% based on sensitive wavelengths. The NIR spectroscopy combined with chemometrics offers a powerful tool for evaluating large number of samples from maize hybrid performance trials and breeding programs.


2021 ◽  
Vol 11 (20) ◽  
pp. 9389
Author(s):  
Zhenbao Li ◽  
Wanlu Jiang ◽  
Sheng Zhang ◽  
Decai Xue ◽  
Shuqing Zhang

Hydraulic pumps are commonly used; however, it is difficult to predict their remaining useful life (RUL) effectively. A new method based on kernel principal component analysis (KPCA) and the just-in-time learning (JITL) method was proposed to solve this problem. First, as the research object, the non-substitute time tac-tail life experiment pressure signals of gear pumps were collected. Following the removal and denoising of the DC component of the pressure signals by the wavelet packet method, multiple characteristic indices were extracted. Subsequently, the KPCA method was used to calculate the weighted fusion of the selected feature indices. Then the state evaluation indices were extracted to characterize the performance degradation of the gear pumps. Finally, an RUL prediction method based on the k-vector nearest neighbor (k-VNN) and JITL methods was proposed. The k-VNN method refers to both the Euclidean distance and angle relationship between two vectors as the basis for modeling. The prediction results verified the feasibility and effectiveness of the proposed method. Compared to the traditional JITL RUL prediction method based on the k-nearest neighbor algorithm, the proposed prediction model of the RUL of a gear pump presents a higher prediction accuracy. The method proposed in this paper is expected to be applied to the RUL prediction and condition monitoring and has broad application prospects and wide applicability.


2020 ◽  
Vol 2020 ◽  
pp. 1-8
Author(s):  
Yinglin Yang ◽  
Xin Zhang ◽  
Jianwei Yin ◽  
Xiangyang Yu

The classification of plastic waste before recycling is of great significance to achieve effective recycling. In order to achieve rapid, nondestructive, and on-site detection, a portable near-infrared spectrometer was used in this study to obtain the diffuse reflectance spectrum for both standard and commercial plastics made by ABS, PC, PE, PET, PP, PS, and PVC. After applying a series of pretreatments, the principal component analysis (PCA) was used to analyze the cluster trend. K-nearest neighbor (KNN), support vector machine (SVM), and back propagation neural network (BPNN) classification models were developed and evaluated, respectively. The result showed that different plastics could be well separated in top three principal components space after pretreatment, and the classification models performed excellent classification results and high generalization capability. This study indicated that the portable NIR spectrometer, integrated with chemometrics, could achieve excellent performance and has great potential in the field of commercial plastic identification.


2012 ◽  
Vol 503-504 ◽  
pp. 1601-1604 ◽  
Author(s):  
Jing Ming Ning ◽  
Sheng Peng Wang ◽  
Zheng Zhu Zhang ◽  
Xiao Chun Wan

Near-infrared (NIR) spectroscopy, combined with pattern recognition, was applied in this study for the rapid identification of Black tea from different origins.The K-Nearest Neighbor model recognition method was used for the establishment of a tea origin recognition model, which involved optimization of the principal component factors (PCs) and the identification rate using a cross-validation method. The experimental results showed that, after standard normal variant spectral preprocessing, an optimized model was obtained when the PCs were equal to three, with the cross-validation recognition rate and the predicted recognition rate reaching 98.1% and 93.3%, respectively.


Sign in / Sign up

Export Citation Format

Share Document