Application of Computational Intelligence Methods for the Automated Identification of Paper-Ink Samples Based on LIBS

Krzysztof Rzecki; Tomasz Sośnicki; Mateusz Baran; Michał Niedźwiecki; Małgorzata Król; Tomasz Łojewski; U Acharya; Özal Yildirim; Paweł Pławiak

doi:10.3390/s18113670

Application of Computational Intelligence Methods for the Automated Identification of Paper-Ink Samples Based on LIBS

Sensors ◽

10.3390/s18113670 ◽

2018 ◽

Vol 18 (11) ◽

pp. 3670 ◽

Cited By ~ 18

Author(s):

Krzysztof Rzecki ◽

Tomasz Sośnicki ◽

Mateusz Baran ◽

Michał Niedźwiecki ◽

Małgorzata Król ◽

...

Keyword(s):

Neural Network ◽

Random Forest ◽

Computational Intelligence ◽

Probabilistic Neural Network ◽

Spectral Lines ◽

Support Vector ◽

Generalized Regression Neural Network ◽

K Nearest Neighbor ◽

Computational Intelligence Methods

Laser-induced breakdown spectroscopy (LIBS) is an important analysis technique with applications in many industrial branches and fields of scientific research. Nowadays, the advantages of LIBS are impaired by the main drawback in the interpretation of obtained spectra and identification of observed spectral lines. This procedure is highly time-consuming since it is essentially based on the comparison of lines present in the spectrum with the literature database. This paper proposes the use of various computational intelligence methods to develop a reliable and fast classification of quasi-destructively acquired LIBS spectra into a set of predefined classes. We focus on a specific problem of classification of paper-ink samples into 30 separate, predefined classes. For each of 30 classes (10 pens of each of 5 ink types combined with 10 sheets of 5 paper types plus empty pages), 100 LIBS spectra are collected. Four variants of preprocessing, seven classifiers (decision trees, random forest, k-nearest neighbor, support vector machine, probabilistic neural network, multi-layer perceptron, and generalized regression neural network), 5-fold stratified cross-validation, and a test on an independent set (for methods evaluation) scenarios are employed. Our developed system yielded an accuracy of 99.08%, obtained using the random forest classifier. Our results clearly demonstrates that machine learning methods can be used to identify the paper-ink samples based on LIBS reliably at a faster rate.

Download Full-text

Application of Computational Intelligence Methods for the Automated Identification of Paper-Ink Samples Based on LIBS

10.20944/preprints201808.0402.v1 ◽

2018 ◽

Author(s):

Krzysztof Rzecki ◽

Tomasz Sośnicki ◽

Mateusz Baran ◽

Michał Niedźwiecki ◽

Małgorzata Król ◽

...

Keyword(s):

Neural Network ◽

Random Forest ◽

Computational Intelligence ◽

Probabilistic Neural Network ◽

Independent Set ◽

Support Vector ◽

Generalized Regression Neural Network ◽

Breakdown Spectroscopy ◽

Computational Intelligence Methods

Laser-induced breakdown spectroscopy (LIBS) is an important analysis technique with applications in many industrial branches and fields of scientific research. Nowadays, the advantages of LIBS are impaired by the main drawback in the analysis of collected data. This procedure is essentially based on the comparison of lines present in the spectrum with a literature database. This paper proposes the use of various computational intelligence methods to develop a reliable and fast classification of non-destructively acquired LIBS spectra into a set of predefined classes. We focus on a specific problem of classification of paper-ink samples into 30 separate, predefined classes. For each of 30 classes (10 pens of each of 5 ink types combined with 10 sheets of 5 paper types plus empty pages) 100 LIBS spectra are collected. Four variants of preprocessing, seven classifiers (Decision trees, Random forest, k-Nearest Neighbour, Support Vector Machine, Probabilistic Neural Network, Multi-Layer Perceptron, and Generalized Regression Neural Network), 5-fold stratified cross-validation and test on an independent set (for methods evaluation) scenarios are employed. Our developed system yielded an accuracy of 99.08% with average classification time of about 0.12 s is obtained using the random forest classifier. Our results clearly demonstrates that machine learning methods can be used to identify the paper-ink samples based on LIBS reliably at a faster rate.

Download Full-text

Recognition and Classification of Incipient Cable Failures Based on Variational Mode Decomposition and a Convolutional Neural Network

Energies ◽

10.3390/en12102005 ◽

2019 ◽

Vol 12 (10) ◽

pp. 2005 ◽

Cited By ~ 5

Author(s):

Jiaying Deng ◽

Wenhai Zhang ◽

Xiaomei Yang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Bp Neural Network ◽

Nearest Neighbor ◽

Feature Vector ◽

Support Vector ◽

Variational Mode Decomposition ◽

K Nearest Neighbor ◽

Mode Decomposition

To avoid power supply hazards caused by cable failures, this paper presents an approach of incipient cable failure recognition and classification based on variational mode decomposition (VMD) and a convolutional neural network (CNN). By using VMD, the original current signal is decomposed into seven modes with different center frequencies. Then, 42 features are extracted for the seven modes and used to construct a feature vector as input of the CNN to classify incipient cable failure through deep learning. Compared with using the original signals directly as the CNN input, the proposed approach is more efficient and robust. Experiments on different classifiers, namely, the decision tree (DT), K-nearest neighbor (KNN), BP neural network (BP) and support vector machine (SVM), and show that the CNN outperforms the other classifiers in terms of accuracy.

Download Full-text

Image Classification of Tourist Attractions with K-Nearest Neighbor, Logistic Regression, Random Forest, and Support Vector Machine

International Journal on Advanced Science Engineering and Information Technology ◽

10.18517/ijaseit.10.6.9098 ◽

2020 ◽

Vol 10 (6) ◽

pp. 2207

Author(s):

Herry Sujaini

Keyword(s):

Support Vector Machine ◽

Logistic Regression ◽

Random Forest ◽

Image Classification ◽

Nearest Neighbor ◽

Support Vector ◽

K Nearest Neighbor ◽

Tourist Attractions

Download Full-text

Perbandingan Algoritma Machine Learning dalam Menilai Sebuah Lokasi Toko Ritel

Jurnal Teknik Informatika dan Sistem Informasi ◽

10.28932/jutisi.v7i1.3182 ◽

2021 ◽

Vol 7 (1) ◽

Author(s):

Kristiawan Kristiawan ◽

Andreas Widjaja

Keyword(s):

Neural Network ◽

Machine Learning ◽

Logistic Regression ◽

Random Forest ◽

Pearson Correlation ◽

Recursive Feature Elimination ◽

Support Vector ◽

Learning Technology ◽

K Nearest Neighbor ◽

Store Location

Abstract — The application of machine learning technology in various industrial fields is currently developing rapidly, including in the retail industry. This study aims to find the most accurate algorithmic model so that it can be used to help retailers choose a store location more precisely. By using several methods such as Pearson Correlation, Chi-Square Features, Recursive Feature Elimination and Tree-based to select features (predictive variables). These features are then used to train and build models using 6 different classification algorithms such as Logistic Regression, K Nearest Neighbor (KNN), Decision Tree, Random Forest, Support Vector Machine (SVM) and Neural Network to classify whether a location is recommended or not as a new store location. Keywords— Application of Machine Learning, Pearson Correlation, Random Forest, Neural Network, Logistic Regression.

Download Full-text

Combination of support vector machine, artificial neural network and random forest for improving the classification of convective and stratiform rain using spectral features of SEVIRI data

Atmospheric Research ◽

10.1016/j.atmosres.2017.12.006 ◽

2018 ◽

Vol 203 ◽

pp. 118-129 ◽

Cited By ~ 14

Author(s):

Mourad Lazri ◽

Soltane Ameur

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Support Vector Machine ◽

Random Forest ◽

Support Vector ◽

Spectral Features ◽

Stratiform Rain ◽

Artificial Neural

Download Full-text

A Comparative Assessment of Artificial Neural Network, Generalized Regression Neural Network, Least-Square Support Vector Regression, and K-Nearest Neighbor Regression for Monthly Streamflow Forecasting in Linear and Nonlinear Conditions

Water Resources Management ◽

10.1007/s11269-017-1807-2 ◽

2017 ◽

Vol 32 (1) ◽

pp. 243-258 ◽

Cited By ~ 19

Author(s):

Fereshteh Modaresi ◽

Shahab Araghinejad ◽

Kumars Ebrahimi

Keyword(s):

Neural Network ◽

Nearest Neighbor ◽

Comparative Assessment ◽

Least Square ◽

Support Vector ◽

Generalized Regression Neural Network ◽

Streamflow Forecasting ◽

K Nearest Neighbor ◽

Monthly Streamflow ◽

Monthly Streamflow Forecasting

Download Full-text

Review on Techniques for Plant Leaf Classification and Recognition

Computers ◽

10.3390/computers8040077 ◽

2019 ◽

Vol 8 (4) ◽

pp. 77 ◽

Cited By ~ 8

Author(s):

Muhammad Azfar Firdaus Azlah ◽

Lee Suan Chua ◽

Fakhrul Razan Rahmad ◽

Farah Izana Abdullah ◽

Sharifah Rafidah Wan Alwi

Keyword(s):

Neural Network ◽

Machine Learning ◽

Nearest Neighbor ◽

Learning Algorithm ◽

Probabilistic Neural Network ◽

Machine Learning Algorithms ◽

Support Vector ◽

Plant Systematics ◽

K Nearest Neighbor ◽

Plant Leaf

Plant systematics can be classified and recognized based on their reproductive system (flowers) and leaf morphology. Neural networks is one of the most popular machine learning algorithms for plant leaf classification. The commonly used neutral networks are artificial neural network (ANN), probabilistic neural network (PNN), convolutional neural network (CNN), k-nearest neighbor (KNN) and support vector machine (SVM), even some studies used combined techniques for accuracy improvement. The utilization of several varying preprocessing techniques, and characteristic parameters in feature extraction appeared to improve the performance of plant leaf classification. The findings of previous studies are critically compared in terms of their accuracy based on the applied neural network techniques. This paper aims to review and analyze the implementation and performance of various methodologies on plant classification. Each technique has its advantages and limitations in leaf pattern recognition. The quality of leaf images plays an important role, and therefore, a reliable source of leaf database must be used to establish the machine learning algorithm prior to leaf recognition and validation.

Download Full-text

An Optimized Recursive General Regression Neural Network Oracle for the Prediction and Diagnosis of Diabetes

Global Journal of Computer Science and Technology ◽

10.34257/gjcstdvol19is2pg1 ◽

2018 ◽

pp. 1-11 ◽

Cited By ~ 1

Author(s):

Dana Bani-Hani ◽

Pruthak Patel ◽

Tasneem Alshaikh

Keyword(s):

Neural Network ◽

Performance Metrics ◽

Prediction Models ◽

Probabilistic Neural Network ◽

General Regression Neural Network ◽

Medical Decision ◽

Support Vector ◽

Pima Indians ◽

K Nearest Neighbor ◽

General Regression

Diabetes is a serious, chronic disease that has been seeing a rise in the number of cases and prevalence over the past few decades. It can lead to serious complications and can increase the overall risk of dying prematurely. Data-oriented prediction models have become effective tools that help medical decision-making and diagnoses in which the use of machine learning in medicine has increased substantially. This research introduces the Recursive General Regression Neural Network Oracle (RGRNN Oracle) and is applied on the Pima Indians Diabetes dataset for the prediction and diagnosis of diabetes. The R-GRNN Oracle (Bani-Hani, 2017) is an enhancement to the GRNN Oracle developed by Masters et al. in 1998, in which the recursive model is created of two oracles: one within the other. Several classifiers, along with the R-GRNN Oracle and the GRNN Oracle, are applied to the dataset, they are: Support Vector Machine (SVM), Multilayer Perceptron (MLP), Probabilistic Neural Network (PNN), Gaussian Naïve Bayes (GNB), K-Nearest Neighbor (KNN), and Random Forest (RF). Genetic Algorithm (GA) was used for feature selection as well as the hyperparameter optimization of SVM and MLP, and Grid Search (GS) was used to optimize the hyperparameters of KNN and RF. The performance metrics accuracy, AUC, sensitivity, and specificity were recorded for each classifier.

Download Full-text

Selected model fusion: an approach for improving the accuracy of monthly streamflow forecasting

Journal of Hydroinformatics ◽

10.2166/hydro.2018.098 ◽

2018 ◽

Vol 20 (4) ◽

pp. 917-933 ◽

Cited By ~ 3

Author(s):

Fereshteh Modaresi ◽

Shahab Araghinejad ◽

Kumars Ebrahimi

Keyword(s):

Neural Network ◽

Nearest Neighbor ◽

Least Square ◽

Support Vector ◽

Generalized Regression Neural Network ◽

Streamflow Forecasting ◽

K Nearest Neighbor ◽

Model Fusion ◽

Monthly Streamflow ◽

Monthly Streamflow Forecasting

Abstract Monthly streamflow forecasting plays an important role in water resources management, especially for dam operation. In this paper, an approach of model fusion technique named selected model fusion (SMF) is applied and assessed under two strategies of model selection in order to improve the accuracy of streamflow forecasting. The two strategies of SMF are: fusion of the outputs of best individual forecasting models (IFMs) selected by dendrogram analysis (S1), and fusion of the best outputs of all IFMs resulting from an ordered selection algorithm (S2). In both strategies, five data-driven models including: artificial neural network, generalized regression neural network, least square-support vector regression, K-nearest neighbor regression, and multiple linear regression with optimized structure are performed as IFMs. The SMF strategies are applied for forecasting the monthly inflow to Karkheh reservoir, Iran, owning various patterns between predictor and predicted variables in different months. Results show that applying SMF approach based on both strategies results in more accurate forecasts in comparison with fusion of all IFMs outputs (S3), as the benchmark. However, comparison of the two SMF strategies reveals that the implementation of strategy (S2) considerably improves the accuracy of forecasts than strategy (S1) as well as the best IFM results (S4) in all months.

Download Full-text

An Improved Coronary Heart Disease Predictive System Using Random Forest

Asian Journal of Research in Computer Science ◽

10.9734/ajrcos/2021/v11i130253 ◽

2021 ◽

pp. 17-27

Author(s):

Abdulraheem Abdul ◽

Rafiu M. Isiaka ◽

Ronke S. Babatunde ◽

Jumoke F. Ajao

Keyword(s):

Neural Network ◽

Coronary Heart Disease ◽

Feature Selection ◽

Heart Disease ◽

Random Forest ◽

Cross Validation ◽

Nearest Neighbor ◽

Support Vector ◽

Disease Prediction ◽

K Nearest Neighbor

Aims: This work aim is to develop an enhanced predictive system for Coronary Heart Disease (CHD). Study Design: Synthetic Minority Oversampling Technique and Random Forest. Methodology: The Framingham heart disease dataset was used, which was collected from a study in Framingham, Massachusetts, the data was cleaned, normalized, rebalanced. Classifiers such as random forest, artificial neural network, naïve bayes, logistic regression, k-nearest neighbor and support vector machine were used for classification. Results: Random Forest outperformed other classifiers with an accuracy of 98%, a sensitivity of 99% and a precision of 95.8%. Feature selection was employed for better classification, but no significant improvement was recorded on the performance of the classifier with feature selection. Train test split also performed better that cross validation. Conclusion: Random Forest is recommended for research in Coronary Heart Disease prediction domain.

Download Full-text