Multi-Layer Perceptron Neural Network Model Development for Chili Pepper Disease Diagnosis Using Filter and Wrapper Feature Selection Methods

The goal of the current study is to develop a diagnosis model for chili pepper disease diagnosis by applying filter and wrapper feature selection methods as well as a Multi-Layer Perceptron Neural Network (MLPNN). The data used for developing the model include 1) types, 2) causative agents, 3) areas of infection, 4) growth stages of infection, 5) conditions, 6) symptoms, and 7) 14 types of chili pepper diseases. These datasets were applied to the 3 feature selection techniques, including information gain, gain ratio, and wrapper. After selecting the key features, the selected datasets were utilized to develop the diagnosis model towards the application of MLPNN. According to the model’s effectiveness evaluation results, estimated by 10-fold cross-validation, it can be seen that the diagnosis model developed by applying the wrapper method along with MLPNN provided the highest level of effectiveness, with an accuracy of 98.91%, precision of 98.92%, and recall of 98.89%. The findings showed that the developed model is applicable.

Download Full-text

Thai Water Buffalo Disease Analysis with the Application of Feature Selection Technique and Multi-Layer Perceptron Neural Network

Engineering, Technology & Applied Science Research ◽

10.48084/etasr.4049 ◽

2021 ◽

Vol 11 (2) ◽

pp. 6907-6911

Author(s):

S. Nuanmeesri ◽

W. Sriurai

Keyword(s):

Neural Network ◽

Feature Selection ◽

Water Buffalo ◽

Information Gain ◽

Sampling Technique ◽

Multi Layer Perceptron ◽

Analysis Model ◽

Feature Selection Technique ◽

Selection Technique ◽

Correlation Based Feature Selection

This research aims to develop the analysis model for diseases in water buffalo towards the application of the feature selection technique along with the Multi-Layer Perceptron neural network. The data used for analysis was collected from books and documents related to diseases in water buffalo and the official website of the Department of Livestock Development. The data consists of the characteristics of six diseases in water buffalo, including Anthrax disease, Hemorrhagic Septicemia, Brucellosis, Foot and Mouth disease, Parasitic disease, and Mastitis. Since the amount of the collected data was limited, the Synthetic Minority Over-sampling Technique was also employed to adjust the imbalance dataset. Afterward, the adjusted dataset was used to select the disease characteristics towards the application of two feature selection techniques, including Correlation-based Feature Selection and Information Gain. Subsequently, the selected features were then used for developing the analysis model for diseases in water buffalo towards the use of Multi-Layer Perceptron neural network. The evaluation results of the model’s effectiveness, given by the 10-fold cross-validation, showed that the analysis model for diseases in water buffalo developed by Correlation-based Feature Selection and Multi-Layer Perceptron neural network provided the highest level of effectiveness with the accuracy of 99.71%, the precision of 99.70%, and the recall of 99.72%. This implies that the analysis model is effectively applicable.

Download Full-text

Forecasting mergers and acquisitions failure based on partial-sigmoid neural network and feature selection

PLoS ONE ◽

10.1371/journal.pone.0259575 ◽

2021 ◽

Vol 16 (11) ◽

pp. e0259575

Author(s):

Wenbin Bi ◽

Qiusheng Zhang

Keyword(s):

Neural Network ◽

Feature Selection ◽

Mergers And Acquisitions ◽

Information Gain ◽

Capital Expenditure ◽

Feature Selection Method ◽

Activation Function ◽

Gradient Boosting ◽

Selection Methods ◽

Chi2 Test

Traditional forecasting methods in mergers and acquisitions (M&A) data have two limitations that significantly reduce forecasting accuracy: (1) the imbalance of data, that is, the failure cases of M&A are far fewer than the successful cases (82%/18% of our sample), and (2) both the bidder and the target of the merger have numerous descriptive features, making it difficult to choose which ones to forecast. This study proposes a neural network using partial-sigmoid (i.e., partial-sigmoid neural network [PSNN]) as the activation function of the output layer and compares three feature selection methods, namely, chi-square (chi2) test, information gain and gradient boosting decision tree (GBDT). Experimental results prove that our PSNN (improved up to 0.37 precision, 0.49 recall, 0.41 G-Mean and 0.23 F1-measure) and feature selection (improved 1.83%-13.16% accuracy) method can effectively improve the adverse effects of the defects of the above two merger data on forecasting. Scholars who studied the forecast of merger failure have overlooked three important features: assets of the previous year, market value and capital expenditure. The chi2 test feature selection method is the best among the three feature selection methods.

Download Full-text

Deep Neural Network Feature Selection Approaches for Data-Driven Prognostic Model of Aircraft Engines

Aerospace ◽

10.3390/aerospace7090132 ◽

2020 ◽

Vol 7 (9) ◽

pp. 132

Author(s):

Phattara Khumprom ◽

David Grewell ◽

Nita Yodo

Keyword(s):

Neural Network ◽

Feature Selection ◽

Gas Turbine ◽

Deep Neural Network ◽

Data Driven ◽

Turbine Engines ◽

Gas Turbine Engines ◽

Aircraft Engines ◽

Selection Methods ◽

Wrapper Feature Selection

Predicting Remaining Useful Life (RUL) of systems has played an important role in various fields of reliability engineering analysis, including in aircraft engines. RUL prediction is critically an important part of Prognostics and Health Management (PHM), which is the reliability science that is aimed at increasing the reliability of the system and, in turn, reducing the maintenance cost. The majority of the PHM models proposed during the past few years have shown a significant increase in the amount of data-driven deployments. While more complex data-driven models are often associated with higher accuracy, there is a corresponding need to reduce model complexity. One possible way to reduce the complexity of the model is to use the features (attributes or variables) selection and dimensionality reduction methods prior to the model training process. In this work, the effectiveness of multiple filter and wrapper feature selection methods (correlation analysis, relief forward/backward selection, and others), along with Principal Component Analysis (PCA) as a dimensionality reduction method, was investigated. A basis algorithm of deep learning, Feedforward Artificial Neural Network (FFNN), was used as a benchmark modeling algorithm. All those approaches can also be applied to the prognostics of an aircraft gas turbine engines. In this paper, the aircraft gas turbine engines data from NASA Ames prognostics data repository was used to test the effectiveness of the filter and wrapper feature selection methods not only for the vanilla FFNN model but also for Deep Neural Network (DNN) model. The findings show that applying feature selection methods helps to improve overall model accuracy and significantly reduced the complexity of the models.

Download Full-text

Evaluating the Effectiveness of Wrapper Feature Selection Methods with Artificial Neural Network Classifier for Diabetes Prediction

Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering - Testbeds and Research Infrastructures for the Development of Networks and Communications ◽

10.1007/978-3-030-43215-7_1 ◽

2020 ◽

pp. 3-17 ◽

Cited By ~ 1

Author(s):

M. A. Fahmiin ◽

T. H. Lim

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Feature Selection ◽

Neural Network Classifier ◽

Selection Methods ◽

Artificial Neural Network Classifier ◽

Diabetes Prediction ◽

Artificial Neural ◽

Wrapper Feature Selection

Download Full-text

Research on Pepper Disease Diagnosis Model Based on Genetic Neural network

MATEC Web of Conferences ◽

10.1051/matecconf/201712802010 ◽

2017 ◽

Vol 128 ◽

pp. 02010

Author(s):

Qingfeng Wei ◽

Sufen Sun ◽

ChangShou Luo

Keyword(s):

Neural Network ◽

Disease Diagnosis ◽

Model Based ◽

Genetic Neural Network ◽

Diagnosis Model

Download Full-text

Search-Based Wrapper Feature Selection Methods in Software Defect Prediction: An Empirical Analysis

Intelligent Algorithms in Software Engineering - Advances in Intelligent Systems and Computing ◽

10.1007/978-3-030-51965-0_43 ◽

2020 ◽

pp. 492-503 ◽

Cited By ~ 1

Author(s):

Abdullateef O. Balogun ◽

Shuib Basri ◽

Said A. Jadid ◽

Saipunidzam Mahamad ◽

Malek A. Al-momani ◽

...

Keyword(s):

Feature Selection ◽

Empirical Analysis ◽

Defect Prediction ◽

Software Defect Prediction ◽

Selection Methods ◽

Software Defect ◽

Wrapper Feature Selection

Download Full-text

Fault diagnosis model for photovoltaic array using a dual-channels convolutional neural network with a feature selection structure

Energy Conversion and Management ◽

10.1016/j.enconman.2021.114777 ◽

2021 ◽

Vol 248 ◽

pp. 114777

Author(s):

Xiaoyang Lu ◽

Peijie Lin ◽

Shuying Cheng ◽

Gengfa Fang ◽

Xiangjian He ◽

...

Keyword(s):

Neural Network ◽

Feature Selection ◽

Fault Diagnosis ◽

Convolutional Neural Network ◽

Photovoltaic Array ◽

Dual Channels ◽

Diagnosis Model

Download Full-text

To use or not to use: Feature selection for sentiment analysis of highly imbalanced data

Natural Language Engineering ◽

10.1017/s1351324917000298 ◽

2017 ◽

Vol 24 (1) ◽

pp. 3-37 ◽

Cited By ~ 5

Author(s):

SANDRA KÜBLER ◽

CAN LIU ◽

ZEESHAN ALI SAYYED

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Sentiment Analysis ◽

Information Gain ◽

Binary Classification ◽

Small Subset ◽

Large Set ◽

Learning Approaches ◽

Selection Methods ◽

Data Set

AbstractWe investigate feature selection methods for machine learning approaches in sentiment analysis. More specifically, we use data from the cooking platform Epicurious and attempt to predict ratings for recipes based on user reviews. In machine learning approaches to such tasks, it is a common approach to use word or part-of-speech n-grams. This results in a large set of features, out of which only a small subset may be good indicators for the sentiment. One of the questions we investigate concerns the extension of feature selection methods from a binary classification setting to a multi-class problem. We show that an inherently multi-class approach, multi-class information gain, outperforms ensembles of binary methods. We also investigate how to mitigate the effects of extreme skewing in our data set by making our features more robust and by using review and recipe sampling. We show that over-sampling is the best method for boosting performance on the minority classes, but it also results in a severe drop in overall accuracy of at least 6 per cent points.

Download Full-text