feature selection technique Latest Research Papers

The quantity of data required to give a valid analysis grows exponentially as machine learning dimensionality increases. In a single experiment, microarrays or gene expression profiling assesses and determines gene expression levels and patterns in various cell types or tissues. The advent of DNA microarray technology has enabled simultaneous intensive care of hundreds of gene expressions on a single chip, advancing cancer categorization. The most challenging aspect of categorization is working out many information points from many sources. The proposed approach uses microarray data to train deep learning algorithms on extracted features and then uses the Latent Feature Selection Technique to reduce classification time and increase accuracy. The feature-selection-based techniques will pick the important genes before classifying microarray data for cancer prediction and diagnosis. These methods improve classification accuracy by removing duplicate and superfluous information. The Artificial Bee Colony (ABC) technique of feature selection was proposed in this research using bone marrow PC gene expression data. The ABC algorithm, based on swarm intelligence, has been proposed for gene identification. The ABC has been used here for feature selection that generates a subset of features and every feature produced by the spectators, making this a wrapper-based feature selection system. This method’s main goal is to choose the fewest genes that are critical to PC performance while also increasing prediction accuracy. Convolutional Neural Networks were used to classify tumors without labelling them. Lung, kidney, and brain cancer datasets were used in the procedure’s training and testing stages. Using the cross-validation technique of k-fold methodology, the Convolutional Neural Network has an accuracy rate of 96.43%. The suggested research includes techniques for preprocessing and modifying gene expression data to enhance future cancer detection accuracy.

Performance analysis of machine learning models for intrusion detection system using Gini Impurity-based Weighted Random Forest (GIWRF) feature selection technique

Cybersecurity ◽

10.1186/s42400-021-00103-8 ◽

2022 ◽

Vol 5 (1) ◽

Author(s):

Raisa Abedin Disha ◽

Sajjad Waheed

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Random Forest ◽

Performance Analysis ◽

Intrusion Detection ◽

Intrusion Detection System ◽

Detection System ◽

Experimental Result ◽

Feature Selection Technique ◽

Selection Technique

AbstractTo protect the network, resources, and sensitive data, the intrusion detection system (IDS) has become a fundamental component of organizations that prevents cybercriminal activities. Several approaches have been introduced and implemented to thwart malicious activities so far. Due to the effectiveness of machine learning (ML) methods, the proposed approach applied several ML models for the intrusion detection system. In order to evaluate the performance of models, UNSW-NB 15 and Network TON_IoT datasets were used for offline analysis. Both datasets are comparatively newer than the NSL-KDD dataset to represent modern-day attacks. However, the performance analysis was carried out by training and testing the Decision Tree (DT), Gradient Boosting Tree (GBT), Multilayer Perceptron (MLP), AdaBoost, Long-Short Term Memory (LSTM), and Gated Recurrent Unit (GRU) for the binary classification task. As the performance of IDS deteriorates with a high dimensional feature vector, an optimum set of features was selected through a Gini Impurity-based Weighted Random Forest (GIWRF) model as the embedded feature selection technique. This technique employed Gini impurity as the splitting criterion of trees and adjusted the weights for two different classes of the imbalanced data to make the learning algorithm understand the class distribution. Based upon the importance score, 20 features were selected from UNSW-NB 15 and 10 features from the Network TON_IoT dataset. The experimental result revealed that DT performed well with the feature selection technique than other trained models of this experiment. Moreover, the proposed GIWRF-DT outperformed other existing methods surveyed in the literature in terms of the F1 score.

A Classification Model for Multispectral Forest Datatype with the help of a Decision Tree and Wrapper Based Forward Feature Selection Technique

Lecture Notes in Networks and Systems - Advances in Distributed Computing and Machine Learning ◽

10.1007/978-981-16-4807-6_42 ◽

2022 ◽

pp. 444-456

Author(s):

Madhusmita Sahu ◽

Rasmita Dash

Keyword(s):

Feature Selection ◽

Decision Tree ◽

Classification Model ◽

Feature Selection Technique ◽

Selection Technique

Product Review Based Customer Sentiment Analysis using an Ensemble of mRMR and Forest Optimization Algorithm (FOA)

International Journal of Applied Metaheuristic Computing ◽

10.4018/ijamc.2022010107 ◽

2022 ◽

Vol 13 (1) ◽

pp. 0-0

Keyword(s):

Feature Selection ◽

Sentiment Analysis ◽

Optimization Algorithm ◽

Nearest Neighbor ◽

Hybrid Approach ◽

Support Vector ◽

K Nearest Neighbor ◽

Feature Selection Technique ◽

Feature Selection Problem

This research presents a way of feature selection problem for classification of sentiments that use ensemble-based classifier. This includes a hybrid approach of minimum redundancy and maximum relevance (mRMR) technique and Forest Optimization Algorithm (FOA) (i.e. mRMR-FOA) based feature selection. Before applying the FOA on sentiment analysis, it has been used as feature selection technique applied on 10 different classification datasets publically available on UCI machine learning repository. The classifiers for example k-Nearest Neighbor (k-NN), Support Vector Machine (SVM) and Naïve Bayes used the ensemble based algorithm for available datasets. The mRMR-FOA uses the Blitzer’s dataset (customer reviews on electronic products survey) to select the significant features. The classification of sentiments has noticed to improve by 12 to 18%. The evaluated results are further enhanced by the ensemble of k-NN, NB and SVM with an accuracy of 88.47% for the classification of sentiment analysis task.

A HYBRID SENTIMENT ANALYSIS APPROACH USING BLACK WIDOW OPTIMIZATION BASED FEATURE SELECTION

International Journal of Information Retrieval Research ◽

10.4018/ijirr.289955 ◽

2022 ◽

Vol 12 (1) ◽

pp. 0-0

Keyword(s):

Feature Selection ◽

Sentiment Analysis ◽

Computation Time ◽

Online Reviews ◽

Reduction Technique ◽

Feature Reduction ◽

Analysis Approach ◽

Feature Selection Technique ◽

Set Size ◽

Feature Selection Techniques

This paper proposes a novel hybrid framework with BWO based feature reduction technique which combines the merits of both machine learning and lexicon-based approaches to attain better scalability and accuracy. The scalability problem arises due to noisy, irrelevant and unique features present in the extracted features from proposed approach, which can be eliminated by adopting an effective feature reduction technique. In our proposed BWO approach, without changing the accuracy (90%), the feature-set size is reduced up to 43%. The proposed feature selection technique outperforms other commonly used PSO and GAbased feature selection techniques with reduced computation time of 21 sec. Moreover, our sentiment analysis approach is analysed using performance metrices such as precision, recall, F-measure, and computation time. Many organizations can use these online reviews to make well-informed decisions towards the users’ interests and preferences to enhance customer satisfaction, product quality and to find the aspects to improve the products, thereby to generate more profits.

Hybrid binary Butterfly Optimization algorithm and Simulated Annealing for Feature Selection Problem

International Journal of Applied Metaheuristic Computing ◽

10.4018/ijamc.2022010104 ◽

2022 ◽

Vol 13 (1) ◽

pp. 0-0

Keyword(s):

Feature Selection ◽

Simulated Annealing ◽

Optimization Algorithm ◽

Optimization Problems ◽

Hybrid Approach ◽

Feature Subset ◽

Feature Selection Technique ◽

Feature Selection Problem ◽

Optimal Feature Subset ◽

Optimal Feature

Feature selection is performed to eliminate irrelevant features to reduce computational overheads. Metaheuristic algorithms have become popular for the task of feature selection due to their effectiveness and flexibility. Hybridization of two or more such metaheuristics has become popular in solving optimization problems. In this paper, we propose a hybrid wrapper feature selection technique based on binary butterfly optimization algorithm (bBOA) and Simulated Annealing (SA). The SA is combined with the bBOA in a pipeline fashion such that the best solution obtained by the bBOA is passed on to the SA for further improvement. The SA solution improves the best solution obtained so far by searching in its neighborhood. Thus the SA tries to enhance the exploitation property of the bBOA. The proposed method is tested on twenty datasets from the UCI repository and the results are compared with five popular algorithms for feature selection. The results confirm the effectiveness of the hybrid approach in improving the classification accuracy and selecting the optimal feature subset.

A two-phase feature selection technique using mutual information and XGB-RFE for credit card fraud detection

International Journal of Advanced Technology and Engineering Exploration ◽

10.19101/ijatee.2021.874615 ◽

2021 ◽

Vol 8 (85) ◽

Author(s):

C. Victoria Priscilla ◽

D. Padma Prabha

Keyword(s):

Feature Selection ◽

Mutual Information ◽

Credit Card ◽

Fraud Detection ◽

Two Phase ◽

Feature Selection Technique ◽

Credit Card Fraud ◽

Selection Technique

Design and Application of Radial Basis Function Neural Networks Pattern Classifier with Feature Selection Technique Based on Random Forest for Partial Discharge Pattern Classification

Journal of Korean institute of intelligent systems ◽

10.5391/jkiis.2021.31.6.526 ◽

2021 ◽

Vol 31 (6) ◽

pp. 526-532

Author(s):

Seok-Beom Roh ◽

Sung—Kwun Oh ◽

Young-Eil Kim

Keyword(s):

Neural Networks ◽

Feature Selection ◽

Random Forest ◽

Radial Basis Function ◽

Pattern Classification ◽

Basis Function ◽

Partial Discharge ◽

Discharge Pattern ◽

Feature Selection Technique ◽

Selection Technique

Feature Entropy Estimation (FEE) for Malicious IoT Traffic and Detection Using Machine Learning

Mobile Information Systems ◽

10.1155/2021/8091363 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13

Author(s):

Tarun Dhar Diwan ◽

Siddartha Choubey ◽

H. S. Hota ◽

S. B Goyal ◽

Sajjad Shaukat Jamal ◽

...

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Low Cost ◽

Pearson Correlation ◽

Low Complexity ◽

Computational Time ◽

Chi Square ◽

Feature Selection Technique ◽

Detection Techniques ◽

Entropy Estimation

Identification of anomaly and malicious traffic in the Internet of things (IoT) network is essential for IoT security. Tracking and blocking unwanted traffic flows in the IoT network is required to design a framework for the identification of attacks more accurately, quickly, and with less complexity. Many machine learning (ML) algorithms proved their efficiency to detect intrusion in IoT networks. But this ML algorithm suffers many misclassification problems due to inappropriate and irrelevant feature size. In this paper, an in-depth study is presented to address such issues. We have presented lightweight low-cost feature selection IoT intrusion detection techniques with low complexity and high accuracy due to their low computational time. A novel feature selection technique was proposed with the integration of rank-based chi-square, Pearson correlation, and score correlation to extract relevant features out of all available features from the dataset. Then, feature entropy estimation was applied to validate the relationship among all extracted features to identify malicious traffic in IoT networks. Finally, an extreme gradient ensemble boosting approach was used to classify the features in relevant attack types. The simulation is performed on three datasets, i.e., NSL-KDD, USNW-NB15, and CCIDS2017, and results are presented on different test sets. It was observed that on the NSL-KDD dataset, accuracy was approx. 97.48%. Similarly, the accuracy of USNW-NB15 and CCIDS2017 was approx. 99.96% and 99.93%, respectively. Along with that, state-of-the-art comparison is also presented with existing techniques.

A Novel Feature Selection Technique to Better Predict Climate Change Stage of Change

Sustainability ◽

10.3390/su14010040 ◽

2021 ◽

Vol 14 (1) ◽

pp. 40

Author(s):

Hamed Naseri ◽

E. Owen D. Waygood ◽

Bobin Wang ◽

Zachary Patterson ◽

Ricardo A. Daziano

Keyword(s):

Climate Change ◽

Feature Selection ◽

Environmental Concern ◽

Stage Of Change ◽

Principal Component ◽

Selection Methods ◽

Feature Selection Technique ◽

Selection Technique ◽

Testing Data ◽

New Feature

Indications of people’s environmental concern are linked to transport decisions and can provide great support for policymaking on climate change. This study aims to better predict individual climate change stage of change (CC-SoC) based on different features of transport-related behavior, General Ecological Behavior, New Environmental Paradigm, and socio-demographic characteristics. Together these sources result in over 100 possible features that indicate someone’s level of environmental concern. Such a large number of features may create several analytical problems, such as overfitting, accuracy reduction, and high computational costs. To this end, a new feature selection technique, named the Coyote Optimization Algorithm-Quadratic Discriminant Analysis (COA-QDA), is first proposed to find the optimal features to predict CC-SoC with the highest accuracy. Different conventional feature selection methods (Lasso, Elastic Net, Random Forest Feature Selection, Extra Trees, and Principal Component Analysis Feature Selection) are employed to compare with the COA-QDA. Afterward, eight classification techniques are applied to solve the prediction problem. Finally, a sensitivity analysis is performed to determine the most important features affecting the prediction of CC-SoC. The results indicate that COA-QDA outperforms conventional feature selection methods by increasing average testing data accuracy from 0.7 to 5.6%. Logistic Regression surpasses other classifiers with the highest prediction accuracy.

feature selection technique
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Deep-Learning-Based Cancer Profiles Classification Using Gene Expression Data Profile

Performance analysis of machine learning models for intrusion detection system using Gini Impurity-based Weighted Random Forest (GIWRF) feature selection technique

A Classification Model for Multispectral Forest Datatype with the help of a Decision Tree and Wrapper Based Forward Feature Selection Technique

Product Review Based Customer Sentiment Analysis using an Ensemble of mRMR and Forest Optimization Algorithm (FOA)

A HYBRID SENTIMENT ANALYSIS APPROACH USING BLACK WIDOW OPTIMIZATION BASED FEATURE SELECTION

Hybrid binary Butterfly Optimization algorithm and Simulated Annealing for Feature Selection Problem

A two-phase feature selection technique using mutual information and XGB-RFE for credit card fraud detection

Design and Application of Radial Basis Function Neural Networks Pattern Classifier with Feature Selection Technique Based on Random Forest for Partial Discharge Pattern Classification

Feature Entropy Estimation (FEE) for Malicious IoT Traffic and Detection Using Machine Learning

A Novel Feature Selection Technique to Better Predict Climate Change Stage of Change

Export Citation Format

feature selection techniqueRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Deep-Learning-Based Cancer Profiles Classification Using Gene Expression Data Profile

Performance analysis of machine learning models for intrusion detection system using Gini Impurity-based Weighted Random Forest (GIWRF) feature selection technique

A Classification Model for Multispectral Forest Datatype with the help of a Decision Tree and Wrapper Based Forward Feature Selection Technique

Product Review Based Customer Sentiment Analysis using an Ensemble of mRMR and Forest Optimization Algorithm (FOA)

A HYBRID SENTIMENT ANALYSIS APPROACH USING BLACK WIDOW OPTIMIZATION BASED FEATURE SELECTION

Hybrid binary Butterfly Optimization algorithm and Simulated Annealing for Feature Selection Problem

A two-phase feature selection technique using mutual information and XGB-RFE for credit card fraud detection

Design and Application of Radial Basis Function Neural Networks Pattern Classifier with Feature Selection Technique Based on Random Forest for Partial Discharge Pattern Classification

Feature Entropy Estimation (FEE) for Malicious IoT Traffic and Detection Using Machine Learning

A Novel Feature Selection Technique to Better Predict Climate Change Stage of Change

feature selection technique
Recently Published Documents