The Impact of Feature Selection on Defect Prediction Performance: An Empirical Comparison

Software testing using software defect prediction aims to detect as many defects as possible in software before the software release. This plays an important role in ensuring quality and reliability. Software defect prediction can be modeled as a classification problem that classifies software modules into two classes: defective and non-defective; and classification algorithms are used for this process. This study investigated the impact of feature selection methods on classification via clustering techniques for software defect prediction. Three clustering techniques were selected; Farthest First Clusterer, K-Means and Make-Density Clusterer, and three feature selection methods: Chi-Square, Clustering Variation, and Information Gain were used on software defect datasets from NASA repository. The best software defect prediction model was farthest-first using information gain feature selection method with an accuracy of 78.69%, precision value of 0.804 and recall value of 0.788. The experimental results showed that the use of clustering techniques as a classifier gave a good predictive performance and feature selection methods further enhanced their performance. This indicates that classification via clustering techniques can give competitive results against standard classification methods with the advantage of not having to train any model using labeled dataset; as it can be used on the unlabeled datasets.Keywords: Classification, Clustering, Feature Selection, Software Defect PredictionVol. 26, No 1, June, 2019

Download Full-text

Impact of Feature Selection Methods on the Predictive Performance of Software Defect Prediction Models: An Extensive Empirical Study

Symmetry ◽

10.3390/sym12071147 ◽

2020 ◽

Vol 12 (7) ◽

pp. 1147 ◽

Cited By ~ 2

Author(s):

Abdullateef O. Balogun ◽

Shuib Basri ◽

Saipunidzam Mahamad ◽

Said J. Abdulkadir ◽

Malek A. Almomani ◽

...

Keyword(s):

Feature Selection ◽

Empirical Study ◽

Prediction Models ◽

Empirical Studies ◽

Experimental Results ◽

Defect Prediction ◽

Software Defect Prediction ◽

Search Methods ◽

Software Defect ◽

The Impact

Feature selection (FS) is a feasible solution for mitigating high dimensionality problem, and many FS methods have been proposed in the context of software defect prediction (SDP). Moreover, many empirical studies on the impact and effectiveness of FS methods on SDP models often lead to contradictory experimental results and inconsistent findings. These contradictions can be attributed to relative study limitations such as small datasets, limited FS search methods, and unsuitable prediction models in the respective scope of studies. It is hence critical to conduct an extensive empirical study to address these contradictions to guide researchers and buttress the scientific tenacity of experimental conclusions. In this study, we investigated the impact of 46 FS methods using Naïve Bayes and Decision Tree classifiers over 25 software defect datasets from 4 software repositories (NASA, PROMISE, ReLink, and AEEEM). The ensuing prediction models were evaluated based on accuracy and AUC values. Scott–KnottESD and the novel Double Scott–KnottESD rank statistical methods were used for statistical ranking of the studied FS methods. The experimental results showed that there is no one best FS method as their respective performances depends on the choice of classifiers, performance evaluation metrics, and dataset. However, we recommend the use of statistical-based, probability-based, and classifier-based filter feature ranking (FFR) methods, respectively, in SDP. For filter subset selection (FSS) methods, correlation-based feature selection (CFS) with metaheuristic search methods is recommended. For wrapper feature selection (WFS) methods, the IWSS-based WFS method is recommended as it outperforms the conventional SFS and LHS-based WFS methods.

Download Full-text

An Empirical Study on The Impact of The Interaction between Feature Selection and Sampling in Defect Prediction

2020 7th International Conference on Dependable Systems and Their Applications (DSA) ◽

10.1109/dsa51864.2020.00025 ◽

2020 ◽

Author(s):

Shuyue Fan ◽

Chun Liu ◽

Zheng Li

Keyword(s):

Feature Selection ◽

Empirical Study ◽

Defect Prediction ◽

The Impact

Download Full-text

Inflated prediction accuracy of neuropsychiatric biomarkers caused by data leakage in feature selection

Scientific Reports ◽

10.1038/s41598-021-87157-3 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Miseon Shim ◽

Seung-Hwan Lee ◽

Han-Jeong Hwang

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Feature Selection Method ◽

Selection Method ◽

Prediction Performance ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Eeg Data ◽

Neuropsychiatric Diseases ◽

The Impact

AbstractIn recent years, machine learning techniques have been frequently applied to uncovering neuropsychiatric biomarkers with the aim of accurately diagnosing neuropsychiatric diseases and predicting treatment prognosis. However, many studies did not perform cross validation (CV) when using machine learning techniques, or others performed CV in an incorrect manner, leading to significantly biased results due to overfitting problem. The aim of this study is to investigate the impact of CV on the prediction performance of neuropsychiatric biomarkers, in particular, for feature selection performed with high-dimensional features. To this end, we evaluated prediction performances using both simulation data and actual electroencephalography (EEG) data. The overall prediction accuracies of the feature selection method performed outside of CV were considerably higher than those of the feature selection method performed within CV for both the simulation and actual EEG data. The differences between the prediction accuracies of the two feature selection approaches can be thought of as the amount of overfitting due to selection bias. Our results indicate the importance of correctly using CV to avoid biased results of prediction performance of neuropsychiatric biomarkers.

Download Full-text

An Empirical Study of Software Metrics Diversity for Cross-Project Defect Prediction

Mathematical Problems in Engineering ◽

10.1155/2021/3135702 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Yiwen Zhong ◽

Kun Song ◽

ShengKai Lv ◽

Peng He

Keyword(s):

Software Metrics ◽

Historical Data ◽

Prediction Performance ◽

Defect Prediction ◽

Improvement Rate ◽

Modeling Techniques ◽

Structural Metrics ◽

The Impact ◽

F Measure ◽

Cross Project

Cross-project defect prediction (CPDP) is a mainstream method estimating the most defect-prone components of software with limited historical data. Several studies investigate how software metrics are used and how modeling techniques influence prediction performance. However, the software’s metrics diversity impact on the predictor remains unclear. Thus, this paper aims to assess the impact of various metric sets on CPDP and investigate the feasibility of CPDP with hybrid metrics. Based on four software metrics types, we investigate the impact of various metric sets on CPDP in terms of F-measure and statistical methods. Then, we validate the dominant performance of CPDP with hybrid metrics. Finally, we further verify the CPDP-OSS feasibility built with three types of metrics (orient-object, semantic, and structural metrics) and challenge them against two current models. The experimental results suggest that the impact of different metric sets on the performance of CPDP is significantly distinct, with semantic and structural metrics performing better. Additionally, trials indicate that it is helpful for CPDP to increase the software’s metrics diversity appropriately, as the CPDP-OSS improvement is up to 53.8%. Finally, compared with two baseline methods, TCA+ and TDSelector, the optimized CPDP model is viable in practice, and the improvement rate is up to 50.6% and 25.7%, respectively.

Download Full-text

RFC: A feature selection algorithm for software defect prediction

Journal of Systems Engineering and Electronics ◽

10.23919/jsee.2021.000032 ◽

2021 ◽

Vol 32 (2) ◽

pp. 389-398

Author(s):

Xu Xiaolong ◽

Chen Wen ◽

Wang Xinheng

Keyword(s):

Feature Selection ◽

Defect Prediction ◽

Software Defect Prediction ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Software Defect

Download Full-text

A comprehensive investigation of the impact of feature selection techniques on crashing fault residence prediction models

Information and Software Technology ◽

10.1016/j.infsof.2021.106652 ◽

2021 ◽

pp. 106652

Author(s):

Kunsong Zhao ◽

Zhou Xu ◽

Meng Yan ◽

Tao Zhang ◽

Dan Yang ◽

...

Keyword(s):

Feature Selection ◽

Prediction Models ◽

Comprehensive Investigation ◽

The Impact ◽

Feature Selection Techniques

Download Full-text

A Novel Feature Selection Approach based on Binary Particle Swarm Optimization and Ensemble Learning for Heterogeneous Defect Prediction

2021 3rd Asia Pacific Information Technology Conference ◽

10.1145/3449365.3449384 ◽

2021 ◽

Author(s):

Ruchika Malhotra ◽

Anmol Budhiraja ◽

Abhinav Kumar Singh ◽

Ishani Ghoshal

Keyword(s):

Feature Selection ◽

Particle Swarm Optimization ◽

Ensemble Learning ◽

Particle Swarm ◽

Defect Prediction ◽

Binary Particle Swarm Optimization ◽

Swarm Optimization ◽

Selection Approach ◽

Feature Selection Approach ◽

Heterogeneous Defect Prediction

Download Full-text

Radiomics side experiments and DAFIT approach in identifying pulmonary hypertension using Cardiac MRI derived radiomics based machine learning models

Scientific Reports ◽

10.1038/s41598-021-92155-6 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Sarv Priya ◽

Tanya Aggarwal ◽

Caitlin Ward ◽

Girish Bathla ◽

Mathews Jacob ◽

...

Keyword(s):

Machine Learning ◽

Pulmonary Hypertension ◽

Feature Selection ◽

Subgroup Analysis ◽

Cardiac Mri ◽

Intraclass Correlation ◽

Poor Performance ◽

Superior Performance ◽

Feature Filtering ◽

The Impact

AbstractSide experiments are performed on radiomics models to improve their reproducibility. We measure the impact of myocardial masks, radiomic side experiments and data augmentation for information transfer (DAFIT) approach to differentiate patients with and without pulmonary hypertension (PH) using cardiac MRI (CMRI) derived radiomics. Feature extraction was performed from the left ventricle (LV) and right ventricle (RV) myocardial masks using CMRI in 82 patients (42 PH and 40 controls). Various side study experiments were evaluated: Original data without and with intraclass correlation (ICC) feature-filtering and DAFIT approach (without and with ICC feature-filtering). Multiple machine learning and feature selection strategies were evaluated. Primary analysis included all PH patients with subgroup analysis including PH patients with preserved LVEF (≥ 50%). For both primary and subgroup analysis, DAFIT approach without feature-filtering was the highest performer (AUC 0.957–0.958). ICC approaches showed poor performance compared to DAFIT approach. The performance of combined LV and RV masks was superior to individual masks alone. There was variation in top performing models across all approaches (AUC 0.862–0.958). DAFIT approach with features from combined LV and RV masks provide superior performance with poor performance of feature filtering approaches. Model performance varies based upon the feature selection and model combination.

Download Full-text

The Impact of Feature Selection on Defect Prediction Performance: An Empirical Comparison

What is the Impact of Imbalance on Software Defect Prediction Performance?

Impact of feature selection on classification via clustering techniques in software defect prediction

Impact of Feature Selection Methods on the Predictive Performance of Software Defect Prediction Models: An Extensive Empirical Study

An Empirical Study on The Impact of The Interaction between Feature Selection and Sampling in Defect Prediction

Inflated prediction accuracy of neuropsychiatric biomarkers caused by data leakage in feature selection

An Empirical Study of Software Metrics Diversity for Cross-Project Defect Prediction

RFC: A feature selection algorithm for software defect prediction

A comprehensive investigation of the impact of feature selection techniques on crashing fault residence prediction models

A Novel Feature Selection Approach based on Binary Particle Swarm Optimization and Ensemble Learning for Heterogeneous Defect Prediction

Radiomics side experiments and DAFIT approach in identifying pulmonary hypertension using Cardiac MRI derived radiomics based machine learning models

Export Citation Format