Software Fault Prediction Using Filtering Feature Selection in Cluster-Based Classification

Improving the software product quality before releasing by periodic tests is one of the most expensive activities in software projects. Due to limited resources to modules test in software projects, it is important to identify fault-prone modules and use the test sources for fault prediction in these modules. Software fault predictors based on machine learning algorithms, are effective tools for identifying fault-prone modules. Extensive studies are being done in this field to find the connection between features of software modules, and their fault-prone. Some of features in predictive algorithms are ineffective and reduce the accuracy of prediction process. So, feature selection methods to increase performance of prediction models in fault-prone modules are widely used. In this study, we proposed a feature selection method for effective selection of features, by using combination of filter feature selection methods. In the proposed filter method, the combination of several filter feature selection methods presented as fused weighed filter method. Then, the proposed method caused convergence rate of feature selection as well as the accuracy improvement. The obtained results on NASA and PROMISE with ten datasets, indicates the effectiveness of proposed method in improvement of accuracy and convergence of software fault prediction.

Download Full-text

Combining feature selection, feature learning and ensemble learning for software fault prediction

2019 11th International Conference on Knowledge and Systems Engineering (KSE) ◽

10.1109/kse.2019.8919292 ◽

2019 ◽

Author(s):

Hung Duy Tran ◽

LE Thi My Hanh ◽

Nguyen Thanh Binh

Keyword(s):

Feature Selection ◽

Ensemble Learning ◽

Feature Learning ◽

Fault Prediction ◽

Software Fault Prediction ◽

Software Fault

Download Full-text

A Hybrid Feature Selection Method for Software Fault Prediction

IEICE Transactions on Information and Systems ◽

10.1587/transinf.2019edp7033 ◽

2019 ◽

Vol E102.D (10) ◽

pp. 1966-1975

Author(s):

Yiheng JIAN ◽

Xiao YU ◽

Zhou XU ◽

Ziyi MA

Keyword(s):

Feature Selection ◽

Feature Selection Method ◽

Selection Method ◽

Fault Prediction ◽

Software Fault Prediction ◽

Software Fault

Download Full-text

Genetic Evolutionary Learning Fitness Function (GELFF) for Feature Diagnosis to Software Fault Prediction

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.k1233.09811s19 ◽

2019 ◽

Vol 8 (11S) ◽

pp. 1151-1161

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Fitness Function ◽

Fault Prediction ◽

Evolutionary Learning ◽

Software Fault Prediction ◽

Underlying Process ◽

Learning Function ◽

Wide Range ◽

Software Fault

Nowadays, proper feature selection f+orFault prediction is very perplexing task. Improper feature selection may lead to bad result. To avoid this, there is a need to find the aridity of software fault. This is achieved by finding the fitness of the evolutionaryAlgorithmic function. In this paper, we finalize the Genetic evolutionarynature of our Feature set with the help of Fitness Function. Feature Selection is the objective of the prediction model tocreate the underlying process of generalized data. The wide range of data like fault dataset, need the better objective function is obtained by feature selection, ranking, elimination and construction. In this paper, we focus on finding the fitness of the machine learning function which is used in the diagnostics of fault in the software for the better classification.

Download Full-text

A Hybrid Feature Selection Model For Software Fault Prediction

International Journal on Computational Science & Applications ◽

10.5121/ijcsa.2012.2203 ◽

2012 ◽

Vol 2 (2) ◽

pp. 25-35 ◽

Cited By ~ 3

Author(s):

C Akalya devi

Keyword(s):

Feature Selection ◽

Selection Model ◽

Fault Prediction ◽

Software Fault Prediction ◽

Software Fault

Download Full-text

Iterated feature selection algorithms with layered recurrent neural network for software fault prediction

Expert Systems with Applications ◽

10.1016/j.eswa.2018.12.033 ◽

2019 ◽

Vol 122 ◽

pp. 27-42 ◽

Cited By ~ 22

Author(s):

Hamza Turabieh ◽

Majdi Mafarja ◽

Xiaodong Li

Keyword(s):

Neural Network ◽

Feature Selection ◽

Recurrent Neural Network ◽

Fault Prediction ◽

Software Fault Prediction ◽

Software Fault ◽

Selection Algorithms

Download Full-text

Empirical studies on feature selection for software fault prediction

Proceedings of the 5th Asia-Pacific Symposium on Internetware - Internetware '13 ◽

10.1145/2532443.2532461 ◽

2013 ◽

Cited By ~ 6

Author(s):

Jiaqiang Chen ◽

Shulong Liu ◽

Xiang Chen ◽

Qing Gu ◽

Daoxu Chen

Keyword(s):

Feature Selection ◽

Empirical Studies ◽

Fault Prediction ◽

Software Fault Prediction ◽

Software Fault ◽

Selection For

Download Full-text

Efficient Multi-Swarm Binary Harris Hawks Optimization as a Feature Selection Approach for Software Fault Prediction

2020 11th International Conference on Information and Communication Systems (ICICS) ◽

10.1109/icics49469.2020.239557 ◽

2020 ◽

Author(s):

Thaer Thaher ◽

Nabil Arman

Keyword(s):

Feature Selection ◽

Fault Prediction ◽

Software Fault Prediction ◽

Selection Approach ◽

Software Fault ◽

Feature Selection Approach

Download Full-text

Enhanced Binary Moth Flame Optimization as a Feature Selection Algorithm to Predict Software Fault Prediction

IEEE Access ◽

10.1109/access.2020.2964321 ◽

2020 ◽

Vol 8 ◽

pp. 8041-8055 ◽

Cited By ~ 4

Author(s):

Iyad Tumar ◽

Yousef Hassouneh ◽

Hamza Turabieh ◽

Thaer Thaher

Keyword(s):

Feature Selection ◽

Fault Prediction ◽

Selection Algorithm ◽

Feature Selection Algorithm ◽

Software Fault Prediction ◽

Software Fault

Download Full-text

Rough Noise-Filtered Easy Ensemble for Software Fault Prediction

10.20944/preprints201805.0248.v1 ◽

2018 ◽

Author(s):

Saman Riaz ◽

Ali Arshad ◽

Licheng Jiao

Keyword(s):

Feature Selection ◽

Rough Set ◽

Rough Set Theory ◽

Predictive Accuracy ◽

Class Imbalance ◽

Fault Prediction ◽

Software Fault Prediction ◽

Software Fault ◽

The Impact ◽

Noisy Examples

Software fault prediction is the very consequent research topic for software quality assurance. Data driven approaches provide robust mechanisms to deal with software fault prediction. However, the prediction performance of the model highly depends on the quality of dataset. Many software datasets suffers from the problem of class imbalance. In this regard, under-sampling is a popular data pre-processing method in dealing with class imbalance problem, Easy Ensemble (EE) present a robust approach to achieve a high classification rate and address the biasness towards majority class samples. However, imbalance class is not the only issue that harms performance of classifiers. Some noisy examples and irrelevant features may additionally reduce the rate of predictive accuracy of the classifier. In this paper, we proposed two-stage data pre-processing which incorporates feature selection and a new Rough set Easy Ensemble scheme. In feature selection stage, we eliminate the irrelevant features by feature ranking algorithm. In the second stage of a new Rough set Easy Ensemble by incorporating Rough K nearest neighbor rule filter (RK) afore executing Easy Ensemble (EE), named RKEE for short. RK can remove noisy examples from both minority and majority class. Experimental evaluation on real-world software projects, such as NASA and Eclipse dataset, is performed in order to demonstrate the effectiveness of our proposed approach. Furthermore, this paper comprehensively investigates the influencing factor in our approach. Such as, the impact of Rough set theory on noise-filter, the relationship between model performance and imbalance ratio etc. comprehensive experiments indicate that the proposed approach shows outstanding performance with significance in terms of area-under-the-curve (AUC).

Download Full-text