scholarly journals Software Fault Prediction Using Filtering Feature Selection in Cluster-Based Classification

2018 ◽  
Vol 4 (1) ◽  
pp. 59 ◽  
Author(s):  
Fachrul Pralienka Bani Muhamad ◽  
Daniel Oranova Siahaan ◽  
Chastine Fatichah
Author(s):  
Fatemeh Alighardashi ◽  
Mohammad Ali Zare Chahooki

Improving the software product quality before releasing by periodic tests is one of the most expensive activities in software projects. Due to limited resources to modules test in software projects, it is important to identify fault-prone modules and use the test sources for fault prediction in these modules. Software fault predictors based on machine learning algorithms, are effective tools for identifying fault-prone modules. Extensive studies are being done in this field to find the connection between features of software modules, and their fault-prone. Some of features in predictive algorithms are ineffective and reduce the accuracy of prediction process. So, feature selection methods to increase performance of prediction models in fault-prone modules are widely used. In this study, we proposed a feature selection method for effective selection of features, by using combination of filter feature selection methods. In the proposed filter method, the combination of several filter feature selection methods presented as fused weighed filter method. Then, the proposed method caused convergence rate of feature selection as well as the accuracy improvement. The obtained results on NASA and PROMISE with ten datasets, indicates the effectiveness of proposed method in improvement of accuracy and convergence of software fault prediction.


Nowadays, proper feature selection f+orFault prediction is very perplexing task. Improper feature selection may lead to bad result. To avoid this, there is a need to find the aridity of software fault. This is achieved by finding the fitness of the evolutionaryAlgorithmic function. In this paper, we finalize the Genetic evolutionarynature of our Feature set with the help of Fitness Function. Feature Selection is the objective of the prediction model tocreate the underlying process of generalized data. The wide range of data like fault dataset, need the better objective function is obtained by feature selection, ranking, elimination and construction. In this paper, we focus on finding the fitness of the machine learning function which is used in the diagnostics of fault in the software for the better classification.


Author(s):  
Saman Riaz ◽  
Ali Arshad ◽  
Licheng Jiao

Software fault prediction is the very consequent research topic for software quality assurance. Data driven approaches provide robust mechanisms to deal with software fault prediction. However, the prediction performance of the model highly depends on the quality of dataset. Many software datasets suffers from the problem of class imbalance. In this regard, under-sampling is a popular data pre-processing method in dealing with class imbalance problem, Easy Ensemble (EE) present a robust approach to achieve a high classification rate and address the biasness towards majority class samples. However, imbalance class is not the only issue that harms performance of classifiers. Some noisy examples and irrelevant features may additionally reduce the rate of predictive accuracy of the classifier. In this paper, we proposed two-stage data pre-processing which incorporates feature selection and a new Rough set Easy Ensemble scheme. In feature selection stage, we eliminate the irrelevant features by feature ranking algorithm. In the second stage of a new Rough set Easy Ensemble by incorporating Rough K nearest neighbor rule filter (RK) afore executing Easy Ensemble (EE), named RKEE for short. RK can remove noisy examples from both minority and majority class. Experimental evaluation on real-world software projects, such as NASA and Eclipse dataset, is performed in order to demonstrate the effectiveness of our proposed approach. Furthermore, this paper comprehensively investigates the influencing factor in our approach. Such as, the impact of Rough set theory on noise-filter, the relationship between model performance and imbalance ratio etc. comprehensive experiments indicate that the proposed approach shows outstanding performance with significance in terms of area-under-the-curve (AUC).


Sign in / Sign up

Export Citation Format

Share Document