Using Clustering Techniques Goes with Genetic Algorithm to Improve Software Defect Prediction

Software testing using software defect prediction aims to detect as many defects as possible in software before the software release. This plays an important role in ensuring quality and reliability. Software defect prediction can be modeled as a classification problem that classifies software modules into two classes: defective and non-defective; and classification algorithms are used for this process. This study investigated the impact of feature selection methods on classification via clustering techniques for software defect prediction. Three clustering techniques were selected; Farthest First Clusterer, K-Means and Make-Density Clusterer, and three feature selection methods: Chi-Square, Clustering Variation, and Information Gain were used on software defect datasets from NASA repository. The best software defect prediction model was farthest-first using information gain feature selection method with an accuracy of 78.69%, precision value of 0.804 and recall value of 0.788. The experimental results showed that the use of clustering techniques as a classifier gave a good predictive performance and feature selection methods further enhanced their performance. This indicates that classification via clustering techniques can give competitive results against standard classification methods with the advantage of not having to train any model using labeled dataset; as it can be used on the unlabeled datasets.Keywords: Classification, Clustering, Feature Selection, Software Defect PredictionVol. 26, No 1, June, 2019

Download Full-text

Neural Network based Software Defect Prediction using Genetic Algorithm and Particle Swarm Optimization

2019 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT) ◽

10.1109/icasert.2019.8934642 ◽

2019 ◽

Cited By ~ 2

Author(s):

Safial Islam Ayon

Keyword(s):

Neural Network ◽

Genetic Algorithm ◽

Particle Swarm Optimization ◽

Particle Swarm ◽

Defect Prediction ◽

Software Defect Prediction ◽

Swarm Optimization ◽

Software Defect

Download Full-text

Genetic algorithm-based oversampling approach to prune the class imbalance issue in software defect prediction

Soft Computing ◽

10.1007/s00500-021-06112-6 ◽

2021 ◽

Author(s):

C. Arun ◽

C. Lakshmi

Keyword(s):

Genetic Algorithm ◽

Class Imbalance ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect

Download Full-text

Genetic Algorithm-based Transfer Learning for Cross-Company Software Defect Prediction

International Journal of Hybrid Information Technology ◽

10.14257/ijhit.2017.10.3.05 ◽

2017 ◽

Vol 10 (3) ◽

pp. 45-56 ◽

Cited By ~ 2

Author(s):

Shengbing Ren ◽

Zhen Zhang ◽

Yuan Liu ◽

Ruliang Xie

Keyword(s):

Genetic Algorithm ◽

Transfer Learning ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect

Download Full-text

Neural Network Parameter Optimization Based on Genetic Algorithm for Software Defect Prediction

Advanced Science Letters ◽

10.1166/asl.2014.5641 ◽

2014 ◽

Vol 20 (10) ◽

pp. 1951-1955 ◽

Cited By ~ 3

Author(s):

Romi Satria Wahono ◽

Nanna Suryana Herman ◽

Sabrina Ahmad

Keyword(s):

Neural Network ◽

Genetic Algorithm ◽

Parameter Optimization ◽

Defect Prediction ◽

Software Defect Prediction ◽

Network Parameter ◽

Software Defect

Download Full-text

Towards Predicting Software Defects with Clustering Techniques

International Journal of Artificial Intelligence & Applications ◽

10.5121/ijaia.2021.12103 ◽

2021 ◽

Vol 12 (1) ◽

pp. 39-54

Author(s):

Waheeda Almayyan

Keyword(s):

Feature Selection ◽

Predictive Model ◽

Machine Learning Techniques ◽

Defect Prediction ◽

Grey Wolf Optimizer ◽

Software Defect Prediction ◽

Software Defects ◽

Particle Swarm Optimizer ◽

Clustering Techniques ◽

Software Defect

The purpose of software defect prediction is to improve the quality of a software project by building a predictive model to decide whether a software module is or is not fault prone. In recent years, much research in using machine learning techniques in this topic has been performed. Our aim was to evaluate the performance of clustering techniques with feature selection schemes to address the problem of software defect prediction problem. We analysed the National Aeronautics and Space Administration (NASA) dataset benchmarks using three clustering algorithms: (1) Farthest First, (2) X-Means, and (3) selforganizing map (SOM). In order to evaluate different feature selection algorithms, this article presents a comparative analysis involving software defects prediction based on Bat, Cuckoo, Grey Wolf Optimizer (GWO), and particle swarm optimizer (PSO). The results obtained with the proposed clustering models enabled us to build an efficient predictive model with a satisfactory detection rate and acceptable number of features.

Download Full-text