A Machine Learning Model Comparison and Selection Framework for Software Defect Prediction Using VIKOR

Software defect prediction is a significant activity in every software firm. It helps in producing quality software by reliable defect prediction, defect elimination, and prediction of modules that are susceptible to defect. Several researchers have proposed different software prediction approaches in the past. However, these conventional software defect predictions are prone to low classification accuracy, time-consuming, and tasking. This paper aims to develop a novel multi-model ensemble machine-learning for software defect prediction. The ensemble technique can reduce inconsistency among training and test datasets and eliminate bias in the training and testing phase of the model, thereby overcoming the downsides that have characterized the existing techniques used for the prediction of a software defect. To address these shortcomings, this paper proposes a new ensemble machine-learning model for software defect prediction using k Nearest Neighbour (kNN), Generalized Linear Model with Elastic Net Regularization (GLMNet), and Linear Discriminant Analysis (LDA) with Random Forest as base learner. Experiments were conducted using the proposed model on CM1, JM1, KC3, and PC3 datasets from the NASA PROMISE repository using the RStudio simulation tool. The ensemble technique achieved 87.69% for CM1 dataset, 81.11% for JM1 dataset, 90.70% for PC3 dataset, and 94.74% for KC3 dataset. The performance of the proposed system was compared with that of other existing techniques in literature in terms of AUC. The ensemble technique achieved 87%, which is better than the other seven state-of-the-art techniques under consideration. On average, the proposed model achieved an overall prediction accuracy of 88.56% for all datasets used for experiments. The results demonstrated that the ensemble model succeeded in effectively predicting the defects in PROMISE datasets that are notorious for their noisy features and high dimensions. This shows that ensemble machine learning is promising and the future of software defect prediction.

Download Full-text

An Improved Approach to Software Defect Prediction using a Hybrid Machine Learning Model

2018 20th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC) ◽

10.1109/synasc.2018.00074 ◽

2018 ◽

Author(s):

Diana-Lucia Miholca

Keyword(s):

Machine Learning ◽

Learning Model ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Machine Learning Model ◽

Hybrid Machine

Download Full-text

Class Imbalance Issue in Software Defect Prediction Models by various Machine Learning Techniques: An Empirical Study

10.1109/icscc51209.2021.9528170 ◽

2021 ◽

Author(s):

Sushant Kumar Pandey ◽

Anil Kumar Tripathi

Keyword(s):

Machine Learning ◽

Empirical Study ◽

Prediction Models ◽

Class Imbalance ◽

Machine Learning Techniques ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Learning Techniques ◽

Defect Prediction Models

Download Full-text

SDP-ML: An Automated Approach of Software Defect Prediction employing Machine Learning Techniques

10.1109/icecit54077.2021.9641218 ◽

2021 ◽

Author(s):

Md Nasir Uddin ◽

Bixin Li ◽

Md Naim Mondol ◽

Md Mostafizur Rahman ◽

Md Suman Mia ◽

...

Keyword(s):

Machine Learning ◽

Machine Learning Techniques ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Learning Techniques

Download Full-text

Fraud Detection with Machine Learning - Model Comparison

International Journal of Business Intelligence and Data Mining ◽

10.1504/ijbidm.2023.10044239 ◽

2023 ◽

Vol 1 (1) ◽

pp. 1

Author(s):

Guilherme Ferreira Pelucio Salome ◽

Jo�ão Luiz Chela ◽

Jo�ão Carlos Pacheco Junior

Keyword(s):

Machine Learning ◽

Model Comparison ◽

Fraud Detection ◽

Learning Model ◽

Machine Learning Model

Download Full-text

Software Defect Prediction based on Machine Learning Algorithms

2019 IEEE 5th International Conference on Computer and Communications (ICCC) ◽

10.1109/iccc47050.2019.9064412 ◽

2019 ◽

Author(s):

Zhang Tian ◽

Jing Xiang ◽

Sun Zhenxiao ◽

Zhang Yi ◽

Yan Yunqiang

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect

Download Full-text

Machine Learning Model Comparison for Leak Detection in Noisy Industrial Pipelines

2020 9th International Conference on Modern Circuits and Systems Technologies (MOCAST) ◽

10.1109/mocast49295.2020.9200261 ◽

2020 ◽

Author(s):

Dimitrios Kampelopoulos ◽

George N. Papastavrou ◽

George P. Kousiopoulos ◽

Nikolaos Karagiorgos ◽

Sotirios K. Goudos ◽

...

Keyword(s):

Machine Learning ◽

Model Comparison ◽

Leak Detection ◽

Learning Model ◽

Machine Learning Model

Download Full-text

Software defect prediction: A multi-criteria decision-making approach

Nigerian Journal of Technological Research ◽

10.4314/njtr.v15i1.7 ◽

2020 ◽

Vol 15 (1) ◽

pp. 35-42

Author(s):

A.O. Balogun ◽

A.O. Bajeh ◽

H.A. Mojeed ◽

A.G. Akintola

Keyword(s):

Machine Learning ◽

Software Testing ◽

Evaluation Metrics ◽

Defect Prediction ◽

Software Systems ◽

Software Defect Prediction ◽

Learning Models ◽

Decision Method ◽

Software Defect ◽

Machine Learning Models

Failure of software systems as a result of software testing is very much rampant as modern software systems are large and complex. Software testing which is an integral part of the software development life cycle (SDLC), consumes both human and capital resources. As such, software defect prediction (SDP) mechanisms are deployed to strengthen the software testing phase in SDLC by predicting defect prone modules or components in software systems. Machine learning models are used for developing the SDP models with great successes achieved. Moreover, some studies have highlighted that a combination of machine learning models as a form of an ensemble is better than single SDP models in terms of prediction accuracy. However, the efficiency of machine learning models can change with diverse predictive evaluation metrics. Thus, more studies are needed to establish the effectiveness of ensemble SDP models over single SDP models. This study proposes the deployment of Multi-Criteria Decision Method (MCDM) techniques to rank machine learning models. Analytic Network Process (ANP) and Preference Ranking Organization Method for Enrichment Evaluation (PROMETHEE) which are types of MCDM techniques are deployed on 9 machine learning models with 11 performance evaluation metrics and 11 software defects datasets. The experimental results showed that ensemble SDP models are best appropriate SDP models as Boosted SMO and Boosted PART ranked highest for each of the MCDM techniques. Besides, the experimental results also validated the stand of not considering accuracy as the only performance evaluation metrics for SDP models. Conclusively, more performance metrics other than predictive accuracy should be considered when ranking and evaluating machine learning models. Keywords: Ensemble; Multi-Criteria Decision Method; Software Defect Prediction

Download Full-text

The Effects of Parameter Tuning on Machine Learning Performance in a Software Defect Prediction Context

10.33915/etd.6457 ◽

2015 ◽

Author(s):

Benjamin N. Province

Keyword(s):

Machine Learning ◽

Parameter Tuning ◽

Learning Performance ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect

Download Full-text