A Novel Effort Measure Method for Effort-Aware Just-in-Time Software Defect Prediction

Just-in-time software defect prediction (JIT-SDP) is a fine-grained software defect prediction technology, which aims to identify the defective code changes in software systems. Effort-aware software defect prediction is a software defect prediction technology that takes into consideration the cost of code inspection, which can find more defective code changes in limited test resources. The traditional effort-aware defect prediction model mainly measures the effort based on the number of lines of code (LOC) and rarely considers additional factors. This paper proposes a novel effort measure method called Multi-Metric Joint Calculation (MMJC). When measuring the effort, MMJC takes into account not only LOC, but also the distribution of modified code across different files (Entropy), the number of developers that changed the files (NDEV) and the developer experience (EXP). In the simulation experiment, MMJC is combined with Linear Regression, Decision Tree, Random Forest, LightGBM, Support Vector Machine and Neural Network, respectively, to build the software defect prediction model. Several comparative experiments are conducted between the models based on MMJC and baseline models. The results show that indicators ACC and [Formula: see text] of the models based on MMJC are improved by 35.3% and 15.9% on average in the three verification scenarios, respectively, compared with the baseline models.

Download Full-text

Research of Software Defect Prediction Model Based on ACO-SVM

Chinese Journal of Computers ◽

10.3724/sp.j.1016.2011.01148 ◽

2011 ◽

Vol 34 (6) ◽

pp. 1148-1154 ◽

Cited By ~ 13

Author(s):

Hui-Yan JIANG ◽

Mao ZONG ◽

Xiang-Ying LIU

Keyword(s):

Prediction Model ◽

Defect Prediction ◽

Software Defect Prediction ◽

Model Based ◽

Software Defect

Download Full-text

Support Vector based Oversampling Technique for Handling Class Imbalance in Software Defect Prediction

2021 11th International Conference on Cloud Computing, Data Science & Engineering (Confluence) ◽

10.1109/confluence51648.2021.9377068 ◽

2021 ◽

Author(s):

Ruchika Malhotra ◽

Vaibhav Agrawal ◽

Vedansh Pal ◽

Tushar Agarwal

Keyword(s):

Class Imbalance ◽

Defect Prediction ◽

Support Vector ◽

Software Defect Prediction ◽

Software Defect

Download Full-text

An investigation of cross-project learning in online just-in-time software defect prediction

Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering ◽

10.1145/3377811.3380403 ◽

2020 ◽

Cited By ~ 1

Author(s):

Sadia Tabassum ◽

Leandro L. Minku ◽

Danyi Feng ◽

George G. Cabral ◽

Liyan Song

Keyword(s):

Defect Prediction ◽

Just In Time ◽

Software Defect Prediction ◽

Project Learning ◽

Software Defect ◽

Cross Project

Download Full-text

Establishing a software defect prediction model via effective dimension reduction

Information Sciences ◽

10.1016/j.ins.2018.10.056 ◽

2019 ◽

Vol 477 ◽

pp. 399-409 ◽

Cited By ~ 7

Author(s):

Hua Wei ◽

Changzhen Hu ◽

Shiyou Chen ◽

Yuan Xue ◽

Quanxin Zhang

Keyword(s):

Prediction Model ◽

Dimension Reduction ◽

Defect Prediction ◽

Software Defect Prediction ◽

Effective Dimension ◽

Software Defect ◽

Effective Dimension Reduction

Download Full-text

MULTI: Multi-objective effort-aware just-in-time software defect prediction

Information and Software Technology ◽

10.1016/j.infsof.2017.08.004 ◽

2018 ◽

Vol 93 ◽

pp. 1-13 ◽

Cited By ~ 33

Author(s):

Xiang Chen ◽

Yingquan Zhao ◽

Qiuping Wang ◽

Zhidan Yuan

Keyword(s):

Defect Prediction ◽

Just In Time ◽

Software Defect Prediction ◽

Multi Objective ◽

Software Defect

Download Full-text

A Comparison of Semi-Supervised Classification Approaches for Software Defect Prediction

Journal of Intelligent Systems ◽

10.1515/jisys-2013-0030 ◽

2014 ◽

Vol 23 (1) ◽

pp. 75-82 ◽

Cited By ~ 12

Author(s):

Cagatay Catal

Keyword(s):

Supervised Classification ◽

Defect Prediction ◽

Support Vector ◽

Software Defect Prediction ◽

Classification Methods ◽

Data Set ◽

Software Defect ◽

Data Points ◽

Supervised Classification Methods ◽

Prediction Approach

AbstractPredicting the defect-prone modules when the previous defect labels of modules are limited is a challenging problem encountered in the software industry. Supervised classification approaches cannot build high-performance prediction models with few defect data, leading to the need for new methods, techniques, and tools. One solution is to combine labeled data points with unlabeled data points during learning phase. Semi-supervised classification methods use not only labeled data points but also unlabeled ones to improve the generalization capability. In this study, we evaluated four semi-supervised classification methods for semi-supervised defect prediction. Low-density separation (LDS), support vector machine (SVM), expectation-maximization (EM-SEMI), and class mass normalization (CMN) methods have been investigated on NASA data sets, which are CM1, KC1, KC2, and PC1. Experimental results showed that SVM and LDS algorithms outperform CMN and EM-SEMI algorithms. In addition, LDS algorithm performs much better than SVM when the data set is large. In this study, the LDS-based prediction approach is suggested for software defect prediction when there are limited fault data.

Download Full-text

Research of Software Defect Prediction Model Based on Gray Theory

2009 International Conference on Management and Service Science ◽

10.1109/icmss.2009.5301677 ◽

2009 ◽

Cited By ~ 1

Author(s):

Zhuo-yuan Xiang ◽

Zhitao Tang

Keyword(s):

Prediction Model ◽

Defect Prediction ◽

Software Defect Prediction ◽

Model Based ◽

Software Defect ◽

Gray Theory

Download Full-text

An Empirical Study of Model-Agnostic Interpretation Technique for Just-in-Time Software Defect Prediction

Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering - Collaborative Computing: Networking, Applications and Worksharing ◽

10.1007/978-3-030-92635-9_25 ◽

2021 ◽

pp. 420-438

Author(s):

Xingguang Yang ◽

Huiqun Yu ◽

Guisheng Fan ◽

Zijie Huang ◽

Kang Yang ◽

...

Keyword(s):

Empirical Study ◽

Defect Prediction ◽

Just In Time ◽

Software Defect Prediction ◽

Software Defect

Download Full-text

Software Defect Prediction Using Hybrid Distribution Base Balance Instance Selection and Radial Basis Function Classifier

International Journal of System Dynamics Applications ◽

10.4018/ijsda.2019070103 ◽

2019 ◽

Vol 8 (3) ◽

pp. 53-75 ◽

Cited By ~ 1

Author(s):

Mrutyunjaya Panda

Keyword(s):

Prediction Model ◽

Radial Basis Function ◽

Basis Function ◽

Rapid Development ◽

Defect Prediction ◽

Instance Selection ◽

Software Defect Prediction ◽

Software Defect ◽

Radial Basis ◽

Base Balance

Software is an important part of human life and with the rapid development of software engineering the demands for software to be reliable with low defects is increasingly pressing. The building of a software defect prediction model is proposed in this article by using various software metrics with publicly available historical software defect datasets collected from several projects. Such a prediction model can enable the software engineers to take proactive actions in enhancing software quality from the early stages of the software development cycle. This article introduces a hybrid classification method (DBBRBF) by combining distribution base balance (DBB) based instance selection and radial basis function (RBF) neural network classifier to obtain the best prediction compared to the existing research. The experimental results with post-hoc statistical significance tests shows the effectiveness of the proposed approach.

Download Full-text