defect prediction models Latest Research Papers

The Impact of Dormant Defects on Defect Prediction: A Study of 19 Apache Projects

ACM Transactions on Software Engineering and Methodology ◽

10.1145/3467895 ◽

2022 ◽

Vol 31 (1) ◽

pp. 1-26

Author(s):

Davide Falessi ◽

Aalok Ahluwalia ◽

Massimiliano DI Penta

Keyword(s):

Prediction Model ◽

Prediction Models ◽

Defect Prediction ◽

Training Set ◽

Negative Effect ◽

History Of ◽

The Subject ◽

Defect Prediction Models ◽

The Impact ◽

Substantial Effort

Defect prediction models can be beneficial to prioritize testing, analysis, or code review activities, and has been the subject of a substantial effort in academia, and some applications in industrial contexts. A necessary precondition when creating a defect prediction model is the availability of defect data from the history of projects. If this data is noisy, the resulting defect prediction model could result to be unreliable. One of the causes of noise for defect datasets is the presence of “dormant defects,” i.e., of defects discovered several releases after their introduction. This can cause a class to be labeled as defect-free while it is not, and is, therefore “snoring.” In this article, we investigate the impact of snoring on classifiers' accuracy and the effectiveness of a possible countermeasure, i.e., dropping too recent data from a training set. We analyze the accuracy of 15 machine learning defect prediction classifiers, on data from more than 4,000 defects and 600 releases of 19 open source projects from the Apache ecosystem. Our results show that on average across projects (i) the presence of dormant defects decreases the recall of defect prediction classifiers, and (ii) removing from the training set the classes that in the last release are labeled as not defective significantly improves the accuracy of the classifiers. In summary, this article provides insights on how to create defects datasets by mitigating the negative effect of dormant defects on defect prediction.

Download Full-text

Towards Design and Feasibility Analysis of DePaaS: AI Based Global Unified Software Defect Prediction Framework

Applied Sciences ◽

10.3390/app12010493 ◽

2022 ◽

Vol 12 (1) ◽

pp. 493

Author(s):

Mahesha Pandit ◽

Deepali Gupta ◽

Divya Anand ◽

Nitin Goyal ◽

Hani Moaiteq Aljahdali ◽

...

Keyword(s):

Software Development ◽

Prediction Models ◽

Easy Access ◽

Defect Prediction ◽

Software Defect Prediction ◽

Research And Practice ◽

Software Defect ◽

Software Modules ◽

Software Development Teams ◽

Defect Prediction Models

Using artificial intelligence (AI) based software defect prediction (SDP) techniques in the software development process helps isolate defective software modules, count the number of software defects, and identify risky code changes. However, software development teams are unaware of SDP and do not have easy access to relevant models and techniques. The major reason for this problem seems to be the fragmentation of SDP research and SDP practice. To unify SDP research and practice this article introduces a cloud-based, global, unified AI framework for SDP called DePaaS—Defects Prediction as a Service. The article describes the usage context, use cases and detailed architecture of DePaaS and presents the first response of the industry practitioners to DePaaS. In a first of its kind survey, the article captures practitioner’s belief into SDP and ability of DePaaS to solve some of the known challenges of the field of software defect prediction. This article also provides a novel process for SDP, detailed description of the structure and behaviour of DePaaS architecture components, six best SDP models offered by DePaaS, a description of algorithms that recommend SDP models, feature sets and tunable parameters, and a rich set of challenges to build, use and sustain DePaaS. With the contributions of this article, SDP research and practice could be unified enabling building and using more pragmatic defect prediction models leading to increase in the efficiency of software testing.

Download Full-text

Defects-per-unit control chart for assembled products based on defect prediction models

The International Journal of Advanced Manufacturing Technology ◽

10.1007/s00170-021-08157-1 ◽

2021 ◽

Author(s):

Elisa Verna ◽

Gianfranco Genta ◽

Maurizio Galetto ◽

Fiorenzo Franceschini

Keyword(s):

Experimental Data ◽

Phase I ◽

Prediction Models ◽

Structural Complexity ◽

Defect Prediction ◽

Production Volume ◽

Industrial Case Study ◽

Novel Approach ◽

Low Volume ◽

Defect Prediction Models

AbstractTypically, monitoring quality characteristics of very personalized products is a difficult task due to the lack of experimental data. This is the typical case of processes where the production volume continues to shrink due to the growing complexity and customization of products, thus requiring low-volume productions. This paper presents a novel approach to statistically monitor defects-per-unit (DPU) of assembled products based on the use of defect prediction models. The innovative aspect of such DPU-chart is that, unlike conventional SPC charts requiring preliminary experimental data to estimate the control limits (phase I), it is constructed using a predictive model based on a priori knowledge of DPU. This defect prediction model is based on the structural complexity of the assembled product. By avoiding phase I, the novel approach may be of interest to researchers and practitioners to speed up the chart’s construction phase, especially in low-volume productions. The description of the method is supported by a real industrial case study in the electromechanical field.

Download Full-text

The Effect of the Dataset Size on the Accuracy of Software Defect Prediction Models: An Empirical Study

INTELIGENCIA ARTIFICIAL ◽

10.4114/intartif.vol24iss68pp72-88 ◽

2021 ◽

Vol 24 (68) ◽

pp. 72-88

Author(s):

Mohammad Alshayeb ◽

Mashaan A. Alshammari

Keyword(s):

Feature Selection ◽

Prediction Model ◽

Prediction Models ◽

Fault Prediction ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Dataset Size ◽

Defect Prediction Models ◽

Selection Algorithms

The ongoing development of computer systems requires massive software projects. Running the components of these huge projects for testing purposes might be a costly process; therefore, parameter estimation can be used instead. Software defect prediction models are crucial for software quality assurance. This study investigates the impact of dataset size and feature selection algorithms on software defect prediction models. We use two approaches to build software defect prediction models: a statistical approach and a machine learning approach with support vector machines (SVMs). The fault prediction model was built based on four datasets of different sizes. Additionally, four feature selection algorithms were used. We found that applying the SVM defect prediction model on datasets with a reduced number of measures as features may enhance the accuracy of the fault prediction model. Also, it directs the test effort to maintain the most influential set of metrics. We also found that the running time of the SVM fault prediction model is not consistent with dataset size. Therefore, having fewer metrics does not guarantee a shorter execution time. From the experiments, we found that dataset size has a direct influence on the SVM fault prediction model. However, reduced datasets performed the same or slightly lower than the original datasets.

Download Full-text

A Classification of Software Defect Prediction Models

10.1109/nir52917.2021.9666110 ◽

2021 ◽

Author(s):

Sourabh Pal ◽

Alberto Sillitti

Keyword(s):

Prediction Models ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Defect Prediction Models

Download Full-text

Quality Monitoring of Assembled Products by Defect Prediction Models

10.21203/rs.3.rs-721973/v1 ◽

2021 ◽

Author(s):

Elisa Verna ◽

Gianfranco Genta ◽

Maurizio Galetto ◽

Fiorenzo Franceschini

Keyword(s):

Experimental Data ◽

Control Charts ◽

Prediction Models ◽

Structural Complexity ◽

Defect Prediction ◽

Production Volume ◽

Industrial Case Study ◽

Novel Approach ◽

Low Volume ◽

Defect Prediction Models

Abstract Typically, monitoring quality characteristics of very personalized products is a difficult task due to the lack of experimental data. This is the typical case of processes where the production volume continues to shrink due to the growing complexity and customization of products, thus requiring low-volume productions. This paper presents a novel approach to statistically monitor Defects Per Unit (DPU) of assembled products based on the use of defect prediction models. Unlike traditional control charts requiring preliminary experimental data to estimate the control limits (phase I), the proposed DPU-chart is constructed using a predictive model based on a priori knowledge of DPU. This defect prediction model is built on the structural complexity of assembled product. The novel approach may be of interest to researchers and practitioners to speed up the construction of the chart, especially in cases of low-volume productions due to the limited amount of data. The description of the method is supported by a real industrial case study in the electromechanical field.

Download Full-text

Inspection planning by defect prediction models and inspection strategy maps

Production Engineering ◽

10.1007/s11740-021-01067-x ◽

2021 ◽

Author(s):

Elisa Verna ◽

Gianfranco Genta ◽

Maurizio Galetto ◽

Fiorenzo Franceschini

Keyword(s):

Probabilistic Models ◽

Prediction Models ◽

Design Tool ◽

General Interest ◽

Manufacturing Processes ◽

Analysis Tool ◽

Manufacturing Companies ◽

Inspection Planning ◽

Inspection Strategy ◽

Defect Prediction Models

AbstractDesigning appropriate quality-inspections in manufacturing processes has always been a challenge to maintain competitiveness in the market. Recent studies have been focused on the design of appropriate in-process inspection strategies for assembly processes based on probabilistic models. Despite this general interest, a practical tool allowing for the assessment of the adequacy of alternative inspection strategies is still lacking. This paper proposes a general framework to assess the effectiveness and cost of inspection strategies. In detail, defect probabilities obtained by prediction models and inspection variables are combined to define a pair of indicators for developing an inspection strategy map. Such a map acts as an analysis tool, enabling positioning assessment and benchmarking of the strategies adopted by manufacturing companies, but also as a design tool to achieve the desired targets. The approach can assist designers of manufacturing processes, and particularly low-volume productions, in the early stages of inspection planning.

Download Full-text

Class Imbalance Issue in Software Defect Prediction Models by various Machine Learning Techniques: An Empirical Study

10.1109/icscc51209.2021.9528170 ◽

2021 ◽

Author(s):

Sushant Kumar Pandey ◽

Anil Kumar Tripathi

Keyword(s):

Machine Learning ◽

Empirical Study ◽

Prediction Models ◽

Class Imbalance ◽

Machine Learning Techniques ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Learning Techniques ◽

Defect Prediction Models

Download Full-text

Practitioners’ Perceptions of the Goals and Visual Explanations of Defect Prediction Models

2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR) ◽

10.1109/msr52588.2021.00055 ◽

2021 ◽

Author(s):

Jirayus Jiarpakdee ◽

Chakkrit Kla Tantithamthavorn ◽

John Grundy

Keyword(s):

Prediction Models ◽

Defect Prediction ◽

Defect Prediction Models

Download Full-text

Threshold benchmarking for feature ranking techniques

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v10i2.2752 ◽

2021 ◽

Vol 10 (2) ◽

pp. 1063-1070

Author(s):

Ruchika Malhotra ◽

Anjali Sharma

Keyword(s):

Prediction Models ◽

Area Under The Curve ◽

Threshold Value ◽

Performance Measure ◽

Machine Learning Techniques ◽

Feature Ranking ◽

Ranking Algorithms ◽

Learning Techniques ◽

Defect Prediction Models ◽

Necessary And Sufficient

In prediction modeling, the choice of features chosen from the original feature set is crucial for accuracy and model interpretability. Feature ranking techniques rank the features by its importance but there is no consensus on the number of features to be cut-off. Thus, it becomes important to identify a threshold value or range, so as to remove the redundant features. In this work, an empirical study is conducted for identification of the threshold benchmark for feature ranking algorithms. Experiments are conducted on Apache Click dataset with six popularly used ranker techniques and six machine learning techniques, to deduce a relationship between the total number of input features (N) to the threshold range. The area under the curve analysis shows that ≃ 33-50% of the features are necessary and sufficient to yield a reasonable performance measure, with a variance of 2%, in defect prediction models. Further, we also find that the log2(N) as the ranker threshold value represents the lower limit of the range.

Download Full-text

defect prediction models
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

The Impact of Dormant Defects on Defect Prediction: A Study of 19 Apache Projects

Towards Design and Feasibility Analysis of DePaaS: AI Based Global Unified Software Defect Prediction Framework

Defects-per-unit control chart for assembled products based on defect prediction models

The Effect of the Dataset Size on the Accuracy of Software Defect Prediction Models: An Empirical Study

A Classification of Software Defect Prediction Models

Quality Monitoring of Assembled Products by Defect Prediction Models

Inspection planning by defect prediction models and inspection strategy maps

Class Imbalance Issue in Software Defect Prediction Models by various Machine Learning Techniques: An Empirical Study

Practitioners’ Perceptions of the Goals and Visual Explanations of Defect Prediction Models

Threshold benchmarking for feature ranking techniques

Export Citation Format

defect prediction modelsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

The Impact of Dormant Defects on Defect Prediction: A Study of 19 Apache Projects

Towards Design and Feasibility Analysis of DePaaS: AI Based Global Unified Software Defect Prediction Framework

Defects-per-unit control chart for assembled products based on defect prediction models

The Effect of the Dataset Size on the Accuracy of Software Defect Prediction Models: An Empirical Study

A Classification of Software Defect Prediction Models

Quality Monitoring of Assembled Products by Defect Prediction Models

Inspection planning by defect prediction models and inspection strategy maps

Class Imbalance Issue in Software Defect Prediction Models by various Machine Learning Techniques: An Empirical Study

Practitioners’ Perceptions of the Goals and Visual Explanations of Defect Prediction Models

Threshold benchmarking for feature ranking techniques

defect prediction models
Recently Published Documents