Impact of restricted forward greedy feature selection technique on bug prediction

Mapping Intimacies ◽

10.7287/peerj.preprints.1411 ◽

2015 ◽

Author(s):

Muthukumaran Kasinathan ◽

Lalita Bhanu Murthy Neti

Keyword(s):

Feature Selection ◽

Prediction Models ◽

Source Code ◽

Feature Selection Technique ◽

Code Metrics ◽

Change Metrics ◽

Misclassification Rates ◽

The Individual ◽

The Impact ◽

Source Code Metrics

Several change metrics and source code metrics have been introduced and proved to be effective in bug prediction. Researchers performed comparative studies of bug prediction models built using the individual metrics as well as combination of these metrics. In this paper, we investigate the impact of feature selection in bug prediction models by analyzing the misclassification rates of these models with and without feature selection in place. We conduct our experiments on five open source projects by considering numerous change metrics and source code metrics. And this study aims to figure out the reliable subset of metrics that are common amongst all projects.

Download Full-text

Impact of restricted forward greedy feature selection technique on bug prediction

10.7287/peerj.preprints.1411v1 ◽

2015 ◽

Author(s):

Muthukumaran Kasinathan ◽

Lalita Bhanu Murthy Neti

Keyword(s):

Feature Selection ◽

Prediction Models ◽

Source Code ◽

Feature Selection Technique ◽

Code Metrics ◽

Change Metrics ◽

Misclassification Rates ◽

The Individual ◽

The Impact ◽

Source Code Metrics

Download Full-text

A public unified bug dataset for java and its assessment regarding metrics and bug prediction

Software Quality Journal ◽

10.1007/s11219-020-09515-0 ◽

2020 ◽

Vol 28 (4) ◽

pp. 1447-1506 ◽

Cited By ~ 1

Author(s):

Rudolf Ferenc ◽

Zoltán Tóth ◽

Gergely Ladányi ◽

István Siket ◽

Tibor Gyimóthy

Keyword(s):

Prediction Models ◽

Source Code ◽

Decision Tree Algorithm ◽

Large Dataset ◽

Code Analysis ◽

Project Learning ◽

Code Metrics ◽

Public Datasets ◽

Source Code Metrics ◽

Cross Project

AbstractBug datasets have been created and used by many researchers to build and validate novel bug prediction models. In this work, our aim is to collect existing public source code metric-based bug datasets and unify their contents. Furthermore, we wish to assess the plethora of collected metrics and the capabilities of the unified bug dataset in bug prediction. We considered 5 public datasets and we downloaded the corresponding source code for each system in the datasets and performed source code analysis to obtain a common set of source code metrics. This way, we produced a unified bug dataset at class and file level as well. We investigated the diversion of metric definitions and values of the different bug datasets. Finally, we used a decision tree algorithm to show the capabilities of the dataset in bug prediction. We found that there are statistically significant differences in the values of the original and the newly calculated metrics; furthermore, notations and definitions can severely differ. We compared the bug prediction capabilities of the original and the extended metric suites (within-project learning). Afterwards, we merged all classes (and files) into one large dataset which consists of 47,618 elements (43,744 for files) and we evaluated the bug prediction model build on this large dataset as well. Finally, we also investigated cross-project capabilities of the bug prediction models and datasets. We made the unified dataset publicly available for everyone. By using a public unified dataset as an input for different bug prediction related investigations, researchers can make their studies reproducible, thus able to be validated and verified.

Download Full-text

A comprehensive investigation of the impact of feature selection techniques on crashing fault residence prediction models

Information and Software Technology ◽

10.1016/j.infsof.2021.106652 ◽

2021 ◽

pp. 106652

Author(s):

Kunsong Zhao ◽

Zhou Xu ◽

Meng Yan ◽

Tao Zhang ◽

Dan Yang ◽

...

Keyword(s):

Feature Selection ◽

Prediction Models ◽

Comprehensive Investigation ◽

The Impact ◽

Feature Selection Techniques

Download Full-text

Source Code Metrics to Predict the Properties of FPGA/VHDL-Based Synthesized Products

2018 6th International Conference in Software Engineering Research and Innovation (CONISOFT) ◽

10.1109/conisoft.2018.8645854 ◽

2018 ◽

Author(s):

Oscar E. Perez-Cham ◽

Carlos Soubervielle- Montalvo ◽

Alberto S. Nunez-Varela ◽

Cesar Puente ◽

Luis J. Ontanon-Garcia

Keyword(s):

Source Code ◽

Code Metrics ◽

Source Code Metrics

Download Full-text

A Study of the Relationships between Source Code Metrics and Attractiveness in Free Software Projects

2010 Brazilian Symposium on Software Engineering ◽

10.1109/sbes.2010.27 ◽

2010 ◽

Cited By ~ 18

Author(s):

Paulo Meirelles ◽

Carlos Santos Jr. ◽

Joao Miranda ◽

Fabio Kon ◽

Antonio Terceiro ◽

...

Keyword(s):

Source Code ◽

Free Software ◽

Software Projects ◽

Code Metrics ◽

Source Code Metrics

Download Full-text

An Empirical Analysis on Effectiveness of Source Code Metrics for Aging Related Bug Prediction

10.18293/jvlc2019-n2-022 ◽

2019 ◽

Vol 2019 (2) ◽

pp. 117-126

Author(s):

Chinmay Hota ◽

Lov Kumar ◽

Lalita Bhanu Murthy Neti

Keyword(s):

Empirical Analysis ◽

Source Code ◽

Code Metrics ◽

Source Code Metrics

Download Full-text

An efficient Software Source Code Metrics for Implementing for Software quality analysis

International Journal of Emerging Trends in Engineering Research ◽

10.30534/ijeter/2019/01792019 ◽

2019 ◽

pp. 216-222

Author(s):

Varun K L Srivastava ◽

Keyword(s):

Software Quality ◽

Source Code ◽

Quality Analysis ◽

Code Metrics ◽

Source Code Metrics ◽

Efficient Software

Download Full-text

Analyzing fault prediction usefulness from cost perspective using source code metrics

2017 Tenth International Conference on Contemporary Computing (IC3) ◽

10.1109/ic3.2017.8284297 ◽

2017 ◽

Cited By ~ 1

Author(s):

Lov Kumar ◽

Ashish Sureka

Keyword(s):

Source Code ◽

Fault Prediction ◽

Code Metrics ◽

Source Code Metrics

Download Full-text

Data stream mining for predicting software build outcomes using source code metrics

Information and Software Technology ◽

10.1016/j.infsof.2013.09.001 ◽

2014 ◽

Vol 56 (2) ◽

pp. 183-198 ◽

Cited By ~ 16

Author(s):

Jacqui Finlay ◽

Russel Pears ◽

Andy M. Connor

Keyword(s):

Data Stream ◽

Source Code ◽

Data Stream Mining ◽

Stream Mining ◽

Code Metrics ◽

Source Code Metrics

Download Full-text

Can Software Project Maturity Be Accurately Predicted Using Internal Source Code Metrics?

Machine Learning and Data Mining in Pattern Recognition - Lecture Notes in Computer Science ◽

10.1007/978-3-319-41920-6_59 ◽

2016 ◽

pp. 774-789 ◽

Cited By ~ 1

Author(s):

Mark Grechanik ◽

Nitin Prabhu ◽

Daniel Graham ◽

Denys Poshyvanyk ◽

Mohak Shah

Keyword(s):

Source Code ◽

Software Project ◽

Internal Source ◽

Code Metrics ◽

Source Code Metrics

Download Full-text