Machine Learning Techniques to Predict Software Defect

The past 10 years have seen the prediction of software defects proposed by many researchers using various metrics based on measurable aspects of source code entities (e.g. methods, classes, files or modules) and the social structure of software project in an effort to predict the software defects. However, these metrics could not predict very high accuracies in terms of sensitivity, specificity and accuracy. In this chapter, we propose the use of machine learning techniques to predict software defects. The effectiveness of all these techniques is demonstrated on ten datasets taken from literature. Based on an experiment, it is observed that PNN outperformed all other techniques in terms of accuracy and sensitivity in all the software defects datasets followed by CART and Group Method of data handling. We also performed feature selection by t-statistics based approach for selecting feature subsets across different folds for a given technique and followed by the feature subset selection. By taking the most important variables, we invoked the classifiers again and observed that PNN outperformed other classifiers in terms of sensitivity and accuracy. Moreover, the set of ‘if- then rules yielded by J48 and CART can be used as an expert system for prediction of software defects.

Download Full-text

Robust multiobjective evolutionary feature subset selection algorithm for binary classification using machine learning techniques

Neurocomputing ◽

10.1016/j.neucom.2017.02.033 ◽

2017 ◽

Vol 241 ◽

pp. 128-146 ◽

Cited By ~ 14

Author(s):

Ayça Deniz ◽

Hakan Ezgi Kiziloz ◽

Tansel Dokeroglu ◽

Ahmet Cosar

Keyword(s):

Machine Learning ◽

Binary Classification ◽

Subset Selection ◽

Feature Subset Selection ◽

Machine Learning Techniques ◽

Feature Subset ◽

Selection Algorithm ◽

Learning Techniques

Download Full-text

A Literature Review Study of Software Defect Prediction using Machine Learning Techniques

International Journal of Emerging Research in Management and Technology ◽

10.23956/ijermt.v6i6.286 ◽

2018 ◽

Vol 6 (6) ◽

pp. 300 ◽

Cited By ~ 3

Author(s):

Feidu Akmel ◽

Ermiyas Birihanu ◽

Bahir Siraj

Keyword(s):

Machine Learning ◽

Software Metrics ◽

Quality Standard ◽

Machine Learning Techniques ◽

Software Systems ◽

Health Care Insurance ◽

Software Defect ◽

Learning Techniques ◽

Software Product

Software systems are any software product or applications that support business domains such as Manufacturing,Aviation, Health care, insurance and so on.Software quality is a means of measuring how software is designed and how well the software conforms to that design. Some of the variables that we are looking for software quality are Correctness, Product quality, Scalability, Completeness and Absence of bugs, However the quality standard that was used from one organization is different from other for this reason it is better to apply the software metrics to measure the quality of software. Attributes that we gathered from source code through software metrics can be an input for software defect predictor. Software defect are an error that are introduced by software developer and stakeholders. Finally, in this study we discovered the application of machine learning on software defect that we gathered from the previous research works.

Download Full-text

Class Imbalance Issue in Software Defect Prediction Models by various Machine Learning Techniques: An Empirical Study

10.1109/icscc51209.2021.9528170 ◽

2021 ◽

Author(s):

Sushant Kumar Pandey ◽

Anil Kumar Tripathi

Keyword(s):

Machine Learning ◽

Empirical Study ◽

Prediction Models ◽

Class Imbalance ◽

Machine Learning Techniques ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Learning Techniques ◽

Defect Prediction Models

Download Full-text

SDP-ML: An Automated Approach of Software Defect Prediction employing Machine Learning Techniques

10.1109/icecit54077.2021.9641218 ◽

2021 ◽

Author(s):

Md Nasir Uddin ◽

Bixin Li ◽

Md Naim Mondol ◽

Md Mostafizur Rahman ◽

Md Suman Mia ◽

...

Keyword(s):

Machine Learning ◽

Machine Learning Techniques ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Learning Techniques

Download Full-text

Improved Intrusion Detection Algorithm based on TLBO and GA Algorithms

The International Arab Journal of Information Technology ◽

10.34028/iajit/18/2/5 ◽

2021 ◽

Vol 18 (2) ◽

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Optimization Algorithm ◽

Feature Subset Selection ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Support Vector ◽

Feature Subset ◽

Teaching Learning Based Optimization ◽

Teaching Learning

Optimization algorithms are widely used for the identification of intrusion. This is attributable to the increasing number of audit data features and the decreasing performance of human-based smart Intrusion Detection Systems (IDS) regarding classification accuracy and training time. In this paper, an improved method for intrusion detection for binary classification was presented and discussed in detail. The proposed method combined the New Teaching-Learning-Based Optimization Algorithm (NTLBO), Support Vector Machine (SVM), Extreme Learning Machine (ELM), and Logistic Regression (LR) (feature selection and weighting) NTLBO algorithm with supervised machine learning techniques for Feature Subset Selection (FSS). The process of selecting the least number of features without any effect on the result accuracy in FSS was considered a multi-objective optimization problem. The NTLBO was proposed in this paper as an FSS mechanism; its algorithm-specific, parameter-less concept (which requires no parameter tuning during an optimization) was explored. The experiments were performed on the prominent intrusion machine-learning datasets (KDDCUP’99 and CICIDS 2017), where significant enhancements were observed with the suggested NTLBO algorithm as compared to the classical Teaching-Learning-Based Optimization algorithm (TLBO), NTLBO presented better results than TLBO and many existing works. The results showed that NTLBO reached 100% accuracy for KDDCUP’99 dataset and 97% for CICIDS dataset

Download Full-text

Design and Development of an Efficient Network Intrusion Detection System Using Machine Learning Techniques

Wireless Communications and Mobile Computing ◽

10.1155/2021/9974270 ◽

2021 ◽

Vol 2021 ◽

pp. 1-35

Author(s):

Thomas Rincy N ◽

Roopam Gupta

Keyword(s):

Machine Learning ◽

Performance Metrics ◽

Detection System ◽

Machine Learning Algorithms ◽

Feature Subset Selection ◽

Hybrid Network ◽

Machine Learning Techniques ◽

Feature Subset ◽

Network Connection ◽

Network Intrusion

Today’s internets are made up of nearly half a million different networks. In any network connection, identifying the attacks by their types is a difficult task as different attacks may have various connections, and their number may vary from a few to hundreds of network connections. To solve this problem, a novel hybrid network IDS called NID-Shield is proposed in the manuscript that classifies the dataset according to different attack types. Furthermore, the attack names found in attack types are classified individually helping considerably in predicting the vulnerability of individual attacks in various networks. The hybrid NID-Shield NIDS applies the efficient feature subset selection technique called CAPPER and distinct machine learning methods. The UNSW-NB15 and NSL-KDD datasets are utilized for the evaluation of metrics. Machine learning algorithms are applied for training the reduced accurate and highly merit feature subsets obtained from CAPPER and then assessed by the cross-validation method for the reduced attributes. Various performance metrics show that the hybrid NID-Shield NIDS applied with the CAPPER approach achieves a good accuracy rate and low FPR on the UNSW-NB15 and NSL-KDD datasets and shows good performance results when analyzed with various approaches found in existing literature studies.

Download Full-text

Software Defect Prediction Using Supervised Machine Learning Techniques: A Systematic Literature Review

Intelligent Automation & Soft Computing ◽

10.32604/iasc.2021.017562 ◽

2021 ◽

Vol 29 (2) ◽

pp. 403-421

Author(s):

Faseeha Matloob ◽

Shabib Aftab ◽

Munir Ahmad ◽

Muhammad Adnan Khan ◽

Areej Fatima ◽

...

Keyword(s):

Machine Learning ◽

Literature Review ◽

Systematic Literature Review ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Learning Techniques

Download Full-text

Software Defect Prediction Using Machine Learning Techniques

2020 4th International Conference on Trends in Electronics and Informatics (ICOEI)(48184) ◽

10.1109/icoei48184.2020.9142909 ◽

2020 ◽

Author(s):

C.Lakshmi Prabha ◽

N. Shivakumar

Keyword(s):

Machine Learning ◽

Machine Learning Techniques ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Learning Techniques

Download Full-text

Improved prediction of software defects using ensemble machine learning techniques

Neural Computing and Applications ◽

10.1007/s00521-021-05811-3 ◽

2021 ◽

Author(s):

Sweta Mehta ◽

K. Sridhar Patnaik

Keyword(s):

Machine Learning ◽

Machine Learning Techniques ◽

Software Defects ◽

Ensemble Machine Learning ◽

Learning Techniques

Download Full-text

Software Defect Estimation using Machine Learning Algorithms

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.a5898.0510121 ◽

2021 ◽

Vol 10 (1) ◽

pp. 204-208

Author(s):

Mandi Akif Hussain* ◽

Revoori Veeharika Reddy ◽

Kedharnath Nagella ◽

Vidya S.

Keyword(s):

Machine Learning ◽

Software Engineering ◽

Computer Science ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Data Sets ◽

System Software ◽

Software Defect ◽

Learning Techniques ◽

And Training

Software Engineering is a branch of computer science that enables tight communication between system software and training it as per the requirement of the user. We have selected seven distinct algorithms from machine learning techniques and are going to test them using the data sets acquired for NASA public promise repositories. The results of our project enable the users of this software to bag up the defects are selecting the most efficient of given algorithms in doing their further respective tasks, resulting in effective results.

Download Full-text