Software fault prediction using machine learning techniques with metric thresholds

BACKGROUND: Fault data is vital to predicting the fault-proneness in large systems. Predicting faulty classes helps in allocating the appropriate testing resources for future releases. However, current fault data face challenges such as unlabeled instances and data imbalance. These challenges degrade the performance of the prediction models. Data imbalance happens because the majority of classes are labeled as not faulty whereas the minority of classes are labeled as faulty. AIM: The research proposes to improve fault prediction using software metrics in combination with threshold values. Statistical techniques are proposed to improve the quality of the datasets and therefore the quality of the fault prediction. METHOD: Threshold values of object-oriented metrics are used to label classes as faulty to improve the fault prediction models The resulting datasets are used to build prediction models using five machine learning techniques. The use of threshold values is validated on ten large object-oriented systems. RESULTS: The models are built for the datasets with and without the use of thresholds. The combination of thresholds with machine learning has improved the fault prediction models significantly for the five classifiers. CONCLUSION: Threshold values can be used to label software classes as fault-prone and can be used to improve machine learners in predicting the fault-prone classes.

Download Full-text

A Literature Review Study of Software Defect Prediction using Machine Learning Techniques

International Journal of Emerging Research in Management and Technology ◽

10.23956/ijermt.v6i6.286 ◽

2018 ◽

Vol 6 (6) ◽

pp. 300 ◽

Cited By ~ 3

Author(s):

Feidu Akmel ◽

Ermiyas Birihanu ◽

Bahir Siraj

Keyword(s):

Machine Learning ◽

Software Metrics ◽

Quality Standard ◽

Machine Learning Techniques ◽

Software Systems ◽

Health Care Insurance ◽

Software Defect ◽

Learning Techniques ◽

Software Product

Software systems are any software product or applications that support business domains such as Manufacturing,Aviation, Health care, insurance and so on.Software quality is a means of measuring how software is designed and how well the software conforms to that design. Some of the variables that we are looking for software quality are Correctness, Product quality, Scalability, Completeness and Absence of bugs, However the quality standard that was used from one organization is different from other for this reason it is better to apply the software metrics to measure the quality of software. Attributes that we gathered from source code through software metrics can be an input for software defect predictor. Software defect are an error that are introduced by software developer and stakeholders. Finally, in this study we discovered the application of machine learning on software defect that we gathered from the previous research works.

Download Full-text

Prediction of Quality of Service Parameters Using Aggregate Software Metrics and Machine Learning Techniques*

2018 15th IEEE India Council International Conference (INDICON) ◽

10.1109/indicon45594.2018.8986987 ◽

2018 ◽

Author(s):

Manish K Tripathi ◽

Divyanshu Chaubisa ◽

Lov Kumar ◽

Lalita Bhanu Murthy Neti

Keyword(s):

Machine Learning ◽

Quality Of Service ◽

Software Metrics ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Quality Of Service Parameters

Download Full-text

Overview of Machine Learning Approaches

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Enhancing Software Fault Prediction With Machine Learning ◽

10.4018/978-1-5225-3185-2.ch003 ◽

2017 ◽

pp. 19-33

Keyword(s):

Machine Learning ◽

Software Quality ◽

Fault Prediction ◽

Machine Learning Techniques ◽

Case Based Reasoning ◽

Learning Approaches ◽

Software Module ◽

Learning Techniques ◽

Case Based

This chapter enlists and presents an overview of various machine learning approaches. It also explains the machine learning techniques used in the area of software engineering domain especially case-based reasoning method. Case-based reasoning is used to predict software quality of the system by examining a software module and predicting whether it is faulty or non-faulty. In this chapter an attempt has been made to propose a model with the help of previous data which is used for prediction. In this chapter, how machine learning technique such as case-based reasoning has been used for error estimation or fault prediction. Apart from case-based reasoning, some other types of learning methods have been discussed in detail.

Download Full-text

Important Issues in Software Fault Prediction

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Handbook of Research on Emerging Advancements and Technologies in Software Engineering ◽

10.4018/978-1-4666-6026-7.ch023 ◽

2014 ◽

pp. 510-539 ◽

Cited By ~ 1

Author(s):

Golnoush Abaei ◽

Ali Selamat

Keyword(s):

Software Quality ◽

Software Metrics ◽

Prediction Models ◽

Research Field ◽

Verification And Validation ◽

Fault Prediction ◽

Machine Learning Techniques ◽

Software Fault Prediction ◽

Learning Techniques ◽

Software Fault

Quality assurance tasks such as testing, verification and validation, fault tolerance, and fault prediction play a major role in software engineering activities. Fault prediction approaches are used when a software company needs to deliver a finished product while it has limited time and budget for testing it. In such cases, identifying and testing parts of the system that are more defect prone is reasonable. In fact, prediction models are mainly used for improving software quality and exploiting available resources. Software fault prediction is studied in this chapter based on different criteria that matters in this research field. Usually, there are certain issues that need to be taken care of such as different machine-learning techniques, artificial intelligence classifiers, variety of software metrics, distinctive performance evaluation metrics, and some statistical analysis. In this chapter, the authors present a roadmap for those researchers who are interested in working in this area. They illustrate problems along with objectives related to each mentioned criterion, which could assist researchers to build the finest software fault prediction model.

Download Full-text

Important Issues in Software Fault Prediction

Computer Systems and Software Engineering ◽

10.4018/978-1-5225-3923-0.ch007 ◽

2017 ◽

pp. 162-190

Author(s):

Golnoush Abaei ◽

Ali Selamat

Keyword(s):

Software Quality ◽

Software Metrics ◽

Prediction Models ◽

Research Field ◽

Verification And Validation ◽

Fault Prediction ◽

Machine Learning Techniques ◽

Software Fault Prediction ◽

Learning Techniques ◽

Software Fault

Download Full-text

Lessons Learned from the Assessment of Software Defect Prediction on WLCG Software: A Study with Unlabelled Datasets and Machine Learning Techniques

EPJ Web of Conferences ◽

10.1051/epjconf/202024505041 ◽

2020 ◽

Vol 245 ◽

pp. 05041

Author(s):

Elisabetta Ronchieri ◽

Marco Canaparo ◽

Mauro Belgiovine ◽

Davide Salomoni ◽

Barbara Martelli

Keyword(s):

Machine Learning ◽

Software Metrics ◽

Prediction Models ◽

Lessons Learned ◽

Machine Learning Techniques ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Learning Techniques ◽

Software Modules

Software defect prediction is an activity that aims at narrowing down the most likely defect-prone software modules and helping developers and testers to prioritize inspection and testing. This activity can be addressed by using Machine Learning techniques applied to software metrics datasets that are usually unlabelled, i.e. they lack modules classification in terms of defectiveness. To overcome this limitation, in addition to the usual data pre-processing operations to manage mission values and/or to remove inconsistencies, researches have to adopt an approach to label their unlabelled software datasets. The extraction of defectiveness data to label all the instances of the datasets is an extremely time and effort consuming operation. In literature, many studies have introduced approaches to build a defect prediction models on unlabelled datasets. In this paper, we describe the analysis of new unlabelled datasets from WLCG software, coming from HEP-related experiments and middleware, by using Machine Learning techniques. We have experimented new approaches to label the various modules due to the heterogeneity of software metrics distribution. We discuss a number of lessons learned from conducting these activities, what has worked, what has not and how our research can be improved.

Download Full-text

A Software Fault Prediction on Inter and Intra Release Prediction Scenarios

International Journal of Open Source Software and Processes ◽

10.4018/ijossp.287611 ◽

2021 ◽

Vol 12 (4) ◽

pp. 0-0

Keyword(s):

Machine Learning ◽

Research Work ◽

Fault Prediction ◽

Machine Learning Techniques ◽

Software Fault Prediction ◽

Machine Learning Methods ◽

Learning Techniques ◽

Software Modules ◽

Software Fault

Software quality engineering applied numerous techniques for assuring the quality of software, namely testing, verification, validation, fault tolerance, and fault prediction of the software. The machine learning techniques facilitate the identification of software modules as faulty or non-faulty. In most of the research, these approaches predict the fault-prone module in the same release of the software. Although, the model is found to be more efficient and validated when training and tested data are taken from previous and subsequent releases of the software respectively. The contribution of this paper is to predict the faults in two scenarios i.e. inter and intra release prediction. The comparison of both intra and inter-release fault prediction by computing various performance matrices using machine learning methods shows that intra-release prediction is having better accuracy compared to inter-releases prediction across all the releases. Also, but both the scenarios achieve good results in comparison to existing research work.

Download Full-text

Assessment of defect prediction models using machine learning techniques for object-oriented systems

2016 5th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO) ◽

10.1109/icrito.2016.7785021 ◽

2016 ◽

Cited By ~ 1

Author(s):

Ruchika Malhotra ◽

Shivani Shukla ◽

Geet Sawhney

Keyword(s):

Machine Learning ◽

Prediction Models ◽

Object Oriented ◽

Machine Learning Techniques ◽

Defect Prediction ◽

Learning Techniques ◽

Defect Prediction Models ◽

Object Oriented Systems

Download Full-text

A Review of Statistical and Machine Learning Techniques for Microvascular Complications in Type 2 Diabetes

Current Diabetes Reviews ◽

10.2174/1573399816666200511003357 ◽

2020 ◽

Vol 16 ◽

Author(s):

Nitigya Sambyal ◽

Poonam Saini ◽

Rupali Syal

Keyword(s):

Machine Learning ◽

Prediction Models ◽

Clinical Medicine ◽

Microvascular Complications ◽

Descriptive Analysis ◽

Machine Learning Techniques ◽

World Health ◽

Public Health Issue ◽

Learning Techniques ◽

Health Organization

Background and Introduction: Diabetes mellitus is a metabolic disorder that has emerged as a serious public health issue worldwide. According to the World Health Organization (WHO), without interventions, the number of diabetic incidences is expected to be at least 629 million by 2045. Uncontrolled diabetes gradually leads to progressive damage to eyes, heart, kidneys, blood vessels and nerves. Method: The paper presents a critical review of existing statistical and Artificial Intelligence (AI) based machine learning techniques with respect to DM complications namely retinopathy, neuropathy and nephropathy. The statistical and machine learning analytic techniques are used to structure the subsequent content review. Result: It has been inferred that statistical analysis can help only in inferential and descriptive analysis whereas, AI based machine learning models can even provide actionable prediction models for faster and accurate diagnose of complications associated with DM. Conclusion: The integration of AI based analytics techniques like machine learning and deep learning in clinical medicine will result in improved disease management through faster disease detection and cost reduction for disease treatment.

Download Full-text

Machine learning techniques based on security management in smart cities using robots

Work ◽

10.3233/wor-203423 ◽

2021 ◽

pp. 1-12

Author(s):

Zhang Mengqi ◽

Wang Xi ◽

V.E. Sathishkumar ◽

V. Sivakumar

Keyword(s):

Machine Learning ◽

Information And Communication Technologies ◽

Smart City ◽

Smart Cities ◽

Security Management ◽

Machine Learning Techniques ◽

Quality Of Services ◽

Learning Techniques ◽

Learning Concept

BACKGROUND: Nowadays, the growth of smart cities is enhanced gradually, which collects a lot of information and communication technologies that are used to maximize the quality of services. Even though the intelligent city concept provides a lot of valuable services, security management is still one of the major issues due to shared threats and activities. For overcoming the above problems, smart cities’ security factors should be analyzed continuously to eliminate the unwanted activities that used to enhance the quality of the services. OBJECTIVES: To address the discussed problem, active machine learning techniques are used to predict the quality of services in the smart city manages security-related issues. In this work, a deep reinforcement learning concept is used to learn the features of smart cities; the learning concept understands the entire activities of the smart city. During this energetic city, information is gathered with the help of security robots called cobalt robots. The smart cities related to new incoming features are examined through the use of a modular neural network. RESULTS: The system successfully predicts the unwanted activity in intelligent cities by dividing the collected data into a smaller subset, which reduces the complexity and improves the overall security management process. The efficiency of the system is evaluated using experimental analysis. CONCLUSION: This exploratory study is conducted on the 200 obstacles are placed in the smart city, and the introduced DRL with MDNN approach attains maximum results on security maintains.

Download Full-text