A Literature Review Study of Software Defect Prediction using Machine Learning Techniques

Software systems are any software product or applications that support business domains such as Manufacturing,Aviation, Health care, insurance and so on.Software quality is a means of measuring how software is designed and how well the software conforms to that design. Some of the variables that we are looking for software quality are Correctness, Product quality, Scalability, Completeness and Absence of bugs, However the quality standard that was used from one organization is different from other for this reason it is better to apply the software metrics to measure the quality of software. Attributes that we gathered from source code through software metrics can be an input for software defect predictor. Software defect are an error that are introduced by software developer and stakeholders. Finally, in this study we discovered the application of machine learning on software defect that we gathered from the previous research works.

Download Full-text

Prediction of Quality of Service Parameters Using Aggregate Software Metrics and Machine Learning Techniques*

2018 15th IEEE India Council International Conference (INDICON) ◽

10.1109/indicon45594.2018.8986987 ◽

2018 ◽

Author(s):

Manish K Tripathi ◽

Divyanshu Chaubisa ◽

Lov Kumar ◽

Lalita Bhanu Murthy Neti

Keyword(s):

Machine Learning ◽

Quality Of Service ◽

Software Metrics ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Quality Of Service Parameters

Download Full-text

Software fault prediction using machine learning techniques with metric thresholds

International Journal of Knowledge-based and Intelligent Engineering Systems ◽

10.3233/kes-210061 ◽

2021 ◽

Vol 25 (2) ◽

pp. 159-172

Author(s):

Raed Shatnawi

Keyword(s):

Machine Learning ◽

Software Metrics ◽

Prediction Models ◽

Object Oriented ◽

Fault Prediction ◽

Machine Learning Techniques ◽

Threshold Values ◽

Data Imbalance ◽

Learning Techniques

BACKGROUND: Fault data is vital to predicting the fault-proneness in large systems. Predicting faulty classes helps in allocating the appropriate testing resources for future releases. However, current fault data face challenges such as unlabeled instances and data imbalance. These challenges degrade the performance of the prediction models. Data imbalance happens because the majority of classes are labeled as not faulty whereas the minority of classes are labeled as faulty. AIM: The research proposes to improve fault prediction using software metrics in combination with threshold values. Statistical techniques are proposed to improve the quality of the datasets and therefore the quality of the fault prediction. METHOD: Threshold values of object-oriented metrics are used to label classes as faulty to improve the fault prediction models The resulting datasets are used to build prediction models using five machine learning techniques. The use of threshold values is validated on ten large object-oriented systems. RESULTS: The models are built for the datasets with and without the use of thresholds. The combination of thresholds with machine learning has improved the fault prediction models significantly for the five classifiers. CONCLUSION: Threshold values can be used to label software classes as fault-prone and can be used to improve machine learners in predicting the fault-prone classes.

Download Full-text

A Study on Software Metrics based Software Defect Prediction using Data Mining and Machine Learning Techniques

International Journal of Database Theory and Application ◽

10.14257/ijdta.2015.8.3.15 ◽

2015 ◽

Vol 8 (3) ◽

pp. 179-190 ◽

Cited By ~ 15

Author(s):

Manjula.C.M. Prasad ◽

Lilly Florence Florence ◽

Arti Arya3

Keyword(s):

Machine Learning ◽

Data Mining ◽

Software Metrics ◽

Machine Learning Techniques ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Learning Techniques ◽

Using Data

Download Full-text

Lessons Learned from the Assessment of Software Defect Prediction on WLCG Software: A Study with Unlabelled Datasets and Machine Learning Techniques

EPJ Web of Conferences ◽

10.1051/epjconf/202024505041 ◽

2020 ◽

Vol 245 ◽

pp. 05041

Author(s):

Elisabetta Ronchieri ◽

Marco Canaparo ◽

Mauro Belgiovine ◽

Davide Salomoni ◽

Barbara Martelli

Keyword(s):

Machine Learning ◽

Software Metrics ◽

Prediction Models ◽

Lessons Learned ◽

Machine Learning Techniques ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Learning Techniques ◽

Software Modules

Software defect prediction is an activity that aims at narrowing down the most likely defect-prone software modules and helping developers and testers to prioritize inspection and testing. This activity can be addressed by using Machine Learning techniques applied to software metrics datasets that are usually unlabelled, i.e. they lack modules classification in terms of defectiveness. To overcome this limitation, in addition to the usual data pre-processing operations to manage mission values and/or to remove inconsistencies, researches have to adopt an approach to label their unlabelled software datasets. The extraction of defectiveness data to label all the instances of the datasets is an extremely time and effort consuming operation. In literature, many studies have introduced approaches to build a defect prediction models on unlabelled datasets. In this paper, we describe the analysis of new unlabelled datasets from WLCG software, coming from HEP-related experiments and middleware, by using Machine Learning techniques. We have experimented new approaches to label the various modules due to the heterogeneity of software metrics distribution. We discuss a number of lessons learned from conducting these activities, what has worked, what has not and how our research can be improved.

Download Full-text

Machine learning techniques based on security management in smart cities using robots

Work ◽

10.3233/wor-203423 ◽

2021 ◽

pp. 1-12

Author(s):

Zhang Mengqi ◽

Wang Xi ◽

V.E. Sathishkumar ◽

V. Sivakumar

Keyword(s):

Machine Learning ◽

Information And Communication Technologies ◽

Smart City ◽

Smart Cities ◽

Security Management ◽

Machine Learning Techniques ◽

Quality Of Services ◽

Learning Techniques ◽

Learning Concept

BACKGROUND: Nowadays, the growth of smart cities is enhanced gradually, which collects a lot of information and communication technologies that are used to maximize the quality of services. Even though the intelligent city concept provides a lot of valuable services, security management is still one of the major issues due to shared threats and activities. For overcoming the above problems, smart cities’ security factors should be analyzed continuously to eliminate the unwanted activities that used to enhance the quality of the services. OBJECTIVES: To address the discussed problem, active machine learning techniques are used to predict the quality of services in the smart city manages security-related issues. In this work, a deep reinforcement learning concept is used to learn the features of smart cities; the learning concept understands the entire activities of the smart city. During this energetic city, information is gathered with the help of security robots called cobalt robots. The smart cities related to new incoming features are examined through the use of a modular neural network. RESULTS: The system successfully predicts the unwanted activity in intelligent cities by dividing the collected data into a smaller subset, which reduces the complexity and improves the overall security management process. The efficiency of the system is evaluated using experimental analysis. CONCLUSION: This exploratory study is conducted on the 200 obstacles are placed in the smart city, and the introduced DRL with MDNN approach attains maximum results on security maintains.

Download Full-text

Class Imbalance Issue in Software Defect Prediction Models by various Machine Learning Techniques: An Empirical Study

10.1109/icscc51209.2021.9528170 ◽

2021 ◽

Author(s):

Sushant Kumar Pandey ◽

Anil Kumar Tripathi

Keyword(s):

Machine Learning ◽

Empirical Study ◽

Prediction Models ◽

Class Imbalance ◽

Machine Learning Techniques ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Learning Techniques ◽

Defect Prediction Models

Download Full-text

SDP-ML: An Automated Approach of Software Defect Prediction employing Machine Learning Techniques

10.1109/icecit54077.2021.9641218 ◽

2021 ◽

Author(s):

Md Nasir Uddin ◽

Bixin Li ◽

Md Naim Mondol ◽

Md Mostafizur Rahman ◽

Md Suman Mia ◽

...

Keyword(s):

Machine Learning ◽

Machine Learning Techniques ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Learning Techniques

Download Full-text

Machine Learning for Neurodegenerative Disorder Diagnosis — Survey of Practices and Launch of Benchmark Dataset

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213018500112 ◽

2018 ◽

Vol 27 (03) ◽

pp. 1850011 ◽

Cited By ~ 5

Author(s):

Athanasios Tagaris ◽

Dimitrios Kollias ◽

Andreas Stafylopatis ◽

Georgios Tagaris ◽

Stefanos Kollias

Keyword(s):

Machine Learning ◽

Neurodegenerative Disorder ◽

Developed Countries ◽

Machine Learning Techniques ◽

Network Architectures ◽

Imaging Data ◽

Learning Techniques ◽

Medical Examinations

Neurodegenerative disorders, such as Alzheimer’s and Parkinson’s, constitute a major factor in long-term disability and are becoming more and more a serious concern in developed countries. As there are, at present, no effective therapies, early diagnosis along with avoidance of misdiagnosis seem to be critical in ensuring a good quality of life for patients. In this sense, the adoption of computer-aided-diagnosis tools can offer significant assistance to clinicians. In the present paper, we provide in the first place a comprehensive recording of medical examinations relevant to those disorders. Then, a review is conducted concerning the use of Machine Learning techniques in supporting diagnosis of neurodegenerative diseases, with reference to at times used medical datasets. Special attention has been given to the field of Deep Learning. In addition to that, we communicate the launch of a newly created dataset for Parkinson’s disease, containing epidemiological, clinical and imaging data, which will be publicly available to researchers for benchmarking purposes. To assess the potential of the new dataset, an experimental study in Parkinson’s diagnosis is carried out, based on state-of-the-art Deep Neural Network architectures and yielding very promising accuracy results.

Download Full-text

Framework for the Classification of Emotions in People With Visual Disabilities Through Brain Signals

Frontiers in Neuroinformatics ◽

10.3389/fninf.2021.642766 ◽

2021 ◽

Vol 15 ◽

Author(s):

Jesús Leonardo López-Hernández ◽

Israel González-Carrasco ◽

José Luis López-Cuadrado ◽

Belén Ruiz-Mezcua

Keyword(s):

Machine Learning ◽

Positive Emotions ◽

Machine Learning Techniques ◽

Visual Disabilities ◽

Brain Signals ◽

Learning Techniques ◽

Sensory Disabilities ◽

Recognition Of Emotions

Nowadays, the recognition of emotions in people with sensory disabilities still represents a challenge due to the difficulty of generalizing and modeling the set of brain signals. In recent years, the technology that has been used to study a person’s behavior and emotions based on brain signals is the brain–computer interface (BCI). Although previous works have already proposed the classification of emotions in people with sensory disabilities using machine learning techniques, a model of recognition of emotions in people with visual disabilities has not yet been evaluated. Consequently, in this work, the authors present a twofold framework focused on people with visual disabilities. Firstly, auditory stimuli have been used, and a component of acquisition and extraction of brain signals has been defined. Secondly, analysis techniques for the modeling of emotions have been developed, and machine learning models for the classification of emotions have been defined. Based on the results, the algorithm with the best performance in the validation is random forest (RF), with an accuracy of 85 and 88% in the classification for negative and positive emotions, respectively. According to the results, the framework is able to classify positive and negative emotions, but the experimentation performed also shows that the framework performance depends on the number of features in the dataset and the quality of the Electroencephalogram (EEG) signals is a determining factor.

Download Full-text