Comparative Analysis of Random Forests with Statistical and Machine Learning Methods in Predicting Fault-Prone Classes

Author(s):  
Ruchika Malhotra ◽  
Arvinder Kaur ◽  
Yogesh Singh

There are available metrics for predicting fault prone classes, which may help software organizations for planning and performing testing activities. This may be possible due to proper allocation of resources on fault prone parts of the design and code of the software. Hence, importance and usefulness of such metrics is understandable, but empirical validation of these metrics is always a great challenge. Random Forest (RF) algorithm has been successfully applied for solving regression and classification problems in many applications. In this work, the authors predict faulty classes/modules using object oriented metrics and static code metrics. This chapter evaluates the capability of RF algorithm and compares its performance with nine statistical and machine learning methods in predicting fault prone software classes. The authors applied RF on six case studies based on open source, commercial software and NASA data sets. The results indicate that the prediction performance of RF is generally better than statistical and machine learning models. Further, the classification of faulty classes/modules using the RF method is better than the other methods in most of the data sets.

Author(s):  
Matheus del Valle ◽  
Kleber Stancari ◽  
Pedro Arthur Augusto de Castro ◽  
Moises Oliveira dos Santos ◽  
Denise Maria Zezell

Author(s):  
Anna Evgenievna Kharitonova ◽  
Alina Alekseevna Sundupey ◽  
Svetlana Skachkova

The article provides a comparative analysis of the results of the Russian Agricultural Census of 2006 and 2016. As a result, there is a decrease in the number of agricultural producers, a decrease in the size of agricultural land and equipment in organizations. Against this background, one can see an increase in the concentration of production in both crop and livestock production. Machine learning models have been built to classify subsidy organizations using Python libraries. The accuracy of the constructed models was up to 86 %, which proves the possibility of their use. In the future, the use of machine learning methods will reduce the number of Russian agricultural census indicators and classify organizations with high accuracy according to qualitative characteristics.


ACS Omega ◽  
2018 ◽  
Vol 3 (11) ◽  
pp. 15837-15849 ◽  
Author(s):  
Yang Li ◽  
Yujia Tian ◽  
Zijian Qin ◽  
Aixia Yan

PLoS ONE ◽  
2016 ◽  
Vol 11 (12) ◽  
pp. e0166898 ◽  
Author(s):  
Monique A. Ladds ◽  
Adam P. Thompson ◽  
David J. Slip ◽  
David P. Hocking ◽  
Robert G. Harcourt

2019 ◽  
Vol 23 (1) ◽  
pp. 125-142
Author(s):  
Helle Hein ◽  
Ljubov Jaanuska

In this paper, the Haar wavelet discrete transform, the artificial neural networks (ANNs), and the random forests (RFs) are applied to predict the location and severity of a crack in an Euler–Bernoulli cantilever subjected to the transverse free vibration. An extensive investigation into two data collection sets and machine learning methods showed that the depth of a crack is more difficult to predict than its location. The data set of eight natural frequency parameters produces more accurate predictions on the crack depth; meanwhile, the data set of eight Haar wavelet coefficients produces more precise predictions on the crack location. Furthermore, the analysis of the results showed that the ensemble of 50 ANN trained by Bayesian regularization and Levenberg–Marquardt algorithms slightly outperforms RF.


Sign in / Sign up

Export Citation Format

Share Document