Prediction of cohesion and friction angle from well-logging data using decision tree and random forest

Abstract Advances in computational algorithms and the availability of large datasets with clinically relevant characteristics provide an opportunity to develop machine learning prediction models to aid in diagnosis, prognosis, and treatment of older adults. Some studies have employed machine learning methods for prediction modeling, but skepticism of these methods remains due to lack of reproducibility and difficulty understanding the complex algorithms behind models. We aim to provide an overview of two common machine learning methods: decision tree and random forest. We focus on these methods because they provide a high degree of interpretability. We discuss the underlying algorithms of decision tree and random forest methods and present a tutorial for developing prediction models for serious fall injury using data from the Lifestyle Interventions and Independence for Elders (LIFE) study. Decision tree is a machine learning method that produces a model resembling a flow chart. Random forest consists of a collection of many decision trees whose results are aggregated. In the tutorial example, we discuss evaluation metrics and interpretation for these models. Illustrated in data from the LIFE study, prediction models for serious fall injury were moderate at best (area under the receiver operating curve of 0.54 for decision tree and 0.66 for random forest). Machine learning methods may offer improved performance compared to traditional models for modeling outcomes in aging, but their use should be justified and output should be carefully described. Models should be assessed by clinical experts to ensure compatibility with clinical practice.

Download Full-text

Robot Perceptual Classification Method Based on Mixed Features of Decision Tree and Random Forest

2021 IEEE 2nd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE) ◽

10.1109/icbaie52039.2021.9389973 ◽

2021 ◽

Author(s):

Yifan Song ◽

Jiankai Zuo ◽

Jiehong Wu ◽

Zeyuan Liu ◽

Ziheng Li

Keyword(s):

Random Forest ◽

Decision Tree ◽

Classification Method ◽

Perceptual Classification ◽

Mixed Features

Download Full-text

Development of a Safety Management System Tracking the Weight of Heavy Objects Carried by Construction Workers Using FSR Sensors

Applied Sciences ◽

10.3390/app11041378 ◽

2021 ◽

Vol 11 (4) ◽

pp. 1378

Author(s):

Seung Hyun Lee ◽

Jaeho Son

Keyword(s):

Random Forest ◽

Decision Tree ◽

Safety Management ◽

Tracking System ◽

Construction Workers ◽

Gradient Boosting ◽

Prototype System ◽

Construction Site ◽

Average Accuracy ◽

Smart Safety

It has been pointed out that the act of carrying a heavy object that exceeds a certain weight by a worker at a construction site is a major factor that puts physical burden on the worker’s musculoskeletal system. However, due to the nature of the construction site, where there are a large number of workers simultaneously working in an irregular space, it is difficult to figure out the weight of the object carried by the worker in real time or keep track of the worker who carries the excess weight. This paper proposes a prototype system to track the weight of heavy objects carried by construction workers by developing smart safety shoes with FSR (Force Sensitive Resistor) sensors. The system consists of smart safety shoes with sensors attached, a mobile device for collecting initial sensing data, and a web-based server computer for storing, preprocessing and analyzing such data. The effectiveness and accuracy of the weight tracking system was verified through the experiments where a weight was lifted by each experimenter from +0 kg to +20 kg in 5 kg increments. The results of the experiment were analyzed by a newly developed machine learning based model, which adopts effective classification algorithms such as decision tree, random forest, gradient boosting algorithm (GBM), and light GBM. The average accuracy classifying the weight by each classification algorithm showed similar, but high accuracy in the following order: random forest (90.9%), light GBM (90.5%), decision tree (90.3%), and GBM (89%). Overall, the proposed weight tracking system has a significant 90.2% average accuracy in classifying how much weight each experimenter carries.

Download Full-text

Classification Models Using Decision Tree, Random Forest, and Moving Average Analysis

New Frontiers in Nanochemistry ◽

10.1201/9780429022951-6 ◽

2020 ◽

pp. 91-115

Author(s):

Rohit Dutt ◽

Harish Dureja ◽

A. K. Madan

Keyword(s):

Random Forest ◽

Decision Tree ◽

Moving Average ◽

Classification Models ◽

Average Analysis

Download Full-text

Modified Decision Tree Technique for Ransomware Detection at Runtime through API Calls

Scientific Programming ◽

10.1155/2020/8845833 ◽

2020 ◽

Vol 2020 ◽

pp. 1-10

Author(s):

Faizan Ullah ◽

Qaisar Javaid ◽

Abdu Salam ◽

Masood Ahmad ◽

Nadeem Sarwar ◽

...

Keyword(s):

Machine Learning ◽

Random Forest ◽

Decision Tree ◽

Feature Vector ◽

Machine Learning Algorithms ◽

The Novel ◽

Proposed Model ◽

Testing Accuracy ◽

Financial Losses

Ransomware (RW) is a distinctive variety of malware that encrypts the files or locks the user’s system by keeping and taking their files hostage, which leads to huge financial losses to users. In this article, we propose a new model that extracts the novel features from the RW dataset and performs classification of the RW and benign files. The proposed model can detect a large number of RW from various families at runtime and scan the network, registry activities, and file system throughout the execution. API-call series was reutilized to represent the behavior-based features of RW. The technique extracts fourteen-feature vector at runtime and analyzes it by applying online machine learning algorithms to predict the RW. To validate the effectiveness and scalability, we test 78550 recent malign and benign RW and compare with the random forest and AdaBoost, and the testing accuracy is extended at 99.56%.

Download Full-text

Deteksi Gempa Berdasarkan Data Twitter Menggunakan Decision Tree, Random Forest, dan SVM

Jurnal Teknik ITS ◽

10.12962/j23373539.v6i1.22037 ◽

2017 ◽

Vol 6 (1) ◽

Author(s):

Rendra Dwi Lingga P. ◽

Chastine Fatichah ◽

Diana Purwitasari

Keyword(s):

Random Forest ◽

Decision Tree

Download Full-text

A DECISION TREE-BASED CLASSIFICATION FRAMEWORK FOR USED OIL ANALYSIS APPLYING RANDOM FOREST FEATURE SELECTION

Journal of Applied Science, Engineering and Technology for Development ◽

10.33803/jasetd.2017.3-1.7 ◽

2018 ◽

Keyword(s):

Decision Making ◽

Feature Selection ◽

Random Forest ◽

Decision Tree ◽

Condition Monitoring ◽

Critical Parameters ◽

Oil Analysis ◽

Used Oil ◽

Maintenance Decision

Lubricant condition monitoring (LCM), part of condition monitoring techniques under Condition Based Maintenance, monitors the condition and state of the lubricant which reveal the condition and state of the equipment. LCM has proved and evidenced to represent a key concept driving maintenance decision making involving sizeable number of parameter (variables) tests requiring classification and interpretation based on the lubricant’s condition. Reduction of the variables to a manageable and admissible level and utilization for prediction is key to ensuring optimization of equipment performance and lubricant condition. This study advances a methodology on feature selection and predictive modelling of in-service oil analysis data to assist in maintenance decision making of critical equipment. Proposed methodology includes data pre-processing involving cleaning, expert assessment and standardization due to the different measurement scales. Limits provided by the Original Equipment Manufacturers (OEM) are used by the analysts to manually classify and indicate samples with significant lubricant deterioration. In the last part of the methodology, Random Forest (RF) is used as a feature selection tool and a Decision Tree-based (DT) classification of the in-service oil samples. A case study of a thermal power plant is advanced, to which the framework is applied. The selection of admissible variables using Random Forest exposes critical used oil analysis (UOA) variables indicative of lubricant/machine degradation, while DT model, besides predicting the classification of samples, offers visual interpretability of parametric impact to the classification outcome. The model evaluation returned acceptable predictive, while the framework renders speedy classification with insights for maintenance decision making, thus ensuring timely interventions. Moreover, the framework highlights critical and relevant oil analysis parameters that are indicative of lubricant degradation; hence, by addressing such critical parameters, organizations can better enhance the reliability of their critical operable equipment.

Download Full-text

EVALUATING EFFECTIVENESS OF ENSEMBLE CLASSIFIERS WHEN DETECTING FUZZERS ATTACKS ON THE UNSW-NB15 DATASET

Journal of Computer Science and Cybernetics ◽

10.15625/1813-9663/36/2/14786 ◽

2020 ◽

Vol 36 (2) ◽

pp. 173-185

Author(s):

Hoang Ngoc Thanh ◽

Tran Van Lang

Keyword(s):

Random Forest ◽

Decision Tree ◽

Cyber Security ◽

Experimental Results ◽

Ensemble Classifiers ◽

Research Results ◽

Ensemble Techniques ◽

F Measure ◽

Classification Quality

The UNSW-NB15 dataset was created by the Australian Cyber Security Centre in 2015 by using the IXIA tool to extract normal behaviors and modern attacks, it includes normal data and 9 types of attacks with 49 features. Previous research results show that the detection of Fuzzers attacks in this dataset gives the lowest classification quality. This paper analyzes and evaluates the performance of using known ensemble techniques such as Bagging, AdaBoost, Stacking, Decorate, Random Forest and Voting to detect FUZZERS attacks on UNSW-NB15 dataset to create models. The experimental results show that the AdaBoost technique with the component classifiers using decision tree for the best classification quality with F-Measure is 96.76% compared to 94.16%, which is the best result obtained by using single classifiers and 96.36% by using the Random Forest technique.

Download Full-text

Analysis and Prediction Blood Pressure and Disease by Applying Decision Tree, Naïve Base and Random Forest algorithms

Indian Journal of Public Health Research & Development ◽

10.37506/ijphrd.v11i4.9150 ◽

2020 ◽

Keyword(s):

Blood Pressure ◽

Random Forest ◽

Decision Tree

Download Full-text