Machine Learning Predictive Models for Pile Drivability: An Evaluation of Random Forest Regression and Multivariate Adaptive Regression Splines

Author(s):  
Wengang Zhang ◽  
Chongzhi Wu
2021 ◽  
Vol 2021 (1) ◽  
pp. 1044-1053
Author(s):  
Nuri Taufiq ◽  
Siti Mariyah

Metode yang digunakan untuk pemeringkatan status sosial ekonomi rumah tangga Basis Data Terpadu adalah dengan memprediksi nilai pengeluaran rumah tangga dengan metode Proxy Mean Testing (PMT). Secara umum metode ini merupakan model prediksi dengan menggunakan teknik regresi. Pilihan model statistik yang digunakan adalah forward-stepwise. Dalam praktiknya diasumsikan bahwa variabel prediktor yang digunakan dalam PMT memiliki korelasi linier dengan variabel pengeluaran. Penelitian ini mencoba menerapkan pendekatan machine learning sebagai alternatif metode prediksi selain model forward-stepwise. Model dibangun menggunakan beberapa algoritma machine learning seperti Multivariate Adaptive Regression Splines (MARS), K-Nearest Neighbors, Decision Tree, dan Bagging. Hasil pemodelan menunjukkan bahwa model machine learning menghasilkan nilai rata-rata inclusion error (IE) lebih rendah dibandingkan nilai rata-rata exclusion error (EE). Model machine learning bekerja efektif dalam mengurangi IE namun belum cukup sensitif dalam mengurangi EE. Nilai rata-rata IE model machine learning sebesar 0,21 sedangkan nilai rata-rata IE model PMT sebesar 0,29.


Author(s):  
Mohammed Okoe Alhassan ◽  
Michael Boakye Osei

Soft-computing techniques for fire safety parameter predictions in flammability studies are essential for describing a material fire behaviour. This study proposed, two novel Artificial Intelligence developed models, Multivariate Adaptive Regression Splines (MARS) and Random Forest (RF) methods, to model and predict peak heat release rate (pHRR) of Polymethyl methacrylate (PMMA) from Microscale Combustion Calorimetry (MCC) experiment. From the statistical analysis, MARS presented the highest coefficient of determination (R2) values of (0.9998) and (0.9996) for training and testing respectively, with low MAD, MAPE and RMSE values. Comparatively, MARS outperformed RF in the predictions of pHRR, through its model algorithms that generated optimized equations for pHRR predictions, covering all non-linearity points of the experimental data. Amongst the input variables (sample mass, THR, HRC, pTemp and pTime), heating rate (β), highly influenced pHRR outcome predictions from MARS and RF models. However, to validate the performance and applicability of the proposed models. Results of MARS and RF were benchmarked with that from Artificial Neural Network (ANN) methods. The MARS and RF models observed the least error deviation when compared with pHRR results for PMMA from the ANN models. This study therefore, recommends the adoption of MARS and RF in the predictions of flammability characteristics of polymeric materials.


2016 ◽  
Vol 20 (7) ◽  
pp. 2611-2628 ◽  
Author(s):  
Julie E. Shortridge ◽  
Seth D. Guikema ◽  
Benjamin F. Zaitchik

Abstract. In the past decade, machine learning methods for empirical rainfall–runoff modeling have seen extensive development and been proposed as a useful complement to physical hydrologic models, particularly in basins where data to support process-based models are limited. However, the majority of research has focused on a small number of methods, such as artificial neural networks, despite the development of multiple other approaches for non-parametric regression in recent years. Furthermore, this work has often evaluated model performance based on predictive accuracy alone, while not considering broader objectives, such as model interpretability and uncertainty, that are important if such methods are to be used for planning and management decisions. In this paper, we use multiple regression and machine learning approaches (including generalized additive models, multivariate adaptive regression splines, artificial neural networks, random forests, and M5 cubist models) to simulate monthly streamflow in five highly seasonal rivers in the highlands of Ethiopia and compare their performance in terms of predictive accuracy, error structure and bias, model interpretability, and uncertainty when faced with extreme climate conditions. While the relative predictive performance of models differed across basins, data-driven approaches were able to achieve reduced errors when compared to physical models developed for the region. Methods such as random forests and generalized additive models may have advantages in terms of visualization and interpretation of model structure, which can be useful in providing insights into physical watershed function. However, the uncertainty associated with model predictions under extreme climate conditions should be carefully evaluated, since certain models (especially generalized additive models and multivariate adaptive regression splines) become highly variable when faced with high temperatures.


Sign in / Sign up

Export Citation Format

Share Document