The influence of neighborhood-level urban morphology on PM2.5 variation based on random forest regression

A Random Forest Regression Model Predicting the Winners of Summer Olympic Events

Proceedings of the 2020 2nd International Conference on Big Data Engineering ◽

10.1145/3404512.3404513 ◽

2020 ◽

Author(s):

Mengjie Jia ◽

Yue Zhao ◽

Furong Chang ◽

Bofeng Zhang ◽

Kenji Yoshigoe

Keyword(s):

Random Forest ◽

Regression Model ◽

Random Forest Regression

Get full-text (via PubEx)

A novel framework for designing a multi-DoF prosthetic wrist control using machine learning

Scientific Reports ◽

10.1038/s41598-021-94449-1 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Chinmay P. Swami ◽

Nicholas Lenhard ◽

Jiyeon Kang

Keyword(s):

Machine Learning ◽

Random Forest ◽

Upper Limb ◽

Daily Living ◽

Machine Learning Algorithms ◽

Data Sets ◽

Random Forest Regression ◽

Prosthetic Devices ◽

Upper Limb Function ◽

The Neural Network

AbstractProsthetic arms can significantly increase the upper limb function of individuals with upper limb loss, however despite the development of various multi-DoF prosthetic arms the rate of prosthesis abandonment is still high. One of the major challenges is to design a multi-DoF controller that has high precision, robustness, and intuitiveness for daily use. The present study demonstrates a novel framework for developing a controller leveraging machine learning algorithms and movement synergies to implement natural control of a 2-DoF prosthetic wrist for activities of daily living (ADL). The data was collected during ADL tasks of ten individuals with a wrist brace emulating the absence of wrist function. Using this data, the neural network classifies the movement and then random forest regression computes the desired velocity of the prosthetic wrist. The models were trained/tested with ADLs where their robustness was tested using cross-validation and holdout data sets. The proposed framework demonstrated high accuracy (F-1 score of 99% for the classifier and Pearson’s correlation of 0.98 for the regression). Additionally, the interpretable nature of random forest regression was used to verify the targeted movement synergies. The present work provides a novel and effective framework to develop an intuitive control for multi-DoF prosthetic devices.

Get full-text (via PubEx)

Real-Time Moisture Ratio Study of Drying Date Fruit Chips Based on On-Line Image Attributes Using kNN and Random Forest Regression Methods

Measurement ◽

10.1016/j.measurement.2020.108899 ◽

2020 ◽

pp. 108899

Author(s):

Madi Keramat-Jahromi ◽

Seyed Saeid Mohtasebi ◽

Hossein Mousazadeh ◽

Mahdi Ghasemi-Varnamkhasri ◽

Maryam Rahimi-Movassagh

Keyword(s):

Random Forest ◽

Real Time ◽

Moisture Ratio ◽

Random Forest Regression ◽

Regression Methods ◽

Line Image ◽

Date Fruit ◽

On Line

Get full-text (via PubEx)

Random forest regression results in accurate assessment of potato nitrogen status based on multispectral data from different platforms and the critical concentration approach

Field Crops Research ◽

10.1016/j.fcr.2021.108158 ◽

2021 ◽

Vol 268 ◽

pp. 108158

Author(s):

Junxiang Peng ◽

Kiril Manevski ◽

Kirsten Kørup ◽

René Larsen ◽

Mathias Neumann Andersen

Keyword(s):

Random Forest ◽

Critical Concentration ◽

Accurate Assessment ◽

Nitrogen Status ◽

Random Forest Regression ◽

Multispectral Data

Get full-text (via PubEx)

Application of random forest regression to the calculation of gas-phase chemistry within the GEOS-Chem chemistry model v10

Geoscientific Model Development ◽

10.5194/gmd-12-1209-2019 ◽

2019 ◽

Vol 12 (3) ◽

pp. 1209-1225 ◽

Cited By ~ 15

Author(s):

Christoph A. Keller ◽

Mat J. Evans

Keyword(s):

Machine Learning ◽

Random Forest ◽

Gas Phase ◽

Atmospheric Chemistry ◽

Random Forest Regression ◽

Data Set ◽

Gas Phase Chemistry ◽

Chemical Conditions ◽

Phase Chemistry ◽

The Impact

Abstract. Atmospheric chemistry models are a central tool to study the impact of chemical constituents on the environment, vegetation and human health. These models are numerically intense, and previous attempts to reduce the numerical cost of chemistry solvers have not delivered transformative change. We show here the potential of a machine learning (in this case random forest regression) replacement for the gas-phase chemistry in atmospheric chemistry transport models. Our training data consist of 1 month (July 2013) of output of chemical conditions together with the model physical state, produced from the GEOS-Chem chemistry model v10. From this data set we train random forest regression models to predict the concentration of each transported species after the integrator, based on the physical and chemical conditions before the integrator. The choice of prediction type has a strong impact on the skill of the regression model. We find best results from predicting the change in concentration for long-lived species and the absolute concentration for short-lived species. We also find improvements from a simple implementation of chemical families (NOx = NO + NO2). We then implement the trained random forest predictors back into GEOS-Chem to replace the numerical integrator. The machine-learning-driven GEOS-Chem model compares well to the standard simulation. For ozone (O3), errors from using the random forests (compared to the reference simulation) grow slowly and after 5 days the normalized mean bias (NMB), root mean square error (RMSE) and R2 are 4.2 %, 35 % and 0.9, respectively; after 30 days the errors increase to 13 %, 67 % and 0.75, respectively. The biases become largest in remote areas such as the tropical Pacific where errors in the chemistry can accumulate with little balancing influence from emissions or deposition. Over polluted regions the model error is less than 10 % and has significant fidelity in following the time series of the full model. Modelled NOx shows similar features, with the most significant errors occurring in remote locations far from recent emissions. For other species such as inorganic bromine species and short-lived nitrogen species, errors become large, with NMB, RMSE and R2 reaching >2100 % >400 % and <0.1, respectively. This proof-of-concept implementation takes 1.8 times more time than the direct integration of the differential equations, but optimization and software engineering should allow substantial increases in speed. We discuss potential improvements in the implementation, some of its advantages from both a software and hardware perspective, its limitations, and its applicability to operational air quality activities.

Get full-text (via PubEx)

Hand detection in American Sign Language depth data using domain-driven random forest regression

2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG) ◽

10.1109/fg.2015.7163135 ◽

2015 ◽

Cited By ~ 1

Author(s):

Zahoor Zafrulla ◽

Himanshu Sahni ◽

Abdelkareem Bedri ◽

Pavleen Thukral ◽

Thad Starner

Keyword(s):

Random Forest ◽

American Sign Language ◽

Sign Language ◽

American Sign ◽

Hand Detection ◽

Random Forest Regression ◽

Depth Data

Get full-text (via PubEx)

Lower Limb Continuous Motion Estimation Based on Binary Cuckoo Search and Random Forest Regression

10.23919/ccc52363.2021.9550023 ◽

2021 ◽

Author(s):

Fei Wang ◽

Zhibo Fan ◽

Chengpeng Li ◽

Jian Lu

Keyword(s):

Random Forest ◽

Motion Estimation ◽

Lower Limb ◽

Cuckoo Search ◽

Random Forest Regression ◽

Continuous Motion

Get full-text (via PubEx)

Early Classification Method for US Corn and Soybean by Incorporating MODIS-Estimated Phenological Data and Historical Classification Maps in Random-Forest Regression Algorithm

Photogrammetric Engineering & Remote Sensing ◽

10.14358/pers.21-00003r2 ◽

2021 ◽

Vol 87 (10) ◽

pp. 747-758

Author(s):

Toshihiro Sakamoto

Keyword(s):

Random Forest ◽

Classification Accuracy ◽

Classification Method ◽

Estimation Accuracy ◽

Random Forest Regression ◽

Crop Phenology ◽

Phenological Data ◽

Mixed Pixel ◽

Crop Classification ◽

Emergence Date

An early crop classification method is functionally required in a near-real-time crop-yield prediction system, especially for upland crops. This study proposes methods to estimate the mixed-pixel ratio of corn, soybean, and other classes within a low-resolution MODIS pixel by coupling MODIS-derived crop phenology information and the past Cropland Data Layer in a random-forest regression algorithm. Verification of the classification accuracy was conducted for the Midwestern United States. The following conclusions are drawn: The use of the random-forest algorithm is effective in estimating the mixed-pixel ratio, which leads to stable classification accuracy; the fusion of historical data and MODIS-derived crop phenology information provides much better crop classification accuracy than when these are used individually; and the input of a longer MODIS data period can improve classification accuracy, especially after day of year 279, because of improved estimation accuracy for the soybean emergence date.

Get full-text (via PubEx)

Empirical methods for the estimation of Southern Ocean CO<sub>2</sub>: support vector and random forest regression

Biogeosciences ◽

10.5194/bg-14-5551-2017 ◽

2017 ◽

Vol 14 (23) ◽

pp. 5551-5569 ◽

Cited By ~ 10

Author(s):

Luke Gregor ◽

Schalk Kok ◽

Pedro M. S. Monteiro

Keyword(s):

Random Forest ◽

Southern Ocean ◽

Synthetic Data ◽

Good Effect ◽

Support Vector ◽

Co2 Uptake ◽

Empirical Methods ◽

Random Forest Regression ◽

Proxy Variables ◽

The Impact

Abstract. The Southern Ocean accounts for 40 % of oceanic CO2 uptake, but the estimates are bound by large uncertainties due to a paucity in observations. Gap-filling empirical methods have been used to good effect to approximate pCO2 from satellite observable variables in other parts of the ocean, but many of these methods are not in agreement in the Southern Ocean. In this study we propose two additional methods that perform well in the Southern Ocean: support vector regression (SVR) and random forest regression (RFR). The methods are used to estimate ΔpCO2 in the Southern Ocean based on SOCAT v3, achieving similar trends to the SOM-FFN method by Landschützer et al. (2014). Results show that the SOM-FFN and RFR approaches have RMSEs of similar magnitude (14.84 and 16.45 µatm, where 1 atm  =  101 325 Pa) where the SVR method has a larger RMSE (24.40 µatm). However, the larger errors for SVR and RFR are, in part, due to an increase in coastal observations from SOCAT v2 to v3, where the SOM-FFN method used v2 data. The success of both SOM-FFN and RFR depends on the ability to adapt to different modes of variability. The SOM-FFN achieves this by having independent regression models for each cluster, while this flexibility is intrinsic to the RFR method. Analyses of the estimates shows that the SVR and RFR's respective sensitivity and robustness to outliers define the outcome significantly. Further analyses on the methods were performed by using a synthetic dataset to assess the following: which method (RFR or SVR) has the best performance? What is the effect of using time, latitude and longitude as proxy variables on ΔpCO2? What is the impact of the sampling bias in the SOCAT v3 dataset on the estimates? We find that while RFR is indeed better than SVR, the ensemble of the two methods outperforms either one, due to complementary strengths and weaknesses of the methods. Results also show that for the RFR and SVR implementations, it is better to include coordinates as proxy variables as RMSE scores are lowered and the phasing of the seasonal cycle is more accurate. Lastly, we show that there is only a weak bias due to undersampling. The synthetic data provide a useful framework to test methods in regions of sparse data coverage and show potential as a useful tool to evaluate methods in future studies.

Get full-text (via PubEx)

Application of visible and near-infrared spectroscopy for evaluation of ewes milk with different feeds

Animal Production Science ◽

10.1071/an17240 ◽

2019 ◽

Vol 59 (6) ◽

pp. 1190 ◽

Cited By ~ 2

Author(s):

A. Bahri ◽

S. Nawar ◽

H. Selmi ◽

M. Amraoui ◽

H. Rouissi ◽

...

Keyword(s):

Principal Component Analysis ◽

Infrared Spectroscopy ◽

Random Forest ◽

Near Infrared Spectroscopy ◽

Partial Least Squares Regression ◽

Near Infrared ◽

Principal Component ◽

Least Squares Regression ◽

Random Forest Regression ◽

Milk Samples

Rapid measurement optical techniques have the advantage over traditional methods of being faster and non-destructive. In this work visible and near-infrared spectroscopy (vis-NIRS) was used to investigate differences between measured values of key milk properties (e.g. fat, protein and lactose) in 30 samples of ewes milk according to three feed systems; faba beans, field peas and control diet. A mobile fibre-optic vis-NIR spectrophotometer (350–2500 nm) was used to collect reflectance spectra from milk samples. Principal component analysis was used to explore differences between milk samples according to the feed supplied, and a partial least-squares regression and random forest regression were adopted to develop calibration models for the prediction of milk properties. Results of the principal component analysis showed clear separation between the three groups of milk samples according to the diet of the ewes throughout the lactation period. Milk fat, protein and lactose were predicted with good accuracy by means of partial least-squares regression (R2 = 0.70–0.83 and ratio of prediction deviation, which is the ratio of standard deviation to root mean square error of prediction = 1.85–2.44). However, the best prediction results were obtained with random forest regression models (R2 = 0.86–0.90; ratio of prediction deviation = 2.73–3.26). The adoption of the vis-NIRS coupled with multivariate modelling tools can be recommended for exploring to differences between milk samples according to different feed systems, and to predict key milk properties, based particularly on the random forest regression modelling technique.

Get full-text (via PubEx)