scholarly journals Atomic partial charge predictions for furanoses by random forest regression with atom type symmetry function

RSC Advances ◽  
2020 ◽  
Vol 10 (2) ◽  
pp. 666-673 ◽  
Author(s):  
Xiaocong Wang ◽  
Jun Gao

Atom type symmetry function that utilizes atom types defined in traditional force fields demonstrated improvements for describing structures of furanoses, and the capability of predicting their conformational adaptive charges with random forest regression models.

Genes ◽  
2018 ◽  
Vol 9 (2) ◽  
pp. 104 ◽  
Author(s):  
Aeriel Belk ◽  
Zhenjiang Zech Xu ◽  
David O. Carter ◽  
Aaron Lynne ◽  
Sibyl Bucheli ◽  
...  

2020 ◽  
Author(s):  
Peijia Liu ◽  
Dong Yang ◽  
Shaomin Li ◽  
Yutian Chong ◽  
Wentao Hu ◽  
...  

Abstract Background The utilization of estimating-GFR equations is critical for kidney disease in the clinic. However, the performance of the Chronic Kidney Disease Epidemiology Collaboration (CKD-EPI) equation has not improved substantially in the past eight years. Here we hypothesized that random forest regression(RF) method could go beyond revised linear regression, which is used to build the CKD-EPI equationMethods 1732 participants were enrolled in this study totally (1333 in development data set from Tianhe District and 399 in external data set Luogang District). Recursive feature elimination (RFE) is applied to the development data to select important variables and build random forest models. Then same variables were used to develop the estimated GFR equation with linear regression as a comparison. The performances of these equations are measured by bias, 30% accuracy , precision and root mean square error(RMSE).Results Of all the variables, creatinine, cystatin C, weight, body mass index (BMI), age, uric acid(UA), blood urea nitrogen(BUN), hematocrit(HCT) and apolipoprotein B(APOB) were selected by RFE method. The results revealed that the overall performance of random forest regression models ascended the revised regression models based on the same variables. In the 9-variable model, RF model was better than revised linear regression in term of bias, precision ,30%accuracy and RMSE(0.78 vs 2.98, 16.90 vs 23.62, 0.84 vs 0.80, 16.88 vs 18.70, all P<0.01 ). In the 4-variable model, random forest regression model showed an improvement in precision and RMSE compared with revised regression model. (20.82 vs 25.25, P<0.01, 19.08 vs 20.60, P<0.001). Bias and 30%accurancy were preferable, but the results were not statistically significant (0.34 vs 2.07, P=0.10, 0.8 vs 0.78, P=0.19, respectively).Conclusions The performances of random forest regression models are better than revised linear regression models when it comes to GFR estimation.


2016 ◽  
Vol 82 (3) ◽  
pp. 189-197 ◽  
Author(s):  
John W. Coulston ◽  
Christine E. Blinn ◽  
Valerie A. Thomas ◽  
Randolph H. Wynne

2020 ◽  
Author(s):  
Peijia Liu ◽  
Dong Yang ◽  
Shaomin Li ◽  
Yutian Chong ◽  
Ming Li ◽  
...  

Abstract Background The utilization of estimating-GFR equations is critical for kidney disease in the clinic. However, the performance of the Chronic Kidney Disease Epidemiology Collaboration (CKD-EPI) equation has not improved substantially in the past eight years. Here we hypothesized that random forest regression(RF) method could go beyond revised linear regression, which is used to build the CKD-EPI equation Methods 1732 participants were enrolled in this study totally (1333 in development data set from Tianhe District and 399 in external data set Luogang District). Recursive feature elimination (RFE) is applied to the development data to select important variables and build random forest models. Then same variables were used to develop the estimated GFR equation with linear regression as a comparison. The performances of these equations are measured by bias, 30% accuracy, precision and root mean square error(RMSE). Results Of all the variables, creatinine, cystatin C, weight, body mass index (BMI), age, uric acid(UA), blood urea nitrogen(BUN), hematocrit(HCT) and apolipoprotein B(APOB) were selected by RFE method. The results revealed that the overall performance of random forest regression models ascended the revised regression models based on the same variables. In the 9-variable model, RF model was better than revised linear regression in term of bias, precision ,30%accuracy and RMSE(0.78 vs 2.98, 16.90 vs 23.62, 0.84 vs 0.80, 16.88 vs 18.70, all P < 0.01 ). In the 4-variable model, random forest regression model showed an improvement in precision and RMSE compared with revised regression model. (20.82 vs 25.25, P < 0.01, 19.08 vs 20.60, P < 0.001). Bias and 30%accurancy were preferable, but the results were not statistically significant (0.34 vs 2.07, P = 0.10, 0.8 vs 0.78, P = 0.19, respectively). Conclusions The performances of random forest regression models are better than revised linear regression models when it comes to GFR estimation.


2021 ◽  
Vol 13 (16) ◽  
pp. 3123
Author(s):  
Chunzhu Wei ◽  
Qianying Zhao ◽  
Yang Lu ◽  
Dongjie Fu

Pearl River Delta (PRD), as one of the most densely populated regions in the world, is facing both natural changes (e.g., sea level rise) and human-induced changes (e.g., dredging for navigation and land reclamation). Bathymetric information is thus important for the protection and management of the estuarine environment, but little effort has been made to comprehensively evaluate the performance of different methods and datasets. In this study, two linear regression models—the linear band model and the log-transformed band ratio model, and two non-linear regression models—the support vector regression model and the random forest regression model—were applied to Landsat 8 (L8) and Sentinel-2 (S2) imagery for bathymetry mapping in 2019 and 2020. Results suggested that a priori area clustering based on spectral features using the K-means algorithm improved estimation accuracy. The random forest regression model performed best, and the three-band combinations outperformed two-band combinations in all models. When the non-linear models were applied with three-band combination (red, green, blue) to L8 and S2 imagery, the Root Mean Square Error (Mean Absolute Error) decreased by 23.10% (35.53%), and the coefficient of determination (Kling-Gupta efficiency) increased by 0.08 (0.09) on average, compared to those using the linear regression models. Despite the differences in spatial resolution and band wavelength, L8 and S2 performed similarly in bathymetry estimation. This study quantified the relative performance of different models and may shed light on the potential combination of multiple data sources for more timely and accurate bathymetry mapping.


2020 ◽  
Author(s):  
Nusrat Rouf ◽  
Majid Bashir Malik ◽  
Tasleem Arif

Abstract Introduction: Advancement in information technology, be it hardware, software or communication technology, over few decades has rapidly impacted almost every field of study. Machine learning tools and techniques are nowadays applied to every field. It has opened the ways for interdisciplinary research by promising effective analyzation and decision-making strategies. COVID-19 has badly affected more than 200 countries within a short span of time. It has drastically affected both daily activities as well as economic activities. Herd behavior of investors has triggered panic selling. As a result, stock markets around the world have plunged down.Methods: In this paper, we analyze the impact of COVID-19 on NSE (National Stock Exchange) index Nifty50. We employ Pearson Correlation and investigate the impact of total confirmed cases and daily cases on Nifty50 closing price. We use various machine learning regression models for predictive analysis viz, linear regression with polynomial terms (quadratic, cubic), Decision Tree Regression and Random Forest Regression. Model performance is measured using MSE (Mean Square Error), RMSE (Root Mean Square Error) and R2 (R Squared) evaluators. Results: Correlation analysis reveals that total confirmed cases and daily cases in both India and the World have negative correlation with Nifty50 closing prices. Moreover, Nifty50 closing prices are more negatively correlated with total confirmed and daily cases in India. Predictive analysis shows that the Random Forest Regression model outperforms all other models. Conclusion: We analyze and predict the impact of COVID-19 on closing price of Nifty50 index. We employ Pearson Correlation and investigate the impact of COVID-19 on Nifty50 closing prices. We use various machine learning regression models to predict the closing price of Nifty50 index. Results reveal that the market volatility is directly proportional to increase in number of COVID-19 cases. Random Forest Regression model has comparatively shown better RMSE and R2 values.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Chinmay P. Swami ◽  
Nicholas Lenhard ◽  
Jiyeon Kang

AbstractProsthetic arms can significantly increase the upper limb function of individuals with upper limb loss, however despite the development of various multi-DoF prosthetic arms the rate of prosthesis abandonment is still high. One of the major challenges is to design a multi-DoF controller that has high precision, robustness, and intuitiveness for daily use. The present study demonstrates a novel framework for developing a controller leveraging machine learning algorithms and movement synergies to implement natural control of a 2-DoF prosthetic wrist for activities of daily living (ADL). The data was collected during ADL tasks of ten individuals with a wrist brace emulating the absence of wrist function. Using this data, the neural network classifies the movement and then random forest regression computes the desired velocity of the prosthetic wrist. The models were trained/tested with ADLs where their robustness was tested using cross-validation and holdout data sets. The proposed framework demonstrated high accuracy (F-1 score of 99% for the classifier and Pearson’s correlation of 0.98 for the regression). Additionally, the interpretable nature of random forest regression was used to verify the targeted movement synergies. The present work provides a novel and effective framework to develop an intuitive control for multi-DoF prosthetic devices.


Measurement ◽  
2020 ◽  
pp. 108899
Author(s):  
Madi Keramat-Jahromi ◽  
Seyed Saeid Mohtasebi ◽  
Hossein Mousazadeh ◽  
Mahdi Ghasemi-Varnamkhasri ◽  
Maryam Rahimi-Movassagh

Sign in / Sign up

Export Citation Format

Share Document