Rule Generation of Cataract Patient Data Using Random Forest Algorithm

Prediction of Prognosis and Survival of Patients with Gastric Cancer by Weighted Improved Random Forest Model

Archives of Medical Science ◽

10.5114/aoms/135594 ◽

2021 ◽

Author(s):

Cheng Xu ◽

Jing Wang ◽

TianLong Zheng ◽

Yue Cao ◽

Fan Ye

Keyword(s):

Gastric Cancer ◽

Random Forest ◽

Cancer Patient ◽

Gastric Cancer Patient ◽

Patient Data ◽

Random Forest Model ◽

Average Increase ◽

Random Forest Algorithm ◽

Generalization Ability ◽

Forest Model

IntroductionIt’s very necessary to predict the survival status of patients based on their prognosis. This can assist physicians in evaluating treatment decisions. Random Forest is an excellent machine learning algorithm even without any modification. We propose a new Random Forest weighting method and apply it to the gastric cancer patient data from the Surveillance, Epidemiology, and End Results (SEER) program, and then evaluated the generalization ability of this weighted Random Forest algorithm on 10 public medical datasets. Furthermore, for the same weighting mode, the difference between using out-of-bag (OOB) data and all training sets as the weighting basis is explored.Material and methods110697 cases of gastric cancer patients diagnosed between 1975 and 2016 obtained from the SEER database were contained in the experiment. In addition, 10 public medical datasets are used for the generalization ability evaluation of this weighted Random Forest algorithm.ResultsThrough experimental verification, on the SEER gastric cancer patient data, the weighted Random Forest algorithm improves the accuracy by 0.79% compared with the original Random Forest. In AUC, Macro-averaging increased by 2.32% and Micro-averaging increased by 0.51% on average. Among the 10 public datasets, the Random Forest weighted in accuracy has the best performance on 6 datasets, with an average increase of 1.44% in accuracy and an average increase of 1.2% in AUC.ConclusionsCompared with the original Random Forest, the weighted Random Forest model has a significant improvement in performance, and the effect of using all training data as the weighting basis is better than using OOB data.

Download Full-text

Prediction of Breast Cancer using Decision tree and Random Forest Algorithm

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i2.226229 ◽

2018 ◽

Vol 6 (2) ◽

pp. 226-229

Author(s):

N.Sridevi . ◽

◽

S.Anitha . ◽

Keyword(s):

Breast Cancer ◽

Random Forest ◽

Decision Tree ◽

Random Forest Algorithm

Download Full-text

METHOD SUGGESTING CITY WALKING ROUTES FOR PEDESTRIANS USING AN EXAMPLE OF SAINT-PETERSBURG

Informatization and communication ◽

10.34219/2078-8320-2019-10-3-71-76 ◽

2019 ◽

pp. 71-76

Author(s):

A.E. Semenov

Keyword(s):

Random Forest ◽

Develop Model ◽

Random Forest Algorithm ◽

Saint Petersburg ◽

Pedestrian Navigation ◽

Factors Influencing ◽

The City

The method of pedestrian navigation in the cities illustrated by the example of Saint-Petersburg was investigated. The factors influencing people when they choose a route for their walk were determined. Based on acquired factors corresponding data was collected and used to develop model determining attractiveness of a street in the city using Random Forest algorithm. The results obtained shows that routes provided by the method are 14% more attractive and just 6% longer compared with the shortest ones.

Download Full-text

MetalExplorer, a Bioinformatics Tool for the Improved Prediction of Eight Types of Metal-Binding Sites Using a Random Forest Algorithm with Two- Step Feature Selection

Current Bioinformatics ◽

10.2174/2468422806666160618091522 ◽

2017 ◽

Vol 12 (6) ◽

Cited By ~ 6

Author(s):

Jiangning Song ◽

Chen Li ◽

Cheng Zheng ◽

Jerico Revote ◽

Ziding Zhang ◽

...

Keyword(s):

Feature Selection ◽

Random Forest ◽

Metal Binding ◽

Binding Sites ◽

Random Forest Algorithm ◽

Bioinformatics Tool ◽

Metal Binding Sites

Download Full-text

Prediction model based on the Laplacian eigenmap method combined with a random forest algorithm for rainstorm satellite images during the first annual rainy season in South China

Natural Hazards ◽

10.1007/s11069-021-04585-0 ◽

2021 ◽

Author(s):

Xiao-yan Huang ◽

Li He ◽

Hua-sheng Zhao ◽

Ying Huang ◽

Yu-shuang Wu

Keyword(s):

Random Forest ◽

Prediction Model ◽

South China ◽

Rainy Season ◽

Satellite Images ◽

Random Forest Algorithm ◽

Model Based

Download Full-text

Classification and photometric redshift estimation of quasars in photometric surveys

Proceedings of the International Astronomical Union ◽

10.1017/s1743921320001829 ◽

2020 ◽

Vol 15 (S359) ◽

pp. 40-41

Author(s):

L. M. Izuti Nakazono ◽

C. Mendes de Oliveira ◽

N. S. T. Hirata ◽

S. Jeram ◽

A. Gonzalez ◽

...

Keyword(s):

Machine Learning ◽

Random Forest ◽

Nearest Neighbour ◽

Random Forest Algorithm ◽

Photometric Redshift ◽

Using Data

AbstractWe present a machine learning methodology to separate quasars from galaxies and stars using data from S-PLUS in the Stripe-82 region. In terms of quasar classification, we achieved 95.49% for precision and 95.26% for recall using a Random Forest algorithm. For photometric redshift estimation, we obtained a precision of 6% using k-Nearest Neighbour.

Download Full-text

Mapping maize crop coefficient Kc using random forest algorithm based on leaf area index and UAV-based multispectral vegetation indices

Agricultural Water Management ◽

10.1016/j.agwat.2021.106906 ◽

2021 ◽

Vol 252 ◽

pp. 106906

Author(s):

Guomin Shao ◽

Wenting Han ◽

Huihui Zhang ◽

Shouyang Liu ◽

Yi Wang ◽

...

Keyword(s):

Random Forest ◽

Leaf Area Index ◽

Leaf Area ◽

Vegetation Indices ◽

Crop Coefficient ◽

Random Forest Algorithm ◽

Maize Crop ◽

Area Index

Download Full-text

Identifying different types of urban land use dynamics using Point-of-interest (POI) and Random Forest algorithm: The case of Huizhou, China

Cities ◽

10.1016/j.cities.2021.103202 ◽

2021 ◽

Vol 114 ◽

pp. 103202

Author(s):

Rong Wu ◽

Jieyu Wang ◽

Dachuan Zhang ◽

Shaojian Wang

Keyword(s):

Land Use ◽

Random Forest ◽

Urban Land ◽

Urban Land Use ◽

Random Forest Algorithm ◽

Point Of Interest ◽

Land Use Dynamics ◽

Different Types

Download Full-text

Image Classification of Rice Leaf Diseases Using Random Forest Algorithm

2021 Joint International Conference on Digital Arts, Media and Technology with ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunication Engineering ◽

10.1109/ectidamtncon51128.2021.9425696 ◽

2021 ◽

Author(s):

Panuwat Mekha ◽

Nutnicha Teeyasuksaet

Keyword(s):

Random Forest ◽

Image Classification ◽

Random Forest Algorithm ◽

Rice Leaf

Download Full-text

Random forest classification for predicting lifespan-extending chemical compounds

Scientific Reports ◽

10.1038/s41598-021-93070-6 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Sofia Kapsiani ◽

Brendan J. Howlin

Keyword(s):

Caenorhabditis Elegans ◽

Random Forest ◽

Molecular Descriptors ◽

Area Under The Curve ◽

Chemical Compounds ◽

Research Area ◽

Importance Measure ◽

Random Forest Algorithm ◽

Molecular Fingerprints ◽

Age Related

AbstractAgeing is a major risk factor for many conditions including cancer, cardiovascular and neurodegenerative diseases. Pharmaceutical interventions that slow down ageing and delay the onset of age-related diseases are a growing research area. The aim of this study was to build a machine learning model based on the data of the DrugAge database to predict whether a chemical compound will extend the lifespan of Caenorhabditis elegans. Five predictive models were built using the random forest algorithm with molecular fingerprints and/or molecular descriptors as features. The best performing classifier, built using molecular descriptors, achieved an area under the curve score (AUC) of 0.815 for classifying the compounds in the test set. The features of the model were ranked using the Gini importance measure of the random forest algorithm. The top 30 features included descriptors related to atom and bond counts, topological and partial charge properties. The model was applied to predict the class of compounds in an external database, consisting of 1738 small-molecules. The chemical compounds of the screening database with a predictive probability of ≥ 0.80 for increasing the lifespan of Caenorhabditis elegans were broadly separated into (1) flavonoids, (2) fatty acids and conjugates, and (3) organooxygen compounds.

Download Full-text