Prediction of Breast Cancer using Decision tree and Random Forest Algorithm

One of the most dreadful disease is breast cancer and it has a potential cause for death in women. Every year, death rate increases drastically due to breast cancer. An effective way to classify data is through classification or data mining. This becomes very handy, especially in the medical field where diagnosis and analysis are done through these techniques. Wisconsin Breast cancer dataset is used to perform a comparison between SVM, Logistic Regression, Naïve Bayes and Random Forest. Evaluating the correctness in classifying data based on accuracy and time consumption is used to determine the efficiency of the algorithms, which is the main objective. Based on the result of performed experiments, the Random Forest algorithm shows the highest accuracy (99.76%) with the least error rate. ANACONDA Data Science Platform is used to execute all the experiments in a simulated environment.

Download Full-text

Sentiment Analysis of Social Media Users Using Naïve Bayes, Decision Tree, Random Forest Algorithm: A Case Study of Draft Law on the Elimination of Sexual Violence (RUU PKS)

2019 International Conference on Sustainable Engineering and Creative Computing (ICSECC) ◽

10.1109/icsecc.2019.8907228 ◽

2019 ◽

Author(s):

Khalisa Virra ◽

Rachmadita Andreswari ◽

Muhammad Azani Hasibuan

Keyword(s):

Social Media ◽

Random Forest ◽

Decision Tree ◽

Sexual Violence ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Random Forest Algorithm

Download Full-text

Sentiment Analysis on Youtube Social Media Using Decision Tree and Random Forest Algorithm: A Case Study

2020 International Conference on Data Science and Its Applications (ICoDSA) ◽

10.1109/icodsa50139.2020.9213078 ◽

2020 ◽

Author(s):

Mohammad Aufar ◽

Rachmadita Andreswari ◽

Dita Pramesti

Keyword(s):

Social Media ◽

Random Forest ◽

Decision Tree ◽

Sentiment Analysis ◽

Random Forest Algorithm

Download Full-text

Usage of Data Mining Techniques in Predicting the Heart Diseases Decision Tree & Random Forest Algorithm

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.h7168.129219 ◽

2019 ◽

Vol 9 (2) ◽

pp. 963-967

Keyword(s):

Data Mining ◽

Heart Disease ◽

Random Forest ◽

Early Diagnosis ◽

Decision Tree ◽

Heart Diseases ◽

Classification Algorithms ◽

Random Forest Algorithm ◽

Medical Field ◽

Data Mining Techniques

Nowadays, heart disease is the main cause of several deaths among all other diseases. Due to the lack of resources in the medical field, the prediction of heart diseases becomes a major problem. For early diagnosis and treatment, some classification algorithms such as Decision Tree and Random Forest Algorithm are used. The data mining techniques compare the accuracy of the algorithm and predict heart diseases. The main aim of this paper is to predict heart disease based on the dataset values. In this paper we are comparing the accuracy of above two algorithms. To implement these methods the following steps are used. In first phase, a dataset of 13 attributes is collected and it was applied on classification techniques using the Decision tree and Random Forest Algorithms. Finally, the accuracy is collected for both the algorithms. In this paper we observed that random forest is generating better results than decision tree in prediction of heart diseases.

Download Full-text

Sentiment Analysis of Social Media Twitter with Case of Anti-LGBT Campaign in Indonesia using Naïve Bayes, Decision Tree, and Random Forest Algorithm

Procedia Computer Science ◽

10.1016/j.procs.2019.11.181 ◽

2019 ◽

Vol 161 ◽

pp. 765-772 ◽

Cited By ~ 6

Author(s):

Veny Amilia Fitri ◽

Rachmadita Andreswari ◽

Muhammad Azani Hasibuan

Keyword(s):

Social Media ◽

Random Forest ◽

Decision Tree ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Random Forest Algorithm

Download Full-text

Accident Prediction Accuracy Assessment for Highway-Rail Grade Crossings Using Random Forest Algorithm Compared with Decision Tree

Reliability Engineering & System Safety ◽

10.1016/j.ress.2020.106931 ◽

2020 ◽

Vol 200 ◽

pp. 106931 ◽

Cited By ~ 10

Author(s):

Xiaoyi Zhou ◽

Pan Lu ◽

Zijian Zheng ◽

Denver Tolliver ◽

Amin Keramati

Keyword(s):

Random Forest ◽

Decision Tree ◽

Prediction Accuracy ◽

Accuracy Assessment ◽

Random Forest Algorithm ◽

Accident Prediction ◽

Grade Crossings

Download Full-text

Breast Cancer Risk Prediction using XGBoost and Random Forest Algorithm

2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT) ◽

10.1109/icccnt49239.2020.9225451 ◽

2020 ◽

Author(s):

Sajib Kabiraj ◽

M. Raihan ◽

Nasif Alvi ◽

Marina Afrin ◽

Laboni Akter ◽

...

Keyword(s):

Breast Cancer ◽

Random Forest ◽

Breast Cancer Risk ◽

Cancer Risk ◽

Risk Prediction ◽

Random Forest Algorithm

Download Full-text

CLASSIFICATION OF ORCHARD CROP USING SENTINEL-1A SYNTHETIC APERTURE RADAR DATA

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-5-335-2018 ◽

2018 ◽

Vol XLII-5 ◽

pp. 335-338 ◽

Cited By ~ 1

Author(s):

H. Sahu ◽

D. Haldar ◽

A. Danodia ◽

S. Kumar

Keyword(s):

Maximum Likelihood ◽

Random Forest ◽

Decision Tree ◽

Decision Tree Algorithm ◽

Random Forest Algorithm ◽

Synthetic Aperture Radar Data ◽

Tree Algorithm ◽

Maximum Likelihood Classifier ◽

Sar Data

Abstract. A study was conducted in Saharanpur District of Uttar Pradesh to asses the potential of Sentinel-1A SAR Data in orchard crop classification. The objective of the study was to evaluate three different classifiers that are maximum likelihood classifier, decision tree algorithm and random forest algorithm in Sentinel-1A SAR Data. An attempt is made to study Sentinel-1A SAR Data to classify orchard crop using this approach. Here the rule-based classifiers such as decision tree algorithm and random forest algorithm are compared with conventional maximum likelihood classifier. Statistical analysis of the classification show that the distribution of the crop, forest orchard, settlement and waterbody was 17.47%, 0.47%, 28.3%, 28.3% and 25.5% respectively in all the classification algorithm but root mean square error for maximum likelihood classifier (1.278) is more than decision tree algorithm (1.196) and random forest algorithm (1.193). Out of three, a percentage correct prediction is highest in case of decision tree algorithm (73.4) than random forest algorithm (72.5) and least for maximum likelihood classifier (66.8) in December 2017. The accuracy for orchard class is 0.81 for maximum likelihood classifier, 0.80 for decision tree algorithm and 0.78 for random forest algorithm. Thus Sentinel-1A SAR Data was effectively utilized for the classification of orchard crops.

Download Full-text

Genre e-sport gaming tournament classification using machine learning technique based on decision tree, Naïve Bayes, and random forest algorithm

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/1088/1/012037 ◽

2021 ◽

Vol 1088 (1) ◽

pp. 012037

Author(s):

Arif Rinaldi Dikananda ◽

Irfan Ali ◽

Fathurrohman ◽

Rizki Ade Rinaldi ◽

Iin

Keyword(s):

Machine Learning ◽

Random Forest ◽

Decision Tree ◽

Naive Bayes ◽

Naïve Bayes ◽

Random Forest Algorithm ◽

Machine Learning Technique ◽

Learning Technique

Download Full-text

Research on Random Forest Algorithm Based on Big Data in Parallel Load Forecasting

MATEC Web of Conferences ◽

10.1051/matecconf/201822801020 ◽

2018 ◽

Vol 228 ◽

pp. 01020 ◽

Cited By ~ 1

Author(s):

Qingqing Liu

Keyword(s):

Random Forest ◽

Decision Tree ◽

Prediction Accuracy ◽

Load Forecasting ◽

Large Data ◽

Prototype System ◽

Data Sets ◽

Random Forest Algorithm ◽

Cluster Management ◽

Forecasting Method

The paper propose a parallel load forecasting method based on random forest algorithm, through the analysis of historical load, temperature, wind speed and other data, the algorithm can shorten the load forecasting time and improve the processing capability of large data. This paper also designs and implements parallel load forecasting prototype system based on power user side large data of a Hadoop, including data cluster management, data management, prediction classification algorithm library and other functions. The experimental results show that the accuracy of parallel stochastic forest algorithm is obviously higher than decision tree, and the prediction accuracy on the different data sets is generally higher than decision tree, and it can better analyze and process large data.

Download Full-text