scholarly journals Predictive Analysis for the Detection of Diabetes Mellitus (DM) based on Machine Learning Classification Algorithm

IJARCCE ◽  
2021 ◽  
Vol 10 (12) ◽  
Author(s):  
Dillip Narayan Sahu ◽  
Vijay Pal Singh
Author(s):  
Tyler F. Rooks ◽  
Andrea S. Dargie ◽  
Valeta Carol Chancey

Abstract A shortcoming of using environmental sensors for the surveillance of potentially concussive events is substantial uncertainty regarding whether the event was caused by head acceleration (“head impacts”) or sensor motion (with no head acceleration). The goal of the present study is to develop a machine learning model to classify environmental sensor data obtained in the field and evaluate the performance of the model against the performance of the proprietary classification algorithm used by the environmental sensor. Data were collected from Soldiers attending sparring sessions conducted under a U.S. Army Combatives School course. Data from one sparring session were used to train a decision tree classification algorithm to identify good and bad signals. Data from the remaining sparring sessions were kept as an external validation set. The performance of the proprietary algorithm used by the sensor was also compared to the trained algorithm performance. The trained decision tree was able to correctly classify 95% of events for internal cross-validation and 88% of events for the external validation set. Comparatively, the proprietary algorithm was only able to correctly classify 61% of the events. In general, the trained algorithm was better able to predict when a signal was good or bad compared to the proprietary algorithm. The present study shows it is possible to train a decision tree algorithm using environmental sensor data collected in the field.


2021 ◽  
Vol 11 (2) ◽  
pp. 642-650
Author(s):  
C.S. Anita ◽  
P. Nagarajan ◽  
G. Aditya Sairam ◽  
P. Ganesh ◽  
G. Deepakkumar

With the pandemic situation, there is a strong rise in the number of online jobs posted on the internet in various job portals. But some of the jobs being posted online are actually fake jobs which lead to a theft of personal information and vital information. Thus, these fake jobs can be precisely detected and classified from a pool of job posts of both fake and real jobs by using advanced deep learning as well as machine learning classification algorithms. In this paper, machine learning and deep learning algorithms are used so as to detect fake jobs and to differentiate them from real jobs. The data analysis part and data cleaning part are also proposed in this paper, so that the classification algorithm applied is highly precise and accurate. It has to be noted that the data cleaning step is a very important step in machine learning project because it actually determines the accuracy of the machine learning as well as deep learning algorithms. Hence a great importance is emphasized on data cleaning and pre-processing step in this paper. The classification and detection of fake jobs can be done with high accuracy and high precision. Hence the machine learning and deep learning algorithms have to be applied on cleaned and pre-processed data in order to achieve a better accuracy. Further, deep learning neural networks are used so as to achieve higher accuracy. Finally all these classification models are compared with each other to find the classification algorithm with highest accuracy and precision.


2020 ◽  
Vol 16 (1) ◽  
pp. 59-64
Author(s):  
Jaja Miharja ◽  
Jordy Lasmana Putra ◽  
Nur Hadianto

Analysis of hotel review sentiment is very helpful to be used as a benchmark or reference for making hotel business decisions today. However, all the review information obtained must be processed first by using an algorithm. The purpose of this study is to compare the Classification Algorithm of Machine Learning to obtain information that has a better level of accuracy in the analysis of hotel reviews. The algorithm that will be used is k-NN (k-Nearest Neighbor) and NB (Naive Bayes). After doing the calculation, the following accuracy level is obtained: k-NN of 60,50% with an AUC value of 0.632 and NB of 85,25% with an AUC value of 0.658. These results can be determined by the right algorithm to assist in making accurate decisions by business people in the analysis of hotel reviews using the NB Algorithm.


Author(s):  
Ankit Singh

Cardiovascular Disease is the leading cause of death (Approximately, 17 million people every year) in the all the area of the world. Prediction of heart disease is the critical challenge in the area of the clinical data analysis. The objective of paper is to build the model for predicting the Heart Disease using various machine learning classification algorithm. Classification is a powerful machine learning technique that is commonly used for prediction. Some of the classification algorithm are Logistic Regression, Support Vector Machine, Naïve Bayes, Decision Tree, Random Forest Classifier, KNN. This paper investigate which algorithm is used for the improving the accuracy in the prediction of heart disease. And, a comparative analysis on the accuracy and mean squared error is to done for predicting the best model. The result of the study indicates that KNN algorithm is effective in predicting the model with the accuracy of the 85.71% and having a very low mean squared error.


2021 ◽  
Vol 2087 (1) ◽  
pp. 012095
Author(s):  
Zhangchi Ying ◽  
Yuteng Huang ◽  
Ke Chen ◽  
Tianqi Yu

Abstract Aiming at the low cleaning rate of the traditional multi-source heterogeneous power grid big data cleaning model, a multi-source heterogeneous power grid big data cleaning model based on machine learning classification algorithm is designed. By capturing high-quality multi-source heterogeneous power grid big data, weight labeling of data source importance measurement, data attributes and tuples, and constructing Tan network based on the idea of machine learning classification algorithm, the data probability value is finally used to complete the classification and cleaning of inaccurate data. Experiments show that the model based on machine learning classification algorithm can effectively improve the imprecise data cleaning rate compared with the traditional model to solve multi-source heterogeneous imprecise data cleaning.


Sign in / Sign up

Export Citation Format

Share Document