Experimental and Comparison Based Study on Diabetes Prediction Using Artificial Neural Network

Author(s):  
Nitesh Pradhan ◽  
VijayPal Singh Dhaka ◽  
Satish Chandra Kulhari

Background: Diabetes is spreading in the entire world. In a survey, it is observed that every generation from child to old age people are suffering from diabetes. If diabetes is not identified in time, it may lead to deadliest disease. Prediction of diabetes is of the utmost challenging task by machines. In the human body, diabetes is one of the perilous maladies that creates depended disease such as kidney disease, heart attack, blindness etc. Thus it is very important to diagnose diabetes in time. Objective: Our target is to develop a system using Artificial Neural Network(ANN), with the ability to predict whether a patient suffers from diabetes or not. Method: This paper illustrates various machine learning techniques in form of literature review; such as Support Vector Machine, Naïve Bayes, K Nearest Neighbor, Decision Tree, Random Forest Etc. We applied ANN to predict diabetes. In this paper, the architecture of ANN consists of four hidden layers each of six neurons and one output layer with one neuron. Optimizer used for the architecture is ‘Adam’. Results: We have Pima Indian diabetes dataset of sufficient number of patients with nine different symptoms with respect to the patients and nine different features in connection with the mathematical computation/prediction. Hence we bifurcate the dataset into training and testing set in majority and minority ratio of 80:20 respectively. It facilitates us the majority patient’s data to be used as training set and minority data to be used as testing set. We train our network for multiple epoch with different activation function. We used four hidden layers with six neurons in each hidden layer and one output layer. On the hidden layer, we used multiple activation functions such as sigmoid, ReLU etc. and obtained beat accuracy (88.71%) in 600 epochs with ReLU activation function. On the output layer, we used only sigmoid activation function because we have only two classes in our dataset. Conclusion: Diabetes prediction by machine is a challenging task. So many machine learning algorithms exist to predict the diabetes such as Naïve Bayes, decision tree, K nearest neighbor, support vector machine etc. This paper presents a novel approach to predict whether a patient has diabetes or not based on Pima Indian diabetes dataset. In this paper, we used artificial neural network to train out network and it is observed that artificial neural network approach performs better than all other classifiers

Author(s):  
S. Vijaya Rani ◽  
G. N. K. Suresh Babu

The illegal hackers  penetrate the servers and networks of corporate and financial institutions to gain money and extract vital information. The hacking varies from one computing system to many system. They gain access by sending malicious packets in the network through virus, worms, Trojan horses etc. The hackers scan a network through various tools and collect information of network and host. Hence it is very much essential to detect the attacks as they enter into a network. The methods  available for intrusion detection are Naive Bayes, Decision tree, Support Vector Machine, K-Nearest Neighbor, Artificial Neural Networks. A neural network consists of processing units in complex manner and able to store information and make it functional for use. It acts like human brain and takes knowledge from the environment through training and learning process. Many algorithms are available for learning process This work carry out research on analysis of malicious packets and predicting the error rate in detection of injured packets through artificial neural network algorithms.


Author(s):  
Edy Fradinata ◽  
Sakesun Suthummanon ◽  
Wannarat Suntiamorntut

This paper presents architecture of backpropagation Artificial Neural Network (ANN) and Support Vector Regression (SVR) models in supervised learning process for cement demand dataset. This study aims to identify the effectiveness of each parameter of mean square error (MSE) indicators for time series dataset. The study varies different random sample in each demand parameter in the network of ANN and support vector function as well. The variations of percent datasets from activation function, learning rate of sigmoid and purelin, hidden layer, neurons, and training function should be applied for ANN. Furthermore, SVR is varied in kernel function, lost function and insensitivity to obtain the best result from its simulation. The best results of this study for ANN activation function is Sigmoid. The amount of data input is 100% or 96 of data, 150 learning rates, one hidden layer, trinlm training function, 15 neurons and 3 total layers. The best results for SVR are six variables that run in optimal condition, kernel function is linear, loss function is ౬-insensitive, and insensitivity was 1. The better results for both methods are six variables. The contribution of this study is to obtain the optimal parameters for specific variables of ANN and SVR.


Author(s):  
Maria Morgan ◽  
Carla Blank ◽  
Raed Seetan

<p>This paper investigates the capability of six existing classification algorithms (Artificial Neural Network, Naïve Bayes, k-Nearest Neighbor, Support Vector Machine, Decision Tree and Random Forest) in classifying and predicting diseases in soybean and mushroom datasets using datasets with numerical or categorical attributes. While many similar studies have been conducted on datasets of images to predict plant diseases, the main objective of this study is to suggest classification methods that can be used for disease classification and prediction in datasets that contain raw measurements instead of images. A fungus and a plant dataset, which had many differences, were chosen so that the findings in this paper could be applied to future research for disease prediction and classification in a variety of datasets which contain raw measurements. A key difference between the two datasets, other than one being a fungus and one being a plant, is that the mushroom dataset is balanced and only contained two classes while the soybean dataset is imbalanced and contained eighteen classes. All six algorithms performed well on the mushroom dataset, while the Artificial Neural Network and k-Nearest Neighbor algorithms performed best on the soybean dataset. The findings of this paper can be applied to future research on disease classification and prediction in a variety of dataset types such as fungi, plants, humans, and animals.</p>


Author(s):  
Julio Narabel ◽  
Setia Budi

In the fitness industry, the number of members is a major factor for the sustainability of its business. The ability of managers and trainers to detect members who represent traits to quit membership is critical. Four supervised learning classification methods like Support Vector Machine, Random Forest, K-Nearest Neighbor, and Artificial Neural Network were used to generate early detection using two variants of datasets that have different amounts of data. Classification results are separated into three different zones, which are Green Zone, Yellow Zone, and Red Zone. Artificial Neural Network methods using backpropagation training give 99.90% of accuracy on a dataset which has more amount of data. The evaluation has been done using the confusion matrix and AUC-ROC curves.


PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e11529
Author(s):  
Adel M. Al-Saif ◽  
Mahmoud Abdel-Sattar ◽  
Abdulwahed M. Aboukarima ◽  
Dalia H. Eshra

In the fresh fruit industry, identification of fruit cultivars and fruit quality is of vital importance. In the current study, nine peach cultivars (Dixon, Early Grande, Flordaprince, Flordastar, Flordaglo, Florda 834, TropicSnow, Desertred, and Swelling) were evaluated for differences in skin color, firmness, and size. Additionally, a multilayer perceptron (MLP) artificial neural network was applied for identification of the cultivars according to these attributes. The MLP was trained with an input layer including six input nodes, a single hidden layer with six hidden nodes, and an output layer with nine output nodes. A hyperbolic tangent activation function was used in the hidden layer and the cross entropy error was given because the softmax activation function was functional to the output layer. Results showed that the cross entropy error was 0.165. The peach identification process was significantly affected by the following variables in order of contribution (normalized importance): polar diameter (100%), L∗ (89.0), b∗ (88.0%), a∗ (78.5%), firmness (71.3%), and cross diameter (37.5.3%). The MLP was found to be a viable method of peach cultivar identification and classification because few identifying attributes were required and an overall classification accuracy of 100% was achieved in the testing phase. Measurements and quantitative discrimination of peach properties are provided in this research; these data may help enhance the processing efficiency and quality of processed peaches.


2018 ◽  
Vol 204 ◽  
pp. 02018
Author(s):  
Aisyah Larasati ◽  
Anik Dwiastutik ◽  
Darin Ramadhanti ◽  
Aal Mahardika

This study aims to explore the effect of kurtosis level of the data in the output layer on the accuracy of artificial neural network predictive models. The artificial neural network predictive models are comprised of one node in the output layer and six nodes in the input layer. The number of hidden layer is automatically built by the program. Data are generated using simulation approach. The results show that the kurtosis level of the node in the output layer is significantly affect the accuracy of the artificial neural network predictive model. Platycurtic and leptocurtic data has significantly higher misclassification rates than mesocurtic data. However, the misclassification rates between platycurtic and leptocurtic is not significantly different. Thus, data distribution with kurtosis nearly to zero results in a better ANN predictive model.


2004 ◽  
Vol 67 (8) ◽  
pp. 1604-1609 ◽  
Author(s):  
UBONRATANA SIRIPATRAWAN ◽  
JOHN E. LINZ ◽  
BRUCE R. HARTE

An electronic sensor array with 12 nonspecific metal oxide sensors was evaluated for its ability to monitor volatile compounds in super broth alone and in super broth inoculated with Escherichia coli (ATCC 25922) at 37°C for 2 to 12 h. Using discriminant function analysis, it was possible to differentiate super broth alone from that containing E. coli when cell numbers were 105 CFU or more. There was a good agreement between the volatile profiles from the electronic sensor array and a gas chromatography–mass spectrometer method. The potential to predict the number of E. coli and the concentration of specific metabolic compounds was investigated using an artificial neural network (ANN). The artificial neural network was composed of an input layer, one hidden layer, and an output layer, with a hyperbolic tangent sigmoidal transfer function in the hidden layer and a linear transfer function in the output layer. Good prediction was found as measured by a regression coefficient (R2 = 0.999) between actual and predicted data.


Sign in / Sign up

Export Citation Format

Share Document