A Benchmark Feature Selection Framework for Non Communicable Disease Prediction Model

2015 ◽  
Vol 21 (10) ◽  
pp. 3409-3416
Author(s):  
Daniel Hartono Sutanto ◽  
Mohd. Khanapi Abd Ghani

2015 ◽  
Vol 77 (18) ◽  
Author(s):  
Mohd. Khanapi Abd. Ghani ◽  
Daniel Hartono Sutanto

Over recent years, Non-communicable Disease (NCDs) is the high mortality rate in worldwide likely diabetes mellitus, cardiovascular diseases, liver and cancers. NCDs prediction model have problems such as redundant data, missing data, imbalance dataset and irrelevant attribute. This paper proposes a novel NCDs prediction model to improve accuracy. Our model comprisesk-means as clustering technique, Weight by SVM as feature selection technique and Support Vector Machine as classifier technique. The result shows that k-means + weight SVM + SVM improved the classification accuracy on most of all NCDs dataset (accuracy; AUC), likely Pima Indian Dataset (99.52; 0.999), Breast Cancer Diagnosis Dataset (98.85; 1.000), Breast Cancer Biopsy Dataset (97.71; 0.998), Colon Cancer (99.41; 1.000), ECG (98.33; 1.000), Liver Disorder (99.13; 0.998).The significant different performed by k-means + weight by SVM + SVM. In the time to come, we are expecting to better accuracy rate with another classifier such as Neural Network.



Author(s):  
Fábio Pittoli ◽  
Henrique Damasceno Vianna ◽  
Jorge Luis Victória Barbosa

Patients with chronic diseases should be made aware of their planned treatments as well as being kept informed of the progress of those treatments. The Chronic Prediction model was designed not only to educate patients and assist them with some chronic non-communicable disease, but to control the risk factors that affect their diseases. The model utilizes Bayesian networks to map three things: to identify the cause and effect relationships among existing risk factors; to provide treatment recommendations about these risk factors and; to aid caregivers in the treatment of the patients.



2021 ◽  
Vol 26 (6) ◽  
pp. 541-547
Author(s):  
Wiharto ◽  
Esti Suryani ◽  
Sigit Setyawan

Coronary heart disease is a non-communicable disease with high mortality. A good action to anticipate this is to do prevention, namely by carrying out a healthy lifestyle and routine early examinations. Early detection of coronary heart disease requires a number of examinations, such as demographics, ECG, laboratory, symptoms, and even angiography. The number of inspection parameters in the context of early detection will have an impact on the time and costs that must be incurred. Selection of the right and important inspection parameters will save time and costs. This study proposes an intelligence system model for the detection of coronary heart disease by using a minimal examination attribute, with performance in the good category. This research method is divided into a number of stages, namely data normalization, feature selection, classification, and performance analysis. Feature selection uses a Two-tier feature selection framework consisting of correlation-based filters and wrappers. The system model is tested using a number of datasets, and classification algorithms. The test results show that the proposed two-tier feature selection framework is able to reduce the highest attribute of 73.51% in the z-Alizadeh Sani dataset. The performance of the system using the bagging-PART algorithm is able to provide the best performance with parameters area under the curve (AUC) 95.4%, sensitivity 95.9% while accuracy is 94.1%. Referring to the AUC value, the proposed system model is included in the good category.



2021 ◽  
Vol 10 (1) ◽  
pp. 46
Author(s):  
Maria Yousef ◽  
Prof. Khaled Batiha

These days, heart disease comes to be one of the major health problems which have affected the lives of people in the whole world. Moreover, death due to heart disease is increasing day by day. So the heart disease prediction systems play an important role in the prevention of heart problems. Where these prediction systems assist doctors in making the right decision to diagnose heart disease easily. The existing prediction systems suffering from the high dimensionality problem of selected features that increase the prediction time and decrease the performance accuracy of the prediction due to many redundant or irrelevant features. Therefore, this paper aims to provide a solution of the dimensionality problem by proposing a new mixed model for heart disease prediction based on (Naïve Bayes method, and machine learning classifiers).In this study, we proposed a new heart disease prediction model (NB-SKDR) based on the Naïve Bayes algorithm (NB) and several machine learning techniques including Support Vector Machine, K-Nearest Neighbors, Decision Tree, and Random Forest. This prediction model consists of three main phases which include: preprocessing, feature selection, and classification. The main objective of this proposed model is to improve the performance of the prediction system and finding the best subset of features. This proposed approach uses the Naïve Bayes technique based on the Bayes theorem to select the best subset of features for the next classification phase, also to handle the high dimensionality problem by avoiding unnecessary features and select only the important ones in an attempt to improve the efficiency and accuracy of classifiers. This method is able to reduce the number of features from 13 to 6 which are (age, gender, blood pressure, fasting blood sugar, cholesterol, exercise induce engine) by determining the dependency between a set of attributes. The dependent attributes are the attributes in which an attribute depends on the other attribute in deciding the value of the class attribute. The dependency between attributes is measured by the conditional probability, which can be easily computed by Bayes theorem. Moreover, in the classification phase, the proposed system uses different classification algorithms such as (DT Decision Tree, RF Random Forest, SVM Support Vector machine, KNN Nearest Neighbors) as a classifiers for predicting whether a patient has heart disease or not. The model is trained and evaluated using the Cleveland Heart Disease database, which contains 13 features and 303 samples.Different algorithms use different rules for producing different representations of knowledge. So, the selection of algorithms to build our model is based on their performance. In this work, we applied and compared several classification algorithms which are (DT, SVM, RF, and KNN) to identify the best-suited algorithm to achieve high accuracy in the prediction of heart disease. After combining the Naive Bayes method with each one of these previous classifiers the performance of these combines algorithms is evaluated by different performance metrics such as (Specificity, Sensitivity, and Accuracy). Where the experimental results show that out of these four classification models, the combination between the Naive Bayes feature selection approach and the SVM RBF classifier can predict heart disease with the highest accuracy of 98%. Finally, the proposed approach is compared with another two systems which developed based on two different approaches in the feature selection step. The first system, based on the Genetic Algorithm (GA) technique, and the second uses the Principal Component Analysis (PCA) technique. Consequently, the comparison proved that the Naive Bayes selection approach of the proposed system is better than the GA and PCA approach in terms of prediction accuracy.   





2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Rita Suhuyini Salifu ◽  
Khumbulani W. Hlongwana

Abstract Objectives To explore the mechanisms of collaboration between the stakeholders, including National Tuberculosis Control Program (NTP) and the Non-Communicable Disease Control and Prevention Program (NCDCP) at the national, regional, and local (health facility) levels of the health care system in Ghana. This is one of the objectives in a study on the “Barriers and Facilitators to the Implementation of the Collaborative Framework for the Care and Control of Tuberculosis and Diabetes in Ghana” Results The data analysis revealed 4 key themes. These were (1) Increased support for communicable diseases (CDs) compared to stagnant support for non-communicable diseases (NCDs), (2) Donor support, (3) Poor collaboration between NTP and NCDCP, and (4) Low Tuberculosis-Diabetes Mellitus (TB-DM) case detection.



Sign in / Sign up

Export Citation Format

Share Document