Machine learning versus logistic regression methods for 2-year mortality prognostication in a small, heterogeneous glioma database

AbstractBackgroundMachine learning (ML) is the application of specialized algorithms to datasets for trend delineation, categorization or prediction. ML techniques have been traditionally applied to large, highly-dimensional databases. Gliomas are a heterogeneous group of primary brain tumors, traditionally graded using histopathological features. Recently the World Health Organization proposed a novel grading system for gliomas incorporating molecular characteristics. We aimed to study whether ML could achieve accurate prognostication of 2-year mortality in a small, highly-dimensional database of glioma patients.MethodsWe applied three machine learning techniques: artificial neural networks (ANN), decision trees (DT), support vector machine (SVM), and classical logistic regression (LR) to a dataset consisting of 76 glioma patients of all grades. We compared the effect of applying the algorithms to the raw database, versus a database where only statistically significant features were included into the algorithmic inputs (feature selection).ResultsRaw input consisted of 21 variables, and achieved performance of (accuracy/AUC): 70.7%/0.70 for ANN, 68%/0.72 for SVM, 66.7%/0.64 for LR and 65%/0.70 for DT. Feature selected input consisted of 14 variables and achieved performance of 73.4%/0.75 for ANN, 73.3%/0.74 for SVM, 69.3%/0.73 for LR and 65.2%/0.63 for DT.ConclusionsWe demonstrate that these techniques can also be applied to small, yet highly-dimensional datasets. Our ML techniques achieved reasonable performance compared to similar studies in the literature. Though local databases may be small versus larger cancer repositories, we demonstrate that ML techniques can still be applied to their analysis, though traditional statistical methods are of similar benefit.

Download Full-text

Preparedness and Mitigation by projecting the risk against COVID-19 transmission using Machine Learning Techniques

10.1101/2020.04.26.20080655 ◽

2020 ◽

Author(s):

Akshay Kumar ◽

Farhan Mohammad Khan ◽

Rajiv Gupta ◽

Harish Puppala

Keyword(s):

Machine Learning ◽

Gaussian Process Regression ◽

Machine Learning Techniques ◽

World Health ◽

Support Vector ◽

Learning Tools ◽

Learning Techniques ◽

Health Organization ◽

Respiratory Coronavirus ◽

Criticality Index

AbstractThe outbreak of COVID-19 is first identified in China, which later spread to various parts of the globe and was pronounced pandemic by the World Health Organization (WHO). The disease of transmissible person-to-person pneumonia caused by the extreme acute respiratory coronavirus 2 syndrome (SARS-COV-2, also known as COVID-19), has sparked a global warning. Thermal screening, quarantining, and later lockdown were methods employed by various nations to contain the spread of the virus. Though exercising various possible plans to contain the spread help in mitigating the effect of COVID-19, projecting the rise and preparing to face the crisis would help in minimizing the effect. In the scenario, this study attempts to use Machine Learning tools to forecast the possible rise in the number of cases by considering the data of daily new cases. To capture the uncertainty, three different techniques: (i) Decision Tree algorithm, (ii) Support Vector Machine algorithm, and (iii) Gaussian process regression are used to project the data and capture the possible deviation. Based on the projection of new cases, recovered cases, deceased cases, medical facilities, population density, number of tests conducted, and facilities of services, are considered to define the criticality index (CI). CI is used to classify all the districts of the country in the regions of high risk, low risk, and moderate risk. An online dashpot is created, which updates the data on daily bases for the next four weeks. The prospective suggestions of this study would aid in planning the strategies to apply the lockdown/ any other plan for any country, which can take other parameters to define the CI.

Download Full-text

A Review of Statistical and Machine Learning Techniques for Microvascular Complications in Type 2 Diabetes

Current Diabetes Reviews ◽

10.2174/1573399816666200511003357 ◽

2020 ◽

Vol 16 ◽

Author(s):

Nitigya Sambyal ◽

Poonam Saini ◽

Rupali Syal

Keyword(s):

Machine Learning ◽

Prediction Models ◽

Clinical Medicine ◽

Microvascular Complications ◽

Descriptive Analysis ◽

Machine Learning Techniques ◽

World Health ◽

Public Health Issue ◽

Learning Techniques ◽

Health Organization

Background and Introduction: Diabetes mellitus is a metabolic disorder that has emerged as a serious public health issue worldwide. According to the World Health Organization (WHO), without interventions, the number of diabetic incidences is expected to be at least 629 million by 2045. Uncontrolled diabetes gradually leads to progressive damage to eyes, heart, kidneys, blood vessels and nerves. Method: The paper presents a critical review of existing statistical and Artificial Intelligence (AI) based machine learning techniques with respect to DM complications namely retinopathy, neuropathy and nephropathy. The statistical and machine learning analytic techniques are used to structure the subsequent content review. Result: It has been inferred that statistical analysis can help only in inferential and descriptive analysis whereas, AI based machine learning models can even provide actionable prediction models for faster and accurate diagnose of complications associated with DM. Conclusion: The integration of AI based analytics techniques like machine learning and deep learning in clinical medicine will result in improved disease management through faster disease detection and cost reduction for disease treatment.

Download Full-text

Comparison of Machine Learning algorithm for COVID-19 Death Risk Prediction

10.21203/rs.3.rs-196077/v1 ◽

2021 ◽

Author(s):

Praveeen Anandhanathan ◽

Priyanka Gopalan

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

Machine Learning Techniques ◽

Support Vector ◽

Nearest Neighbour ◽

Decision Tree Algorithm ◽

The Past ◽

Random Forest Method ◽

Learning Techniques ◽

The World

Abstract Coronavirus disease (COVID-19) is spreading across the world. Since at first it has appeared in Wuhan, China in December 2019, it has become a serious issue across the globe. There are no accurate resources to predict and find the disease. So, by knowing the past patients’ records, it could guide the clinicians to fight against the pandemic. Therefore, for the prediction of healthiness from symptoms Machine learning techniques can be implemented. From this we are going to analyse only the symptoms which occurs in every patient. These predictions can help clinicians in the easier manner to cure the patients. Already for prediction of many of the diseases, techniques like SVM (Support vector Machine), Fuzzy k-Means Clustering, Decision Tree algorithm, Random Forest Method, ANN (Artificial Neural Network), KNN (k-Nearest Neighbour), Naïve Bayes, Linear Regression model are used. As we haven’t faced this disease before, we can’t say which technique will give the maximum accuracy. So, we are going to provide an efficient result by comparing all the such algorithms in RStudio.

Download Full-text

Identification of Pneumonia Disease Applying an Intelligent Computational Framework Based on Deep Learning and Machine Learning Techniques

Mobile Information Systems ◽

10.1155/2021/9989237 ◽

2021 ◽

Vol 2021 ◽

pp. 1-20

Author(s):

Yar Muhammad ◽

Mohammad Dahman Alshehri ◽

Wael Mohammed Alenazy ◽

Truong Vinh Hoang ◽

Ryan Alturki

Keyword(s):

Machine Learning ◽

Medical Image Analysis ◽

Machine Learning Techniques ◽

World Health ◽

X Ray ◽

Expert Radiologist ◽

Learning Techniques ◽

Effective Manner ◽

The World ◽

Chest X Ray

Pneumonia is a very common and fatal disease, which needs to be identified at the initial stages in order to prevent a patient having this disease from more damage and help him/her in saving his/her life. Various techniques are used for the diagnosis of pneumonia including chest X-ray, CT scan, blood culture, sputum culture, fluid sample, bronchoscopy, and pulse oximetry. Medical image analysis plays a vital role in the diagnosis of various diseases like MERS, COVID-19, pneumonia, etc. and is considered to be one of the auspicious research areas. To analyze chest X-ray images accurately, there is a need for an expert radiologist who possesses expertise and experience in the desired domain. According to the World Health Organization (WHO) report, about 2/3 people in the world still do not have access to the radiologist, in order to diagnose their disease. This study proposes a DL framework to diagnose pneumonia disease in an efficient and effective manner. Various Deep Convolutional Neural Network (DCNN) transfer learning techniques such as AlexNet, SqueezeNet, VGG16, VGG19, and Inception-V3 are utilized for extracting useful features from the chest X-ray images. In this study, several machine learning (ML) classifiers are utilized. The proposed system has been trained and tested on chest X-ray and CT images dataset. In order to examine the stability and effectiveness of the proposed system, different performance measures have been utilized. The proposed system is intended to be beneficial and supportive for medical doctors to accurately and efficiently diagnose pneumonia disease.

Download Full-text

The feasibility of creating a checklist for the assessment of the methodological quality both of randomised and non-randomised studies of health care interventions (Preprint)

10.2196/preprints.12559 ◽

2018 ◽

Author(s):

Roberto Acuña

Keyword(s):

Machine Learning ◽

Suicidal Ideation ◽

Suicide Ideation ◽

Machine Learning Techniques ◽

World Health ◽

Global Public Health ◽

Systematic Analysis ◽

Middle Income ◽

Learning Techniques ◽

Health Organization

BACKGROUND According to the World Health Organization (WHO) close to 800,000 people worldwide death by suicidal each year. Many more attempt to do it. In consequence, the WHO recognizes suicide as a global public health priority, which affects not only rich countries, but poor and middle income countries as well. OBJECTIVE The aim of this study is to evaluate several supervised classifiers for detecting messages with suicidal ideation in order to know if these systems can be used in automatic suicide prevention systems. METHODS We used machine learning techniques to make a systematic analysis of 28 supervised classifier algorithms with parameters by defect. The Life Corpus, used in this research, is a bilingual corpus (English and Spanish) oriented to suicide. The corpus was constructed by two annotation experts, retrieving texts from several social networks. The corpus quality was measured using mutual annotation agreement. RESULTS The different experiments determined that the classifier with the best performance was KStar, with the corpus version POS-SYNSETS-NUM; and the cycle with 2 classes Urgent and No Risk was the one that achieved the best results with the PRC-Area metrics of 0,81036 and F-measure of 0,7148. CONCLUSIONS The present research fulfilled the objective of discovering which characteristics are the most suitable for the automatic classification of messages with suicidal ideation, using the Life Corpus. The results of this evaluation demonstrate that the Life Corpus and machine learning techniques could be suitable for detecting suicide ideation messages.

Download Full-text

An Efficient Model for Predicting Student Dropout using Data Mining and Machine Learning Techniques

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.i1155.0789s219 ◽

2019 ◽

Vol 8 (9S2) ◽

pp. 750-752

Keyword(s):

Machine Learning ◽

Similarity Measures ◽

Machine Learning Techniques ◽

World Health ◽

Frequent Patterns ◽

Rule Mining ◽

Student Dropout ◽

Learning Techniques ◽

Using Data ◽

Health Organization

Education could be a important resource that has to lean to all or any kids. one in all the largest assets of the longer term generation cloud is alleged because the education that's given to the youngsters. Most of the youngsters aren't ready to continue their education because of many reasons. The prediction of student dropout plays a very important role in characteristic the scholars World Health Organization are on the sting of being a dropout from their education. whereas predicting this, we will simply try and solve their issues and create them continue their education. during this paper, we've planned a model for predicting the scholars can get born out or not mistreatment many machine learning techniques. we have a tendency to create use of decision trees that make a call mistreatment many factors. the choice of the prediction involves crucial wherever many knowledge attributes are used for prediction like correlations, similarity measures, frequent patterns, and associations rule mining. The planned work is evaluated mistreatment numerous parameters and is well-tried to figure expeditiously in predicting the dropout students compared with alternative.

Download Full-text

Machine Learning Based Coronary Artery Disease Prediction

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2020.9007 ◽

2020 ◽

Vol 17 (9) ◽

pp. 3999-4002

Author(s):

A. C. Bhavani ◽

K. Aditya Shastry ◽

K. Deepika ◽

Nithya N. Shanbag ◽

G. C. Akshatha

Keyword(s):

Machine Learning ◽

Coronary Artery ◽

Medical Diagnosis ◽

Performance Metrics ◽

Naive Bayes ◽

Naïve Bayes ◽

Machine Learning Techniques ◽

World Health ◽

Support Vector ◽

Health Organization

The world health organization (WHO) has assessed that the death of around 12 million people across the globe is observed each year because of diseases related to cardiovascular. The dangers associated with the cardiovascular disease can be identified effectively using machine learning techniques. As per survey, around 30% of the patient suffers no symptoms during heart attacks. But the bloodstream contains unique indications of the attack for days. The medical diagnosis of a patient remains a complex task due to several factors. The accurate medical diagnosis of a patient’s heart disease is critical as it significantly leads to the saving of millions of human lives. In this regard, the automation of the medical diagnosis is significant. The goal of this work is the development of a system for predicting the disease related to coronary artery in a patient with high accuracy utilizing machine learning (ML) techniques. Several algorithms like Naïve Bayes (NB), Support Vector Machine (SVM), and Decision Tree (DT) classifiers were implemented for predicting the disease. Extensive experiments demonstrated that the naïve Bayes achieved higher accuracy than the DT and SVM with regards to accuracy, precision, F-Measure, Recall, and receiver operating characteristic (ROC) performance metrics.

Download Full-text

Alzheimer's Disease Early Detection Using Machine Learning Techniques

10.21203/rs.3.rs-624520/v1 ◽

2021 ◽

Author(s):

Roobaea Alroobaea ◽

Seifeddine Mechti ◽

Mariem Haoues ◽

Saeed Rubaiee ◽

Anas Ahmed ◽

...

Keyword(s):

Machine Learning ◽

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Support Vector Machine ◽

Logistic Regression ◽

Random Forest ◽

Machine Learning Techniques ◽

Support Vector ◽

Disease Detection ◽

Learning Techniques

Abstract Alzheimer's is the main reason for dementia, that affects frequently older adults. This disease is costly especially, in terms of treatment. In addition, Alzheimer's is one of the deaths causes in the old-age citizens. Early Alzheimer's detection helps medical staffs in this disease diagnosis, which will certainly decrease the risk of death. This made the early Alzheimer's disease detection a crucial problem in the healthcare industry. The objective of this research study is to introduce a computer-aided diagnosis system for Alzheimer's disease detection using machine learning techniques. We employed data from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) and the Open Access Series of Imaging Studies (OASIS) brain datasets. Common supervised machine learning techniques have been applied for automatic Alzheimer’s disease detection such as: logistic regression, support vector machine, random forest, linear discriminant analysis, etc. The best accuracy values provided by the machine learning classifiers are 99.43% and 99.10% given by respectively, logistic regression and support vector machine using ADNI dataset, whereas for the OASIS dataset, we obtained 84.33% and 83.92% given by respectively logistic regression and random forest.

Download Full-text

Discrimination of SARS-Cov 2 and arboviruses (DENV, ZIKV and CHIKV) clinical features using machine learning techniques: a fast and inexpensive clinical screening for countries simultaneously affected by both diseases

10.1101/2021.01.28.21250714 ◽

2021 ◽

Author(s):

João Daniel S. Castro

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Learning Algorithm ◽

Machine Learning Techniques ◽

Support Vector ◽

Machine Learning Algorithm ◽

Clinical Screening ◽

Learning Techniques ◽

The World ◽

Area Under Roc Curve

AbstractSARS-Cov-2 (Covid-19) has spread rapidly throughout the world, and especially in tropical countries already affected by outbreaks of arboviruses, such as Dengue, Zika and Chikungunya, and may lead these locations to a collapse of health systems. Thus, the present work aims to develop a methodology using a machine learning algorithm (Support Vector Machine) for the prediction and discrimination of patients affected by Covid-19 and arboviruses (DENV, ZIKV and CHIKV). Clinical data from 204 patients with both Covid-19 and arboviruses obtained from 23 scientific articles and 1 dataset were used. The developed model was able to predict 93.1% of Covid-19 cases and 82.1% of arbovirus cases, with an accuracy of 89.1% and Area under Roc Curve of 95.6%, proving to be effective in prediction and possible screening of these patients, especially those affected by Covid-19, allowing early isolation.

Download Full-text

Cardiac Disease Prediction using Supervised Machine Learning Techniques.

Journal of Physics Conference Series ◽

10.1088/1742-6596/2161/1/012013 ◽

2022 ◽

Vol 2161 (1) ◽

pp. 012013

Author(s):

Chiradeep Gupta ◽

Athina Saha ◽

N V Subba Reddy ◽

U Dinesh Acharya

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Cardiac Disease ◽

Performance Metrics ◽

Confusion Matrix ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Support Vector ◽

Ensemble Techniques ◽

Learning Techniques

Abstract Diagnosis of cardiac disease requires being more accurate, precise, and reliable. The number of death cases due to cardiac attacks is increasing exponentially day by day. Thus, practical approaches for earlier diagnosis of cardiac or heart disease are done to achieve prompt management of the disease. Various supervised machine learning techniques like K-Nearest Neighbour, Decision Tree, Logistic Regression, Naïve Bayes, and Support Vector Machine (SVM) model are used for predicting cardiac disease using a dataset that was collected from the repository of the University of California, Irvine (UCI). The results depict that Logistic Regression was better than all other supervised classifiers in terms of the performance metrics. The model is also less risky since the number of false negatives is low as compared to other models as per the confusion matrix of all the models. In addition, ensemble techniques can be approached for the accuracy improvement of the classifier. Jupyter notebook is the best tool, for the implementation of Python Programming having many types of libraries, header files, for accurate and precise work.

Download Full-text