Use of Machine Learning to Investigate the Quantitative Checklist for Autism in Toddlers (Q-CHAT) towards Early Autism Screening

In the past two decades, several screening instruments were developed to detect toddlers who may be autistic both in clinical and unselected samples. Among others, the Quantitative CHecklist for Autism in Toddlers (Q-CHAT) is a quantitative and normally distributed measure of autistic traits that demonstrates good psychometric properties in different settings and cultures. Recently, machine learning (ML) has been applied to behavioral science to improve the classification performance of autism screening and diagnostic tools, but mainly in children, adolescents, and adults. In this study, we used ML to investigate the accuracy and reliability of the Q-CHAT in discriminating young autistic children from those without. Five different ML algorithms (random forest (RF), naïve Bayes (NB), support vector machine (SVM), logistic regression (LR), and K-nearest neighbors (KNN)) were applied to investigate the complete set of Q-CHAT items. Our results showed that ML achieved an overall accuracy of 90%, and the SVM was the most effective, being able to classify autism with 95% accuracy. Furthermore, using the SVM–recursive feature elimination (RFE) approach, we selected a subset of 14 items ensuring 91% accuracy, while 83% accuracy was obtained from the 3 best discriminating items in common to ours and the previously reported Q-CHAT-10. This evidence confirms the high performance and cross-cultural validity of the Q-CHAT, and supports the application of ML to create shorter and faster versions of the instrument, maintaining high classification accuracy, to be used as a quick, easy, and high-performance tool in primary-care settings.

Download Full-text

Use of Machine Learning to Investigate The Quantitative Checklist For Autism in Toddlers (Q-CHAT) Towards Early Autism Screening

10.21203/rs.3.rs-40334/v1 ◽

2020 ◽

Author(s):

Gennaro Tartarisco ◽

Giovanni Cicceri ◽

Davide Di Pietro ◽

Stefania Aiello ◽

Elisa Leonardi ◽

...

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

High Performance ◽

Autistic Traits ◽

Classification Performance ◽

Recursive Feature Elimination ◽

Diagnostic Tools ◽

Support Vector ◽

Behavioural Science ◽

Autism Screening

Abstract Background: In the past two decades, several screening instruments have been developed to detect toddlers who may be autistic, both in clinical and unselected samples. Among others, the Quantitative CHecklist for Autism in Toddlers (Q−CHAT) is a quantitative and normally distributed measure of autistic traits which demonstrated good psychometric properties in different settings and cultures. Recently machine learning (ML) has been applied to behavioural science to improve classification performance of autism screening and diagnostic tools, but mainly in children, adolescents and adults. Methods: In this study, we used machine learning (ML) to investigate the accuracy and reliability of the Q−CHAT in discriminating young autistic children from those without. Three different ML algorithms (Random Forest, Naive Bayes and Support Vector Machine) were applied to investigate the complete set of Q-CHAT items and the best predictive items. Results: Our results showed that the three selected models outperformed the classical statistical methods of predictive validity and among the three ML classifiers, the Support Vector Machine was the most effective, being able to classify autism with 95% accuracy. Furthermore, using the Support Vector Machine-Recursive Feature Elimination approach we were able to select a subset of 14 items ensuring an accuracy of 93%, while an accuracy of 83% was obtained from the best 3 discriminating items in common to our and the previous reported Q-CHAT-10. Limitations: Further data collection is needed.Conclusions: This evidence confirms the high performance and cross-cultural validity of the Q-CHAT and supports the application of ML to create shorter and faster versions of the instrument maintaining high classification accuracy, to be used as a quick, easy and high-performance tool in primary care settings.

Download Full-text

Quantitative Assessment of Ataxia in Multiple Sclerosis Patients using Spatiotemporal Parameters: A Relief-Based Machine Learning Analysis

Physikalische Medizin Rehabilitationsmedizin Kurortmedizin ◽

10.1055/a-1512-4858 ◽

2021 ◽

Author(s):

Furkan Bilek ◽

Ferhat Balgetir ◽

Caner Feyzi Demir ◽

Gökhan Alkan ◽

Seda Arslan-Tuncer

Keyword(s):

Machine Learning ◽

Multiple Sclerosis ◽

Sensitivity And Specificity ◽

Classification Performance ◽

Support Vector ◽

Learning Methods ◽

K Nearest Neighbors ◽

Machine Learning Methods ◽

Spatiotemporal Parameters ◽

Multiple Sclerosis Patients

Abstract Background and Objective Multiple sclerosis (MS) is a chronic, progressive, and autoimmune disease of the central nervous system (CNS) characterized by inflammation, demyelination, and axonal injury. In patients with newly diagnosed MS (ndMS), ataxia can present either as mild or severe and can be difficult to diagnose in the absence of clinical disability. Such difficulties can be eliminated by using decision support systems supported by machine learning methods. The present study aimed to achieve early diagnosis of ataxia in ndMS patients by using machine learning methods with spatiotemporal parameters. Materials and Methods The prospective study included 32 ndMS patients with an Expanded Disability Status Scale (EDSS) score of≤2.0 and 32 healthy volunteers. A total of 14 parameters were elicited by using a Win-Track platform. The ndMS patients were differentiated from healthy individuals using multiple classifiers including Artificial Neural Network (ANN), Support Vector Machine (SVM), the k-nearest neighbors (K-NN) algorithm, and Decision Tree Learning (DTL). To improve the performance of the classification, a Relief-based feature selection algorithm was applied to select the subset that best represented the whole dataset. Performance evaluation was achieved based on several criteria such as Accuracy (ACC), Sensitivity (SN), Specificity (SP), and Precision (PREC). Results ANN had a higher classification performance compared to other classifiers, whereby it provided an accuracy, sensitivity, and specificity of 89, 87.8, 90.3% with the use of all parameters and provided the values of 93.7, 96.6%, and 91.1% with the use of parameters selected by the Relief algorithm, respectively. Significance To our knowledge, this is the first study of its kind in the literature to investigate the diagnosis of ataxia in ndMS patients by using machine learning methods with spatiotemporal parameters. The proposed method, i. e. Relief-based ANN method, successfully diagnosed ataxia by using a lower number of parameters compared to the numbers of parameters reported in clinical studies, thereby reducing the costs and increasing the performance of the diagnosis. The method also provided higher rates of accuracy, sensitivity, and specificity in the diagnosis of ataxia in ndMS patients compared to other methods. Taken together, these findings indicate that the proposed method could be helpful in the diagnosis of ataxia in minimally impaired ndMS patients and could be a pathfinder for future studies.

Download Full-text

MLoF: Machine Learning Accelerators for the Low-Cost FPGA Platforms

Applied Sciences ◽

10.3390/app12010089 ◽

2021 ◽

Vol 12 (1) ◽

pp. 89

Author(s):

Ruiqi Chen ◽

Tianyu Wu ◽

Yuchen Zheng ◽

Ming Ling

Keyword(s):

Machine Learning ◽

High Performance ◽

Low Cost ◽

Raspberry Pi ◽

Future Research ◽

Support Vector ◽

K Nearest Neighbors ◽

Field Programmable ◽

Ip Cores ◽

Cost Efficient

In Internet of Things (IoT) scenarios, it is challenging to deploy Machine Learning (ML) algorithms on low-cost Field Programmable Gate Arrays (FPGAs) in a real-time, cost-efficient, and high-performance way. This paper introduces Machine Learning on FPGA (MLoF), a series of ML IP cores implemented on the low-cost FPGA platforms, aiming at helping more IoT developers to achieve comprehensive performance in various tasks. With Verilog, we deploy and accelerate Artificial Neural Networks (ANNs), Decision Trees (DTs), K-Nearest Neighbors (k-NNs), and Support Vector Machines (SVMs) on 10 different FPGA development boards from seven producers. Additionally, we analyze and evaluate our design with six datasets, and compare the best-performing FPGAs with traditional SoC-based systems including NVIDIA Jetson Nano, Raspberry Pi 3B+, and STM32L476 Nucle. The results show that Lattice’s ICE40UP5 achieves the best overall performance with low power consumption, on which MLoF averagely reduces power by 891% and increases performance by 9 times. Moreover, its cost, power, Latency Production (CPLP) outperforms SoC-based systems by 25 times, which demonstrates the significance of MLoF in endpoint deployment of ML algorithms. Furthermore, we make all of the code open-source in order to promote future research.

Download Full-text

Diabetes Prediction Using Machine Learning Techniques

Journal of Intelligent Systems with Applications ◽

10.54856/10.54856/jiswa.202112183 ◽

2021 ◽

pp. 150-152

Author(s):

Seyma Kiziltas Koc ◽

Mustafa Yeniad

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

High Performance ◽

Nearest Neighbor ◽

Classification Performance ◽

Machine Learning Techniques ◽

Support Vector ◽

Classification Algorithms ◽

K Nearest Neighbor ◽

Machine Learning Classification

Technologies which are used in the healthcare industry are changing rapidly because the technology is evolving to improve people's lifestyles constantly. For instance, different technological devices are used for the diagnosis and treatment of diseases. It has been revealed that diagnosis of disease can be made by computer systems with developing technology.Machine learning algorithms are frequently used tools because of their high performance in the field of health as well as many field. The aim of this study is to investigate different machine learning classification algorithms that can be used in the diagnosis of diabetes and to make comparative analyzes according to the metrics in the literature. In the study, seven classification algorithms were used in the literature. These algorithms are Logistic Regression, K-Nearest Neighbor, Multilayer Perceptron, Random Forest, Decision Trees, Support Vector Machine and Naive Bayes. Firstly, classification performance of algorithms are compared. These comparisons are based on accuracy, sensitivity, precision, and F1-score. The results obtained showed that support vector machine algorithm had the highest accuracy with 78.65%.

Download Full-text

Predicting 30-Day Mortality after an Acute Coronary Syndrome (ACS) using Machine Learning Methods for Feature Selection, Classification and Visualisation

Sains Malaysiana ◽

10.17576/jsm-2021-5003-17 ◽

2021 ◽

Vol 50 (3) ◽

pp. 753-768

Author(s):

NANYONGA AZIIDA ◽

SORAYYA MALEK ◽

FIRDAUS AZIZ ◽

KHAIRUL SHAFIQ IBRAHIM ◽

SAZZLI KASIM

Keyword(s):

Machine Learning ◽

Cardiovascular Disease ◽

Feature Selection ◽

High Performance ◽

Feature Selection Method ◽

Mortality Prediction ◽

Recursive Feature Elimination ◽

Support Vector ◽

Related Factors ◽

Coronary Syndrome

Hybrid combinations of feature selection, classification and visualisation using machine learning (ML) methods have the potential for enhanced understanding and 30-day mortality prediction of patients with cardiovascular disease using population-specific data. Identifying a feature selection method with a classifier algorithm that produces high performance in mortality studies is essential and has not been reported before. Feature selection methods such as Boruta, Random Forest (RF), Elastic Net (EN), Recursive Feature Elimination (RFE), learning vector quantization (LVQ), Genetic Algorithm (GA), Cluster Dendrogram (CD), Support Vector Machine (SVM) and Logistic Regression (LR) were combined with RF, SVM, LR, and EN classifiers for 30-day mortality prediction. ML models were constructed using 302 patients and 54 input variables from the Malaysian National Cardiovascular Disease Database. Validation of the best ML model was performed against Thrombolysis in Myocardial Infarction (TIMI) using an additional dataset of 102 patients. The Self-Organising Feature Map (SOM) was used to visualise mortality-related factors post-ACS. The performance of MLmodels using the area under the curve (AUC) ranged from 0.48 to 0.80. The best-performing model (AUC = 0.80) was a hybrid combination of the RF variable importance method, the sequential backward selection and the RF classifier using five predictors (age, triglyceride, creatinine, troponin, and total cholesterol). Comparison with TIMI using an additional dataset resulted in the best ML model outperforming the TIMI score (AUC = 0.75 vs. AUC = 0.60). The findings of this study will provide a basis for developing an online ML-based population-specific risk scoring calculator.

Download Full-text

Machine Learning-based Web Application for Early Diagnosis of Diabetes

Journal of Applied and Emerging Sciences ◽

10.36785/buitems.jaes.418 ◽

2020 ◽

pp. 133-142

Keyword(s):

Machine Learning ◽

Early Diagnosis ◽

Web Application ◽

Metabolic Diseases ◽

Early Stage ◽

Classification Performance ◽

Support Vector ◽

Common Disease ◽

Diabetes Diagnosis ◽

K Nearest Neighbors

Diabetes has become a chronic disease that seriously threatens human health. It is a group of metabolic diseases characterized by hyperglycemia and there is no role of the age factor involved. The long-term of diabetes disease causes chronic damage and dysfunction of various tissues, especially the eyes, kidneys, heart, blood vessels, and nerves. Most of the time people are not sure about this common disease at the early stage and unluckily the patient moves to a critical situation to meet with major disease due to the continuous effect of diabetes. This research is conducted to build the machine learning-based web application platform for the early diagnosis of the disease, freely accessible anywhere anytime. We used the benchmark dataset named PIDD (Prima Indian Diabetes Dataset) and performed the comparative analysis among the Naïve Bayes, Logistic Regression, K-Nearest Neighbors, Decision Trees, Random Forest and Support Vector Machines. Based on the classification performance, we found that SVM performed the best among the pool of mentioned algorithms and, therefore, adopted for the development of the intelligent web application for the diabetes diagnosis.

Download Full-text

Machine learning-based prediction of motor status in glioma patients using diffusion MRI metrics along the corticospinal tract

10.1101/2021.08.24.21262484 ◽

2021 ◽

Author(s):

Boshra Shams ◽

Ziqian Wang ◽

Timo Roine ◽

Baran Aydogan ◽

Peter Vajkoczy ◽

...

Keyword(s):

Machine Learning ◽

Diffusion Mri ◽

High Performance ◽

Clinical Utility ◽

Tumor Location ◽

Recursive Feature Elimination ◽

Support Vector ◽

Who Grade ◽

Corticospinal Tracts ◽

Resting Motor Threshold

AbstractAlong tract statistics enables white matter characterization using various diffusion MRI (dMRI) metrics. Here, we applied a machine learning (ML) method to assess the clinical utility of dMRI metrics along corticospinal tracts (CST), investigating whether motor glioma patients can be classified with respect to their motor status. The ML-based analysis included developing models based on support vector machine (SVM) using histogram-based measures of dMRI-based tract profiles (e.g., mean, standard deviation, kurtosis and skewness), following a recursive feature elimination (RFE) method based on SVM (SVM-RFE). Our model achieved high performance (74% sensitivity, 75% specificity, 74% overall accuracy and 77% AUC). Incorporating the patients’ demographics and clinical features such as age, tumor WHO grade, tumor location, gender and resting motor threshold (RMT) into our designed models demonstrated that these features were not as effective as microstructural measures. The results revealed that ADC, FA and RD contributed more than other features to the model.

Download Full-text

Diabetes Prediction Using Machine Learning Techniques

Journal of Intelligent Systems with Applications ◽

10.54856/jiswa.202112183 ◽

2021 ◽

pp. 150-152

Author(s):

Seyma Kiziltas Koc ◽

Mustafa Yeniad

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

High Performance ◽

Nearest Neighbor ◽

Classification Performance ◽

Machine Learning Techniques ◽

Support Vector ◽

Classification Algorithms ◽

K Nearest Neighbor ◽

Machine Learning Classification

Download Full-text

Binary Spectrum Feature for Improved Classiﬁer Performance

10.36227/techrxiv.12993122 ◽

2020 ◽

Author(s):

Nalika Ulapane ◽

Karthick Thiyagarajan ◽

sarath kodagoda

Keyword(s):

Machine Learning ◽

Classification Performance ◽

Feature Reduction ◽

Sensor Data ◽

Machine Learning Techniques ◽

Support Vector ◽

Svm Classifier ◽

Monitoring Task ◽

Classifier Performance ◽

Spectrum Feature

<div>Classiﬁcation has become a vital task in modern machine learning and Artiﬁcial Intelligence applications, including smart sensing. Numerous machine learning techniques are available to perform classiﬁcation. Similarly, numerous practices, such as feature selection (i.e., selection of a subset of descriptor variables that optimally describe the output), are available to improve classiﬁer performance. In this paper, we consider the case of a given supervised learning classiﬁcation task that has to be performed making use of continuous-valued features. It is assumed that an optimal subset of features has already been selected. Therefore, no further feature reduction, or feature addition, is to be carried out. Then, we attempt to improve the classiﬁcation performance by passing the given feature set through a transformation that produces a new feature set which we have named the “Binary Spectrum”. Via a case study example done on some Pulsed Eddy Current sensor data captured from an infrastructure monitoring task, we demonstrate how the classiﬁcation accuracy of a Support Vector Machine (SVM) classiﬁer increases through the use of this Binary Spectrum feature, indicating the feature transformation’s potential for broader usage.</div><div><br></div>

Download Full-text

Phybrata Sensors and Machine Learning for Enhanced Neurophysiological Diagnosis and Treatment

Sensors ◽

10.3390/s21217417 ◽

2021 ◽

Vol 21 (21) ◽

pp. 7417

Author(s):

Alex J. Hope ◽

Utkarsh Vashisth ◽

Matthew J. Parker ◽

Andreas B. Ralston ◽

Joshua M. Roper ◽

...

Keyword(s):

Machine Learning ◽

Time Series ◽

Random Forest ◽

Binary Classification ◽

Classification Performance ◽

Support Vector ◽

Use Case ◽

Signal Features ◽

Test Population

Concussion injuries remain a significant public health challenge. A significant unmet clinical need remains for tools that allow related physiological impairments and longer-term health risks to be identified earlier, better quantified, and more easily monitored over time. We address this challenge by combining a head-mounted wearable inertial motion unit (IMU)-based physiological vibration acceleration (“phybrata”) sensor and several candidate machine learning (ML) models. The performance of this solution is assessed for both binary classification of concussion patients and multiclass predictions of specific concussion-related neurophysiological impairments. Results are compared with previously reported approaches to ML-based concussion diagnostics. Using phybrata data from a previously reported concussion study population, four different machine learning models (Support Vector Machine, Random Forest Classifier, Extreme Gradient Boost, and Convolutional Neural Network) are first investigated for binary classification of the test population as healthy vs. concussion (Use Case 1). Results are compared for two different data preprocessing pipelines, Time-Series Averaging (TSA) and Non-Time-Series Feature Extraction (NTS). Next, the three best-performing NTS models are compared in terms of their multiclass prediction performance for specific concussion-related impairments: vestibular, neurological, both (Use Case 2). For Use Case 1, the NTS model approach outperformed the TSA approach, with the two best algorithms achieving an F1 score of 0.94. For Use Case 2, the NTS Random Forest model achieved the best performance in the testing set, with an F1 score of 0.90, and identified a wider range of relevant phybrata signal features that contributed to impairment classification compared with manual feature inspection and statistical data analysis. The overall classification performance achieved in the present work exceeds previously reported approaches to ML-based concussion diagnostics using other data sources and ML models. This study also demonstrates the first combination of a wearable IMU-based sensor and ML model that enables both binary classification of concussion patients and multiclass predictions of specific concussion-related neurophysiological impairments.

Download Full-text