scholarly journals Machine Learning Models for Diagnosis of Cushing’s Syndrome Using Retrospective Data

Author(s):  
Senol Isci ◽  
Derya Sema Yaman Kalender ◽  
Firat Bayraktar ◽  
Alper Yaman

ABSTRACTAccurate classification of Cushing’s Syndrome (CS) plays a critical role in providing early and correct diagnosis of CS that may facilitate treatment and improve patient outcomes. Diagnosis of CS is a complex process, which requires careful and concurrent interpretation of signs and symptoms, multiple biochemical test results, and findings of medical imaging by physicians with a high degree of specialty and knowledge to make correct judgments. In this article, we explore the state of the art machine learning algorithms to demonstrate their potential as a clinical decision support system to analyze and classify CS in order to facilitate the diagnosis, prognosis, and treatment of CS. Prominent algorithms are compared using nested cross-validation and various class comparison strategies including multiclass, one vs. all, and one vs. one binary classification. Our findings show that Random Forest (RF) algorithm is most suitable for the classification of CS. We demonstrate that the proposed approach can classify CS subjects with an average accuracy of 92% and an average F1 score of 91.5%, depending on the class comparison strategy and selected features. RF-based one vs. all binary classification model achieves sensitivity of 97.6%, precision of 91.1%, and specificity of 87.1% to discriminate CS from non-CS on the test dataset. RF-based multiclass classification model achieves average per class sensitivity of 91.8%, average per class specificity of 97.1%, and average per class precision of 92.1% to classify different subtypes of CS on the test dataset. Clinical performance evaluation suggests that the developed models can help improve physician’s judgment in diagnosing CS.

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Imogen Schofield ◽  
David C. Brodbelt ◽  
Noel Kennedy ◽  
Stijn J. M. Niessen ◽  
David B. Church ◽  
...  

AbstractCushing’s syndrome is an endocrine disease in dogs that negatively impacts upon the quality-of-life of affected animals. Cushing’s syndrome can be a challenging diagnosis to confirm, therefore new methods to aid diagnosis are warranted. Four machine-learning algorithms were applied to predict a future diagnosis of Cushing's syndrome, using structured clinical data from the VetCompass programme in the UK. Dogs suspected of having Cushing's syndrome were included in the analysis and classified based on their final reported diagnosis within their clinical records. Demographic and clinical features available at the point of first suspicion by the attending veterinarian were included within the models. The machine-learning methods were able to classify the recorded Cushing’s syndrome diagnoses, with good predictive performance. The LASSO penalised regression model indicated the best overall performance when applied to the test set with an AUROC = 0.85 (95% CI 0.80–0.89), sensitivity = 0.71, specificity = 0.82, PPV = 0.75 and NPV = 0.78. The findings of our study indicate that machine-learning methods could predict the future diagnosis of a practicing veterinarian. New approaches using these methods could support clinical decision-making and contribute to improved diagnosis of Cushing’s syndrome in dogs.


Author(s):  
Z. Neili ◽  
M. Fezari ◽  
A. Redjati

The acquisition of Breath sounds (BS) signals from a human respiratory system with an electronic stethoscope, provide and offer prominent information which helps the doctors to diagnosis and classification of pulmonary diseases. Unfortunately, this BS signals with other biological signals have a non-stationary nature according to the variation of the lung volume, and this nature makes it difficult to analyze and classify between several diseases. In this study, we were focused on comparing the ability of the extreme learning machine (ELM) and k-nearest neighbour (K-nn) machine learning algorithms in the classification of adventitious and normal breath sounds. To do so, the empirical mode decomposition (EMD) was used in this work to analyze BS, this method is rarely used in the breath sounds analysis. After the EMD decomposition of the signals into Intrinsic Mode Functions (IMFs), the Hjorth descriptors (Activity) and Permutation Entropy (PE) features were extracted from each IMFs and combined for classification stage. The study has found that the combination of features (activity and PE) yielded an accuracy of 90.71%, 95% using ELM and K-nn respectively in binary classification (normal and abnormal breath sounds), and 83.57%, 86.42% in multiclass classification (five classes).


2021 ◽  
Vol 12 ◽  
Author(s):  
Shengqi Yang ◽  
Ran Li ◽  
Jiliang Chen ◽  
Zhen Li ◽  
Zhangqin Huang ◽  
...  

Ca2+ sparks are the elementary Ca2+ release events in cardiomyocytes, altered properties of which lead to impaired Ca2+ handling and finally contribute to cardiac pathology under various diseases. Despite increasing use of machine-learning algorithms in deciphering the content of biological and medical data, Ca2+ spark images and data are yet to be deeply learnt and analyzed. In the present study, we developed a deep residual convolutional neural network method to detect Ca2+ sparks. Compared to traditional detection methods with arbitrarily defined thresholds to distinguish signals from noises, our new method detected more Ca2+ sparks with lower amplitudes but similar spatiotemporal distributions, thereby indicating that our new algorithm detected many very weak events that are usually omitted when using traditional detection methods. Furthermore, we proposed an event-based logistic regression and binary classification model to classify single cardiomyocytes using Ca2+ spark characteristics, which to date have generally been used only for simple statistical analyses and comparison between normal and diseased groups. Using this new detection algorithm and classification model, we succeeded in distinguishing wild type (WT) vs RyR2-R2474S± cardiomyocytes with 100% accuracy, and vehicle vs isoprenaline-insulted WT cardiomyocytes with 95.6% accuracy. The model can be extended to judge whether a small number of cardiomyocytes (and so the whole heart) are under a specific cardiac disease. Thus, this study provides a novel and powerful approach for the research and application of calcium signaling in cardiac diseases.


2019 ◽  
Vol 2 ◽  
pp. 1-8
Author(s):  
Lukas Gokl ◽  
Marvin Mc Cutchan ◽  
Bartosz Mazurkiewicz ◽  
Paolo Fogliaroni ◽  
Ioannis Giannopoulos

Abstract. Location Based Services (LBS) are definitely very helpful for people that interact within an unfamiliar environment, but also for those that already possess a certain level of familiarity with it. In order to avoid overwhelming familiar users with unnecessary information, the level of details offered by the LBS shall be adapted to the level of familiarity with the environment: providing more details to unfamiliar users and a lighter amount of information (that would be superfluous, if not even misleading) to the users that are more familiar with the current environment. Currently, the information exchange between the service and its users is not taking into account familiarity. Within this work, we investigate the potential of machine learning for a binary classification of environment familiarity (i.e., familiar vs unfamiliar) with the surrounding environment. For this purpose, a 3D virtual environment based on a part of Vienna, Austria was designed using datasets from the municipal government. During a navigation experiment with 22 participants we collected ground truth data in order to train four machine learning algorithms. The captured data included motion and orientation of the users as well as visual interaction with the surrounding buildings during navigation. This work demonstrates the potential of machine learning for predicting the state of familiarity as an enabling step for the implementation of LBS better tailored to the user.


2021 ◽  
Vol 12 (3) ◽  
pp. 1550-1556
Author(s):  
Ravi Kumar Y B Et.al

The current research work encompasses the assessment of similarity based facial features of images with erected method so as to determines the genealogical similarity. It is based on the principle of grouping the closer features, as compared to those which are away from the predefined threshold for a better ascertainment of the extracted features. The system developed is trained using deep learning-oriented architecture incorporating these closer features for a binary classification of the subjects considered into genealogic non-genealogic. The genealogic set of data is further used to calculate the percentage of similarity with erected methods. The present work considered XX datasets from XXXX source for the assessment of facial similarities. The results portrayed an accuracy of 96.3% for genealogic data, the salient among them being those of father-daughter (98.1%), father-son(98.3%), mother-daughter(96.6%), mother-son(96.1%) genealogy in case of the datasets from “kinface W-I”. Extending this work onto “kinface W-II” set of data, the results were promising with father-daughter(98.5%), father-son(96.7%), mother-daughter(93.4%) and mother-son(98.9%) genealogy. Such an approach could be further extended to larger database so as to assess the genealogical similarity with the aid of machine-learning algorithms.


2019 ◽  
Vol 143 (8) ◽  
pp. 990-998 ◽  
Author(s):  
Min Yu ◽  
Lindsay A. L. Bazydlo ◽  
David E. Bruns ◽  
James H. Harrison

Context.— Turnaround time and productivity of clinical mass spectrometric (MS) testing are hampered by time-consuming manual review of the analytical quality of MS data before release of patient results. Objective.— To determine whether a classification model created by using standard machine learning algorithms can verify analytically acceptable MS results and thereby reduce manual review requirements. Design.— We obtained retrospective data from gas chromatography–MS analyses of 11-nor-9-carboxy-delta-9-tetrahydrocannabinol (THC-COOH) in 1267 urine samples. The data for each sample had been labeled previously as either analytically unacceptable or acceptable by manual review. The dataset was randomly split into training and test sets (848 and 419 samples, respectively), maintaining equal proportions of acceptable (90%) and unacceptable (10%) results in each set. We used stratified 10-fold cross-validation in assessing the abilities of 6 supervised machine learning algorithms to distinguish unacceptable from acceptable assay results in the training dataset. The classifier with the highest recall was used to build a final model, and its performance was evaluated against the test dataset. Results.— In comparison testing of the 6 classifiers, a model based on the Support Vector Machines algorithm yielded the highest recall and acceptable precision. After optimization, this model correctly identified all unacceptable results in the test dataset (100% recall) with a precision of 81%. Conclusions.— Automated data review identified all analytically unacceptable assays in the test dataset, while reducing the manual review requirement by about 87%. This automation strategy can focus manual review only on assays likely to be problematic, allowing improved throughput and turnaround time without reducing quality.


Author(s):  
Isak Taksa ◽  
Sarah Zelikovitz ◽  
Amanda Spink

Background knowledge has been actively investigated as a possible means to improve performance of machine learning algorithms. Research has shown that background knowledge plays an especially critical role in three atypical text categorization tasks: short-text classification, limited labeled data, and non-topical classification. This chapter explores the use of machine learning for non-hierarchical classification of search queries, and presents an approach to background knowledge discovery by using information retrieval techniques. Two different sets of background knowledge that were obtained from the World Wide Web, one in 2006 and one in 2009, are used with the proposed approach to classify a commercial corpus of web query data by the age of the user. In the process, various classification scenarios are generated and executed, providing insight into choice, significance and range of tuning parameters, and exploring impact of the dynamic web on classification results.


Sign in / Sign up

Export Citation Format

Share Document