Immune-inflammatory, coagulation, adhesion, and imaging biomarkers combined in machine learning models improve the prediction of death 1 year after ischemic stroke

Author(s):  
Ana Lucia Cruz Fürstenberger Lehmann ◽  
Daniela Frizon Alfieri ◽  
Maria Caroline Martins de Araújo ◽  
Emanuelle Roberto Trevisani ◽  
Maisa Rocha Nagao ◽  
...  
Stroke ◽  
2017 ◽  
Vol 48 (suppl_1) ◽  
Author(s):  
Charles Esenwa ◽  
Jorge Luna ◽  
Benjamin Kummer ◽  
Hojjat Salmasian ◽  
David Vawdrey ◽  
...  

Introduction: Retrospective identification of patients hospitalized with new diagnosis of acute ischemic stroke is important for administrative quality assurance, post-discharge clinical management, and stroke research. The benefit of using administrative claims data is its widespread availability, but the disadvantage is in the inability to accurately and consistently identify the clinical diagnosis of interest. Hypothesis: We hypothesized that decision tree and logistic regression models could be applied to administrative claims data coded using International Classification of Diseases, version 10 (ICD-10) to create algorithms that could accurately identify patients with acute ischemic stroke. Methods: We used hospital records from our institution to develop a gold standard list of 243 patients, continuously hospitalized with a new diagnosis of stroke from 10/1/2015 to 3/31/2016. We used 1,393 neurological patients without a diagnosis of stroke as negative controls. This list was used to train and test two machine learning methods of diagnosis and procedure codes analysis, for the purpose of ischemic stroke identification: one using classification and regression tree (CART) and another using regularized logistic regression. We trained the models using 75% of the data and performed the evaluation using the remaining 25%. Results: The CART model had a κ=0.78, sensitivity of 96%, specificity of 90%, and a positive predictive value of 99%. The regularized logistic regression model had a κ=0.73, sensitivity of 97%, specificity of 81%, and a positive predictive value of 98%. Conclusion: Both the decision tree and logistic regression machine based learning models showed very high accuracy in identifying patients with a new diagnosis of ischemic stroke, using ICD-10 code claims data, when compared to our gold standard. Applying these machine learning models to identify patients with ischemic stroke has widespread applications, especially in this period where national billing data has transitioned from ICD-9 to ICD-10 codes.


PLoS ONE ◽  
2020 ◽  
Vol 15 (11) ◽  
pp. e0241917
Author(s):  
Malte Grosser ◽  
Susanne Gellißen ◽  
Patrick Borchert ◽  
Jan Sedlacik ◽  
Jawed Nawabi ◽  
...  

Background An accurate prediction of tissue outcome in acute ischemic stroke patients is of high interest for treatment decision making. To date, various machine learning models have been proposed that combine multi-parametric imaging data for this purpose. However, most of these machine learning models were trained using voxel information extracted from the whole brain, without taking differences in susceptibility to ischemia into account that exist between brain regions. The aim of this study was to develop and evaluate a local tissue outcome prediction approach, which makes predictions using locally trained machine learning models and thus accounts for regional differences. Material and methods Multi-parametric MRI data from 99 acute ischemic stroke patients were used for the development and evaluation of the local tissue outcome prediction approach. Diffusion (ADC) and perfusion parameter maps (CBF, CBV, MTT, Tmax) and corresponding follow-up lesion masks for each patient were registered to the MNI brain atlas. Logistic regression (LR) and random forest (RF) models were trained employing a local approach, which makes predictions using models individually trained for each specific voxel position using the corresponding local data. A global approach, which uses a single model trained using all voxels of the brain, was used for comparison. Tissue outcome predictions resulting from the global and local RF and LR models, as well as a combined (hybrid) approach were quantitatively evaluated and compared using the area under the receiver operating characteristic curve (ROC AUC), the Dice coefficient, and the sensitivity and specificity metrics. Results Statistical analysis revealed the highest ROC AUC and Dice values for the hybrid approach. With 0.872 (ROC AUC; LR) and 0.353 (Dice; RF), these values were significantly higher (p < 0.01) than the values of the two other approaches. In addition, the local approach achieved the highest sensitivity of 0.448 (LR). Overall, the hybrid approach was only outperformed in sensitivity (LR) by the local approach and in specificity by both other approaches. However, in these cases the effect sizes were comparatively small. Conclusion The results of this study suggest that using locally trained machine learning models can lead to better lesion outcome prediction results compared to a single global machine learning model trained using all voxel information independent of the location in the brain.


2020 ◽  
Vol 2 (1) ◽  
pp. 3-6
Author(s):  
Eric Holloway

Imagination Sampling is the usage of a person as an oracle for generating or improving machine learning models. Previous work demonstrated a general system for using Imagination Sampling for obtaining multibox models. Here, the possibility of importing such models as the starting point for further automatic enhancement is explored.


2021 ◽  
Author(s):  
Norberto Sánchez-Cruz ◽  
Jose L. Medina-Franco

<p>Epigenetic targets are a significant focus for drug discovery research, as demonstrated by the eight approved epigenetic drugs for treatment of cancer and the increasing availability of chemogenomic data related to epigenetics. This data represents a large amount of structure-activity relationships that has not been exploited thus far for the development of predictive models to support medicinal chemistry efforts. Herein, we report the first large-scale study of 26318 compounds with a quantitative measure of biological activity for 55 protein targets with epigenetic activity. Through a systematic comparison of machine learning models trained on molecular fingerprints of different design, we built predictive models with high accuracy for the epigenetic target profiling of small molecules. The models were thoroughly validated showing mean precisions up to 0.952 for the epigenetic target prediction task. Our results indicate that the herein reported models have considerable potential to identify small molecules with epigenetic activity. Therefore, our results were implemented as freely accessible and easy-to-use web application.</p>


Sign in / Sign up

Export Citation Format

Share Document