P5705Machine learning for phenotyping and risk prediction in cardiovascular diseases: a systematic review

2019 ◽  
Vol 40 (Supplement_1) ◽  
Author(s):  
A Banerjee ◽  
S Chen ◽  
G Fatemifar ◽  
H Hemingway ◽  
T Lumbers ◽  
...  

Abstract Introduction Heart failure (HF), acute coronary syndromes (ACS) and atrial fibrillation (AF) are among the commonest cardiovascular diseases (CVD), frequently co-exist and share pathophysiology. Definitions of diagnosis and prognosis are suboptimal. Machine learning (ML) is increasingly used in subtype definition and risk prediction, but the design, methods and results of studies have not been appraised. Purpose To conduct a systematic review of ML for discovery of new subtypes and risk prediction in HF, ACS and AF. Methods PubMed, MEDLINE, and Web of Science databases were searched (January 2000-August 2018) for English language publications with agreed search terms pertaining to machine learning, clustering, CVD, subtype and risk prediction. The baseline characteristics of the study population, the method of ML, covariates and results were extracted for each study. Results Of 5012 identified studies, 43 met inclusion criteria. Of the 33 studies of unsupervised ML for disease clustering (mean n=2354; min 117, max 44886), there were 22 in HF, 9 in ACS and 2 in AF. 22/33 studies involved <1000 individuals and 24 were based in North America. Across diseases, 27 studies were in outpatients, and 5 used trial data. The mean number of covariates used was 26; most commonly demographic and symptom variables. The ML methods used were partitional (n=12), hierarchical (n=4), self-organising map (n=1) and hidden Markov model (n=1). Most studies used only one ML method (n=25). Only 15 studies validated or replicated findings. 20/33 studies found 2 or 3 disease clusters, Most studies found 2–3 clusters (20/33) and most clusters were based on physical or physiological characteristics (30/33). Of the 10 studies of supervised ML for risk prediction (mean n=43003; min 228, max 378256), 4 were in HF, 5 in ACS and 1 in AF. 2/11 studies involved <1000 individuals and most were from North America (n=6). All studies had an observational design, used at least 2 ML methods and validated or replicated findings. The setting was varied: primary care (n=2), emergency department (n=2), inpatient (n=4) and mixed (n=2). The mean number of covariates was 102. The commonest ML methods were neural networks (n=5), random forest (n=4) and support vector machine (n=4). All studies showed positive finding, i.e. ML approaches improved risk prediction. Conclusions Studies to-date of ML in HF, ACS and AF have focused on North America (68.2%), and 50% included less than 1000 individuals. Moreover, there is heterogeneity in clinical setting, study designs for data collection and ML methods used. Comparison between methods of ML and validation are common to studies of risk prediction but not disease clustering. There is likely to be a publication bias of ML studies in HF, AF and ACS. ML may improve data-driven characterisation of CVD but consensus guidelines for reporting of research using ML are urgently needed to ensure the internal and external validity and applicability of study findings. Acknowledgement/Funding Innovative Medicines Initiative (European Union)

BMC Medicine ◽  
2021 ◽  
Vol 19 (1) ◽  
Author(s):  
Amitava Banerjee ◽  
Suliang Chen ◽  
Ghazaleh Fatemifar ◽  
Mohamad Zeina ◽  
R. Thomas Lumbers ◽  
...  

Abstract Background Machine learning (ML) is increasingly used in research for subtype definition and risk prediction, particularly in cardiovascular diseases. No existing ML models are routinely used for cardiovascular disease management, and their phase of clinical utility is unknown, partly due to a lack of clear criteria. We evaluated ML for subtype definition and risk prediction in heart failure (HF), acute coronary syndromes (ACS) and atrial fibrillation (AF). Methods For ML studies of subtype definition and risk prediction, we conducted a systematic review in HF, ACS and AF, using PubMed, MEDLINE and Web of Science from January 2000 until December 2019. By adapting published criteria for diagnostic and prognostic studies, we developed a seven-domain, ML-specific checklist. Results Of 5918 studies identified, 97 were included. Across studies for subtype definition (n = 40) and risk prediction (n = 57), there was variation in data source, population size (median 606 and median 6769), clinical setting (outpatient, inpatient, different departments), number of covariates (median 19 and median 48) and ML methods. All studies were single disease, most were North American (n = 61/97) and only 14 studies combined definition and risk prediction. Subtype definition and risk prediction studies respectively had limitations in development (e.g. 15.0% and 78.9% of studies related to patient benefit; 15.0% and 15.8% had low patient selection bias), validation (12.5% and 5.3% externally validated) and impact (32.5% and 91.2% improved outcome prediction; no effectiveness or cost-effectiveness evaluations). Conclusions Studies of ML in HF, ACS and AF are limited by number and type of included covariates, ML methods, population size, country, clinical setting and focus on single diseases, not overlap or multimorbidity. Clinical utility and implementation rely on improvements in development, validation and impact, facilitated by simple checklists. We provide clear steps prior to safe implementation of machine learning in clinical practice for cardiovascular diseases and other disease areas.


Author(s):  
Mirza Rizwan Sajid ◽  
Bader A. Almehmadi ◽  
Waqas Sami ◽  
Mansour K. Alzahrani ◽  
Noryanti Muhammad ◽  
...  

Criticism of the implementation of existing risk prediction models (RPMs) for cardiovascular diseases (CVDs) in new populations motivates researchers to develop regional models. The predominant usage of laboratory features in these RPMs is also causing reproducibility issues in low–middle-income countries (LMICs). Further, conventional logistic regression analysis (LRA) does not consider non-linear associations and interaction terms in developing these RPMs, which might oversimplify the phenomenon. This study aims to develop alternative machine learning (ML)-based RPMs that may perform better at predicting CVD status using nonlaboratory features in comparison to conventional RPMs. The data was based on a case–control study conducted at the Punjab Institute of Cardiology, Pakistan. Data from 460 subjects, aged between 30 and 76 years, with (1:1) gender-based matching, was collected. We tested various ML models to identify the best model/models considering LRA as a baseline RPM. An artificial neural network and a linear support vector machine outperformed the conventional RPM in the majority of performance matrices. The predictive accuracies of the best performed ML-based RPMs were between 80.86 and 81.09% and were found to be higher than 79.56% for the baseline RPM. The discriminating capabilities of the ML-based RPMs were also comparable to baseline RPMs. Further, ML-based RPMs identified substantially different orders of features as compared to baseline RPM. This study concludes that nonlaboratory feature-based RPMs can be a good choice for early risk assessment of CVDs in LMICs. ML-based RPMs can identify better order of features as compared to the conventional approach, which subsequently provided models with improved prognostic capabilities.


Author(s):  
Nghia H Nguyen ◽  
Dominic Picetti ◽  
Parambir S Dulai ◽  
Vipul Jairath ◽  
William J Sandborn ◽  
...  

Abstract Background and Aims There is increasing interest in machine learning-based prediction models in inflammatory bowel diseases (IBD). We synthesized and critically appraised studies comparing machine learning vs. traditional statistical models, using routinely available clinical data for risk prediction in IBD. Methods Through a systematic review till January 1, 2021, we identified cohort studies that derived and/or validated machine learning models, based on routinely collected clinical data in patients with IBD, to predict the risk of harboring or developing adverse clinical outcomes, and reported its predictive performance against a traditional statistical model for the same outcome. We appraised the risk of bias in these studies using the Prediction model Risk of Bias ASsessment (PROBAST) tool. Results We included 13 studies on machine learning-based prediction models in IBD encompassing themes of predicting treatment response to biologics and thiopurines, predicting longitudinal disease activity and complications and outcomes in patients with acute severe ulcerative colitis. The most common machine learnings models used were tree-based algorithms, which are classification approaches achieved through supervised learning. Machine learning models outperformed traditional statistical models in risk prediction. However, most models were at high risk of bias, and only one was externally validated. Conclusions Machine learning-based prediction models based on routinely collected data generally perform better than traditional statistical models in risk prediction in IBD, though frequently have high risk of bias. Future studies examining these approaches are warranted, with special focus on external validation and clinical applicability.


Sensors ◽  
2020 ◽  
Vol 20 (21) ◽  
pp. 6100
Author(s):  
Vibhuti Gupta ◽  
Thomas M. Braun ◽  
Mosharaf Chowdhury ◽  
Muneesh Tewari ◽  
Sung Won Choi

Machine learning techniques are widely used nowadays in the healthcare domain for the diagnosis, prognosis, and treatment of diseases. These techniques have applications in the field of hematopoietic cell transplantation (HCT), which is a potentially curative therapy for hematological malignancies. Herein, a systematic review of the application of machine learning (ML) techniques in the HCT setting was conducted. We examined the type of data streams included, specific ML techniques used, and type of clinical outcomes measured. A systematic review of English articles using PubMed, Scopus, Web of Science, and IEEE Xplore databases was performed. Search terms included “hematopoietic cell transplantation (HCT),” “autologous HCT,” “allogeneic HCT,” “machine learning,” and “artificial intelligence.” Only full-text studies reported between January 2015 and July 2020 were included. Data were extracted by two authors using predefined data fields. Following PRISMA guidelines, a total of 242 studies were identified, of which 27 studies met the inclusion criteria. These studies were sub-categorized into three broad topics and the type of ML techniques used included ensemble learning (63%), regression (44%), Bayesian learning (30%), and support vector machine (30%). The majority of studies examined models to predict HCT outcomes (e.g., survival, relapse, graft-versus-host disease). Clinical and genetic data were the most commonly used predictors in the modeling process. Overall, this review provided a systematic review of ML techniques applied in the context of HCT. The evidence is not sufficiently robust to determine the optimal ML technique to use in the HCT setting and/or what minimal data variables are required.


2020 ◽  
Vol 10 (15) ◽  
pp. 5135
Author(s):  
Nuria Caballé-Cervigón ◽  
José L. Castillo-Sequera ◽  
Juan A. Gómez-Pulido ◽  
José M. Gómez-Pulido ◽  
María L. Polo-Luque

Human healthcare is one of the most important topics for society. It tries to find the correct effective and robust disease detection as soon as possible to patients receipt the appropriate cares. Because this detection is often a difficult task, it becomes necessary medicine field searches support from other fields such as statistics and computer science. These disciplines are facing the challenge of exploring new techniques, going beyond the traditional ones. The large number of techniques that are emerging makes it necessary to provide a comprehensive overview that avoids very particular aspects. To this end, we propose a systematic review dealing with the Machine Learning applied to the diagnosis of human diseases. This review focuses on modern techniques related to the development of Machine Learning applied to diagnosis of human diseases in the medical field, in order to discover interesting patterns, making non-trivial predictions and useful in decision-making. In this way, this work can help researchers to discover and, if necessary, determine the applicability of the machine learning techniques in their particular specialties. We provide some examples of the algorithms used in medicine, analysing some trends that are focused on the goal searched, the algorithm used, and the area of applications. We detail the advantages and disadvantages of each technique to help choose the most appropriate in each real-life situation, as several authors have reported. The authors searched Scopus, Journal Citation Reports (JCR), Google Scholar, and MedLine databases from the last decades (from 1980s approximately) up to the present, with English language restrictions, for studies according to the objectives mentioned above. Based on a protocol for data extraction defined and evaluated by all authors using PRISMA methodology, 141 papers were included in this advanced review.


2021 ◽  
Vol 13 (1) ◽  
Author(s):  
Sergio Grueso ◽  
Raquel Viejo-Sobera

Abstract Background An increase in lifespan in our society is a double-edged sword that entails a growing number of patients with neurocognitive disorders, Alzheimer’s disease being the most prevalent. Advances in medical imaging and computational power enable new methods for the early detection of neurocognitive disorders with the goal of preventing or reducing cognitive decline. Computer-aided image analysis and early detection of changes in cognition is a promising approach for patients with mild cognitive impairment, sometimes a prodromal stage of Alzheimer’s disease dementia. Methods We conducted a systematic review following PRISMA guidelines of studies where machine learning was applied to neuroimaging data in order to predict whether patients with mild cognitive impairment might develop Alzheimer’s disease dementia or remain stable. After removing duplicates, we screened 452 studies and selected 116 for qualitative analysis. Results Most studies used magnetic resonance image (MRI) and positron emission tomography (PET) data but also magnetoencephalography. The datasets were mainly extracted from the Alzheimer’s disease neuroimaging initiative (ADNI) database with some exceptions. Regarding the algorithms used, the most common was support vector machine with a mean accuracy of 75.4%, but convolutional neural networks achieved a higher mean accuracy of 78.5%. Studies combining MRI and PET achieved overall better classification accuracy than studies that only used one neuroimaging technique. In general, the more complex models such as those based on deep learning, combined with multimodal and multidimensional data (neuroimaging, clinical, cognitive, genetic, and behavioral) achieved the best performance. Conclusions Although the performance of the different methods still has room for improvement, the results are promising and this methodology has a great potential as a support tool for clinicians and healthcare professionals.


2021 ◽  
Vol 3 (Supplement_3) ◽  
pp. iii17-iii17
Author(s):  
Waverly Rose Brim ◽  
Leon Jekel ◽  
Gabriel Cassinelli Petersen ◽  
Harry Subramanian ◽  
Tal Zeevi ◽  
...  

Abstract Purpose Medical staging, surgical planning, and therapeutic decisions are significantly different for brain metastases versus gliomas. Machine learning (ML) algorithms have been developed to differentiate these pathologies. We performed a systematic review to characterize ML methods and to evaluate their accuracy. Methods Studies on the application of machine learning in neuro-oncology were searched in Ovid Embase, Ovid MEDLINE, Cochrane trials (CENTRAL) and Web of science core-collection. A search strategy was designed in compliance with a clinical librarian and confirmed by a second librarian. The search strategy comprised of controlled vocabulary including artificial intelligence, machine learning, deep learning, magnetic resonance imaging, and glioma. The initial search was performed in October 2020 and then updated in February 2021. Candidate articles were screened in Covidence by at least two reviewers each. A bias analysis was conducted in agreement with TRIPOD, a bias assessment tool similar to CLAIM. Results Twenty-nine articles were used for data extraction. Four articles specified model development for solitary brain metastases. Classical ML (cML) algorithms represented 85% of models used, while deep learning (DL) accounted for 15%. cML algorithms performed with an average accuracy, sensitivity, and specificity of 82%, 78%, 88%, respectively; DL performed 84%, 79%, 81%. The support vector machine (SVM) algorithm was the most common used cML model in the literature and convolutional neural networks (CNN) were standard for DL models. We also found T1, T1 post-gadolinium and T2 sequences were most commonly used for feature extraction. Preliminary TRIPOD analysis yielded an average score of 14.25 (range 8–18). Conclusion ML algorithms that can accurately classify glioma from brain metastases have been developed. SVM and CNN are leading approaches with high accuracy. Standardized algorithm performance reporting is a clear limitation to be addressed in future studies.


2021 ◽  
Vol 17 (2) ◽  
pp. 57-67
Author(s):  
Rida Elyamani ◽  
Abdelmajid Soulaymani ◽  
Hind Hami

OBJECTIVE: To provide a systematic review of studies on cardiovascular diseases (CVD) and their risk factors in the Moroccan population. METHODS: A systematic analysis was performed based on PRISMA guidelines by retrieving data bases (Medline, Embase, and other) using technical keywords in addition to manual research on official websites. Published studies in the English or French language, conducted in Morocco or concerning the Moroccan population within the last two decades, were identified. RESULTS: This is the first systematic review of CVD in Morocco. Data from 159 studies were retrieved and analyzed. Most studies were written in the English language (75.89%) and published between 2010 and 2019 (85.47%). The mortality rate caused by CVD in Morocco has reached 38%, with ischemic heart disease and stroke as the main events causing death (31.0% and 22.5% respectively). The risk factors present in the population studied were headed by tobacco smoking (45- 50%), followed by physical inactivity (21.1%), elevated rate of hypertension (25.3%), and depression (5.47%). Impacted by a high rate of illiteracy and poverty and an unprepared health care system in Morocco, these numbers are expected to increase over the next decade. CONCLUSIONS: Based on these alarming incidences, investment in scientific research and epidemiological studies should be increased to determine the needs of the local population. The available evidence shows that the risk of cardiovascular disease and the associated mortality is very high in Morocco and will rise in the next years prospectively, which calls for urgent multi-sectorial approaches and treatment strategies.


2021 ◽  
Author(s):  
Sergio Grueso ◽  
Raquel Viejo-Sobera

Abstract Background: Increase in life-span in our society is a double-edged sword that entails a growing number of patients with neurocognitive disorders, Alzheimer’s disease being the most prevalent. Advances in medical imaging and computational power, enable new methods for early detection of neurocognitive disorders with the goal of preventing or reducing cognitive decline. Computer-aided image analysis and early detection of changes in cognition is a promising approach for patients with mild cognitive impairment, sometimes a prodromal stage of Alzheimer’s disease.Methods: We conducted a systematic review following PRISMA guidelines of studies where Machine Learning was applied to neuroimaging data in order to predict the progression from Mild Cognitive Impairment to Alzheimer’s disease. After removing duplicates, we screened 159 studies and selected 47 for a qualitative analysis. Results: Most studies used Magnetic Resonance Image and Positron Emission Tomography data but also Magnetoencephalography. The datasets were mainly extracted from the Alzheimer’s disease Neuroimage Initiative (ADNI) database with some exceptions. Regarding the algorithms used, the most common were support vector machines, but more complex models such as Deep Learning, combined with multimodal and multidimensional data (neuroimaging, clinical, cognitive, biological, and behavioral) achieved the best performance. Conclusions: Although performance of the different models still has room for improvement, the results are promising and this methodology has a great potential as a support tool for clinicians and healthcare professionals.


BMJ Open ◽  
2019 ◽  
Vol 9 (4) ◽  
pp. e027049 ◽  
Author(s):  
F Fatoye ◽  
P Smith ◽  
T Gebrye ◽  
G Yeowell

ObjectivesThis study examined patient adherence and persistence to oral bisphosphonates for the treatment of osteoporosis in real-world settings.MethodsA systematic review was completed according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses. Medical Literature Analysis and Retrieval System Online (MEDLINE), Cumulative Index to Nursing and Allied Health Literature (CINAHL), Allied and Complementary Medicine Database (AMED), Database of Abstracts of Reviews of Effects (DARE), Health Technology Assessment (HTA) and National Health Service Economic Evaluation Database NHS EED) databases were searched for studies published in English language up to April 2018. Prospective and retrospective observational studies that used prescription claim databases or hospital medical records to examine patient adherence and persistence to oral bisphosphonate treatment among adults with osteoporosis were included. The Newcastle–Ottawa quality assessment scale (NOS) was used to assess the quality of included studies.ResultsThe search yielded 540 published studies, of which 89 were deemed relevant and were included in this review. The mean age of patients included within the studies ranged between 53 to 80.8 years, and the follow-up varied from 3 months to 14 years. The mean persistence of oral bisphosphonates for 6 months, 1 year and 2 years ranged from 34.8% to 71.3%, 17.7% to 74.8% and 12.9% to 72.0%, respectively. The mean medication possession ratio ranged from 28.2% to 84.5%, 23% to 50%, 27.2% to 46% over 1 year, 2 years and 3 years, respectively. All studies included scored between 6 to 8 out of 9 on the NOS. The determinants of adherence and persistence to oral bisphosphonates included geographic residence, marital status, tobacco use, educational status, income, hospitalisation, medication type and dosing frequency.ConclusionsWhile a number of studies reported high levels of persistence and adherence, the findings of this review suggest that patient persistence and adherence with oral bisphosphonates medications was poor and reduced notably over time. Overall, adherence was suboptimal. To maximise adherence and persistence to oral bisphosphonates, it is important to consider possible determinants, including characteristics of the patients.


Sign in / Sign up

Export Citation Format

Share Document