scholarly journals Predicting Cardiovascular Risk in Athletes: Resampling Improves Classification Performance

Author(s):  
Davide Barbieri ◽  
Nitesh Chawla ◽  
Luciana Zaccagni ◽  
Tonći Grgurinović ◽  
Jelena Šarac ◽  
...  

Cardiovascular diseases are the main cause of death worldwide. The aim of the present study is to verify the performances of a data mining methodology in the evaluation of cardiovascular risk in athletes, and whether the results may be used to support clinical decision making. Anthropometric (height and weight), demographic (age and sex) and biomedical (blood pressure and pulse rate) data of 26,002 athletes were collected in 2012 during routine sport medical examinations, which included electrocardiography at rest. Subjects were involved in competitive sport practice, for which medical clearance was needed. Outcomes were negative for the largest majority, as expected in an active population. Resampling was applied to balance positive/negative class ratio. A decision tree and logistic regression were used to classify individuals as either at risk or not. The receiver operating characteristic curve was used to assess classification performances. Data mining and resampling improved cardiovascular risk assessment in terms of increased area under the curve. The proposed methodology can be effectively applied to biomedical data in order to optimize clinical decision making, and—at the same time—minimize the amount of unnecessary examinations.

Author(s):  
Gebeyehu Belay Gebremeskel ◽  
Chai Yi ◽  
Zhongshi He ◽  
Dawit Haile

Purpose – Among the growing number of data mining (DM) techniques, outlier detection has gained importance in many applications and also attracted much attention in recent times. In the past, outlier detection researched papers appeared in a safety care that can view as searching for the needles in the haystack. However, outliers are not always erroneous. Therefore, the purpose of this paper is to investigate the role of outliers in healthcare services in general and patient safety care, in particular. Design/methodology/approach – It is a combined DM (clustering and the nearest neighbor) technique for outliers’ detection, which provides a clear understanding and meaningful insights to visualize the data behaviors for healthcare safety. The outcomes or the knowledge implicit is vitally essential to a proper clinical decision-making process. The method is important to the semantic, and the novel tactic of patients’ events and situations prove that play a significant role in the process of patient care safety and medications. Findings – The outcomes of the paper is discussing a novel and integrated methodology, which can be inferring for different biological data analysis. It is discussed as integrated DM techniques to optimize its performance in the field of health and medical science. It is an integrated method of outliers detection that can be extending for searching valuable information and knowledge implicit based on selected patient factors. Based on these facts, outliers are detected as clusters and point events, and novel ideas proposed to empower clinical services in consideration of customers’ satisfactions. It is also essential to be a baseline for further healthcare strategic development and research works. Research limitations/implications – This paper mainly focussed on outliers detections. Outlier isolation that are essential to investigate the reason how it happened and communications how to mitigate it did not touch. Therefore, the research can be extended more about the hierarchy of patient problems. Originality/value – DM is a dynamic and successful gateway for discovering useful knowledge for enhancing healthcare performances and patient safety. Clinical data based outlier detection is a basic task to achieve healthcare strategy. Therefore, in this paper, the authors focussed on combined DM techniques for a deep analysis of clinical data, which provide an optimal level of clinical decision-making processes. Proper clinical decisions can obtain in terms of attributes selections that important to know the influential factors or parameters of healthcare services. Therefore, using integrated clustering and nearest neighbors techniques give more acceptable searched such complex data outliers, which could be fundamental to further analysis of healthcare and patient safety situational analysis.


2021 ◽  
Author(s):  
Xudong Zhang ◽  
Jin-Cheng Wang ◽  
Baoqiang Wu ◽  
Tao Li ◽  
Lei Jin ◽  
...  

Abstract Background: Gallbladder polyps (GBPs) assessment seeks to identify early-stage gallbladder carcinoma (GBC). Many studies have analyzed the risk factors for malignant GBPs, and we try to establish a more accurate predictive model for potential neoplastic polyps in patients with GBPs.Methods: This retrospective study developed a nomogram-based model in a training cohort of 233 GBP patients. Clinical information, ultrasonographic findings, and blood tests were retrospectively analyzed. Spearman correlation and logistic regression analysis were used to identify independent predictors and establish a nomogram model. An internal validation was conducted in 225 consecutive patients. Performance of models was evaluated through the receiver operating characteristic curve (ROC) and decision curve analysis (DCA). Results: Age, cholelithiasis, CEA, polyp size and sessile were confirmed as independent predictors for neoplastic potential of GBPs in the training group. Compared with other proposed prediction methods, the established nomogram model presented good discrimination ability in the training cohort (area under the curve [AUC]: 0.845) and the validation cohort (AUC: 0.836). DCA demonstrated the most clinical benefits can be provided by the nomogram. Conclusions: Our developed preoperative nomogram model can successfully evaluate the neoplastic potential of GBPs based on simple clinical variables, that maybe useful for clinical decision-making.


2007 ◽  
Vol 9 (5) ◽  
pp. 339-341 ◽  
Author(s):  
Fernando Rodríguez-Artalejo ◽  
José R. Banegas

Author(s):  
Hakimeh Ameri ◽  
Somayeh Alizadeh ◽  
Elham Akhond Zadeh Noughabi

Data mining techniques are increasingly used in clinical decision making and help the physicians to make more accurate and effective decisions. In this chapter, a classification of data mining applications in clinical decision making is presented through a systematic review. The applications of data mining techniques in clinical decision making are divided into two main categories: diagnosis and treatment. Early prediction of medical conditions, detecting multi-morbidity and complications of diseases, identifying and predicting the chronic diseases and medical imaging are the subcategories which are defined in the diagnosis part. The Treatment category is composed of treatment effectiveness and predicting the average length of stay in hospital. The majority of the reviewed articles are related to diagnosis and there is only one article which discusses the determination of drug dosage in successful treatment. The classification model is the most commonly practical model in the clinical decision making.


2021 ◽  
Vol 28 (Supplement_1) ◽  
Author(s):  
L Dinc Asarcikli ◽  
M Kis ◽  
T Guvenc ◽  
V Tosun ◽  
B Acar ◽  
...  

Abstract Funding Acknowledgements Type of funding sources: None. OnBehalf CVSCORE-TR study group Background Friedewald equation (LDL-Cf) is known to produce inaccurate estimations of low-density lipoprotein cholesterol (LDL-C) when triglycerides are high (>400 mg/dl) or LDL-C is low (<70 mg/dl). Martin/Hopkins (LDL-Cmh) and Sampson (LDL-Cs) equations were developed to overcome these limitations, but few data are available whether these equations offer incremental usefulness over LDL-Cf. Purpose   In this pragmatic study, we aimed to evaluate the agreement between LDL-C calculated using LDL-Cmh, LDL-Cs and LDL-Cf equations and to understand whether using LDL-Cmh or LDL-Cs instead of LDL-Cf leads to significant changes on the clinical decision-making  Methods 4196 cardiology outpatient cases that were included in a multicenter registry database were analyzed. Each case was assigned into a cardiovascular risk class using web-based SCORE (Systematic COronary Risk Evaluation) algorithm calibrated for high-risk European countries, and relevant European guidelines were used to assess LDL-C targets. LDL-Cf, LDL-Cs and LDL-Cmh were calculated as previously described.  Results Compared to LDL-Cmh and LDL-Cs, LDL-Cf was able to correctly identify 96.9%-98.08% of cases as within or out of LDL-C target, respectively, while 1.95%-2.8% of cases were falsely identified as within LDL-C target. Kappa coefficients for agreement between LDL-Cf vs. LDL-Cmh and LDL-Cf vs. LDL-Cs were 0.868 and 0.918 (p < 0.001 for both). For patients not on anticholesterolemic drugs, decision to initiate treatment would be different in 1.2%-1.8% of cases if LDL-Cs or LDL-Cmh were used, respectively. For those already on anticholesterolemic drugs, decisions regarding to treatment intensification would be different in 1.5%-2.4% of cases if LDL-Cs or LDL-Cmh were used. Conclusions Friedewald equation had an excellent degree of agreement with the novel Martin/Hopkins and Sampson formulas in most cardiology outpatients, especially those within the lower end of the cardiovascular risk spectrum. In selected patients, especially those with high or very high risk in whom LDL-Cf < 70 mg/dl or those with a TG > 400 mg/dl, agreement was far worse and thus novel equations might have an incremental usefulness for clinical decision making. Table 1 Reference Comparison Correct estimation Underestimation Overestimation Kappa (p value) All patients that were not on cholesterol-lowering treatment LDL-Cmh LDL-Cf 2785 (98.1%) 51 (1.8%) 3 (0.1%) 0.962 (<0.001) LDL-Cs LDL-Cf 2804 (98.8%) 35 (1.2%) 0 (0.0%) 0.975 (<0.001) Agreement for the indication of cholesterol-lowering treatment for patients not already on cholesterol-lowering drugs. Leftmost column shows the reference method, and the second row shows equation which is compared to the reference method. Abstract Figure


Author(s):  
Alberto Santos ◽  
Ana R. Colaço ◽  
Annelaura B. Nielsen ◽  
Lili Niu ◽  
Philipp E. Geyer ◽  
...  

SummaryThe promise of precision medicine is to deliver personalized treatment based on the unique physiology of each patient. This concept was fueled by the genomic revolution, but it is now evident that integrating other types of omics data, like proteomics, into the clinical decision-making process will be essential to accomplish precision medicine goals. However, quantity and diversity of biomedical data, and the spread of clinically relevant knowledge across myriad biomedical databases and publications makes this exceptionally difficult. To address this, we developed the Clinical Knowledge Graph (CKG), an open source platform currently comprised of more than 16 million nodes and 220 million relationships to represent relevant experimental data, public databases and the literature. The CKG also incorporates the latest statistical and machine learning algorithms, drastically accelerating analysis and interpretation of typical proteomics workflows. We use several biomarker studies to illustrate how the CKG may support, enrich and accelerate clinical decision-making.Graphical Abstract


Sign in / Sign up

Export Citation Format

Share Document