scholarly journals Machine learning identifies two autophagy-related genes as markers of recurrence in colorectal cancer

2020 ◽  
Vol 48 (10) ◽  
pp. 030006052095880
Author(s):  
Jianping Wu ◽  
Sulai Liu ◽  
Xiaoming Chen ◽  
Hongfei Xu ◽  
Yaoping Tang

Objective Colorectal cancer (CRC) is the most common cancer worldwide. Patient outcomes following recurrence of CRC are very poor. Therefore, identifying the risk of CRC recurrence at an early stage would improve patient care. Accumulating evidence shows that autophagy plays an active role in tumorigenesis, recurrence, and metastasis. Methods We used machine learning algorithms and two regression models, univariable Cox proportion and least absolute shrinkage and selection operator (LASSO), to identify 26 autophagy-related genes (ARGs) related to CRC recurrence. Results By functional annotation, these ARGs were shown to be enriched in necroptosis and apoptosis pathways. Protein–protein interactions identified SQSTM1, CASP8, HSP80AB1, FADD, and MAPK9 as core genes in CRC autophagy. Of 26 ARGs, BAX and PARP1 were regarded as having the most significant predictive ability of CRC recurrence, with prediction accuracy of 71.1%. Conclusion These results shed light on prediction of CRC recurrence by ARGs. Stratification of patients into recurrence risk groups by testing ARGs would be a valuable tool for early detection of CRC recurrence.

2020 ◽  
Vol 38 (15_suppl) ◽  
pp. 10076-10076
Author(s):  
Michael Moore ◽  
Isabel D Friesner ◽  
Emanuelle M. Rizk ◽  
Megan Trager ◽  
Julide T. Celebi ◽  
...  

10076 Background: Patients diagnosed with early stage melanoma are at risk of recurrence and death. Adjuvant therapy decreases risk but incurs toxicity and expense. While tumor-infiltrating lymphocytes (TILs) improve prognosis, studies have shown conflicting results due, at least in part, to inter-observer variability. Thus, TILs are not included in standard American Joint Committee on Cancer (AJCC) staging. Here, we quantitatively analyze TILs in hematoxylin and eosin (H&E) melanoma images using two machine learning algorithms. Methods: H&E images were evaluated by two methods for patients with resectable stage I-III melanoma from Columbia (N = 81) and validated using samples from Geisinger and Moffitt (N = 128). For both methods, H&E images were manually annotated using open source software, QuPath, to specify tumor regions. For Method A, images were divided into patches and, for each patch, a probability was generated to detect lymphocytes. Patches above a set threshold were considered to be “TIL positive”. Ratio of TIL positive patches to total patches was assessed for every image. For Method B, a classifier was manually trained in QuPath and then applied on each image to determine the ratio of the areas of all immune cells to all tumor cells as previously published. Cutoff values to define high and low risk groups were established based on a test set and then validated in an independent cohort. Results: Both methods distinguished patients with visceral recurrence from those without for the Columbia training set (Method A p = .0015, Method B p = .043). Using Method A, Kaplan-Meier curve at the selected cutoff also correlated significantly with disease specific survival (DSS) for Columbia (p = .022) and was validated in the Geisinger/Moffitt (p = .046) cohort. Cox analysis using Method A showed that TIL status predicted DSS in the validation set (p = .047) and added significantly to depth and ulceration (HR = 3.43, CI: 1.047-11.257, p = .042). Conclusions: Both open source machine learning algorithms find significantly higher TILs in patients who do not develop metastasis. Notably, Method A may add to standard predictors, such as depth and ulceration. These results demonstrate the promise of computational algorithms to enhance visual grading, and suggest that digital TIL evaluation may add to current AJCC staging. [Table: see text]


2021 ◽  
Vol 28 (1) ◽  
pp. e100251
Author(s):  
Ian Scott ◽  
Stacey Carter ◽  
Enrico Coiera

Machine learning algorithms are being used to screen and diagnose disease, prognosticate and predict therapeutic responses. Hundreds of new algorithms are being developed, but whether they improve clinical decision making and patient outcomes remains uncertain. If clinicians are to use algorithms, they need to be reassured that key issues relating to their validity, utility, feasibility, safety and ethical use have been addressed. We propose a checklist of 10 questions that clinicians can ask of those advocating for the use of a particular algorithm, but which do not expect clinicians, as non-experts, to demonstrate mastery over what can be highly complex statistical and computational concepts. The questions are: (1) What is the purpose and context of the algorithm? (2) How good were the data used to train the algorithm? (3) Were there sufficient data to train the algorithm? (4) How well does the algorithm perform? (5) Is the algorithm transferable to new clinical settings? (6) Are the outputs of the algorithm clinically intelligible? (7) How will this algorithm fit into and complement current workflows? (8) Has use of the algorithm been shown to improve patient care and outcomes? (9) Could the algorithm cause patient harm? and (10) Does use of the algorithm raise ethical, legal or social concerns? We provide examples where an algorithm may raise concerns and apply the checklist to a recent review of diagnostic imaging applications. This checklist aims to assist clinicians in assessing algorithm readiness for routine care and identify situations where further refinement and evaluation is required prior to large-scale use.


Author(s):  
Adwait Patil

Abstract: Alzheimer’s disease is one of the neurodegenerative disorders. It initially starts with innocuous symptoms but gradually becomes severe. This disease is so dangerous because there is no treatment, the disease is detected but typically at a later stage. So it is important to detect Alzheimer at an early stage to counter the disease and for a probable recovery for the patient. There are various approaches currently used to detect symptoms of Alzheimer’s disease (AD) at an early stage. The fuzzy system approach is not widely used as it heavily depends on expert knowledge but is quite efficient in detecting AD as it provides a mathematical foundation for interpreting the human cognitive processes. Another more accurate and widely accepted approach is the machine learning detection of AD stages which uses machine learning algorithms like Support Vector Machines (SVMs) , Decision Tree , Random Forests to detect the stage depending on the data provided. The final approach is the Deep Learning approach using multi-modal data that combines image , genetic data and patient data using deep models and then uses the concatenated data to detect the AD stage more efficiently; this method is obscure as it requires huge volumes of data. This paper elaborates on all the three approaches and provides a comparative study about them and which method is more efficient for AD detection. Keywords: Alzheimer’s Disease (AD), Fuzzy System , Machine Learning , Deep Learning , Multimodal data


Energies ◽  
2022 ◽  
Vol 15 (2) ◽  
pp. 582
Author(s):  
Holger Behrends ◽  
Dietmar Millinger ◽  
Werner Weihs-Sedivy ◽  
Anže Javornik ◽  
Gerold Roolfs ◽  
...  

Faults and unintended conditions in grid-connected photovoltaic systems often cause a change of the residual current. This article describes a novel machine learning based approach to detecting anomalies in the residual current of a photovoltaic system. It can be used to detect faults or critical states at an early stage and extends conventional threshold-based detection methods. For this study, a power-hardware-in-the-loop approach was carried out, in which typical faults have been injected under ideal and realistic operating conditions. The investigation shows that faults in a photovoltaic converter system cause a unique behaviour of the residual current and fault patterns can be detected and identified by using pattern recognition and variational autoencoder machine learning algorithms. In this context, it was found that the residual current is not only affected by malfunctions of the system, but also by volatile external influences. One of the main challenges here is to separate the regular residual currents caused by the interferences from those caused by faults. Compared to conventional methods, which respond to absolute changes in residual current, the two machine learning models detect faults that do not affect the absolute value of the residual current.


2020 ◽  
Author(s):  
F. P. Chmiel ◽  
M. Azor ◽  
F. Borca ◽  
M. J. Boniface ◽  
D. K. Burns ◽  
...  

ABSTRACTShort-term reattendances to emergency departments are a key quality of care indicator. Identifying patients at increased risk of early reattendance can help reduce the number of patients with missed or undertreated illness or injury, and could support appropriate discharges with focused interventions. In this manuscript we present a retrospective, single-centre study where we create and evaluate a machine-learnt classifier trained to identify patients at risk of reattendance within 72 hours of discharge from an emergency department. On a patient hold-out test set, our highest performing classifier obtained an AUROC of 0.748 and an average precision of 0.250; demonstrating that machine-learning algorithms can be used to classify patients, with moderate performance, into low and high-risk groups for reattendance. In parallel to our predictive model we train an explanation model, capable of explaining predictions at an attendance level, which can be used to help inform the design of interventional strategies.


2019 ◽  
Vol 20 (3) ◽  
pp. 177-184 ◽  
Author(s):  
Nantao Zheng ◽  
Kairou Wang ◽  
Weihua Zhan ◽  
Lei Deng

Background:Targeting critical viral-host Protein-Protein Interactions (PPIs) has enormous application prospects for therapeutics. Using experimental methods to evaluate all possible virus-host PPIs is labor-intensive and time-consuming. Recent growth in computational identification of virus-host PPIs provides new opportunities for gaining biological insights, including applications in disease control. We provide an overview of recent computational approaches for studying virus-host PPI interactions.Methods:In this review, a variety of computational methods for virus-host PPIs prediction have been surveyed. These methods are categorized based on the features they utilize and different machine learning algorithms including classical and novel methods.Results:We describe the pivotal and representative features extracted from relevant sources of biological data, mainly include sequence signatures, known domain interactions, protein motifs and protein structure information. We focus on state-of-the-art machine learning algorithms that are used to build binary prediction models for the classification of virus-host protein pairs and discuss their abilities, weakness and future directions.Conclusion:The findings of this review confirm the importance of computational methods for finding the potential protein-protein interactions between virus and host. Although there has been significant progress in the prediction of virus-host PPIs in recent years, there is a lot of room for improvement in virus-host PPI prediction.


BMC Cancer ◽  
2019 ◽  
Vol 19 (1) ◽  
Author(s):  
Nathan Wan ◽  
David Weinberg ◽  
Tzu-Yu Liu ◽  
Katherine Niehaus ◽  
Eric A. Ariazi ◽  
...  

2020 ◽  
pp. 104225872094520
Author(s):  
Ivo Blohm ◽  
Torben Antretter ◽  
Charlotta Sirén ◽  
Dietmar Grichnik ◽  
Joakim Wincent

Investors increasingly use machine learning (ML) algorithms to support their early stage investment decisions. However, it remains unclear if algorithms can make better investment decisions and if so, why. Building on behavioral decision theory, our study compares the investment returns of an algorithm with those of 255 business angels (BAs) investing via an angel investment platform. We explore the influence of human biases and experience on BAs’ returns and find that investors only outperformed the algorithm when they had extensive investment experience and managed to suppress their cognitive biases. These results offer novel insights into the role of cognitive limitations, experience, and the use of algorithms in early stage investing.


2020 ◽  
Vol 38 (15_suppl) ◽  
pp. e17554-e17554
Author(s):  
Ioana Danciu ◽  
Samantha Erwin ◽  
Greeshma Agasthya ◽  
Tate Janet ◽  
Benjamin McMahon ◽  
...  

e17554 Background: The ability to understand and predict at the time of diagnosis the trajectories of prostate cancer patients is critical for deciding the appropriate treatment plan. Evidence-based approaches for outcome prediction include predictive machine learning algorithms that harness health record data. Methods: All our analyses used the Veterans Affairs Clinical Data Warehouse (CDW). We included all individuals with a non-metastatic (early stage) prostate cancer diagnosis between 2002 and 2017 as documented in the CDW cancer registry (N = 111351). Our predictors were demographics (age at diagnosis, race), disease staging parameters abstracted at diagnosis ( Stage grouping AJCC, Gleason score, SEER summary stage) and prostate specific antigen (PSA) laboratory values in the last 5 years prior to diagnosis (last value, the value before last, average, minimum, maximum, rate of the change of the last 2 PSAs and density). The predicted outcome was disease progression at 2 years (N = 3469) and 5 years (N = 6325) defined as metastasis - taking either Abiraterone, Sipuleucel-T, Enzalutamide or Radium 223, registry cancer related death or PSA > 50. We used 4 different machine learning classifiers to train prediction models: random forest, k-nearest neighbor, decision trees, and xgboost all with hyper parameter optimization. For testing, we used two approaches: (1) 20% sample held out at the beginning of the study, and (2) stratified test/train split on the remaining data. Results: The table below shows the performance of the best classifier, xgboost. The top five predictors of disease progression were the last PSA, Gleason Score, maximum PSA, age at diagnosis, and SEER summary stage. The last PSA had a significantly higher contribution than the other predictors. More than one PSA value is important for prediction, emphasizing the need for investigating the PSA trajectory in the period before diagnosis. The models are overall very robust going from outcome at 2 years compared to 5 years. Conclusions: A machine learning based xgboost classifier can be integrated in clinical decision support at diagnosis, to robustly predict disease progression at 2 and 5 years. [Table: see text]


Sign in / Sign up

Export Citation Format

Share Document