scholarly journals Personally Tailored Survival Prediction of Patients With Follicular Lymphoma Using Machine Learning Transcriptome-Based Models

2022 ◽  
Vol 11 ◽  
Author(s):  
Adrián Mosquera Orgueira ◽  
Miguel Cid López ◽  
Andrés Peleteiro Raíndo ◽  
Aitor Abuín Blanco ◽  
Jose Ángel Díaz Arias ◽  
...  

Follicular Lymphoma (FL) has a 10-year mortality rate of 20%, and this is mostly related to lymphoma progression and transformation to higher grades. In the era of personalized medicine it has become increasingly important to provide patients with an optimal prediction about their expected outcomes. The objective of this work was to apply machine learning (ML) tools on gene expression data in order to create individualized predictions about survival in patients with FL. Using data from two different studies, we were able to create a model which achieved good prediction accuracies in both cohorts (c-indexes of 0.793 and 0.662 in the training and test sets). Integration of this model with m7-FLIPI and age rendered high prediction accuracies in the test set (cox c-index 0.79), and a simplified approach identified 4 groups with remarkably different outcomes in terms of survival. Importantly, one of the groups comprised 27.35% of patients and had a median survival of 4.64 years. In summary, we have created a gene expression-based individualized predictor of overall survival in FL that can improve the predictions of the m7-FLIPI score.

Leukemia ◽  
2021 ◽  
Author(s):  
Adrián Mosquera Orgueira ◽  
Marta Sonia González Pérez ◽  
José Ángel Díaz Arias ◽  
Beatriz Antelo Rodríguez ◽  
Natalia Alonso Vence ◽  
...  

Blood ◽  
2007 ◽  
Vol 110 (11) ◽  
pp. 2610-2610
Author(s):  
Cheryl J. Foster ◽  
Tara Baetz ◽  
Roland Somogyi ◽  
Larry D. Greller ◽  
Roger Sidhu ◽  
...  

Abstract Background: Follicular lymphoma (FL) is the second most common type of non-Hodgkin lymphoma in the Western world. It is generally an indolent disease, however some patients experience rapid clinical progression. Identification of this subset of patients at the time of initial diagnosis would allow for more informed decisions to be made regarding clinical management. Methods: We investigated whether a multi-dimensional profiling approach using gene expression microarrays, a tissue microarray (TMA) and baseline clinical parameters, would allow for survival prediction for FL patients. Sixty-seven cases of FL were identified, of which high quality gene expression data were obtained with a minimum of 5 years of follow-up for 41 patients. Expression data were subjected to Predictive Interaction Analysis (PIA) to identify pairs of interacting genes that predict poor outcome, defined as death within five years of diagnosis. A TMA of all 67 patients was subjected to immunohistochemistry for markers routinely used in lymphoma diagnosis, and for numerous proteins whose relevance to oncogenesis is well-established, including p53, bcl-2, bcl-6, MUM1, p16 and p65. Results: Gene expression analysis revealed numerous genes that are highly predictive of clinical outcome. Many of these genes are known to be involved in pathways that regulate apoptosis, cell survival, proliferation and hematological function. The highly predictive single genes included BMX, NOTCH2, TFF3, BIRC4 and RIPK5, which have established roles in promoting or antagonizing apoptosis. The PIA approach further identified numerous pairs of genes that together possess greater predictive power than their individual constituent genes. Subsequent Kaplan-Meier analysis indicated that segregation of the cases according to the top performing gene pair, LOXL3 and NTS, produced two groups of cases with significantly different survival. This gene pair was able to further differentiate patient outcomes following stratification of the cases according to the Follicular Lymphoma International Prognostic Index (FLIPI), indicating its utility in providing supplementary information to the FLIPI. Upon analysis of the TMA results, detectable expression of p53 in lymphoma cells, along with clinical involvement of multiple nodal sites and B symptoms emerged as significant predictors of overall survival. Further IHC results examining expression of proteins identified as highly predictive at the transcript level will be presented. Conclusions: Our results support the utility of our profiling approach for the identification of candidate biomarkers in follicular lymphoma. Queen’s University and Biosystemix Ltd are co-owners of the intellectual property and are respectively the assignees of a provisional patent application filed at the US PTO in Sept. 2007. Roland Somogyi PhD and Larry D. Greller PhD as founding directors retain ownership positions in Biosystemix Ltd, a privately held company incorporated in Canada. Queen’s University, Biosystemix, and all the co-authors believe and agree to the best of our respective knowledge that there are no conflicts of interest in how the study was initiated, conducted, analyzed, and reported.


2021 ◽  
Vol 11 ◽  
Author(s):  
Adrián Mosquera Orgueira ◽  
Andrés Peleteiro Raíndo ◽  
Miguel Cid López ◽  
José Ángel Díaz Arias ◽  
Marta Sonia González Pérez ◽  
...  

Acute Myeloid Leukemia (AML) is a heterogeneous neoplasm characterized by cytogenetic and molecular alterations that drive patient prognosis. Currently established risk stratification guidelines show a moderate predictive accuracy, and newer tools that integrate multiple molecular variables have proven to provide better results. In this report, we aimed to create a new machine learning model of AML survival using gene expression data. We used gene expression data from two publicly available cohorts in order to create and validate a random forest predictor of survival, which we named ST-123. The most important variables in the model were age and the expression of KDM5B and LAPTM4B, two genes previously associated with the biology and prognostication of myeloid neoplasms. This classifier achieved high concordance indexes in the training and validation sets (0.7228 and 0.6988, respectively), and predictions were particularly accurate in patients at the highest risk of death. Additionally, ST-123 provided significant prognostic improvements in patients with high-risk mutations. Our results indicate that survival of patients with AML can be predicted to a great extent by applying machine learning tools to transcriptomic data, and that such predictions are particularly precise among patients with high-risk mutations.


Cell Cycle ◽  
2018 ◽  
Vol 17 (4) ◽  
pp. 486-491 ◽  
Author(s):  
Nicolas Borisov ◽  
Victor Tkachev ◽  
Maria Suntsova ◽  
Olga Kovalchuk ◽  
Alex Zhavoronkov ◽  
...  

2016 ◽  
Vol 24 (1) ◽  
pp. 54-65 ◽  
Author(s):  
Stefano Parodi ◽  
Chiara Manneschi ◽  
Damiano Verda ◽  
Enrico Ferrari ◽  
Marco Muselli

This study evaluates the performance of a set of machine learning techniques in predicting the prognosis of Hodgkin’s lymphoma using clinical factors and gene expression data. Analysed samples from 130 Hodgkin’s lymphoma patients included a small set of clinical variables and more than 54,000 gene features. Machine learning classifiers included three black-box algorithms ( k-nearest neighbour, Artificial Neural Network, and Support Vector Machine) and two methods based on intelligible rules (Decision Tree and the innovative Logic Learning Machine method). Support Vector Machine clearly outperformed any of the other methods. Among the two rule-based algorithms, Logic Learning Machine performed better and identified a set of simple intelligible rules based on a combination of clinical variables and gene expressions. Decision Tree identified a non-coding gene ( XIST) involved in the early phases of X chromosome inactivation that was overexpressed in females and in non-relapsed patients. XIST expression might be responsible for the better prognosis of female Hodgkin’s lymphoma patients.


2015 ◽  
Vol 2015 ◽  
pp. 1-7 ◽  
Author(s):  
Shuhei Kaneko ◽  
Akihiro Hirakawa ◽  
Chikuma Hamada

In the past decade, researchers in oncology have sought to develop survival prediction models using gene expression data. The least absolute shrinkage and selection operator (lasso) has been widely used to select genes that truly correlated with a patient’s survival. The lasso selects genes for prediction by shrinking a large number of coefficients of the candidate genes towards zero based on a tuning parameter that is often determined by a cross-validation (CV). However, this method can pass over (or fail to identify) true positive genes (i.e., it identifies false negatives) in certain instances, because the lasso tends to favor the development of a simple prediction model. Here, we attempt to monitor the identification of false negatives by developing a method for estimating the number of true positive (TP) genes for a series of values of a tuning parameter that assumes a mixture distribution for the lasso estimates. Using our developed method, we performed a simulation study to examine its precision in estimating the number of TP genes. Additionally, we applied our method to a real gene expression dataset and found that it was able to identify genes correlated with survival that a CV method was unable to detect.


Sign in / Sign up

Export Citation Format

Share Document