scholarly journals Deep learning transcriptomic model for prediction of pan-drug chemotherapeutic sensitivity

2022 ◽  
pp. 1-16
Author(s):  
Eddie Guo ◽  
Pouria Torabi ◽  
Daiva E. Nielsen ◽  
Matthew Pietrosanu

The emergence of precision oncology approaches has begun to inform clinical decision-making in diagnostic, prognostic, and treatment contexts. High-throughput technology has enabled machine learning algorithms to use the molecular characteristics of tumors to generate personalized therapies. However, precision oncology studies have yet to develop a predictive biomarker incorporating pan-cancer gene expression profiles to stratify tumors into similar drug sensitivity profiles. Here we show that a neural network with ten hidden layers accurately classifies pancancer cell lines into two distinct chemotherapeutic response groups based on a pan-drug dataset with 89.0% accuracy (AUC = 0.904). Using unsupervised clustering algorithms, we found a cohort of cell line gene expression data from the Genomics of Drug Sensitivity in Cancer could be clustered into two response groups with significant differences in pan-drug chemotherapeutic sensitivity. After applying the Boruta feature selection algorithm to this dataset, a deep learning model was developed to predict chemotherapeutic response groups. The model’s high classification efficacy validates our hypothesis that cell lines with similar gene expression profiles present similar pan-drug chemotherapeutic sensitivity. This finding provides evidence for the potential use of similar combinatorial biomarkers to select potent candidate drugs that maximize therapeutic response and minimize the cytotoxic burden. Future investigations should aim to recursively subcluster cell lines within the response clusters defined in this study to provide a higher resolution of potential patient response to chemotherapeutics.

F1000Research ◽  
2016 ◽  
Vol 5 ◽  
pp. 2333 ◽  
Author(s):  
Zhaleh Safikhani ◽  
Petr Smirnov ◽  
Mark Freeman ◽  
Nehme El-Hachem ◽  
Adrian She ◽  
...  

In 2013, we published a comparative analysis mutation and gene expression profiles and drug sensitivity measurements for 15 drugs characterized in the 471 cancer cell lines screened in the Genomics of Drug Sensitivity in Cancer (GDSC) and Cancer Cell Line Encyclopedia (CCLE). While we found good concordance in gene expression profiles, there was substantial inconsistency in the drug responses reported by the GDSC and CCLE projects. We received extensive feedback on the comparisons that we performed. This feedback, along with the release of new data, prompted us to revisit our initial analysis. Here we present a new analysis using these expanded data in which we address the most significant suggestions for improvements on our published analysis — that targeted therapies and broad cytotoxic drugs should have been treated differently in assessing consistency, that consistency of both molecular profiles and drug sensitivity measurements should both be compared across cell lines, and that the software analysis tools we provided should have been easier to run, particularly as the GDSC and CCLE released additional data.             Our re-analysis supports our previous finding that gene expression data are significantly more consistent than drug sensitivity measurements. The use of new statistics to assess data consistency allowed us to identify two broad effect drugs and three targeted drugs with moderate to good consistency in drug sensitivity data between GDSC and CCLE. For three other targeted drugs, there were not enough sensitive cell lines to assess the consistency of the pharmacological profiles. We found evidence of inconsistencies in pharmacological phenotypes for the remaining eight drugs.             Overall, our findings suggest that the drug sensitivity data in GDSC and CCLE continue to present challenges for robust biomarker discovery. This re-analysis provides additional support for the argument that experimental standardization and validation of pharmacogenomic response will be necessary to advance the broad use of large pharmacogenomic screens.


F1000Research ◽  
2017 ◽  
Vol 5 ◽  
pp. 2333 ◽  
Author(s):  
Zhaleh Safikhani ◽  
Petr Smirnov ◽  
Mark Freeman ◽  
Nehme El-Hachem ◽  
Adrian She ◽  
...  

In 2013, we published a comparative analysis of mutation and gene expression profiles and drug sensitivity measurements for 15 drugs characterized in the 471 cancer cell lines screened in the Genomics of Drug Sensitivity in Cancer (GDSC) and Cancer Cell Line Encyclopedia (CCLE). While we found good concordance in gene expression profiles, there was substantial inconsistency in the drug responses reported by the GDSC and CCLE projects. We received extensive feedback on the comparisons that we performed. This feedback, along with the release of new data, prompted us to revisit our initial analysis. We present a new analysis using these expanded data, where we address the most significant suggestions for improvements on our published analysis — that targeted therapies and broad cytotoxic drugs should have been treated differently in assessing consistency, that consistency of both molecular profiles and drug sensitivity measurements should be compared across cell lines, and that the software analysis tools provided should have been easier to run, particularly as the GDSC and CCLE released additional data. Our re-analysis supports our previous finding that gene expression data are significantly more consistent than drug sensitivity measurements. Using new statistics to assess data consistency allowed identification of two broad effect drugs and three targeted drugs with moderate to good consistency in drug sensitivity data between GDSC and CCLE. For three other targeted drugs, there were not enough sensitive cell lines to assess the consistency of the pharmacological profiles. We found evidence of inconsistencies in pharmacological phenotypes for the remaining eight drugs. Overall, our findings suggest that the drug sensitivity data in GDSC and CCLE continue to present challenges for robust biomarker discovery. This re-analysis provides additional support for the argument that experimental standardization and validation of pharmacogenomic response will be necessary to advance the broad use of large pharmacogenomic screens.


2015 ◽  
Author(s):  
Zhaleh Safikhani ◽  
Mark Freeman ◽  
Petr Smirnov ◽  
Nehme El-Hachem ◽  
Adrian She ◽  
...  

Background: In 2012, two large pharmacogenomic studies, the Genomics of Drug Sensitivity in Cancer (GDSC) and Cancer Cell Line Encyclopedia (CCLE), were published, each reported gene expression data and measures of drug response for a large number of drugs and hundreds of cell lines. In 2013, we published a comparative analysis that reported gene expression profiles for the 471 cell lines profiled in both studies and dose response measurements for the 15 drugs characterized in the common cell lines by both studies. While we found good concordance in gene expression profiles, there was substantial inconsistency in the drug responses reported by the GDSC and CCLE projects. Our paper was widely discussed and we received extensive feedback on the comparisons that we performed. This feedback, along with the release of new data, prompted us to revisit our initial analysis. Here we present a new analysis using these expanded data in which we address the most significant suggestions for improvements on our published analysis: that drugs with different response characteristics should have been treated differently, that targeted therapies and broad cytotoxic drugs should have been treated differently in assessing consistency, that consistency of both molecular profiles and drug sensitivity measurements should both be compared across cell lines to accurately assess differences in the studies, that we missed some biomarkers that are consistent between studies, and that the software analysis tools we provided with our analysis should have been easier to run, particularly as the GDSC and CCLE released additional data. Methods: For each drug, we used published sensitivity data from the GDSC and CCLE to separately estimate drug dose-response curves. We then used two statistics, the area between drug dose-response curves (ABC) and the Matthews correlation coefficient (MCC), to robustly estimate the consistency of continuous and discrete drug sensitivity measures, respectively. We also used recently released RNA-seq data together with previously published gene expression microarray data to assess inter-platform reproducibility of cell line gene expression profiles. Results: This re-analysis supports our previous finding that gene expression data are significantly more consistent than drug sensitivity measurements. The use of new statistics to assess data consistency allowed us to identify two broad effect drugs -- 17-AAG and PD-0332901 -- and three targeted drugs -- PLX4720, nilotinib and crizotinib -- with moderate to good consistency in drug sensitivity data between GDSC and CCLE. Not enough sensitive cell lines were screened in both studies to robustly assess consistency for three other targeted drugs, PHA-665752, erlotinib, and sorafenib. Concurring with our published results, we found evidence of inconsistencies in pharmacological phenotypes for the remaining eight drugs. Further, to discover "consistency" between studies required the use of multiple statistics and the selection of specific measures on a case-by-case basis. Conclusion: Our results reaffirm our initial findings of an inconsistency in drug sensitivity measures for eight of fifteen drugs screened both in GDSC and CCLE, irrespective of which statistical metric was used to assess correlation. Taken together, our findings suggest that the phenotypic data on drug response in the GDSC and CCLE continue to present challenges for robust biomarker discovery. This re-analysis provides additional support for the argument that experimental standardization and validation of pharmacogenomic response will be necessary to advance the broad use of large pharmacogenomic screens.


F1000Research ◽  
2017 ◽  
Vol 5 ◽  
pp. 2333 ◽  
Author(s):  
Zhaleh Safikhani ◽  
Petr Smirnov ◽  
Mark Freeman ◽  
Nehme El-Hachem ◽  
Adrian She ◽  
...  

In 2013, we published a comparative analysis of mutation and gene expression profiles and drug sensitivity measurements for 15 drugs characterized in the 471 cancer cell lines screened in the Genomics of Drug Sensitivity in Cancer (GDSC) and Cancer Cell Line Encyclopedia (CCLE). While we found good concordance in gene expression profiles, there was substantial inconsistency in the drug responses reported by the GDSC and CCLE projects. We received extensive feedback on the comparisons that we performed. This feedback, along with the release of new data, prompted us to revisit our initial analysis. We present a new analysis using these expanded data, where we address the most significant suggestions for improvements on our published analysis — that targeted therapies and broad cytotoxic drugs should have been treated differently in assessing consistency, that consistency of both molecular profiles and drug sensitivity measurements should be compared across cell lines, and that the software analysis tools provided should have been easier to run, particularly as the GDSC and CCLE released additional data. Our re-analysis supports our previous finding that gene expression data are significantly more consistent than drug sensitivity measurements. Using new statistics to assess data consistency allowed identification of two broad effect drugs and three targeted drugs with moderate to good consistency in drug sensitivity data between GDSC and CCLE. For three other targeted drugs, there were not enough sensitive cell lines to assess the consistency of the pharmacological profiles. We found evidence of inconsistencies in pharmacological phenotypes for the remaining eight drugs. Overall, our findings suggest that the drug sensitivity data in GDSC and CCLE continue to present challenges for robust biomarker discovery. This re-analysis provides additional support for the argument that experimental standardization and validation of pharmacogenomic response will be necessary to advance the broad use of large pharmacogenomic screens.


Oncogene ◽  
2002 ◽  
Vol 21 (42) ◽  
pp. 6549-6556 ◽  
Author(s):  
Jiafu Ji ◽  
Xin Chen ◽  
Suet Yi Leung ◽  
Jen-Tsan A Chi ◽  
Kent Man Chu ◽  
...  

2006 ◽  
Vol 2 ◽  
pp. S552-S552
Author(s):  
Boe-Hyun Kim ◽  
Jae-Il Kim ◽  
Eun-Kyoung Choi ◽  
Richard I. Carp ◽  
Yong-Sun Kim

Oncogene ◽  
1999 ◽  
Vol 18 (17) ◽  
pp. 2711-2717 ◽  
Author(s):  
Chang Hun Rhee ◽  
Kenneth Hess ◽  
James Jabbur ◽  
Maribelis Ruiz ◽  
Yu Yang ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document