Revealing new therapeutic opportunities through drug target prediction: a class imbalance-tolerant machine learning approach

Siqi Liang; Haiyuan Yu

doi:10.1093/bioinformatics/btaa495

Revealing new therapeutic opportunities through drug target prediction: a class imbalance-tolerant machine learning approach

Bioinformatics ◽

10.1093/bioinformatics/btaa495 ◽

2020 ◽

Vol 36 (16) ◽

pp. 4490-4497

Author(s):

Siqi Liang ◽

Haiyuan Yu

Keyword(s):

Machine Learning ◽

Drug Target ◽

Drug Targets ◽

Class Imbalance ◽

Target Prediction ◽

Drug Repurposing ◽

New Drugs ◽

Supplementary Information ◽

Training Scheme ◽

Drug Target Prediction

Abstract Motivation In silico drug target prediction provides valuable information for drug repurposing, understanding of side effects as well as expansion of the druggable genome. In particular, discovery of actionable drug targets is critical to developing targeted therapies for diseases. Results Here, we develop a robust method for drug target prediction by leveraging a class imbalance-tolerant machine learning framework with a novel training scheme. We incorporate novel features, including drug–gene phenotype similarity and gene expression profile similarity that capture information orthogonal to other features. We show that our classifier achieves robust performance and is able to predict gene targets for new drugs as well as drugs that potentially target unexplored genes. By providing newly predicted drug–target associations, we uncover novel opportunities of drug repurposing that may benefit cancer treatment through action on either known drug targets or currently undrugged genes. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Revealing new therapeutic opportunities through drug target prediction via class imbalance-tolerant machine learning

10.1101/572420 ◽

2019 ◽

Author(s):

Siqi Liang ◽

Haiyuan Yu

Keyword(s):

Machine Learning ◽

Drug Target ◽

Drug Targets ◽

Class Imbalance ◽

Target Prediction ◽

Drug Repurposing ◽

New Drugs ◽

Learning Framework ◽

Training Scheme ◽

Drug Target Prediction

AbstractIn silicodrug target prediction provides valuable information for drug repurposing, understanding of side effects as well as expansion of the druggable genome. In particular, discovery of actionable drug targets is critical to developing targeted therapies for diseases. Here, we develop a robust method for drug target prediction by leveraging a class imbalance-tolerant machine learning framework with a novel training scheme. We incorporate novel features, including drug-gene phenotype similarity and gene expression profile similarity, that capture information orthogonal to other features. We show that our classifier achieves robust performance and is able to predict gene targets for new drugs as well as drugs that target unexplored genes. By providing newly predicted drug-target associations, we uncover novel opportunities of drug repurposing that may benefit cancer treatment through action on either known drug targets or currently undrugged genes.

Download Full-text

Drug Target Prediction Based on the Herbs Components: The Study on the Multitargets Pharmacological Mechanism of Qishenkeli Acting on the Coronary Heart Disease

Evidence-based Complementary and Alternative Medicine ◽

10.1155/2012/698531 ◽

2012 ◽

Vol 2012 ◽

pp. 1-10 ◽

Cited By ~ 20

Author(s):

Yong Wang ◽

Zhongyang Liu ◽

Chun Li ◽

Dong Li ◽

Yulin Ouyang ◽

...

Keyword(s):

Coronary Heart Disease ◽

Heart Disease ◽

Angiotensin Ii ◽

Drug Target ◽

Drug Targets ◽

Target Prediction ◽

Coronary Artery Ligation ◽

Potential Drug ◽

Drug Target Prediction ◽

Potential Drug Targets

In this paper, we present a case study of Qishenkeli (QSKL) to research TCM’s underlying molecular mechanism, based on drug target prediction and analyses of TCM chemical components and following experimental validation. First, after determining the compositive compounds of QSKL, we use drugCIPHER-CS to predict their potential drug targets. These potential targets are significantly enriched with known cardiovascular disease-related drug targets. Then we find these potential drug targets are significantly enriched in the biological processes of neuroactive ligand-receptor interaction, aminoacyl-tRNA biosynthesis, calcium signaling pathway, glycine, serine and threonine metabolism, and renin-angiotensin system (RAAS), and so on. Then, animal model of coronary heart disease (CHD) induced by left anterior descending coronary artery ligation is applied to validate predicted pathway. RAAS pathway is selected as an example, and the results show that QSKL has effect on both rennin and angiotensin II receptor (AT1R), which eventually down regulates the angiotensin II (AngII). Bioinformatics combing with experiment verification can provide a credible and objective method to understand the complicated multitargets mechanism for Chinese herbal formula.

Download Full-text

Interpretable Drug Target Predictions using Self-Expressiveness

10.1101/2021.03.01.433365 ◽

2021 ◽

Author(s):

Diego Galeano ◽

Santiago Noto ◽

Ruben Jimenez ◽

Alberto Paccanaro

Keyword(s):

Drug Target ◽

Drug Targets ◽

Prediction Models ◽

Matrix Completion ◽

Closed Form Solution ◽

Target Prediction ◽

Form Solution ◽

Protein Targets ◽

Drug Target Prediction ◽

Similarity Matrices

AbstractThe identification of missing drug targets is critical for the development of treatments and for the molecular elucidation of drug side effects. Drug targets have been predicted by exploiting molecular, biological or pharmacological features of drugs and protein targets. Yet, developing integrative and interpretable machine learning models for predicting drug targets remains a challenging task. We present Inception, an integrative and interpretable matrix completion model for predicting drug targets. Inception is a self-expressive model that learns two similarity matrices: one for drugs and another for protein targets. These learned similarity matrices are key for our models’ interpretability: they can explain how a predicted drug-target interaction can be explain in terms of a linear combination of chemical, biological and pharmacological similarities. We develop a novel objective function with efficient closed-form solution. To demonstrate the ability of Inception at recovering missing drug-target interactions (DTIs), we perform cross-validation experiments with stringent controls of data imbalance, chemical similarities between drugs and sequence similarities between targets. We also assess the performance of our model using a simulated prospective approach. Having trained our model with DTIs from a snapshot 2011 of the DrugBank database, we test whether we could predict DTIs from a 2020 snapshot of DrugBank. Inception outperforms two state-of-the-art drug target prediction models in all the scenarios. This suggests that Inception could be useful for predicting missing drug target interactions while providing interpretable predictions.

Download Full-text

Large-scale comparison of machine learning methods for drug target prediction on ChEMBL

Chemical Science ◽

10.1039/c8sc00148k ◽

2018 ◽

Vol 9 (24) ◽

pp. 5441-5451 ◽

Cited By ~ 109

Author(s):

Andreas Mayr ◽

Günter Klambauer ◽

Thomas Unterthiner ◽

Marvin Steijaert ◽

Jörg K. Wegner ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Comparative Study ◽

Drug Target ◽

Large Scale ◽

State Of The Art ◽

Target Prediction ◽

Prediction Methods ◽

Machine Learning Methods ◽

Drug Target Prediction

The to date largest comparative study of nine state-of-the-art drug target prediction methods finds that deep learning outperforms all other competitors. The results are based on a benchmark of 1300 assays and half a million compounds.

Download Full-text

DTI-Voodoo: machine learning over interaction networks and ontology-based background knowledge predicts drug–target interactions

10.1101/2021.04.28.441733 ◽

2021 ◽

Author(s):

Tilman Hinnerichs ◽

Robert Hoehndorf

Keyword(s):

Drug Target ◽

Drug Targets ◽

Interaction Network ◽

Drug Repurposing ◽

Computational Method ◽

Interaction Networks ◽

Supplementary Information ◽

Prediction Methods ◽

Link Type ◽

Molecular Features

AbstractMotivationIn silico drug–target interaction (DTI) prediction is important for drug discovery and drug repurposing. Approaches to predict DTIs can proceed indirectly, top-down, using phenotypic effects of drugs to identify potential drug targets, or they can be direct, bottom-up and use molecular information to directly predict binding potentials. Both approaches can be combined with information about interaction networks.ResultsWe developed DTI-Voodoo as a computational method that combines molecular features and ontology-encoded phenotypic effects of drugs with protein–protein interaction networks, and uses a graph convolutional neural network to predict DTIs. We demonstrate that drug effect features can exploit information in the interaction network whereas molecular features do not. DTI-Voodoo is designed to predict candidate drugs for a given protein; we use this formulation to show that common DTI datasets contain intrinsic biases with major affects on performance evaluation and comparison of DTI prediction methods. Using a modified evaluation scheme, we demonstrate that DTI-Voodoo improves significantly over state of the art DTI prediction methods.AvailabilityDTI-Voodoo source code and data necessary to reproduce results are freely available at https://github.com/THinnerichs/DTI-VOODOO.Supplementary informationSupplementary data are available at https://github.com/ THinnerichs/DTI-VOODOO.

Download Full-text

Breaking the paradigm: Dr Insight empowers signature-free, enhanced drug repurposing

Bioinformatics ◽

10.1093/bioinformatics/btz006 ◽

2019 ◽

Vol 35 (16) ◽

pp. 2818-2826 ◽

Cited By ~ 9

Author(s):

Jinyan Chan ◽

Xuan Wang ◽

Jacob A Turner ◽

Nicole E Baldwin ◽

Jinghua Gu

Keyword(s):

Drug Target ◽

Drug Targets ◽

Effective Means ◽

Drug Repurposing ◽

Superior Performance ◽

Supplementary Information ◽

Breast Cancer Dataset ◽

Specific Drug ◽

Cancer Dataset ◽

Disease Specific

Abstract Motivation Transcriptome-based computational drug repurposing has attracted considerable interest by bringing about faster and more cost-effective drug discovery. Nevertheless, key limitations of the current drug connectivity-mapping paradigm have been long overlooked, including the lack of effective means to determine optimal query gene signatures. Results The novel approach Dr Insight implements a frame-breaking statistical model for the ‘hand-shake’ between disease and drug data. The genome-wide screening of concordantly expressed genes (CEGs) eliminates the need for subjective selection of query signatures, added to eliciting better proxy for potential disease-specific drug targets. Extensive comparisons on simulated and real cancer datasets have validated the superior performance of Dr Insight over several popular drug-repurposing methods to detect known cancer drugs and drug–target interactions. A proof-of-concept trial using the TCGA breast cancer dataset demonstrates the application of Dr Insight for a comprehensive analysis, from redirection of drug therapies, to a systematic construction of disease-specific drug-target networks. Availability and implementation Dr Insight R package is available at https://cran.r-project.org/web/packages/DrInsight/index.html. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

TarDict: A RandomForestClassifier based software predicts drug-target interaction using SMILES

10.36462/h.bioinfo.202101 ◽

2021 ◽

pp. bi202101

Author(s):

Peter Habib ◽

Alsamman Alsamman ◽

Sameh Hassanein ◽

Aladdin Hamwieh

Keyword(s):

Machine Learning ◽

Computational Model ◽

Drug Target ◽

Drug Targets ◽

Cost Effective ◽

New Drugs ◽

Drug Target Discovery ◽

Target Proteins ◽

Biological Targets ◽

Testing Dataset

The future of therapeutics depends on understanding the interaction between the chemical structure of the drug and the target protein that contributes to the etiology of the disease in order to improve drug discovery. Predicting the target of unknown drugs being investigated from already identified drug data is very important not only for understanding different processes of drug and molecular interactions but also for the development of new drugs. Using machine learning and published drug information we design an easy-to-use tool that predicts biological target proteins for medical drugs. TarDict is based on a chemical-simplified line-entry molecular input system called SMILES. It receives SMILES entries and returns a list of possible similar drugs as well as possible drug-targets. TarDict uses 20442 drug entries that have well-known biological targets to construct a prognostic computational model capable of predicting novel drug targets with an accuracy of 95%. We developed a machine learning approach to recommend target proteins to approved drug targets. We have shown that the proposed method is highly predictive on a testing dataset consisting of 4088 targets and 102 manually entered drugs. The proposed computational model is an efficient and cost-effective tool for drug target discovery and prioritization. Such novel tool could be used to enhance drug design, predict potential target and identify combination therapy crossroads.

Download Full-text

Faculty Opinions recommendation of Drug target prediction and repositioning using an integrated network-based approach.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.718011274.793509126 ◽

2015 ◽

Author(s):

Jürgen Bajorath

Keyword(s):

Drug Target ◽

Target Prediction ◽

Integrated Network ◽

Drug Target Prediction

Download Full-text

Synteny Approach of Drug Target Prediction among Unique Hypothetical Proteins of Streptococcus Gordonii Causing Infective Endocarditis

Science Technology and Arts Research Journal ◽

10.4314/star.v2i4.7 ◽

2014 ◽

Vol 2 (4) ◽

pp. 34

Author(s):

S Telkar ◽

HSS Kumar ◽

R Mahmood

Keyword(s):

Infective Endocarditis ◽

Drug Target ◽

Target Prediction ◽

Hypothetical Proteins ◽

Streptococcus Gordonii ◽

Drug Target Prediction

Download Full-text

Drug–target prediction utilizing heterogeneous bio-linked network embeddings

Briefings in Bioinformatics ◽

10.1093/bib/bbz147 ◽

2019 ◽

Cited By ~ 1

Author(s):

Nansu Zong ◽

Rachael Sze Nga Wong ◽

Yue Yu ◽

Andrew Wen ◽

Ming Huang ◽

...

Keyword(s):

Drug Target ◽

Target Prediction ◽

Machine Learning Algorithms ◽

Association Mining ◽

Drug Target Prediction ◽

Specific Prediction ◽

Series Of Experiments ◽

Inference Methods ◽

Novel Drug ◽

Prediction Strategy

Abstract To enable modularization for network-based prediction, we conducted a review of known methods conducting the various subtasks corresponding to the creation of a drug–target prediction framework and associated benchmarking to determine the highest-performing approaches. Accordingly, our contributions are as follows: (i) from a network perspective, we benchmarked the association-mining performance of 32 distinct subnetwork permutations, arranging based on a comprehensive heterogeneous biomedical network derived from 12 repositories; (ii) from a methodological perspective, we identified the best prediction strategy based on a review of combinations of the components with off-the-shelf classification, inference methods and graph embedding methods. Our benchmarking strategy consisted of two series of experiments, totaling six distinct tasks from the two perspectives, to determine the best prediction. We demonstrated that the proposed method outperformed the existing network-based methods as well as how combinatorial networks and methodologies can influence the prediction. In addition, we conducted disease-specific prediction tasks for 20 distinct diseases and showed the reliability of the strategy in predicting 75 novel drug–target associations as shown by a validation utilizing DrugBank 5.1.0. In particular, we revealed a connection of the network topology with the biological explanations for predicting the diseases, ‘Asthma’ ‘Hypertension’, and ‘Dementia’. The results of our benchmarking produced knowledge on a network-based prediction framework with the modularization of the feature selection and association prediction, which can be easily adapted and extended to other feature sources or machine learning algorithms as well as a performed baseline to comprehensively evaluate the utility of incorporating varying data sources.

Download Full-text