A Machine Learning Approach for Drug‐target Interaction Prediction using Wrapper Feature Selection and Class Balancing

Background:The identification of drug-target interactions is a crucial issue in drug discovery. In recent years, researchers have made great efforts on the drug-target interaction predictions, and developed databases, software and computational methods.Results:In the paper, we review the recent advances in machine learning-based drug-target interaction prediction. First, we briefly introduce the datasets and data, and summarize features for drugs and targets which can be extracted from different data. Since drug-drug similarity and target-target similarity are important for many machine learning prediction models, we introduce how to calculate similarities based on data or features. Different machine learningbased drug-target interaction prediction methods can be proposed by using different features or information. Thus, we summarize, analyze and compare different machine learning-based prediction methods.Conclusion:This study provides the guide to the development of computational methods for the drug-target interaction prediction.

Download Full-text

Data Classification Using Feature Selection and kNN Machine Learning Approach

2015 International Conference on Computational Intelligence and Communication Networks (CICN) ◽

10.1109/cicn.2015.165 ◽

2015 ◽

Cited By ~ 12

Author(s):

Shemim Begum ◽

Debasis Chakraborty ◽

Ram Sarkar

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Data Classification ◽

Learning Approach ◽

Machine Learning Approach

Download Full-text

Machine Learning Approach for Gesture Recognition Based on Automatic Feature Selection

Motion in Games - Lecture Notes in Computer Science ◽

10.1007/978-3-642-34710-8_34 ◽

2012 ◽

pp. 366-369

Author(s):

Xiubo Liang ◽

Franck Multon ◽

Weidong Geng

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Gesture Recognition ◽

Learning Approach ◽

Machine Learning Approach ◽

Automatic Feature Selection

Download Full-text

Indoor Localization for IoT Using Adaptive Feature Selection: A Cascaded Machine Learning Approach

IEEE Antennas and Wireless Propagation Letters ◽

10.1109/lawp.2019.2915047 ◽

2019 ◽

Vol 18 (11) ◽

pp. 2306-2310 ◽

Cited By ~ 9

Author(s):

Mohamed Ibrahim AlHajri ◽

Nazar T. Ali ◽

Raed M. Shubair

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Indoor Localization ◽

Learning Approach ◽

Machine Learning Approach ◽

Adaptive Feature Selection

Download Full-text

A machine learning approach for feature selection traffic classification using security analysis

The Journal of Supercomputing ◽

10.1007/s11227-018-2263-3 ◽

2018 ◽

Vol 74 (10) ◽

pp. 4867-4892 ◽

Cited By ~ 25

Author(s):

Muhammad Shafiq ◽

Xiangzhan Yu ◽

Ali Kashif Bashir ◽

Hassan Nazeer Chaudhry ◽

Dawei Wang

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Security Analysis ◽

Learning Approach ◽

Traffic Classification ◽

Machine Learning Approach

Download Full-text

AN ANALYSIS ON FEATURE SELECTION METHODS, CLUSTERING AND CLASSIFICATION USED IN HEART DISEASE PREDICTION –A MACHINE LEARNING APPROACH

Journal of Critical Reviews ◽

10.31838/jcr.07.06.27 ◽

2020 ◽

Vol 7 (06) ◽

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Heart Disease ◽

Learning Approach ◽

Disease Prediction ◽

Selection Methods ◽

Machine Learning Approach ◽

Clustering And Classification

Download Full-text

Machine learning prediction of oncology drug targets based on protein and network properties

10.21203/rs.2.15798/v1 ◽

2019 ◽

Author(s):

Zoltan Dezso ◽

Michele Ceccarelli

Keyword(s):

Machine Learning ◽

Clinical Trial ◽

Drug Target ◽

Drug Targets ◽

Validation Dataset ◽

Learning Approach ◽

Biological Functions ◽

Machine Learning Approach ◽

Network Properties ◽

Trial Drug

Abstract Background The selection and prioritization of drug targets is a central problem in drug discovery. Computational approaches can leverage the growing number of large-scale human genomics and proteomics data to make in-silico target identification, reducing the cost and the time needed. Results We developed a machine learning approach to score proteins to generate a druggability score of novel targets. In our model we incorporated 70 protein features which included properties derived from the sequence, features characterizing protein functions as well as network properties derived from the protein-protein interaction network. The advantage of this approach is that it is unbiased and even less studied proteins with limited information about their function can score well as most of the features are independent of the accumulated literature. We build models on a training set which consist of targets with approved drugs and a negative set of non-drug targets. The machine learning techniques help to identify the most important combination of features differentiating validated targets from non-targets. We validated our predictions on an independent set of clinical trial drug targets, achieving a high accuracy characterized by an AUC of 0.89. Our most predictive features included biological function of proteins, network centrality measures, protein essentiality, tissue specificity, localization and solvent accessibility. Our predictions, based on a small set of 102 validated oncology targets, recovered the majority of known drug targets and identifies a novel set of proteins as drug target candidates. Conclusions We developed a machine learning approach to prioritize proteins according to their similarity to approved drug targets. We have shown that the method proposed is highly predictive on a validation dataset consisting of 277 targets of clinical trial drug confirming that our computational approach is an efficient and cost-effective tool for drug target discovery and prioritization. Our predictions were based on oncology targets and cancer relevant biological functions, resulting in significantly higher scores for targets of oncology clinical trial drugs compared to the scores of targets of trial drugs for other indications. Our approach can be used to make indication specific drug-target prediction by combining generic druggability features with indication specific biological functions.

Download Full-text

Improving the Intrusion Detection using Discriminative Machine Learning Approach and Improve the Time Complexity by Data Mining Feature Selection Methods

International Journal of Computer Applications ◽

10.5120/13209-0587 ◽

2013 ◽

Vol 76 (1) ◽

pp. 5-11 ◽

Cited By ~ 14

Author(s):

Karan Bajaj ◽

Amit Arora

Keyword(s):

Machine Learning ◽

Data Mining ◽

Feature Selection ◽

Intrusion Detection ◽

Time Complexity ◽

Learning Approach ◽

Selection Methods ◽

Machine Learning Approach

Download Full-text

Machine learning prediction of oncology drug targets based on protein and network properties

10.21203/rs.2.15798/v2 ◽

2019 ◽

Author(s):

Zoltan Dezso ◽

Michele Ceccarelli

Keyword(s):

Machine Learning ◽

Clinical Trial ◽

Drug Target ◽

Drug Targets ◽

Validation Dataset ◽

Learning Approach ◽

Biological Functions ◽

Machine Learning Approach ◽

Network Properties ◽

Trial Drug

Abstract Background The selection and prioritization of drug targets is a central problem in drug discovery. Computational approaches can leverage the growing number of large-scale human genomics and proteomics data to make in-silico target identification, reducing the cost and the time needed. Results We developed a machine learning approach to score proteins to generate a druggability score of novel targets. In our model we incorporated 70 protein features which included properties derived from the sequence, features characterizing protein functions as well as network properties derived from the protein-protein interaction network. The advantage of this approach is that it is unbiased and even less studied proteins with limited information about their function can score well as most of the features are independent of the accumulated literature. We build models on a training set which consist of targets with approved drugs and a negative set of non-drug targets. The machine learning techniques help to identify the most important combination of features differentiating validated targets from non-targets. We validated our predictions on an independent set of clinical trial drug targets, achieving a high accuracy characterized by an AUC of 0.89. Our most predictive features included biological function of proteins, network centrality measures, protein essentiality, tissue specificity, localization and solvent accessibility. Our predictions, based on a small set of 102 validated oncology targets, recovered the majority of known drug targets and identifies a novel set of proteins as drug target candidates. Conclusions We developed a machine learning approach to prioritize proteins according to their similarity to approved drug targets. We have shown that the method proposed is highly predictive on a validation dataset consisting of 277 targets of clinical trial drug confirming that our computational approach is an efficient and cost-effective tool for drug target discovery and prioritization. Our predictions were based on oncology targets and cancer relevant biological functions, resulting in significantly higher scores for targets of oncology clinical trial drugs compared to the scores of targets of trial drugs for other indications. Our approach can be used to make indication specific drug-target prediction by combining generic druggability features with indication specific biological functions.

Download Full-text