Ranked MSD: A New Feature Ranking and Feature Selection Approach for Biomarker Identification

Author(s):  
Ghanshyam Verma ◽  
Alokkumar Jha ◽  
Dietrich Rebholz-Schuhmann ◽  
Michael G. Madden
2020 ◽  
Vol 21 (21) ◽  
pp. 7891
Author(s):  
Chi-Wei Chen ◽  
Lan-Ying Huang ◽  
Chia-Feng Liao ◽  
Kai-Po Chang ◽  
Yen-Wei Chu

Protein phosphorylation is one of the most important post-translational modifications, and many biological processes are related to phosphorylation, such as DNA repair, transcriptional regulation and signal transduction and, therefore, abnormal regulation of phosphorylation usually causes diseases. If we can accurately predict human phosphorylation sites, this could help to solve human diseases. Therefore, we developed a kinase-specific phosphorylation prediction system, GasPhos, and proposed a new feature selection approach, called Gas, based on the ant colony system and a genetic algorithm and used performance evaluation strategies focused on different kinases to choose the best learning model. Gas uses the mean decrease Gini index (MDGI) as a heuristic value for path selection and adopts binary transformation strategies and new state transition rules. GasPhos can predict phosphorylation sites for six kinases and showed better performance than other phosphorylation prediction tools. The disease-related phosphorylated proteins that were predicted with GasPhos are also discussed. Finally, Gas can be applied to other issues that require feature selection, which could help to improve prediction performance.


Author(s):  
Dzi Lam Tran Tuan ◽  
Thongchai Surinwarangkoon ◽  
Kittikhun Meethongjan ◽  
Vinh Truong Hoang

2009 ◽  
Vol 29 (7) ◽  
pp. 1755-1757
Author(s):  
Zhong-yang XIONG ◽  
Jian JIANG ◽  
Yu-fang ZHANG

Author(s):  
Lungan Zhang ◽  
Liangxiao Jiang ◽  
Chaoqun Li

Handling text data is a challenge for machine learning because text data is high dimensional in many cases. Feature selection has been approved to be an effective approach to handle high-dimensional data. Feature selection approaches can be broadly divided into two categories: filter approaches and wrapper approaches. Generally, wrapper approaches have superior accuracy compared to filters, but filters always run faster than wrapper approaches. In order to integrate the advantages of filter approaches and wrapper approaches, we propose a gain ratio-based hybrid feature selection approach to naive Bayes text classifiers. The hybrid feature selection approach uses base classifiers to evaluate feature subsets like wrapper approaches, but it need not repeatedly search feature subsets and build base classifiers. The experimental results on large suite of benchmark text datasets show that the proposed hybrid feature selection approach significantly improves the classification accuracy of the original naive Bayes text classifiers while does not incur the high time complexity that characterizes wrapper approaches.


Sign in / Sign up

Export Citation Format

Share Document