O-glycosylation site prediction for Homo sapiens by combining properties and sequence features with support vector machine

Author(s):  
Yan Zhu ◽  
Shuwan Yin ◽  
Jia Zheng ◽  
Yixia Shi ◽  
Cangzhi Jia

O-glycosylation is a protein posttranslational modification important in regulating almost all cells. It is related to a large number of physiological and pathological phenomena. Recognizing O-glycosylation sites is the key to further investigating the molecular mechanism of protein posttranslational modification. This study aimed to collect a reliable dataset on Homo sapiens and develop an O-glycosylation predictor for Homo sapiens, named Captor, through multiple features. A random undersampling method and a synthetic minority oversampling technique were employed to deal with imbalanced data. In addition, the Kruskal–Wallis (K–W) test was adopted to optimize feature vectors and improve the performance of the model. A support vector machine, due to its optimal performance, was used to train and optimize the final prediction model after a comprehensive comparison of various classifiers in traditional machine learning methods and deep learning. On the independent test set, Captor outperformed the existing O-glycosylation tool, suggesting that Captor could provide more instructive guidance for further experimental research on O-glycosylation. The source code and datasets are available at https://github.com/YanZhu06/Captor/ .

2018 ◽  
Vol 12 (3) ◽  
pp. 341-347 ◽  
Author(s):  
Feng Wang ◽  
Shaojiang Liu ◽  
Weichuan Ni ◽  
Zhiming Xu ◽  
Zemin Qiu ◽  
...  

2014 ◽  
Vol 47 (9) ◽  
pp. 3158-3167 ◽  
Author(s):  
Yuan-Hai Shao ◽  
Wei-Jie Chen ◽  
Jing-Jing Zhang ◽  
Zhen Wang ◽  
Nai-Yang Deng

2020 ◽  
Vol 122 ◽  
pp. 289-307 ◽  
Author(s):  
Xinmin Tao ◽  
Qing Li ◽  
Chao Ren ◽  
Wenjie Guo ◽  
Qing He ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document