A comparison of two machine learning methods for protein secondary structure prediction

Protein secondary structure prediction (SSP) has a variety of applications; however, there has been relatively limited improvement in accuracy for years. With a vision of moving forward all related fields, we aimed to make a fundamental advance in SSP. There have been many admirable efforts made to improve the machine learning algorithm for SSP. This work thus took a step back by manipulating the input features. A secondary structure element-based position-specific scoring matrix (SSE-PSSM) is proposed, based on which a new set of machine learning features can be established. The feasibility of this new PSSM was evaluated by rigid independent tests with training and testing datasets sharing <25% sequence identities. In all experiments, the proposed PSSM outperformed the traditional amino acid PSSM. This new PSSM can be easily combined with the amino acid PSSM, and the improvement in accuracy was remarkable. Preliminary tests made by combining the SSE-PSSM and well-known SSP methods showed 2.0% and 5.2% average improvements in three- and eight-state SSP accuracies, respectively. If this PSSM can be integrated into state-of-the-art SSP methods, the overall accuracy of SSP may break the current restriction and eventually bring benefit to all research and applications where secondary structure prediction plays a vital role during development. To facilitate the application and integration of the SSE-PSSM with modern SSP methods, we have established a web server and standalone programs for generating SSE-PSSM available at http://10.life.nctu.edu.tw/SSE-PSSM.

Download Full-text

Review of Advances in Machine Learning Based Protein Secondary Structure Prediction

2019 15th International Conference on Electronics, Computer and Computation (ICECCO) ◽

10.1109/icecco48375.2019.9043234 ◽

2019 ◽

Author(s):

Muhammad Yusuf Muhammad ◽

Rajesh Prasad ◽

Mathias Fonkam ◽

Hadiza Ali Umar

Keyword(s):

Machine Learning ◽

Secondary Structure ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Protein Secondary Structure ◽

Protein Secondary Structure Prediction

Download Full-text

The Future of Protein Secondary Structure Prediction Was Invented by Oleg Ptitsyn

Biomolecules ◽

10.3390/biom10060910 ◽

2020 ◽

Vol 10 (6) ◽

pp. 910

Author(s):

Daniel Rademaker ◽

Jarek van Dijk ◽

Willem Titulaer ◽

Joanna Lange ◽

Gert Vriend ◽

...

Keyword(s):

Secondary Structure ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Protein Structures ◽

Protein Secondary Structure ◽

Prediction Method ◽

Research Field ◽

Protein Secondary Structure Prediction ◽

Learning Methods ◽

Average Percentage

When Oleg Ptitsyn and his group published the first secondary structure prediction for a protein sequence, they started a research field that is still active today. Oleg Ptitsyn combined fundamental rules of physics with human understanding of protein structures. Most followers in this field, however, use machine learning methods and aim at the highest (average) percentage correctly predicted residues in a set of proteins that were not used to train the prediction method. We show that one single method is unlikely to predict the secondary structure of all protein sequences, with the exception, perhaps, of future deep learning methods based on very large neural networks, and we suggest that some concepts pioneered by Oleg Ptitsyn and his group in the 70s of the previous century likely are today’s best way forward in the protein secondary structure prediction field.

Download Full-text