OPUS-TASS: a protein backbone torsion angles and secondary structure predictor based on ensemble neural networks

Gang Xu; Qinghua Wang; Jianpeng Ma

doi:10.1093/bioinformatics/btaa629

OPUS-TASS: a protein backbone torsion angles and secondary structure predictor based on ensemble neural networks

Bioinformatics ◽

10.1093/bioinformatics/btaa629 ◽

2020 ◽

Vol 36 (20) ◽

pp. 5021-5026 ◽

Cited By ~ 3

Author(s):

Gang Xu ◽

Qinghua Wang ◽

Jianpeng Ma

Keyword(s):

Neural Networks ◽

Secondary Structure ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Supplementary Information ◽

Learning Approaches ◽

Protein Backbone ◽

Torsion Angles ◽

Secondary Structure Predictions ◽

The Mean

Abstract Motivation Predictions of protein backbone torsion angles (ϕ and ψ) and secondary structure from sequence are crucial subproblems in protein structure prediction. With the development of deep learning approaches, their accuracies have been significantly improved. To capture the long-range interactions, most studies integrate bidirectional recurrent neural networks into their models. In this study, we introduce and modify a recently proposed architecture named Transformer to capture the interactions between the two residues theoretically with arbitrary distance. Moreover, we take advantage of multitask learning to improve the generalization of neural network by introducing related tasks into the training process. Similar to many previous studies, OPUS-TASS uses an ensemble of models and achieves better results. Results OPUS-TASS uses the same training and validation sets as SPOT-1D. We compare the performance of OPUS-TASS and SPOT-1D on TEST2016 (1213 proteins) and TEST2018 (250 proteins) proposed in the SPOT-1D paper, CASP12 (55 proteins), CASP13 (32 proteins) and CASP-FM (56 proteins) proposed in the SAINT paper, and a recently released PDB structure collection from CAMEO (93 proteins) named as CAMEO93. On these six test sets, OPUS-TASS achieves consistent improvements in both backbone torsion angles prediction and secondary structure prediction. On CAMEO93, SPOT-1D achieves the mean absolute errors of 16.89 and 23.02 for ϕ and ψ predictions, respectively, and the accuracies for 3- and 8-state secondary structure predictions are 87.72 and 77.15%, respectively. In comparison, OPUS-TASS achieves 16.56 and 22.56 for ϕ and ψ predictions, and 89.06 and 78.87% for 3- and 8-state secondary structure predictions, respectively. In particular, after using our torsion angles refinement method OPUS-Refine as the post-processing procedure for OPUS-TASS, the mean absolute errors for final ϕ and ψ predictions are further decreased to 16.28 and 21.98, respectively. Availability and implementation The training and the inference codes of OPUS-TASS and its data are available at https://github.com/thuxugang/opus_tass. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

A Combination of Support Vector Machines and Bidirectional Recurrent Neural Networks for Protein Secondary Structure Prediction

AI*IA 2003: Advances in Artificial Intelligence - Lecture Notes in Computer Science ◽

10.1007/978-3-540-39853-0_12 ◽

2003 ◽

pp. 142-153 ◽

Cited By ~ 4

Author(s):

Alessio Ceroni ◽

Paolo Frasconi ◽

Andrea Passerini ◽

Alessandro Vullo

Keyword(s):

Neural Networks ◽

Support Vector Machines ◽

Secondary Structure ◽

Recurrent Neural Networks ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Protein Secondary Structure ◽

Support Vector ◽

Protein Secondary Structure Prediction ◽

Vector Machines

Download Full-text

Protein secondary structure prediction using three neural networks and a segmental semi Markov model

Mathematical Biosciences ◽

10.1016/j.mbs.2008.11.001 ◽

2009 ◽

Vol 217 (2) ◽

pp. 145-150 ◽

Cited By ~ 21

Author(s):

Seyed Amir Malekpour ◽

Sima Naghizadeh ◽

Hamid Pezeshk ◽

Mehdi Sadeghi ◽

Changiz Eslahchi

Keyword(s):

Neural Networks ◽

Secondary Structure ◽

Markov Model ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Protein Secondary Structure ◽

Protein Secondary Structure Prediction

Download Full-text

Combining Deep Neural Networks for Protein Secondary Structure Prediction

IEEE Access ◽

10.1109/access.2020.2992084 ◽

2020 ◽

Vol 8 ◽

pp. 84362-84370

Author(s):

Shusen Zhou ◽

Hailin Zou ◽

Chanjuan Liu ◽

Mujun Zang ◽

Tong Liu

Keyword(s):

Neural Networks ◽

Secondary Structure ◽

Structure Prediction ◽

Deep Neural Networks ◽

Secondary Structure Prediction ◽

Protein Secondary Structure ◽

Protein Secondary Structure Prediction

Download Full-text

A simple and fast secondary structure prediction method using hidden neural networks

Bioinformatics ◽

10.1093/bioinformatics/bth487 ◽

2004 ◽

Vol 21 (2) ◽

pp. 152-159 ◽

Cited By ~ 201

Author(s):

K. Lin ◽

V. A. Simossis ◽

W. R. Taylor ◽

J. Heringa

Keyword(s):

Neural Networks ◽

Secondary Structure ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Prediction Method ◽

Structure Prediction Method ◽

Secondary Structure Prediction Method

Download Full-text

QuanTest2: benchmarking multiple sequence alignments using secondary structure prediction

Bioinformatics ◽

10.1093/bioinformatics/btz552 ◽

2019 ◽

Cited By ~ 3

Author(s):

Fabian Sievers ◽

Desmond G Higgins

Keyword(s):

Secondary Structure ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Reference Sequence ◽

Supplementary Information ◽

Sequence Alignments ◽

Multiple Sequence ◽

Multiple Sequence Alignments ◽

Reference Sequences ◽

Selection Of

Abstract Motivation Secondary structure prediction accuracy (SSPA) in the QuanTest benchmark can be used to measure accuracy of a multiple sequence alignment. SSPA correlates well with the sum-of-pairs score, if the results are averaged over many alignments but not on an alignment-by-alignment basis. This is due to a sub-optimal selection of reference and non-reference sequences in QuanTest. Results We develop an improved strategy for selecting reference and non-reference sequences for a new benchmark, QuanTest2. In QuanTest2, SSPA and SP correlate better on an alignment-by-alignment basis than in QuanTest. Guide-trees for QuanTest2 are more balanced with respect to reference sequences than in QuanTest. QuanTest2 scores correlate well with other well-established benchmarks. Availability and implementation QuanTest2 is available at http://bioinf.ucd.ie/quantest2.tar, comprises of reference and non-reference sequence sets and a scoring script. Supplementary information Supplementary data are available at Bioinformatics online

Download Full-text

The importance of larger data sets for protein secondary structure prediction with neural networks

Protein Science ◽

10.1002/pro.5560050422 ◽

1996 ◽

Vol 5 (4) ◽

pp. 768-774 ◽

Cited By ~ 32

Author(s):

John-Marc Chandonia ◽

Martin Karplus

Keyword(s):

Neural Networks ◽

Secondary Structure ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Protein Secondary Structure ◽

Data Sets ◽

Protein Secondary Structure Prediction

Download Full-text

Protein secondary structure prediction using neural networks

10.1117/12.541411 ◽

2004 ◽

Author(s):

Preeti Singh ◽

Yan-Qing Zhang

Keyword(s):

Neural Networks ◽

Secondary Structure ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Protein Secondary Structure ◽

Protein Secondary Structure Prediction

Download Full-text

Feed-forward neural networks for secondary structure prediction

Journal of Molecular Graphics ◽

10.1016/0263-7855(95)00016-y ◽

1995 ◽

Vol 13 (3) ◽

pp. 175-183 ◽

Cited By ~ 3

Author(s):

T.W. Barlow

Keyword(s):

Neural Networks ◽

Secondary Structure ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Feed Forward ◽

Feed Forward Neural Networks

Download Full-text

Protein secondary structure prediction using modular reciprocal bidirectional recurrent neural networks

Computer Methods and Programs in Biomedicine ◽

10.1016/j.cmpb.2010.04.005 ◽

2010 ◽

Vol 100 (3) ◽

pp. 237-247 ◽

Cited By ~ 16

Author(s):

Sepideh Babaei ◽

Amir Geranmayeh ◽

Seyyed Ali Seyyedsalehi

Keyword(s):

Neural Networks ◽

Secondary Structure ◽

Recurrent Neural Networks ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Protein Secondary Structure ◽

Protein Secondary Structure Prediction

Download Full-text

PROTEIN SECONDARY STRUCTURE PREDICTION METHODS BASED ONRBF NEURAL NETWORKS

Computational Methods ◽

10.1007/978-1-4020-3953-9_4 ◽

2007 ◽

pp. 1037-1043 ◽

Cited By ~ 1

Author(s):

N. Jing ◽

B. Xia ◽

C.G. Zhou ◽

Y. Wang

Keyword(s):

Neural Networks ◽

Secondary Structure ◽

Structure Prediction ◽

Secondary Structure Prediction ◽

Protein Secondary Structure ◽

Prediction Methods ◽

Protein Secondary Structure Prediction

Download Full-text