FoldHSphere: deep hyperspherical embeddings for protein fold recognition

Abstract Background Current state-of-the-art deep learning approaches for protein fold recognition learn protein embeddings that improve prediction performance at the fold level. However, there still exists aperformance gap at the fold level and the (relatively easier) family level, suggesting that it might be possible to learn an embedding space that better represents the protein folds. Results In this paper, we propose the FoldHSphere method to learn a better fold embedding space through a two-stage training procedure. We first obtain prototype vectors for each fold class that are maximally separated in hyperspherical space. We then train a neural network by minimizing the angular large margin cosine loss to learn protein embeddings clustered around the corresponding hyperspherical fold prototypes. Our network architectures, ResCNN-GRU and ResCNN-BGRU, process the input protein sequences by applying several residual-convolutional blocks followed by a gated recurrent unit-based recurrent layer. Evaluation results on the LINDAHL dataset indicate that the use of our hyperspherical embeddings effectively bridges the performance gap at the family and fold levels. Furthermore, our FoldHSpherePro ensemble method yields an accuracy of 81.3% at the fold level, outperforming all the state-of-the-art methods. Conclusions Our methodology is efficient in learning discriminative and fold-representative embeddings for the protein domains. The proposed hyperspherical embeddings are effective at identifying the protein fold class by pairwise comparison, even when amino acid sequence similarities are low.

Download Full-text

ASFold-DNN: Protein Fold Recognition based on Evolutionary Features with Variable Parameters using Full Connected Neural Network

IEEE/ACM Transactions on Computational Biology and Bioinformatics ◽

10.1109/tcbb.2021.3089168 ◽

2021 ◽

pp. 1-1

Author(s):

Xinyi Qin ◽

Lu Zhang ◽

in Liu ◽

Ziwei Xu ◽

Guangzhong Liu

Keyword(s):

Neural Network ◽

Fold Recognition ◽

Protein Fold ◽

Variable Parameters ◽

Protein Fold Recognition ◽

Evolutionary Features

Download Full-text

A machine learning information retrieval approach to protein fold recognition

Bioinformatics ◽

10.1093/bioinformatics/btl102 ◽

2006 ◽

Vol 22 (12) ◽

pp. 1456-1463 ◽

Cited By ~ 122

Author(s):

J. Cheng ◽

P. Baldi

Keyword(s):

Machine Learning ◽

Information Retrieval ◽

Fold Recognition ◽

Protein Fold ◽

Protein Fold Recognition

Download Full-text

Protein fold recognition using geometric kernel data fusion

Bioinformatics ◽

10.1093/bioinformatics/btu118 ◽

2014 ◽

Vol 30 (13) ◽

pp. 1850-1857 ◽

Cited By ~ 20

Author(s):

Pooya Zakeri ◽

Ben Jeuris ◽

Raf Vandebril ◽

Yves Moreau

Keyword(s):

Data Fusion ◽

Fold Recognition ◽

Protein Fold ◽

Protein Fold Recognition

Download Full-text

Application of Classifier Fusion for Protein Fold Recognition

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery ◽

10.1109/fskd.2009.840 ◽

2009 ◽

Cited By ~ 4

Author(s):

Sahar Jazebi ◽

Amir Tohidi ◽

Masoud Rahgozar

Keyword(s):

Fold Recognition ◽

Classifier Fusion ◽

Protein Fold ◽

Protein Fold Recognition

Download Full-text

Ensemble of classifiers for protein fold recognition

Neurocomputing ◽

10.1016/j.neucom.2005.08.006 ◽

2006 ◽

Vol 69 (7-9) ◽

pp. 850-853 ◽

Cited By ~ 19

Author(s):

Loris Nanni

Keyword(s):

Fold Recognition ◽

Protein Fold ◽

Ensemble Of Classifiers ◽

Protein Fold Recognition

Download Full-text

A Survey of Graphical Page Object Detection with Deep Neural Networks

10.20944/preprints202104.0739.v1 ◽

2021 ◽

Author(s):

Jwalin Bhatt ◽

Khurram Azeem Hashmi ◽

Muhammad Zeshan Afzal ◽

Didier Stricker

Keyword(s):

Deep Learning ◽

Object Detection ◽

Conceptual Understanding ◽

Deep Neural Networks ◽

State Of The Art ◽

Learning Approaches ◽

Document Images ◽

Essential Information ◽

Current State ◽

High Level

In any document, graphical elements like tables, figures, and formulas contain essential information. The processing and interpretation of such information require specialized algorithms. Off-the-shelf OCR components cannot process this information reliably. Therefore, an essential step in document analysis pipelines is to detect these graphical components. It leads to a high-level conceptual understanding of the documents that makes digitization of documents viable. Since the advent of deep learning, the performance of deep learning-based object detection has improved many folds. In this work, we outline and summarize the deep learning approaches for detecting graphical page objects in the document images. Therefore, we discuss the most relevant deep learning-based approaches and state-of-the-art graphical page object detection in document images. This work provides a comprehensive understanding of the current state-of-the-art and related challenges. Furthermore, we discuss leading datasets along with the quantitative evaluation. Moreover, it discusses briefly the promising directions that can be utilized for further improvements.

Download Full-text