A Parallel-Processor Implementation of an Algorithm to Delineate Distantly Related Protein Sequences using Conserved Motifs and Neural Networks

Author(s):  
Gerhard Vogt ◽  
Dmitrij Frishman ◽  
Patrick Argos
2005 ◽  
Vol 18 (5-6) ◽  
pp. 835-842 ◽  
Author(s):  
Derong Liu ◽  
Xiaoxu Xiong ◽  
Zeng-Guang Hou ◽  
Bhaskar DasGupta

2021 ◽  
Author(s):  
Guanwen Feng ◽  
Hang Yao ◽  
Chaoneng Li ◽  
Ruyi Liu ◽  
Rungen Huang ◽  
...  

Cancer remains one of the most threatening diseases, which kills millions of lives every year. As a promising perspective for cancer treatments, anticancer peptides (ACPs) overcome a lot of disadvantages of traditional treatments. However, it is time-consuming and expensive to identify ACPs through conventional experiments. Hence, it is urgent and necessary to develop highly effective approaches to accurately identify ACPs in large amounts of protein sequences. In this work, we proposed a novel and effective method named ME-ACP which employed multi-view neural networks with ensemble model to identify ACPs. Firstly, we employed residue level and peptide level features preliminarily with ensemble models based on lightGBMs. Then, the outputs of lightGBM classifiers were fed into a hybrid deep neural network (HDNN) to identify ACPs. The experiments on independent test datasets demonstrated that ME-ACP achieved competitive performance on common evaluation metrics.


2019 ◽  
Vol 36 (1) ◽  
pp. 272-279 ◽  
Author(s):  
Hannah F Löchel ◽  
Dominic Eger ◽  
Theodor Sperlea ◽  
Dominik Heider

AbstractMotivationClassification of protein sequences is one big task in bioinformatics and has many applications. Different machine learning methods exist and are applied on these problems, such as support vector machines (SVM), random forests (RF) and neural networks (NN). All of these methods have in common that protein sequences have to be made machine-readable and comparable in the first step, for which different encodings exist. These encodings are typically based on physical or chemical properties of the sequence. However, due to the outstanding performance of deep neural networks (DNN) on image recognition, we used frequency matrix chaos game representation (FCGR) for encoding of protein sequences into images. In this study, we compare the performance of SVMs, RFs and DNNs, trained on FCGR encoded protein sequences. While the original chaos game representation (CGR) has been used mainly for genome sequence encoding and classification, we modified it to work also for protein sequences, resulting in n-flakes representation, an image with several icosagons.ResultsWe could show that all applied machine learning techniques (RF, SVM and DNN) show promising results compared to the state-of-the-art methods on our benchmark datasets, with DNNs outperforming the other methods and that FCGR is a promising new encoding method for protein sequences.Availability and implementationhttps://cran.r-project.org/.Supplementary informationSupplementary data are available at Bioinformatics online.


2005 ◽  
Vol 21 (14) ◽  
pp. 3166-3167 ◽  
Author(s):  
O. Pible ◽  
G. Imbert ◽  
J.-L. Pellequer

Sign in / Sign up

Export Citation Format

Share Document