Prediction of protein function using a deep convolutional neural network ensemble

10.7287/peerj.preprints.2778 ◽

2017 ◽

Author(s):

Evangelia I Zacharaki

Keyword(s):

Neural Network ◽

Amino Acid ◽

Convolutional Neural Network ◽

Protein Function ◽

Protein Structures ◽

Function Prediction ◽

Deep Convolutional Neural Network ◽

Supervised Machine Learning ◽

Support Vector ◽

Feature Maps

Background. The availability of large databases containing high resolution three-dimensional (3D) models of proteins in conjunction with functional annotation allows the exploitation of advanced supervised machine learning techniques for automatic protein function prediction. Methods. In this work, novel shape features are extracted representing protein structure in the form of local (per amino acid) distribution of angles and amino acid distances, respectively. Each of the multi-channel feature maps is introduced into a deep convolutional neural network (CNN) for function prediction and the outputs are fused through Support Vector Machines (SVM) or a correlation-based k-nearest neighbor classifier. Two different architectures are investigated employing either one CNN per multi-channel feature set, or one CNN per image channel. Results. Cross validation experiments on enzymes (n = 44,661) from the PDB database achieved 90.1% correct classification demonstrating the effectiveness of the proposed method for automatic function annotation of protein structures. Discussion. The automatic prediction of protein function can provide quick annotations on extensive datasets opening the path for relevant applications, such as pharmacological target identification.

Prediction of protein function using a deep convolutional neural network ensemble

10.7287/peerj.preprints.2778v1 ◽

2017 ◽

Author(s):

Evangelia I Zacharaki

Keyword(s):

Neural Network ◽

Amino Acid ◽

Convolutional Neural Network ◽

Protein Function ◽

Protein Structures ◽

Function Prediction ◽

Deep Convolutional Neural Network ◽

Supervised Machine Learning ◽

Support Vector ◽

Feature Maps

Background. The availability of large databases containing high resolution three-dimensional (3D) models of proteins in conjunction with functional annotation allows the exploitation of advanced supervised machine learning techniques for automatic protein function prediction. Methods. In this work, novel shape features are extracted representing protein structure in the form of local (per amino acid) distribution of angles and amino acid distances, respectively. Each of the multi-channel feature maps is introduced into a deep convolutional neural network (CNN) for function prediction and the outputs are fused through Support Vector Machines (SVM) or a correlation-based k-nearest neighbor classifier. Two different architectures are investigated employing either one CNN per multi-channel feature set, or one CNN per image channel. Results. Cross validation experiments on enzymes (n = 44,661) from the PDB database achieved 90.1% correct classification demonstrating the effectiveness of the proposed method for automatic function annotation of protein structures. Discussion. The automatic prediction of protein function can provide quick annotations on extensive datasets opening the path for relevant applications, such as pharmacological target identification.

MDPFP-FCNN: Multidomain Protein Function Prediction Using Fuzzy Convolutional Neural Network

International Journal of Intelligent Engineering and Systems ◽

10.22266/ijies2021.1231.57 ◽

2021 ◽

Vol 14 (6) ◽

pp. 642-655

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Protein Function ◽

Protein Function Prediction ◽

Function Prediction ◽

Multidomain Protein

Pseudo Amino Acid Feature-based Protein Function Prediction using Support Vector Machine and K-Nearest Neighbors

International Journal of Advanced Computer Science and Applications ◽

10.14569/ijacsa.2020.0110922 ◽

2020 ◽

Vol 11 (9) ◽

Author(s):

Anjna Jayant Deen ◽

Manasi Gyanchandani

Keyword(s):

Support Vector Machine ◽

Amino Acid ◽

Protein Function ◽

Protein Function Prediction ◽

Nearest Neighbors ◽

Function Prediction ◽

Support Vector ◽

K Nearest Neighbors ◽

Feature Based ◽

Amino Acid Feature

ProLanGO: Protein Function Prediction Using Neural Machine Translation Based on a Recurrent Neural Network

Molecules ◽

10.3390/molecules22101732 ◽

2017 ◽

Vol 22 (10) ◽

pp. 1732 ◽

Cited By ~ 84

Author(s):

Renzhi Cao ◽

Colton Freitas ◽

Leong Chan ◽

Miao Sun ◽

Haiqing Jiang ◽

...

Keyword(s):

Neural Network ◽

Machine Translation ◽

Recurrent Neural Network ◽

Protein Function ◽

Protein Function Prediction ◽

Function Prediction ◽

Neural Machine Translation

Sars-Cov-2 Spike protein function prediction using a convolutional neural network ensemble

Design Engineering ◽

10.17762/de.vi.4293 ◽

2021 ◽

pp. 7831-7845

Author(s):

Raghad Monther Eid, Eman K. Elsayed, Fatma T. Ghanam

Keyword(s):

Neural Network ◽

Amino Acid ◽

Protein Function ◽

Protein Function Prediction ◽

Small Error ◽

Amino Acid Sequences ◽

Spike Protein ◽

Neural Network Ensemble ◽

Classification Problems ◽

Past Experiences

Introduction: SARS-CoV-2 has become a worldwide pandemic that affects all aspects of life; therefore, numerous organizations and open exploration foundations focus their efforts on research for viable therapeutics. Given past experiences and involvement in SARS, the essential focus has been the Spike protein, considered as the perfect objective for COVID-19 immunotherapies. Most of the vaccines being developed target the spike proteins because this protein covers the virus and helps it invade human cells. Methods: Applications of deep neural network is a quickly expanding field now reaching many areas including proteomics. Results: To be precise, convolutional neural networks have been used for identifying the functional role of amino acid sequences, because of its ability to give nearly accurate results for multi-label classification problems. Here we present a modified convolutional deep learning model that can identify if a given amino acid sequence is a spike protein or not based on the length of the sequence and the function of the protein, that will be done with a short execution time and a relatively small error rate. Conclusion: CNN is an efficient tool at supervised multilabel classification problems

Identifying short disorder-to-order binding regions in disordered proteins with a deep convolutional neural network method

Journal of Bioinformatics and Computational Biology ◽

10.1142/s0219720019500045 ◽

2019 ◽

Vol 17 (01) ◽

pp. 1950004 ◽

Cited By ~ 2

Author(s):

Chun Fang ◽

Yoshitaka Moriwaki ◽

Aikui Tian ◽

Caihong Li ◽

Kentaro Shimizu

Keyword(s):

Neural Network ◽

Amino Acid ◽

Convolutional Neural Network ◽

Intrinsically Disordered Proteins ◽

Protein Sequences ◽

Interaction Network ◽

Deep Convolutional Neural Network ◽

Disordered Proteins ◽

Related Factors ◽

Intrinsically Disordered

Molecular recognition features (MoRFs) are key functional regions of intrinsically disordered proteins (IDPs), which play important roles in the molecular interaction network of cells and are implicated in many serious human diseases. Identifying MoRFs is essential for both functional studies of IDPs and drug design. This study adopts the cutting-edge machine learning method of artificial intelligence to develop a powerful model for improving MoRFs prediction. We proposed a method, named as en_DCNNMoRF (ensemble deep convolutional neural network-based MoRF predictor). It combines the outcomes of two independent deep convolutional neural network (DCNN) classifiers that take advantage of different features. The first, DCNNMoRF1, employs position-specific scoring matrix (PSSM) and 22 types of amino acid-related factors to describe protein sequences. The second, DCNNMoRF2, employs PSSM and 13 types of amino acid indexes to describe protein sequences. For both single classifiers, DCNN with a novel two-dimensional attention mechanism was adopted, and an average strategy was added to further process the output probabilities of each DCNN model. Finally, en_DCNNMoRF combined the two models by averaging their final scores. When compared with other well-known tools applied to the same datasets, the accuracy of the novel proposed method was comparable with that of state-of-the-art methods. The related web server can be accessed freely via http://vivace.bi.a.u-tokyo.ac.jp:8008/fang/en_MoRFs.php .

A Deep Neural Network Based Hierarchical Multi-Label Classifier for Protein Function Prediction

2019 International Conference on Computer, Information and Telecommunication Systems (CITS) ◽

10.1109/cits.2019.8862034 ◽

2019 ◽

Author(s):

Xin Yuan ◽

Weite Li ◽

Kui Lin ◽

Jinglu Hu

Keyword(s):

Neural Network ◽

Protein Function ◽

Deep Neural Network ◽

Protein Function Prediction ◽

Function Prediction

An Intelligent Diagnosis Method of Brain MRI Tumor Segmentation Using Deep Convolutional Neural Network and SVM Algorithm

Computational and Mathematical Methods in Medicine ◽

10.1155/2020/6789306 ◽

2020 ◽

Vol 2020 ◽

pp. 1-10

Author(s):

Wentao Wu ◽

Daning Li ◽

Jiaoyang Du ◽

Xiangyu Gao ◽

Wen Gu ◽

...

Keyword(s):

Neural Network ◽

Support Vector Machine ◽

Brain Tumor ◽

Convolutional Neural Network ◽

Deep Convolutional Neural Network ◽

Support Vector ◽

Tumor Segmentation ◽

Brain Segmentation ◽

Brain Tumor Segmentation ◽

Segmentation Methods

Among the currently proposed brain segmentation methods, brain tumor segmentation methods based on traditional image processing and machine learning are not ideal enough. Therefore, deep learning-based brain segmentation methods are widely used. In the brain tumor segmentation method based on deep learning, the convolutional network model has a good brain segmentation effect. The deep convolutional network model has the problems of a large number of parameters and large loss of information in the encoding and decoding process. This paper proposes a deep convolutional neural network fusion support vector machine algorithm (DCNN-F-SVM). The proposed brain tumor segmentation model is mainly divided into three stages. In the first stage, a deep convolutional neural network is trained to learn the mapping from image space to tumor marker space. In the second stage, the predicted labels obtained from the deep convolutional neural network training are input into the integrated support vector machine classifier together with the test images. In the third stage, a deep convolutional neural network and an integrated support vector machine are connected in series to train a deep classifier. Run each model on the BraTS dataset and the self-made dataset to segment brain tumors. The segmentation results show that the performance of the proposed model is significantly better than the deep convolutional neural network and the integrated SVM classifier.

Impact of Feature Selection Algorithm on Speech Emotion Recognition Using Deep Convolutional Neural Network

Sensors ◽

10.3390/s20216008 ◽

2020 ◽

Vol 20 (21) ◽

pp. 6008 ◽

Cited By ~ 1

Author(s):

Misbah Farooq ◽

Fawad Hussain ◽

Naveed Khan Baloch ◽

Fawad Riasat Raja ◽

Heejung Yu ◽

...

Keyword(s):

Neural Network ◽

Feature Selection ◽

Convolutional Neural Network ◽

Emotion Recognition ◽

Deep Convolutional Neural Network ◽

Speech Emotion Recognition ◽

Support Vector ◽

Emotional Speech ◽

Human Machine Interaction ◽

Speaker Independent

Speech emotion recognition (SER) plays a significant role in human–machine interaction. Emotion recognition from speech and its precise classification is a challenging task because a machine is unable to understand its context. For an accurate emotion classification, emotionally relevant features must be extracted from the speech data. Traditionally, handcrafted features were used for emotional classification from speech signals; however, they are not efficient enough to accurately depict the emotional states of the speaker. In this study, the benefits of a deep convolutional neural network (DCNN) for SER are explored. For this purpose, a pretrained network is used to extract features from state-of-the-art speech emotional datasets. Subsequently, a correlation-based feature selection technique is applied to the extracted features to select the most appropriate and discriminative features for SER. For the classification of emotions, we utilize support vector machines, random forests, the k-nearest neighbors algorithm, and neural network classifiers. Experiments are performed for speaker-dependent and speaker-independent SER using four publicly available datasets: the Berlin Dataset of Emotional Speech (Emo-DB), Surrey Audio Visual Expressed Emotion (SAVEE), Interactive Emotional Dyadic Motion Capture (IEMOCAP), and the Ryerson Audio Visual Dataset of Emotional Speech and Song (RAVDESS). Our proposed method achieves an accuracy of 95.10% for Emo-DB, 82.10% for SAVEE, 83.80% for IEMOCAP, and 81.30% for RAVDESS, for speaker-dependent SER experiments. Moreover, our method yields the best results for speaker-independent SER with existing handcrafted features-based SER approaches.