Implementation of Protein Sequence Classification for Globin family using Ensemble Learnin

International Journal of Emerging Trends in Engineering Research ◽

10.30534/ijeter/2021/18942021 ◽

2021 ◽

Vol 9 (4) ◽

pp. 441-445

Keyword(s):

Feature Extraction ◽

Drug Discovery ◽

Protein Sequence ◽

Learning Algorithm ◽

Protein Sequences ◽

Ensemble Classifier ◽

Important Task ◽

Sequence Classification ◽

Feature Vectors ◽

Protein Sequence Classification

Feature Extraction from protein sequence is a very important task in bioinformatics. The main focus of that work is protein sequences classification that can be used to improve drug discovery and identification of diseases for treating patients in the early stages of diagnosis. In this paper, we proposed a method which is used for feature extraction i.e. converting the protein sequence of hemoglobin in to feature vectors. The feature vectors are then given to the ensemble classifier as an input which uses various classifier to provide better result/performance as compared to any constituent learning algorithm alone.

Download Full-text

Protein Sequence Classification with Improved Extreme Learning Machine Algorithms

BioMed Research International ◽

10.1155/2014/103054 ◽

2014 ◽

Vol 2014 ◽

pp. 1-12 ◽

Author(s):

Jiuwen Cao ◽

Lianglin Xiong

Keyword(s):

Extreme Learning Machine ◽

Protein Sequence ◽

Protein Sequences ◽

Activation Function ◽

Majority Voting ◽

Training Algorithms ◽

Sequence Classification ◽

Protein Sequence Classification ◽

Learning Machine ◽

Majority Voting Method

Precisely classifying a protein sequence from a large biological protein sequences database plays an important role for developing competitive pharmacological products. Comparing the unseen sequence with all the identified protein sequences and returning the category index with the highest similarity scored protein, conventional methods are usually time-consuming. Therefore, it is urgent and necessary to build an efficient protein sequence classification system. In this paper, we study the performance of protein sequence classification using SLFNs. The recent efficient extreme learning machine (ELM) and its invariants are utilized as the training algorithms. The optimal pruned ELM is first employed for protein sequence classification in this paper. To further enhance the performance, the ensemble based SLFNs structure is constructed where multiple SLFNs with the same number of hidden nodes and the same activation function are used as ensembles. For each ensemble, the same training algorithm is adopted. The final category index is derived using the majority voting method. Two approaches, namely, the basic ELM and the OP-ELM, are adopted for the ensemble based SLFNs. The performance is analyzed and compared with several existing methods using datasets obtained from the Protein Information Resource center. The experimental results show the priority of the proposed algorithms.

Download Full-text

A Novel Technique of Feature Extraction with Dual Similarity Measures for Protein Sequence Classification

Procedia Computer Science ◽

10.1016/j.procs.2015.04.217 ◽

2015 ◽

Vol 48 ◽

pp. 795-801 ◽

Author(s):

Neha Bharill ◽

Aruna Tiwari ◽

Anshul Rawat

Keyword(s):

Feature Extraction ◽

Protein Sequence ◽

Similarity Measures ◽

Sequence Classification ◽

Novel Technique ◽

Protein Sequence Classification

Download Full-text

Protein sequence classification using extreme learning machine

Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005. ◽

10.1109/ijcnn.2005.1556080 ◽

2006 ◽

Author(s):

Dianhui Wang ◽

Guang-Bin Huang

Keyword(s):

Extreme Learning Machine ◽

Protein Sequence ◽

Sequence Classification ◽

Protein Sequence Classification ◽

Learning Machine

Download Full-text

A Brief Review of Data Mining Application Involving Protein Sequence Classification

Advances in Computing and Information Technology - Advances in Intelligent Systems and Computing ◽

10.1007/978-3-642-31552-7_48 ◽

2013 ◽

pp. 469-477 ◽

Author(s):

Suprativ Saha ◽

Rituparna Chaki

Keyword(s):

Data Mining ◽

Protein Sequence ◽

Sequence Classification ◽

Protein Sequence Classification ◽

Data Mining Application

Download Full-text

An Approach to Find Proper Execution Parameters of n-Gram Encoding Method Based on Protein Sequence Classification

Communications in Computer and Information Science - Advances in Computing and Data Sciences ◽

10.1007/978-981-13-9942-8_28 ◽

2019 ◽

pp. 294-303 ◽

Author(s):

Suprativ Saha ◽

Tanmay Bhattacharya

Keyword(s):

Protein Sequence ◽

Sequence Classification ◽

Protein Sequence Classification ◽

N Gram ◽

Encoding Method

Download Full-text

Application of Data Mining in Protein Sequence Classification

International Journal of Database Management Systems ◽

10.5121/ijdms.2012.4508 ◽

2012 ◽

Vol 4 (5) ◽

pp. 103-118 ◽

Author(s):

Suprativ Saha

Keyword(s):

Data Mining ◽

Protein Sequence ◽

Sequence Classification ◽

Protein Sequence Classification

Download Full-text

Efficient use of unlabeled data for protein sequence classification: a comparative study

BMC Bioinformatics ◽

10.1186/1471-2105-10-s4-s2 ◽

2009 ◽

Vol 10 (Suppl 4) ◽

pp. S2 ◽

Author(s):

Pavel Kuksa ◽

Pai-Hsi Huang ◽

Vladimir Pavlovic

Keyword(s):

Comparative Study ◽

Protein Sequence ◽

Unlabeled Data ◽

Sequence Classification ◽

Protein Sequence Classification

Download Full-text

Learned Random-Walk Kernels and Empirical-Map Kernels for Protein Sequence Classification

Journal of Computational Biology ◽

10.1089/cmb.2008.0031 ◽

2009 ◽

Vol 16 (3) ◽

pp. 457-474 ◽

Author(s):

Renqiang Min ◽

Anthony Bonner ◽

Jingjing Li ◽

Zhaolei Zhang

Keyword(s):

Random Walk ◽

Protein Sequence ◽

Sequence Classification ◽

Protein Sequence Classification

Download Full-text

Cluster identification and separation in the growing self-organizing map: application in protein sequence classification

Neural Computing and Applications ◽

10.1007/s00521-009-0300-0 ◽

2009 ◽

Vol 19 (4) ◽

pp. 531-542 ◽

Author(s):

Norashikin Ahmad ◽

Damminda Alahakoon ◽

Rowena Chau

Keyword(s):

Protein Sequence ◽

Self Organizing Map ◽

Sequence Classification ◽

Cluster Identification ◽

Protein Sequence Classification ◽

Self Organizing

Download Full-text

Mining for class-specific motifs in protein sequence classification

BMC Bioinformatics ◽

10.1186/1471-2105-14-96 ◽

2013 ◽

Vol 14 (1) ◽

pp. 96 ◽

Author(s):

Satish M Srinivasan ◽

Suleyman Vural ◽

Brian R King ◽

Chittibabu Guda

Keyword(s):

Protein Sequence ◽

Sequence Classification ◽

Protein Sequence Classification

Download Full-text