WITHDRAWN: An insightful recollection for predicting protein subcellular locations in multi-label systems

Background: Revealing the subcellular location of a newly discovered protein can bring insight into their function and guide research at the cellular level. The experimental methods currently used to identify the protein subcellular locations are both time-consuming and expensive. Thus, it is highly desired to develop computational methods for efficiently and effectively identifying the protein subcellular locations. Especially, the rapidly increasing number of protein sequences entering the genome databases has called for the development of automated analysis methods. Methods: In this review, we will describe the recent advances in predicting the protein subcellular locations with machine learning from the following aspects: i) Protein subcellular location benchmark dataset construction, ii) Protein feature representation and feature descriptors, iii) Common machine learning algorithms, iv) Cross-validation test methods and assessment metrics, v) Web servers. Result & Conclusion: Concomitant with a large number of protein sequences generated by highthroughput technologies, four future directions for predicting protein subcellular locations with machine learning should be paid attention. One direction is the selection of novel and effective features (e.g., statistics, physical-chemical, evolutional) from the sequences and structures of proteins. Another is the feature fusion strategy. The third is the design of a powerful predictor and the fourth one is the protein multiple location sites prediction.

Download Full-text

Learning complex subcellular distribution patterns of proteins via analysis of immunohistochemistry images

Bioinformatics ◽

10.1093/bioinformatics/btz844 ◽

2019 ◽

Vol 36 (6) ◽

pp. 1908-1914 ◽

Cited By ~ 4

Author(s):

Ying-Ying Xu ◽

Hong-Bin Shen ◽

Robert F Murphy

Keyword(s):

Subcellular Location ◽

Distribution Patterns ◽

Cell Types ◽

Supplementary Information ◽

Protein Distribution ◽

Protein Subcellular Location ◽

Location Patterns ◽

Location Proteomics ◽

Human Protein Atlas ◽

Protein Subcellular Locations

Abstract Motivation Systematic and comprehensive analysis of protein subcellular location as a critical part of proteomics (‘location proteomics’) has been studied for many years, but annotating protein subcellular locations and understanding variation of the location patterns across various cell types and states is still challenging. Results In this work, we used immunohistochemistry images from the Human Protein Atlas as the source of subcellular location information, and built classification models for the complex protein spatial distribution in normal and cancerous tissues. The models can automatically estimate the fractions of protein in different subcellular locations, and can help to quantify the changes of protein distribution from normal to cancer tissues. In addition, we examined the extent to which different annotated protein pathways and complexes showed similarity in the locations of their member proteins, and then predicted new potential proteins for these networks. Availability and implementation The dataset and code are available at: www.csbio.sjtu.edu.cn/bioinf/complexsubcellularpatterns. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Predictability of protein subcellular locations by pattern recognition techniques

2010 Annual International Conference of the IEEE Engineering in Medicine and Biology ◽

10.1109/iembs.2010.5626772 ◽

2010 ◽

Cited By ~ 5

Author(s):

J A Jaramillo-Garzón ◽

A Perera-Lluna ◽

C G Castellanos-Domínguez

Keyword(s):

Pattern Recognition ◽

Pattern Recognition Techniques ◽

Protein Subcellular Locations

Download Full-text

ML-RBF: Predict protein subcellular locations in a multi-label system using evolutionary features

Chemometrics and Intelligent Laboratory Systems ◽

10.1016/j.chemolab.2020.104055 ◽

2020 ◽

Vol 203 ◽

pp. 104055

Author(s):

Faisal Javed ◽

Jamal Ahmed ◽

Maqsood Hayat

Keyword(s):

Evolutionary Features ◽

Label System ◽

Protein Subcellular Locations

Download Full-text

acACS: Improving the Prediction Accuracy of Protein Subcellular Locations and Protein Classification by Incorporating the Average Chemical Shifts Composition

The Scientific World JOURNAL ◽

10.1155/2014/864135 ◽

2014 ◽

Vol 2014 ◽

pp. 1-9 ◽

Cited By ~ 5

Author(s):

Guo-Liang Fan ◽

Yan-Ling Liu ◽

Yong-Chun Zuo ◽

Han-Xue Mei ◽

Yi Rang ◽

...

Keyword(s):

Chemical Shift ◽

Prediction Accuracy ◽

Structural Changes ◽

Chemical Shifts ◽

Protein Classification ◽

Structure Information ◽

Local Environments ◽

Protein Subcellular Locations ◽

Auto Covariance ◽

Online Web

The chemical shift is sensitive to changes in the local environments and can report the structural changes. The structure information of a protein can be represented by the average chemical shifts (ACS) composition, which has been broadly applied for enhancing the prediction accuracy in protein subcellular locations and protein classification. However, different kinds of ACS composition can solve different problems. We established an online web server named acACS, which can convert secondary structure into average chemical shift and then compose the vector for representing a protein by using the algorithm of auto covariance. Our solution is easy to use and can meet the needs of users.

Download Full-text