Learning complex subcellular distribution patterns of proteins via analysis of immunohistochemistry images

Ying-Ying Xu; Hong-Bin Shen; Robert F Murphy

doi:10.1093/bioinformatics/btz844

Learning complex subcellular distribution patterns of proteins via analysis of immunohistochemistry images

Bioinformatics ◽

10.1093/bioinformatics/btz844 ◽

2019 ◽

Vol 36 (6) ◽

pp. 1908-1914 ◽

Cited By ~ 4

Author(s):

Ying-Ying Xu ◽

Hong-Bin Shen ◽

Robert F Murphy

Keyword(s):

Subcellular Location ◽

Distribution Patterns ◽

Cell Types ◽

Supplementary Information ◽

Protein Distribution ◽

Protein Subcellular Location ◽

Location Patterns ◽

Location Proteomics ◽

Human Protein Atlas ◽

Protein Subcellular Locations

Abstract Motivation Systematic and comprehensive analysis of protein subcellular location as a critical part of proteomics (‘location proteomics’) has been studied for many years, but annotating protein subcellular locations and understanding variation of the location patterns across various cell types and states is still challenging. Results In this work, we used immunohistochemistry images from the Human Protein Atlas as the source of subcellular location information, and built classification models for the complex protein spatial distribution in normal and cancerous tissues. The models can automatically estimate the fractions of protein in different subcellular locations, and can help to quantify the changes of protein distribution from normal to cancer tissues. In addition, we examined the extent to which different annotated protein pathways and complexes showed similarity in the locations of their member proteins, and then predicted new potential proteins for these networks. Availability and implementation The dataset and code are available at: www.csbio.sjtu.edu.cn/bioinf/complexsubcellularpatterns. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Interpretation of Protein Subcellular Location Patterns in 3D Images Across Cell Types and Resolutions

Bioinformatics Research and Development - Lecture Notes in Computer Science ◽

10.1007/978-3-540-71233-6_26 ◽

2007 ◽

pp. 328-342 ◽

Cited By ~ 3

Author(s):

Xiang Chen ◽

Robert F. Murphy

Keyword(s):

Subcellular Location ◽

Cell Types ◽

3D Images ◽

Protein Subcellular Location ◽

Location Patterns

Download Full-text

Advances in the Prediction of Protein Subcellular Locations with Machine Learning

Current Bioinformatics ◽

10.2174/1574893614666181217145156 ◽

2019 ◽

Vol 14 (5) ◽

pp. 406-421 ◽

Cited By ~ 3

Author(s):

Ting-He Zhang ◽

Shao-Wu Zhang

Keyword(s):

Machine Learning ◽

Feature Fusion ◽

Protein Sequences ◽

Subcellular Location ◽

Automated Analysis ◽

Cellular Level ◽

Machine Learning Algorithms ◽

Feature Representation ◽

Protein Subcellular Location ◽

Protein Subcellular Locations

Background: Revealing the subcellular location of a newly discovered protein can bring insight into their function and guide research at the cellular level. The experimental methods currently used to identify the protein subcellular locations are both time-consuming and expensive. Thus, it is highly desired to develop computational methods for efficiently and effectively identifying the protein subcellular locations. Especially, the rapidly increasing number of protein sequences entering the genome databases has called for the development of automated analysis methods. Methods: In this review, we will describe the recent advances in predicting the protein subcellular locations with machine learning from the following aspects: i) Protein subcellular location benchmark dataset construction, ii) Protein feature representation and feature descriptors, iii) Common machine learning algorithms, iv) Cross-validation test methods and assessment metrics, v) Web servers. Result & Conclusion: Concomitant with a large number of protein sequences generated by highthroughput technologies, four future directions for predicting protein subcellular locations with machine learning should be paid attention. One direction is the selection of novel and effective features (e.g., statistics, physical-chemical, evolutional) from the sequences and structures of proteins. Another is the feature fusion strategy. The third is the design of a powerful predictor and the fourth one is the protein multiple location sites prediction.

Download Full-text

Searching online journals for fluorescence microscope images depicting protein subcellular location patterns

Proceedings 2nd Annual IEEE International Symposium on Bioinformatics and Bioengineering (BIBE 2001) ◽

10.1109/bibe.2001.974420 ◽

2001 ◽

Cited By ~ 25

Author(s):

R.F. Murphy ◽

M. Velliste ◽

Jie Yao ◽

G. Porreca

Keyword(s):

Subcellular Location ◽

Fluorescence Microscope ◽

Protein Subcellular Location ◽

Online Journals ◽

Microscope Images ◽

Location Patterns

Download Full-text

Incorporating label correlations into deep neural networks to classify protein subcellular location patterns in immunohistochemistry images

Proteins Structure Function and Bioinformatics ◽

10.1002/prot.26244 ◽

2021 ◽

Author(s):

Jin‐Xian Hu ◽

Yang Yang ◽

Ying‐Ying Xu ◽

Hong‐Bin Shen

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Subcellular Location ◽

Protein Subcellular Location ◽

Location Patterns ◽

Label Correlations

Download Full-text

Automated comparison of protein subcellular location patterns between images of normal and cancerous tissues

2008 5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro ◽

10.1109/isbi.2008.4540993 ◽

2008 ◽

Cited By ~ 8

Author(s):

Estelle Glory ◽

Justin Newberg ◽

Robert F. Murphy

Keyword(s):

Subcellular Location ◽

Protein Subcellular Location ◽

Location Patterns

Download Full-text

Automated Classification of Protein Subcellular Location Patterns on Images of Human Reproductive Tissues

Intelligent Science and Intelligent Data Engineering - Lecture Notes in Computer Science ◽

10.1007/978-3-642-36669-7_32 ◽

2013 ◽

pp. 254-262

Author(s):

Fan Yang ◽

Ying-Ying Xu ◽

Hong-Bin Shen

Keyword(s):

Subcellular Location ◽

Automated Classification ◽

Protein Subcellular Location ◽

Reproductive Tissues ◽

Location Patterns ◽

Human Reproductive

Download Full-text

A Novel Approximate Inference Approach to Automated Classification of Protein Subcellular Location Patterns in Multi-Cell Images

3rd IEEE International Symposium on Biomedical Imaging: Macro to Nano, 2006. ◽

10.1109/isbi.2006.1624977 ◽

2006 ◽

Cited By ~ 2

Author(s):

Shann-Ching Chen ◽

G.J. Gordon ◽

R.F. Murphy

Keyword(s):

Subcellular Location ◽

Approximate Inference ◽

Automated Classification ◽

Protein Subcellular Location ◽

Location Patterns

Download Full-text

Automated Interpretation of Protein Subcellular Location Patterns

International Review of Cytology ◽

10.1016/s0074-7696(06)49004-5 ◽

2006 ◽

pp. 193-227 ◽

Cited By ~ 7

Author(s):

Xiang Chen ◽

Robert F. Murphy

Keyword(s):

Subcellular Location ◽

Protein Subcellular Location ◽

Location Patterns ◽

Automated Interpretation

Download Full-text

ImPLoc: a multi-instance deep learning model for the prediction of protein subcellular localization based on immunohistochemistry images

Bioinformatics ◽

10.1093/bioinformatics/btz909 ◽

2019 ◽

Vol 36 (7) ◽

pp. 2244-2250 ◽

Cited By ~ 5

Author(s):

Wei Long ◽

Yang Yang ◽

Hong-Bin Shen

Keyword(s):

Subcellular Localization ◽

Tissue Level ◽

Image Features ◽

Supplementary Information ◽

Protein Distribution ◽

Protein Subcellular Localization ◽

Significance Level ◽

Protein Functions ◽

Human Protein Atlas ◽

Cancer Tissues

Abstract Motivation The tissue atlas of the human protein atlas (HPA) houses immunohistochemistry (IHC) images visualizing the protein distribution from the tissue level down to the cell level, which provide an important resource to study human spatial proteome. Especially, the protein subcellular localization patterns revealed by these images are helpful for understanding protein functions, and the differential localization analysis across normal and cancer tissues lead to new cancer biomarkers. However, computational tools for processing images in this database are highly underdeveloped. The recognition of the localization patterns suffers from the variation in image quality and the difficulty in detecting microscopic targets. Results We propose a deep multi-instance multi-label model, ImPLoc, to predict the subcellular locations from IHC images. In this model, we employ a deep convolutional neural network-based feature extractor to represent image features, and design a multi-head self-attention encoder to aggregate multiple feature vectors for subsequent prediction. We construct a benchmark dataset of 1186 proteins including 7855 images from HPA and 6 subcellular locations. The experimental results show that ImPLoc achieves significant enhancement on the prediction accuracy compared with the current computational methods. We further apply ImPLoc to a test set of 889 proteins with images from both normal and cancer tissues, and obtain 8 differentially localized proteins with a significance level of 0.05. Availability and implementation https://github.com/yl2019lw/ImPloc. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Location proteomics: determining the optimal grouping of proteins according to their subcellular location patterns as determined from fluorescence microscope images

Conference Record of the Thirty-Eighth Asilomar Conference on Signals, Systems and Computers, 2004. ◽

10.1109/acssc.2004.1399085 ◽

2005 ◽

Author(s):

Xiang Chen ◽

R.F. Murphy

Keyword(s):

Subcellular Location ◽

Fluorescence Microscope ◽

Microscope Images ◽

Location Patterns ◽

Location Proteomics ◽

Optimal Grouping

Download Full-text