PSORTdb--an expanded, auto-updated, user-friendly protein subcellular localization database for Bacteria and Archaea

Knowledge of protein subcellular localization is vitally important for both basic research and drug development. With the avalanche of protein sequences emerging in the post-genomic age, it is highly desired to develop computational tools for timely and effectively identifying their subcellular localization based on the sequence information alone. Recently, a predictor called “pLoc-mPlant” was developed for identifying the subcellular localization of plant proteins. Its performance is overwhelmingly better than that of the other predictors for the same purpose, particularly in dealing with multi-label systems in which some proteins, called “multiplex proteins”, may simultaneously occur in two or more subcellular locations. Although it is indeed a very powerful predictor, more efforts are definitely needed to further improve it. This is because pLoc-mPlant was trained by an extremely skewed dataset in which some subsets (i.e., the protein numbers for some subcellular locations) were more than 10 times larger than the others. Accordingly, it cannot avoid the biased consequence caused by such an uneven training dataset. To overcome such biased consequence, we have developed a new and bias-free predictor called pLoc_bal-mPlant by balancing the training dataset. Cross-validation tests on exactly the same experimentconfirmed dataset have indicated that the proposed new predictor is remarkably superior to pLoc-mPlant, the existing state-of-the-art predictor in identifying the subcellular localization of plant proteins. To maximize the convenience for the majority of experimental scientists, a user-friendly web-server for the new predictor has been established at http://www.jci-bioinfo.cn/pLoc_bal-mPlant/, by which users can easily get their desired results without the need to go through the detailed mathematics.

Download Full-text

Reuse Recipe Document for: A robust fractionation method for protein subcellular localization studies in Escherichia coli

10.23942/biotechniques.1559047365000 ◽

2019 ◽

Keyword(s):

Escherichia Coli ◽

Subcellular Localization ◽

Protein Subcellular Localization ◽

Fractionation Method

Download Full-text

An efficient transient assays system using Agrobacterium-mediated transformation of onion (Allium cepa) epidermal cells

Indian Journal of Genetics and Plant Breeding (The) ◽

10.31742/ijgpb.80.3.17 ◽

2020 ◽

Vol 80 (03) ◽

Author(s):

Yu-Miao Zhang ◽

Jun Wang ◽

Tao Wu

Keyword(s):

Subcellular Localization ◽

Protein Interaction ◽

Protein Interactions ◽

Epidermal Cells ◽

Cyclin Dependent Kinase ◽

Protein Subcellular Localization ◽

Protein Protein Interactions ◽

Efficient System ◽

Protein Protein Interaction ◽

Onion Epidermal Cells

In this study, the Agrobacterium infection medium, infection duration, detergent, and cell density were optimized. The sorghum-based infection medium (SbIM), 10-20 min infection time, addition of 0.01% Silwet L-77, and Agrobacterium optical density at 600 nm (OD600), improved the competence of onion epidermal cells to support Agrobacterium infection at >90% efficiency. Cyclin-dependent kinase D-2 (CDKD-2) and cytochrome c-type biogenesis protein (CYCH), protein-protein interactions were localized. The optimized procedure is a quick and efficient system for examining protein subcellular localization and protein-protein interaction.

Download Full-text

Prediction of Protein Subcellular Localization by Using 𝛌-Order Factor and Principal Component Analysis

Letters in Organic Chemistry ◽

10.2174/1570178614666170227142225 ◽

2017 ◽

Vol 14 (9) ◽

Cited By ~ 1

Author(s):

Shengli Zhang ◽

Jin Jin

Keyword(s):

Principal Component Analysis ◽

Subcellular Localization ◽

Principal Component ◽

Component Analysis ◽

Protein Subcellular Localization ◽

Order Factor

Download Full-text

pLoc_bal-mEuk: Predict Subcellular Localization of Eukaryotic Proteins by General PseAAC and Quasi-balancing Training Dataset

Medicinal Chemistry ◽

10.2174/1573406415666181218102517 ◽

2019 ◽

Vol 15 (5) ◽

pp. 472-485 ◽

Cited By ~ 21

Author(s):

Kuo-Chen Chou ◽

Xiang Cheng ◽

Xuan Xiao

Keyword(s):

Drug Development ◽

Subcellular Localization ◽

Basic Research ◽

The Other ◽

Training Dataset ◽

Sequence Information ◽

Eukaryotic Proteins ◽

Validation Tests ◽

User Friendly ◽

Better Than

Background/Objective: Information of protein subcellular localization is crucially important for both basic research and drug development. With the explosive growth of protein sequences discovered in the post-genomic age, it is highly demanded to develop powerful bioinformatics tools for timely and effectively identifying their subcellular localization purely based on the sequence information alone. Recently, a predictor called “pLoc-mEuk” was developed for identifying the subcellular localization of eukaryotic proteins. Its performance is overwhelmingly better than that of the other predictors for the same purpose, particularly in dealing with multi-label systems where many proteins, called “multiplex proteins”, may simultaneously occur in two or more subcellular locations. Although it is indeed a very powerful predictor, more efforts are definitely needed to further improve it. This is because pLoc-mEuk was trained by an extremely skewed dataset where some subset was about 200 times the size of the other subsets. Accordingly, it cannot avoid the biased consequence caused by such an uneven training dataset. Methods: To alleviate such bias, we have developed a new predictor called pLoc_bal-mEuk by quasi-balancing the training dataset. Cross-validation tests on exactly the same experimentconfirmed dataset have indicated that the proposed new predictor is remarkably superior to pLocmEuk, the existing state-of-the-art predictor in identifying the subcellular localization of eukaryotic proteins. It has not escaped our notice that the quasi-balancing treatment can also be used to deal with many other biological systems. Results: To maximize the convenience for most experimental scientists, a user-friendly web-server for the new predictor has been established at http://www.jci-bioinfo.cn/pLoc_bal-mEuk/. Conclusion: It is anticipated that the pLoc_bal-Euk predictor holds very high potential to become a useful high throughput tool in identifying the subcellular localization of eukaryotic proteins, particularly for finding multi-target drugs that is currently a very hot trend trend in drug development.

Download Full-text

Review of Protein Subcellular Localization Prediction

Current Bioinformatics ◽

10.2174/1574893609666140212000304 ◽

2014 ◽

Vol 9 (3) ◽

pp. 331-342 ◽

Cited By ~ 21

Author(s):

Zhen Wang ◽

Quan Zou ◽

Yi Jiang ◽

Ying Ju ◽

Xiangxiang Zeng

Keyword(s):

Subcellular Localization ◽

Protein Subcellular Localization ◽

Subcellular Localization Prediction ◽

Protein Subcellular Localization Prediction ◽

Localization Prediction

Download Full-text

Integrating Second-order Moving Average and Over-sampling Algorithm to Predict Apoptosis Protein Subcellular Localization

Current Bioinformatics ◽

10.2174/1574893614666190902155811 ◽

2020 ◽

Vol 15 (6) ◽

pp. 517-527

Author(s):

Yunyun Liang ◽

Shengli Zhang

Keyword(s):

Subcellular Localization ◽

Moving Average ◽

Subcellular Location ◽

Second Order ◽

Test Method ◽

Support Vector ◽

Protein Subcellular Localization ◽

Protein Subcellular Location ◽

Apoptosis Protein ◽

Leibler Divergence

Background: Apoptosis proteins have a key role in the development and the homeostasis of the organism, and are very important to understand the mechanism of cell proliferation and death. The function of apoptosis protein is closely related to its subcellular location. Objective: Prediction of apoptosis protein subcellular localization is a meaningful task. Methods: In this study, we predict the apoptosis protein subcellular location by using the PSSMbased second-order moving average descriptor, nonnegative matrix factorization based on Kullback-Leibler divergence and over-sampling algorithms. This model is named by SOMAPKLNMF- OS and constructed on the ZD98, ZW225 and CL317 benchmark datasets. Then, the support vector machine is adopted as the classifier, and the bias-free jackknife test method is used to evaluate the accuracy. Results: Our prediction system achieves the favorable and promising performance of the overall accuracy on the three datasets and also outperforms the other listed models. Conclusion: The results show that our model offers a high throughput tool for the identification of apoptosis protein subcellular localization.

Download Full-text