scholarly journals Whole Proteome Clustering of 2,307 Genomes Reveals Remarkable Conservation of Four Proteins Among Proteobacteria While Revealing Significant Annotation Issues

2018 ◽  
Author(s):  
Svetlana Lockwood ◽  
Kelly A. Brayton ◽  
Jeff A. Daily ◽  
Shira L. Broschat

AbstractTo explore the concept of a minimal gene set, we clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genomes. To our knowledge this is the first study of this scale. Clustering resulted in 707,311 clusters of which 224,442 ranged in size from 2 to 2,894 sequences. The resulting clusters allowed us to ask the question: Is a set of proteins conserved across all Proteobacteria? We chose four essential proteins, the chaperonin GroEL, DNA dependent RNA polymerase subunits beta and beta’ (RpoB/RpoB’), and DNA polymerase I (PolA), representing fundamental cellular functions, and examined their distribution in the clusters. We found these proteins to be remarkably conserved. Although thegroELgene was universally conserved in all the organisms in the study, the protein was not represented in all the deduced proteomes. The genes for RpoB and RpoB’ were missing from two genomes and merged in 88 genomes, and the sequences were sufficiently divergent that they formed separate clusters for 18 RpoB proteins (seven clusters) and 14 RpoB’ proteins (three clusters). For PolA, 52 organisms lacked an identifiable sequence, and seven sequences were sufficiently divergent that they formed five separate clusters. Interestingly, organisms lacking an identifiable PolA and those with divergent RpoB/RpoB’ were almost all endosymbionts. Furthermore, we present a range of examples of annotation issues that caused the deduced proteins to be incorrectly represented in the proteome. These annotation issues represent a significant obstacle for high throughput analyses.

2021 ◽  
Vol 10 (1) ◽  
Author(s):  
Ani Saghatelyan ◽  
Hovik Panosyan ◽  
Armen Trchounian ◽  
Nils‐Kåre Birkeland

Biochemistry ◽  
1984 ◽  
Vol 23 (9) ◽  
pp. 2073-2078 ◽  
Author(s):  
Anup K. Hazra ◽  
Sevilla Detera-Wadleigh ◽  
Samuel H. Wilson

2013 ◽  
Vol 4 (1) ◽  
Author(s):  
Johannes Hohlbein ◽  
Louise Aigrain ◽  
Timothy D. Craggs ◽  
Oya Bermek ◽  
Olga Potapova ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document