scholarly journals A most wanted list of conserved protein families with no known domains

2017 ◽  
Author(s):  
Stacia K. Wyman ◽  
Aram Avila-Herrera ◽  
Stephen Nayfach ◽  
Katherine S. Pollard

AbstractThe number and proportion of genes with no known function are growing rapidly. To quantify this phenomenon and provide criteria for prioritizing genes for functional characterization, we developed a bioinformatics pipeline that identifies robustly defined protein families with no annotated domains, ranks these with respect to phylogenetic breadth, and identifies them in metagenomics data. We applied this approach to 271 965 protein families from the SFams database and discovered many with no functional annotation, including >118 000 families lacking any known protein domain. From these, we prioritized 6 668 conserved protein families with at least three sequences from organisms in at least two distinct classes. These Function Unknown Families (FUnkFams) are present in Tara Oceans Expedition and Human Microbiome Project metagenomes, with distributions associated with sampling environment. Our findings highlight the extent of functional novelty in sequence databases and establish an approach for creating a “most wanted” list of genes to characterize.

2014 ◽  
Vol 7 ◽  
pp. MBI.S18076 ◽  
Author(s):  
Raphael D. Isokpehi ◽  
Udensi K. Udensi ◽  
Shaneka S. Simmons ◽  
Antoinesha L. Hollman ◽  
Antia E. Cain ◽  
...  

The influence of environmental chemicals including arsenic, a type 1 carcinogen, on the composition and function of the human-associated microbiota is of significance in human health and disease. We have developed a suite of bioinformatics and visual analytics methods to evaluate the availability (presence or absence) and abundance of functional annotations in a microbial genome for seven Pfam protein families: As(III)-responsive transcriptional repressor (ArsR), anion-transporting ATPase (ArsA), arsenical pump membrane protein (ArsB), arsenate reductase (ArsC), arsenical resistance operon transacting repressor (ArsD), water/glycerol transport protein (aquaporins), and universal stress protein (USP). These genes encode function for sensing and/or regulating arsenic content in the bacterial cell. The evaluative profiling strategy was applied to 3,274 genomes from which 62 genomes from 18 genera were identified to contain genes for the seven protein families. Our list included 12 genomes in the Human Microbiome Project (HMP) from the following genera: Citrobacter, Escherichia, Lactobacillus, Providencia, Rhodococcus, and Staphylococcus. Gene neighborhood analysis of the arsenic resistance operon in the genome of Bacteroides thetaiotaomicron VPI-5482, a human gut symbiont, revealed the adjacent arrangement of genes for arsenite binding/transfer (ArsD) and cytochrome c biosynthesis (DsbD_2). Visual analytics facilitated evaluation of protein annotations in 367 genomes in the phylum Bacteroidetes identified multiple genomes in which genes for ArsD and DsbD_2 were adjacently arranged. Cytochrome c, produced by a posttranslational process, consists of heme-containing proteins important for cellular energy production and signaling. Further research is desired to elucidate arsenic resistance and arsenic-mediated cellular energy production in the Bacteroidetes.


2021 ◽  
Author(s):  
Joachim Johansen ◽  
Damian R Plichta ◽  
Jakob Nybo Nissen ◽  
Marie Louise Jespersen ◽  
Shiraz A Shah ◽  
...  

Despite the accelerating number of uncultivated virus sequences discovered in metagenomics and their apparent importance for health and disease, the human gut virome and its interactions with bacteria in the gastrointestinal are not well understood. In addition, a paucity of whole-virome datasets from subjects with gastrointestinal diseases is preventing a deeper understanding of the virome role in disease and in gastrointestinal ecology as a whole. By combining a deep-learning based metagenomics binning algorithm with paired metagenome and metavirome datasets we developed the Phages from Metagenomics Binning (PHAMB) approach for binning thousands of viral genomes directly from bulk metagenomics data. Simultaneously our methodology enables clustering of viral genomes into accurate taxonomic viral populations. We applied this methodology on the Human Microbiome Project 2 (HMP2) cohort and recovered 6,077 HQ genomes from 1,024 viral populations and explored viral-host interactions. We show that binning can be advantageously applied to existing and future metagenomes to illuminate viral ecological dynamics with other microbiome constituents.


Pathogens ◽  
2021 ◽  
Vol 10 (2) ◽  
pp. 86
Author(s):  
Erin M. Garcia ◽  
Myrna G. Serrano ◽  
Laahirie Edupuganti ◽  
David J. Edwards ◽  
Gregory A. Buck ◽  
...  

Gardnerella vaginalis has recently been split into 13 distinct species. In this study, we tested the hypotheses that species-specific variations in the vaginolysin (VLY) amino acid sequence could influence the interaction between the toxin and vaginal epithelial cells and that VLY variation may be one factor that distinguishes less virulent or commensal strains from more virulent strains. This was assessed by bioinformatic analyses of publicly available Gardnerella spp. sequences and quantification of cytotoxicity and cytokine production from purified, recombinantly produced versions of VLY. After identifying conserved differences that could distinguish distinct VLY types, we analyzed metagenomic data from a cohort of female subjects from the Vaginal Human Microbiome Project to investigate whether these different VLY types exhibited any significant associations with symptoms or Gardnerella spp.-relative abundance in vaginal swab samples. While Type 1 VLY was most prevalent among the subjects and may be associated with increased reports of symptoms, subjects with Type 2 VLY dominant profiles exhibited increased relative Gardnerella spp. abundance. Our findings suggest that amino acid differences alter the interaction of VLY with vaginal keratinocytes, which may potentiate differences in bacterial vaginosis (BV) immunopathology in vivo.


2018 ◽  
Vol 85 (10) ◽  
Author(s):  
Reed M. Stubbendieck ◽  
Daniel S. May ◽  
Marc G. Chevrette ◽  
Mia I. Temkin ◽  
Evelyn Wendt-Pienkowski ◽  
...  

ABSTRACTResources available in the human nasal cavity are limited. Therefore, to successfully colonize the nasal cavity, bacteria must compete for scarce nutrients. Competition may occur directly through interference (e.g., antibiotics) or indirectly by nutrient sequestration. To investigate the nature of nasal bacterial competition, we performed coculture inhibition assays between nasalActinobacteriaandStaphylococcusspp. We found that isolates of coagulase-negative staphylococci (CoNS) were sensitive to growth inhibition byActinobacteriabut thatStaphylococcus aureusisolates were resistant to inhibition. AmongActinobacteria, we observed thatCorynebacteriumspp. were variable in their ability to inhibit CoNS. We sequenced the genomes of 10Corynebacteriumspecies isolates, including 3Corynebacterium propinquumisolates that strongly inhibited CoNS and 7 otherCorynebacteriumspecies isolates that only weakly inhibited CoNS. Using a comparative genomics approach, we found that theC. propinquumgenomes were enriched in genes for iron acquisition and harbored a biosynthetic gene cluster (BGC) for siderophore production, absent in the noninhibitoryCorynebacteriumspecies genomes. Using a chrome azurol S assay, we confirmed thatC. propinquumproduced siderophores. We demonstrated that iron supplementation rescued CoNS from inhibition byC. propinquum, suggesting that inhibition was due to iron restriction through siderophore production. Through comparative metabolomics and molecular networking, we identified the siderophore produced byC. propinquumas dehydroxynocardamine. Finally, we confirmed that the dehydroxynocardamine BGC is expressedin vivoby analyzing human nasal metatranscriptomes from the NIH Human Microbiome Project. Together, our results suggest that bacteria produce siderophores to compete for limited available iron in the nasal cavity and improve their fitness.IMPORTANCEWithin the nasal cavity, interference competition through antimicrobial production is prevalent. For instance, nasalStaphylococcusspecies strains can inhibit the growth of other bacteria through the production of nonribosomal peptides and ribosomally synthesized and posttranslationally modified peptides. In contrast, bacteria engaging in exploitation competition modify the external environment to prevent competitors from growing, usually by hindering access to or depleting essential nutrients. As the nasal cavity is a nutrient-limited environment, we hypothesized that exploitation competition occurs in this system. We determined thatCorynebacterium propinquumproduces an iron-chelating siderophore, and this iron-sequestering molecule correlates with the ability to inhibit the growth of coagulase-negative staphylococci. Furthermore, we found that the genes required for siderophore production are expressedin vivo. Thus, although siderophore production by bacteria is often considered a virulence trait, our work indicates that bacteria may produce siderophores to compete for limited iron in the human nasal cavity.


2021 ◽  
Author(s):  
Saumya Agrawal ◽  
Tanvir Alam ◽  
Masaru Koido ◽  
Ivan V. Kulakovskiy ◽  
Jessica Severin ◽  
...  

AbstractTranscription of the human genome yields mostly long non-coding RNAs (lncRNAs). Systematic functional annotation of lncRNAs is challenging due to their low expression level, cell type-specific occurrence, poor sequence conservation between orthologs, and lack of information about RNA domains. Currently, 95% of human lncRNAs have no functional characterization. Using chromatin conformation and Cap Analysis of Gene Expression (CAGE) data in 18 human cell types, we systematically located genomic regions in spatial proximity to lncRNA genes and identified functional clusters of interacting protein-coding genes, lncRNAs and enhancers. Using these clusters we provide a cell type-specific functional annotation for 7,651 out of 14,198 (53.88%) lncRNAs. LncRNAs tend to have specialized roles in the cell type in which it is first expressed, and to incorporate more general functions as its expression is acquired by multiple cell types during evolution. By analyzing RNA-binding protein and RNA-chromatin interaction data in the context of the spatial genomic interaction map, we explored mechanisms by which these lncRNAs can act.


2017 ◽  
Author(s):  
Victoria Cepeda ◽  
Bo Liu ◽  
Mathieu Almeida ◽  
Christopher M. Hill ◽  
Sergey Koren ◽  
...  

ABSTRACTMetagenomic studies have primarily relied on de novo approaches for reconstructing genes and genomes from microbial mixtures. While database driven approaches have been employed in certain analyses, they have not been used in the assembly of metagenomes. Here we describe the first effective approach for reference-guided metagenomic assembly of low-abundance bacterial genomes that can complement and improve upon de novo metagenomic assembly methods. When combined with de novo assembly approaches, we show that MetaCompass can generate more complete assemblies than can be obtained by de novo assembly alone, and improve on assemblies from the Human Microbiome Project (over 2,000 samples).


2021 ◽  
Author(s):  
Leyuan Li ◽  
Zhibin Ning ◽  
Xu Zhang ◽  
James Butcher ◽  
Caitlin Simopoulos ◽  
...  

Functional redundancy is a key property of ecosystems and represents the fact that phylogenetically unrelated taxa can play similar functional roles within an ecosystem. The redundancy of potential functions of human microbiome has been recently quantified using metagenomics data. Yet, the redundancy of functions which are actually expressed within the human microbiome remains largely unexplored. Here, we quantify the protein-level functional redundancy in the human gut microbiome using metaproteomics and network approaches. In particular, our ultra-deep metaproteomics approach revealed high protein-level functional redundancy and high nestedness in proteomic content networks - bipartite graphs that connect taxa with their expressed functions. We further examined multiple metaproteomics datasets and showed that various environmental factors, including individuality, biogeography, xenobiotics, and disease, significantly altered the protein-level functional redundancy. Finally, by projecting the bipartite proteomic content networks into unipartite weighted genus networks, functional hub genera across individual microbiomes were discovered, suggesting that there may be a universal principle of functional organization in microbiome assembly.


2018 ◽  
Author(s):  
Maria R. Handrich ◽  
Sriram G. Garg ◽  
Ewen W. Sommerville ◽  
Robert P. Hirt ◽  
Sven B. Gould

AbstractTrichomonas vaginalisis one of the most widespread, sexually transmitted pathogens. The infection involves a morphological switch from a free-swimming pyriform trophozoite to an amoeboid cell upon adhesion to host epithelial cells. While details on how the switch is induced and to what proteins of the host surface the parasite adheres remain poorly characterized, several surface proteins of the parasite itself have been identified as potential candidates. Among those are two expanded protein families that harbor domains that share similarity to functionally investigated surface proteins of prokaryotic oral pathogens; these are the BspA proteins of Bacteroidales and Spirochaetales, and the Pmp proteins of Chlamydiales. We sequenced the transcriptomes of five Trichomonads and screened for the presence of BspA and Pmp domain-containing proteins and tested the ability of individualT. vaginaliscandidates to mediate adhesion. Here we demonstrate that (i) BspA and Pmp domain-containing proteins are specifically expanded inT. vaginalisin comparison to other Trichomonads, and that (ii) individual proteins of both families have the ability to increase adhesion performance in a non-virulentT. vaginalisstrain andTetratrichomonas gallinarum, a parasite usually known to infect birds but not humans. Our results initiate the functional characterization of these two broadly distributed protein families, whose origin we trace back to the origin of Trichomonads themselves.


Sign in / Sign up

Export Citation Format

Share Document