scholarly journals Diversity, evolution, and classification of virophages uncovered through global metagenomics

Microbiome ◽  
2019 ◽  
Vol 7 (1) ◽  
Author(s):  
David Paez-Espino ◽  
Jinglie Zhou ◽  
Simon Roux ◽  
Stephen Nayfach ◽  
Georgios A. Pavlopoulos ◽  
...  

Abstract Background Virophages are small viruses with double-stranded DNA genomes that replicate along with giant viruses and co-infect eukaryotic cells. Due to the paucity of virophage reference genomes, a collective understanding of the global virophage diversity, distribution, and evolution is lacking. Results Here we screened a public collection of over 14,000 metagenomes using the virophage-specific major capsid protein (MCP) as “bait.” We identified 44,221 assembled virophage sequences, of which 328 represent high-quality (complete or near-complete) genomes from diverse habitats including the human gut, plant rhizosphere, and terrestrial subsurface. Comparative genomic analysis confirmed the presence of four core genes in a conserved block. We used these genes to establish a revised virophage classification including 27 clades with consistent genome length, gene content, and habitat distribution. Moreover, for eight high-quality virophage genomes, we computationally predicted putative eukaryotic virus hosts. Conclusion Overall, our approach has increased the number of known virophage genomes by 10-fold and revealed patterns of genome evolution and global virophage distribution. We anticipate that the expanded diversity presented here will provide the backbone for further virophage studies.

2021 ◽  
Author(s):  
Sean Benler ◽  
Natalya Yutin ◽  
Dmitry Antipov ◽  
Mikhail Raykov ◽  
Sergey Shmakov ◽  
...  

Abstract Background: Double-stranded DNA bacteriophages (dsDNA phages) play pivotal roles in structuring human gut microbiomes; yet, the gut virome is far from being fully characterized, and additional groups of phages, including highly abundant ones, continue to be discovered by metagenome mining. A multilevel framework for taxonomic classification of viruses was recently adopted, facilitating the classification of phages into evolutionary informative taxonomic units based on hallmark genes. Together with advanced approaches for sequence assembly and powerful methods of sequence analysis, this revised framework offers the opportunity to discover and classify unknown phage taxa in the human gut.Results: A search of human gut metagenomes for circular contigs encoding phage hallmark genes resulted in the identification of 3,738 apparently complete phage genomes that represent 451 putative genera. Several of these phage genera are only distantly related to previously identified phages and are likely to found new families. Two of the candidate families, “Flandersviridae” and “Quimbyviridae”, include some of the most common and abundant members of the human gut virome that infect Bacteroides, Parabacteroides and Prevotella. The third proposed family, “Gratiaviridae”, consists of less abundant phages that are distantly related to the families Autographiviridae, Drexlerviridae and Chaseviridae. Analysis of CRISPR spacers indicates that phages of all three putative families infect bacteria of the phylum Bacteroidetes. Comparative genomic analysis of the three candidate phage families revealed features without precedent in phage genomes. Some “Quimbyviridae” phages possess Diversity-Generating Retroelements (DGRs) that generate hypervariable target genes nested within defense-related genes, whereas the previously known targets of phage-encoded DGRs are structural genes. Several “Flandersviridae” phages encode enzymes of the isoprenoid pathway, a lipid biosynthesis pathway that so far has not been known to be manipulated by phages. The “Gratiaviridae” phages encode a HipA-family protein kinase and glycosyltransferase, suggesting these phages modify the host cell wall, preventing superinfection by other phages. Hundreds of phages in these three and other families are shown to encode catalases and iron-sequestering enzymes that can be predicted to enhance cellular tolerance to reactive oxygen species.Conclusions: Analysis of phage genomes identified in whole-community human gut metagenomes resulted in the delineation of at least three new candidate families of Caudovirales and revealed diverse putative mechanisms underlying phage-host interactions in the human gut. Addition of these phylogenetically classified, diverse and distinct phages to public databases will facilitate taxonomic decomposition and functional characterization of human gut viromes.


Author(s):  
Sean Benler ◽  
Natalya Yutin ◽  
Dmitry Antipov ◽  
Mikhail Raykov ◽  
Sergey Shmakov ◽  
...  

AbstractBackgroundDouble-stranded DNA bacteriophages (dsDNA phages) play pivotal roles in structuring human gut microbiomes; yet, the gut phageome is far from being fully characterized, and additional groups of phages, including highly abundant ones, continue to be discovered by metagenome mining. A multilevel framework for taxonomic classification of viruses was recently adopted, facilitating the classification of phages into evolutionary informative taxonomic units based on hallmark genes. Together with advanced approaches for sequence assembly and powerful methods of sequence analysis, this revised framework offers the opportunity to discover and classify unknown phage taxa in the human gut.ResultsA search of human gut metagenomes for circular contigs encoding phage hallmark genes resulted in the identification of 3,738 apparently complete phage genomes that represent 451 putative genera. Several of these phage genera are only distantly related to previously identified phages and are likely to found new families. Two of the candidate families, “Flandersviridae” and “Quimbyviridae”, include some of the most common and abundant members of the human gut virome that infect Bacteroides, Parabacteroides and Prevotella. The third proposed family, “Gratiaviridae”, consists of less abundant phages that are distantly related to the families Autographiviridae, Drexlerviridae and Chaseviridae. Analysis of CRISPR spacers indicates that phages of all three putative families infect bacteria of the phylum Bacteroidetes. Comparative genomic analysis of the three candidate phage families revealed features without precedent in phage genomes. Some “Quimbyviridae” phages possess Diversity-Generating Retroelements (DGRs) that generate hypervariable target genes nested within defense-related genes, whereas the previously known targets of phage-encoded DGRs are structural genes. Several “Flandersviridae” phages encode enzymes of the isoprenoid pathway, a lipid biosynthesis pathway that so far has not been known to be manipulated by phages. The “Gratiaviridae” phages encode a HipA-family protein kinase and glycosyltransferase, suggesting these phages modify the host cell wall, preventing superinfection by other phages. Hundreds of phages in these three and other families are shown to encode catalases and iron-sequestering enzymes that can be predicted to enhance cellular tolerance to reactive oxygen species.ConclusionsAnalysis of phage genomes identified in whole-community human gut metagenomes resulted in the delineation of at least three new candidate families of Caudovirales and revealed diverse putative mechanisms underlying phage-host interactions in the human gut. Addition of these phylogenetically classified, diverse and distinct phages to public databases will facilitate taxonomic decomposition and functional characterization of human gut viromes.


Microbiome ◽  
2021 ◽  
Vol 9 (1) ◽  
Author(s):  
Sean Benler ◽  
Natalya Yutin ◽  
Dmitry Antipov ◽  
Mikhail Rayko ◽  
Sergey Shmakov ◽  
...  

Abstract Background Double-stranded DNA bacteriophages (dsDNA phages) play pivotal roles in structuring human gut microbiomes; yet, the gut virome is far from being fully characterized, and additional groups of phages, including highly abundant ones, continue to be discovered by metagenome mining. A multilevel framework for taxonomic classification of viruses was recently adopted, facilitating the classification of phages into evolutionary informative taxonomic units based on hallmark genes. Together with advanced approaches for sequence assembly and powerful methods of sequence analysis, this revised framework offers the opportunity to discover and classify unknown phage taxa in the human gut. Results A search of human gut metagenomes for circular contigs encoding phage hallmark genes resulted in the identification of 3738 apparently complete phage genomes that represent 451 putative genera. Several of these phage genera are only distantly related to previously identified phages and are likely to found new families. Two of the candidate families, “Flandersviridae” and “Quimbyviridae”, include some of the most common and abundant members of the human gut virome that infect Bacteroides, Parabacteroides, and Prevotella. The third proposed family, “Gratiaviridae,” consists of less abundant phages that are distantly related to the families Autographiviridae, Drexlerviridae, and Chaseviridae. Analysis of CRISPR spacers indicates that phages of all three putative families infect bacteria of the phylum Bacteroidetes. Comparative genomic analysis of the three candidate phage families revealed features without precedent in phage genomes. Some “Quimbyviridae” phages possess Diversity-Generating Retroelements (DGRs) that generate hypervariable target genes nested within defense-related genes, whereas the previously known targets of phage-encoded DGRs are structural genes. Several “Flandersviridae” phages encode enzymes of the isoprenoid pathway, a lipid biosynthesis pathway that so far has not been known to be manipulated by phages. The “Gratiaviridae” phages encode a HipA-family protein kinase and glycosyltransferase, suggesting these phages modify the host cell wall, preventing superinfection by other phages. Hundreds of phages in these three and other families are shown to encode catalases and iron-sequestering enzymes that can be predicted to enhance cellular tolerance to reactive oxygen species. Conclusions Analysis of phage genomes identified in whole-community human gut metagenomes resulted in the delineation of at least three new candidate families of Caudovirales and revealed diverse putative mechanisms underlying phage-host interactions in the human gut. Addition of these phylogenetically classified, diverse, and distinct phages to public databases will facilitate taxonomic decomposition and functional characterization of human gut viromes.


2020 ◽  
Author(s):  
Sean Benler ◽  
Natalya Yutin ◽  
Dmitry Antipov ◽  
Mikhail Raykov ◽  
Sergey Shmakov ◽  
...  

Abstract Background: Double-stranded DNA bacteriophages (dsDNA phages) play pivotal roles in structuring human gut microbiomes; yet, the gut phageome is far from being fully characterized, and additional groups of phages, including highly abundant ones, continue to be discovered by metagenome mining. A multilevel framework for taxonomic classification of viruses was recently adopted, facilitating the classification of phages into evolutionary informative taxonomic units based on hallmark genes. Together with advanced approaches for sequence assembly and powerful methods of sequence analysis, this revised framework offers the opportunity to discover and classify unknown phage taxa in the human gut.Results:A search of human gut metagenomes for circular contigs encoding phage hallmark genes resulted in the identification of 3,738 apparently complete phage genomes that represent 451 putative genera. Several of these phage genera are only distantly related to previously identified phages and are likely to found new families. Two of the candidate families, “Flandersviridae” and “Quimbyviridae”, include some of the most common and abundant members of the human gut virome that infect Bacteroides, Parabacteroides and Prevotella. The third proposed family, “Gratiaviridae”, consists of less abundant phages that are distantly related to the families Autographiviridae, Drexlerviridae and Chaseviridae. Analysis of CRISPR spacers indicates that phages of all three putative families infect bacteria of the phylum Bacteroidetes. Comparative genomic analysis of the three candidate phage families revealed features without precedent in phage genomes. Some “Quimbyviridae” phages possess Diversity-Generating Retroelements (DGRs) that generate hypervariable target genes nested within defense-related genes, whereas the previously known targets of phage-encoded DGRs are structural genes. Several “Flandersviridae” phages encode enzymes of the isoprenoid pathway, a lipid biosynthesis pathway that so far has not been known to be manipulated by phages. The “Gratiaviridae” phages encode a HipA-family protein kinase and glycosyltransferase, suggesting these phages modify the host cell wall, preventing superinfection by other phages. Hundreds of phages in these three and other families are shown to encode catalases and iron-sequestering enzymes that can be predicted to enhance cellular tolerance to reactive oxygen species.Conclusions:Analysis of phage genomes identified in whole-community human gut metagenomes resulted in the delineation of at least three new candidate families of Caudovirales and revealed diverse putative mechanisms underlying phage-host interactions in the human gut. Addition of these phylogenetically classified, diverse and distinct phages to public databases will facilitate taxonomic decomposition and functional characterization of human gut viromes.


2018 ◽  
Author(s):  
Benjamin J Tully

AbstractDespite their discovery over 25 years ago, the Marine Group IIEuryarchaea(MGII) have remained a difficult group of organisms to study, lacking cultured isolates and genome references. The MGII have been identified in marine samples from around the world and evidence supports a photoheterotrophic lifestyle combining phototrophy via proteorhodopsins with the remineralization of high molecular weight organic matter. Divided between two clades, the MGII have distinct ecological patterns that are not understood based on the limited number of available genomes. Here, I present the comparative genomic analysis of 250 MGII genomes, providing the most detailed view of these mesophilic archaea to-date. This analysis identified 17 distinct subclades including nine subclades that previously lacked reference genomes. The metabolic potential and distribution of the MGII genera revealed distinct roles in the environment, identifying algal-saccharide-degrading coastal subclades, protein-degrading oligotrophic surface ocean subclades, and mesopelagic subclades lacking proteorhodopsins common in all other subclades. This study redefines the MGII and provides an avenue for understanding the role these organisms play in the cycling of organic matter throughout the water column.


2021 ◽  
Author(s):  
Xinxin Yi ◽  
Jing Liu ◽  
Shengcai Chen ◽  
Hao Wu ◽  
Min Liu ◽  
...  

Cultivated soybean (Glycine max) is an important source for protein and oil. Many elite cultivars with different traits have been developed for different conditions. Each soybean strain has its own genetic diversity, and the availability of more high-quality soybean genomes can enhance comparative genomic analysis for identifying genetic underpinnings for its unique traits. In this study, we constructed a high-quality de novo assembly of an elite soybean cultivar Jidou 17 (JD17) with chromsome contiguity and high accuracy. We annotated 52,840 gene models and reconstructed 74,054 high-quality full-length transcripts. We performed a genome-wide comparative analysis based on the reference genome of JD17 with three published soybeans (WM82, ZH13 and W05) , which identified five large inversions and two large translocations specific to JD17, 20,984 - 46,912 PAVs spanning 13.1 - 46.9 Mb in size, and 5 - 53 large PAV clusters larger than 500kb. 1,695,741 - 3,664,629 SNPs and 446,689 - 800,489 Indels were identified and annotated between JD17 and them. Symbiotic nitrogen fixation (SNF) genes were identified and the effects from these variants were further evaluated. It was found that the coding sequences of 9 nitrogen fixation-related genes were greatly affected. The high-quality genome assembly of JD17 can serve as a valuable reference for soybean functional genomics research.


mSphere ◽  
2019 ◽  
Vol 4 (6) ◽  
Author(s):  
Sophie L. Nixon ◽  
Rebecca A. Daly ◽  
Mikayla A. Borton ◽  
Lindsey M. Solden ◽  
Susan A. Welch ◽  
...  

ABSTRACT Bacteria of the phylum Verrucomicrobia are prevalent and are particularly common in soil and freshwater environments. Their cosmopolitan distribution and reported capacity for polysaccharide degradation suggests members of Verrucomicrobia are important contributors to carbon cycling across Earth’s ecosystems. Despite their prevalence, the Verrucomicrobia are underrepresented in isolate collections and genome databases; consequently, their ecophysiological roles may not be fully realized. Here, we expand genomic sampling of the Verrucomicrobia phylum by describing a novel genus, “Candidatus Marcellius,” belonging to the order Opitutales. “Ca. Marcellius” was recovered from a shale-derived produced fluid metagenome collected 313 days after hydraulic fracturing, the deepest environment from which a member of the Verrucomicrobia has been recovered to date. We uncover genomic attributes that may explain the capacity of this organism to inhabit a shale gas well, including the potential for utilization of organic polymers common in hydraulic fracturing fluids, nitrogen fixation, adaptation to high salinities, and adaptive immunity via CRISPR-Cas. To illuminate the phylogenetic and environmental distribution of these metabolic and adaptive traits across the Verrucomicrobia phylum, we performed a comparative genomic analysis of 31 publicly available, nearly complete Verrucomicrobia genomes. Our genomic findings extend the environmental distribution of the Verrucomicrobia 2.3 kilometers into the terrestrial subsurface. Moreover, we reveal traits widely encoded across members of the Verrucomicrobia, including the capacity to degrade hemicellulose and to adapt to physical and biological environmental perturbations, thereby contributing to the expansive habitat range reported for this phylum. IMPORTANCE The Verrucomicrobia phylum of bacteria is widespread in many different ecosystems; however, its role in microbial communities remains poorly understood. Verrucomicrobia are often low-abundance community members, yet previous research suggests they play a major role in organic carbon degradation. While Verrucomicrobia remain poorly represented in culture collections, numerous genomes have been reconstructed from metagenomic data sets in recent years. The study of genomes from across the phylum allows for an extensive assessment of their potential ecosystem roles. The significance of this work is (i) the recovery of a novel genus of Verrucomicrobia from 2.3 km in the subsurface with the ability to withstand the extreme conditions that characterize this environment, and (ii) the most extensive assessment of ecophysiological traits encoded by Verrucomicrobia genomes to date. We show that members of this phylum are specialist organic polymer degraders that can withstand a wider range of environmental conditions than previously thought.


2017 ◽  
Vol 5 (1) ◽  
Author(s):  
C. Bodi Winn ◽  
J. Dzink-Fox ◽  
Y. Feng ◽  
Z. Shen ◽  
V. Bakthavatchalu ◽  
...  

ABSTRACT In collaboration with the CDC’s Streptococcus Laboratory, we report here the whole-genome sequences of seven Streptococcus agalactiae bacteria isolated from laboratory-reared Long-Evans rats. Four of the S. agalactiae isolates were associated with morbidity accompanied by endocarditis, metritis, and fatal septicemia, providing an opportunity for comparative genomic analysis of this opportunistic pathogen.


2019 ◽  
Author(s):  
Kshitij Tandon ◽  
Pei-Wen Chiang ◽  
Chih-Ying Lu ◽  
Naohisa Wada ◽  
Shan-Hua Yang ◽  
...  

AbstractDominant coral-associated Endozoicomonas bacteria species are hypothesized to play a role in the coral-sulfur cycle by metabolizing Dimethylsulfoniopropionate (DMSP) into Dimethylsulfide (DMS); however, no sequenced genome to date harbors genes for this process. In this study, we assembled high-quality (>95% complete) genomes of strains of a recently added species Endozoicomonas acroporae (Acr-14T, Acr-1 and Acr-5) isolated from the coral Acropora muricata and performed comparative genomic analysis on genus Endozoicomonas. We identified the first DMSP CoA-transferase/lyase—a dddD gene homolog found in all E. acroporae strains—and functionally characterized bacteria capable of metabolizing DMSP into DMS via the DddD cleavage pathway using RT-qPCR and gas chromatography (GC). Furthermore, we demonstrated that E. acroporae strains can use DMSP as the sole carbon source and have genes arranged in an operon-like manner to link DMSP metabolism to the central carbon cycle. This study confirms the role of Endozoicomonas in the coral sulfur cycle.


Author(s):  
Ziyi Liu ◽  
Ruifei Chen ◽  
Poshi Xu ◽  
Zhiqiang Wang ◽  
Ruichao Li

The spread of plasmid-mediated carbapenem-resistant clinical isolates is a serious threat to global health. In this study, an emerging NDM-encoding IncHI5-like plasmid from Klebsiella pneumoniae of infant patient origin was characterized, and the plasmid was compared to the available IncHI5-like plasmids to better understand the genetic composition and evolution of this emerging plasmid. Clinical isolate C39 was identified as K. pneumoniae and belonged to the ST37 and KL15 serotype. Whole genome sequencing (WGS) and analysis revealed that it harbored two plasmids, one of which was a large IncHI5-like plasmid pC39-334kb encoding a wide variety of antimicrobial resistance genes clustered in a single multidrug resistance (MDR) region. The blaNDM-1 gene was located on a ΔISAba125-blaNDM-1-bleMBL-trpF-dsbC structure. Comparative genomic analysis showed that it shared a similar backbone with four IncHI5-like plasmids and the IncHI5 plasmid pNDM-1-EC12, and these six plasmids differed from typical IncHI5 plasmids. The replication genes of IncHI5-like plasmids shared 97.06% (repHI5B) and 97.99% (repFIB-like) nucleotide identity with those of IncHI5 plasmids. Given that pNDM-1-EC12 and all IncHI5-like plasmids are closely related genetically, the occurrence of IncHI5-like plasmid is likely associated with the mutation of the replication genes of pNDM-1-EC12-like IncHI5 plasmids. All available IncHI5-like plasmids harbored 262 core genes encoding replication and maintenance functions and carried distinct MDR regions. Furthermore, 80% of them (4/5) were found in K. pneumoniae from Chinese nosocomial settings. To conclude, this study expands our knowledge of the evolution history of IncHI5-like plasmids, and more attention should be paid to track the evolution pathway of them among clinical, animal, and environmental settings.


Sign in / Sign up

Export Citation Format

Share Document