scholarly journals Metabolic marker gene mining provides insight in global mcrA diversity and, coupled with targeted genome reconstruction, sheds light on metabolic versatility of theMethanomassiliicoccales

2018 ◽  
Author(s):  
Daan R. Speth ◽  
Victoria J. Orphan

AbstractOver the past years, metagenomics has revolutionized our view of microbial diversity. Moreover, extracting near-complete genomes from metagenomes has led to the discovery of known metabolic traits in unsuspected lineages. Genome-resolved metagenomics relies on assembly of the sequencing reads and subsequent binning of assembled contigs, which might be hampered by strain heterogeneity or low abundance of a target organism. Here we present a complementary approach, metagenome marker gene mining, and use it to assess the global diversity of archaeal methane metabolism through the mcrA gene. To this end, we have screened 18,465 metagenomes for the presence of reads matching a database representative of all known mcrA proteins and reconstructed gene sequences from the matching reads. We use our mcrA dataset to assess the environmental distribution of theMethanomassiliicoccalesand reconstruct and analyze a draft genome belonging to the ‘Lake Pavin cluster’, an understudied environmental clade of theMethanomassiliicoccales. Thus, we show that marker gene mining can enhance the discovery power of metagenomics, by identifying novel lineages and aiding selection of targets for in-depth analyses. Marker gene mining is less sensitive to strain heterogeneity and has a lower abundance threshold than genome-resolved metagenomics, as it only requires short contigs and there is no binning step. Additionally, it is computationally cheaper than genome resolved metagenomics, since only a small subset of reads needs to be assembled. It is therefore a suitable approach to extract knowledge from the many publicly available sequencing projects.

PeerJ ◽  
2018 ◽  
Vol 6 ◽  
pp. e5614 ◽  
Author(s):  
Daan R. Speth ◽  
Victoria J. Orphan

Over the past years, metagenomics has revolutionized our view of microbial diversity. Moreover, extracting near-complete genomes from metagenomes has led to the discovery of known metabolic traits in unsuspected lineages. Genome-resolved metagenomics relies on assembly of the sequencing reads and subsequent binning of assembled contigs, which might be hampered by strain heterogeneity or low abundance of a target organism. Here we present a complementary approach, metagenome marker gene mining, and use it to assess the global diversity of archaeal methane metabolism through themcrAgene. To this end, we have screened 18,465 metagenomes for the presence of reads matching a database representative of all known mcrA proteins and reconstructed gene sequences from the matching reads. We use our mcrA dataset to assess the environmental distribution of theMethanomassiliicoccalesand reconstruct and analyze a draft genome belonging to the ‘Lake Pavin cluster’, an uncultivated environmental clade of theMethanomassiliicoccales. Analysis of the ‘Lake Pavin cluster’ draft genome suggests that this organism has a more restricted capacity for hydrogenotrophic methylotrophic methanogenesis than previously studiedMethanomassiliicoccales, with only genes for growth on methanol present. However, the presence of the soluble subunits of methyltetrahydromethanopterin:CoM methyltransferase (mtrAH)provide hypothetical pathways for methanol fermentation, and aceticlastic methanogenesis that await experimental verification. Thus, we show that marker gene mining can enhance the discovery power of metagenomics, by identifying novel lineages and aiding selection of targets for in-depth analyses. Marker gene mining is less sensitive to strain heterogeneity and has a lower abundance threshold than genome-resolved metagenomics, as it only requires short contigs and there is no binning step. Additionally, it is computationally cheaper than genome resolved metagenomics, since only a small subset of reads needs to be assembled. It is therefore a suitable approach to extract knowledge from the many publicly available sequencing projects.


Author(s):  
Bennett J Kapili ◽  
Anne E Dekas

Abstract Motivation Linking microbial community members to their ecological functions is a central goal of environmental microbiology. When assigned taxonomy, amplicon sequences of metabolic marker genes can suggest such links, thereby offering an overview of the phylogenetic structure underpinning particular ecosystem functions. However, inferring microbial taxonomy from metabolic marker gene sequences remains a challenge, particularly for the frequently sequenced nitrogen fixation marker gene, nitrogenase reductase (nifH). Horizontal gene transfer in recent nifH evolutionary history can confound taxonomic inferences drawn from the pairwise identity methods used in existing software. Other methods for inferring taxonomy are not standardized and require manual inspection that is difficult to scale. Results We present Phylogenetic Placement for Inferring Taxonomy (PPIT), an R package that infers microbial taxonomy from nifH amplicons using both phylogenetic and sequence identity approaches. After users place query sequences on a reference nifH gene tree provided by PPIT (n = 6317 full-length nifH sequences), PPIT searches the phylogenetic neighborhood of each query sequence and attempts to infer microbial taxonomy. An inference is drawn only if references in the phylogenetic neighborhood are: (1) taxonomically consistent and (2) share sufficient pairwise identity with the query, thereby avoiding erroneous inferences due to known horizontal gene transfer events. We find that PPIT returns a higher proportion of correct taxonomic inferences than BLAST-based approaches at the cost of fewer total inferences. We demonstrate PPIT on deep-sea sediment and find that Deltaproteobacteria are the most abundant potential diazotrophs. Using this dataset we show that emending PPIT inferences based on visual inspection of query sequence placement can achieve taxonomic inferences for nearly all sequences in a query set. We additionally discuss how users can apply PPIT to the analysis of other marker genes. Availability PPIT is freely available to non-commercial users at https://github.com/bkapili/ppit. Installation includes a vignette that demonstrates package use and reproduces the nifH amplicon analysis discussed here. The raw nifH amplicon sequence data have been deposited in the GenBank, EMBL, and DDBJ databases under BioProject number PRJEB37167. Supplementary information Supplementary data are available at Bioinformatics online.


2016 ◽  
Vol 4 (6) ◽  
Author(s):  
Bo Liu ◽  
Guo-Hong Liu ◽  
Yu-jing Zhu ◽  
Jie-Ping Wang ◽  
Jian-Mei Che ◽  
...  

Here, we report the draft genome sequences of Bacillus drentensis DSM 15600 T and Bacillus novalis DSM 15603 T with 5,305,306 bp and 5,667,584 bp, respectively, which will provide useful information for the functional gene mining and application of these two species. The average DNA G+C contents were 38.91% and 40.01%, respectively.


Author(s):  
Simon Horobin

‘Authorities’ considers how the rules of usage are established. It first describes the role of dictionaries, frequently held to be the ultimate authority. It then discusses linguistic academies, governing bodies that make pronouncements about correct usage. An alternative source to the dictionary is the usage guide, which tends to adopt a more prescriptive approach and which focuses on a small subset of frequently disputed points of usage. Other ways of determining acceptability is to turn to examples of actual usage, logic, or etymology. Despite the many controversies concerning correct usage and how it should be established, each of the authorities discussed would agree that there is a single authoritative form of the language, known as Standard English.


2016 ◽  
Vol 4 (3) ◽  
Author(s):  
Guo-hong Liu ◽  
Bo Liu ◽  
Yu-jing Zhu ◽  
Jie-ping Wang ◽  
Jian-mei Che ◽  
...  

Bacillus mesonae FJAT-13985 T is a Gram-positive, spore-forming, and aerobic bacterium. Here, we report the draft genome sequence of B. mesonae FJAT-13985 T with 5,807,726 bp, which will provide useful information for setting up phylogenomics in the genomic taxonomy of the Bacillus -like bacteria, as well as for the functional gene mining and application of B. mesonae FJAT-13985 T .


Genomics Data ◽  
2016 ◽  
Vol 9 ◽  
pp. 148-153 ◽  
Author(s):  
Radwa A. Hanafy ◽  
M.B. Couger ◽  
Kristina Baker ◽  
Chelsea Murphy ◽  
Shannon D. O'Kane ◽  
...  

2020 ◽  
Vol 33 (3) ◽  
Author(s):  
Mariana Tavares ◽  
Mariya Kozak ◽  
Alexandra Balola ◽  
Isabel Sá-Correia

SUMMARY Burkholderia cepacia (formerly Pseudomonas cepacia) was once thought to be a single bacterial species but has expanded to the Burkholderia cepacia complex (Bcc), comprising 24 closely related opportunistic pathogenic species. These bacteria have a widespread environmental distribution, an extraordinary metabolic versatility, a complex genome with three chromosomes, and a high capacity for rapid mutation and adaptation. Additionally, they present an inherent resistance to antibiotics and antiseptics, as well as the abilities to survive under nutrient-limited conditions and to metabolize the organic matter present in oligotrophic aquatic environments, even using certain antimicrobials as carbon sources. These traits constitute the reason that Bcc bacteria are considered feared contaminants of aqueous pharmaceutical and personal care products and the frequent reason behind nonsterile product recalls. Contamination with Bcc has caused numerous nosocomial outbreaks in health care facilities, presenting a health threat, particularly for patients with cystic fibrosis and chronic granulomatous disease and for immunocompromised individuals. This review addresses the role of Bcc bacteria as a potential public health problem, the mechanisms behind their success as contaminants of pharmaceutical products, particularly in the presence of biocides, the difficulties encountered in their detection, and the preventive measures applied during manufacturing processes to control contamination with these objectionable microorganisms. A summary of Bcc-related outbreaks in different clinical settings, due to contamination of diverse types of pharmaceutical products, is provided.


Sign in / Sign up

Export Citation Format

Share Document