Whole genome sequencing of a novel, dichloromethane-fermenting Peptococcaceae from an enrichment culture

Bacteria capable of dechlorinating the toxic environmental contaminant dichloromethane (DCM, CH2Cl2) are of great interest for potential bioremediation applications. A novel, strictly anaerobic, DCM-fermenting bacterium, "DCMF", was enriched from organochlorine-contaminated groundwater near Botany Bay, Australia. The enrichment culture was maintained in minimal, mineral salt medium amended with dichloromethane as the sole energy source. PacBio whole genome SMRTTM sequencing of DCMF allowed de novo, gap-free assembly despite the presence of cohabiting organisms in the culture. Illumina sequencing reads were utilised to correct minor indels. The single, circularised 6.44 Mb chromosome was annotated with the IMG pipeline and contains 5,773 predicted protein-coding genes. Based on 16S rRNA gene and predicted proteome phylogeny, the organism appears to be a novel member of the Peptococcaceae family. The DCMF genome is large in comparison to known DCM-fermenting bacteria and includes 96 predicted methylamine methyltransferases, which may provide clues to the basis of its DCM metabolism. Full annotation has been provided in a custom genome browser and search tool, in addition to multiple sequence alignments and phylogenetic trees for every predicted protein, available at http://www.slimsuite.unsw.edu.au/research/dcmf/.

Download Full-text

Whole genome sequencing of a novel, dichloromethane-fermenting Peptococcaceae from an enrichment culture

PeerJ ◽

10.7717/peerj.7775 ◽

2019 ◽

Vol 7 ◽

pp. e7775 ◽

Cited By ~ 1

Author(s):

Sophie I. Holland ◽

Richard J. Edwards ◽

Haluk Ertan ◽

Yie Kuan Wong ◽

Tonia L. Russell ◽

...

Keyword(s):

Phylogenetic Trees ◽

De Novo ◽

Enrichment Culture ◽

Rrna Gene ◽

Strictly Anaerobic ◽

Whole Genome ◽

Sequence Alignments ◽

Multiple Sequence ◽

Protein Coding ◽

Fermenting Bacteria

Bacteria capable of dechlorinating the toxic environmental contaminant dichloromethane (DCM, CH2Cl2) are of great interest for potential bioremediation applications. A novel, strictly anaerobic, DCM-fermenting bacterium, “DCMF”, was enriched from organochlorine-contaminated groundwater near Botany Bay, Australia. The enrichment culture was maintained in minimal, mineral salt medium amended with dichloromethane as the sole energy source. PacBio whole genome SMRTTM sequencing of DCMF allowed de novo, gap-free assembly despite the presence of cohabiting organisms in the culture. Illumina sequencing reads were utilised to correct minor indels. The single, circularised 6.44 Mb chromosome was annotated with the IMG pipeline and contains 5,773 predicted protein-coding genes. Based on 16S rRNA gene and predicted proteome phylogeny, the organism appears to be a novel member of the Peptococcaceae family. The DCMF genome is large in comparison to known DCM-fermenting bacteria. It includes an abundance of methyltransferases, which may provide clues to the basis of its DCM metabolism, as well as potential to metabolise additional methylated substrates such as quaternary amines. Full annotation has been provided in a custom genome browser and search tool, in addition to multiple sequence alignments and phylogenetic trees for every predicted protein, http://www.slimsuite.unsw.edu.au/research/dcmf/.

Download Full-text

Whole genome sequencing of a novel, dichloromethane-fermenting Peptococcaceae from an enrichment culture

10.7287/peerj.preprints.27718v1 ◽

2019 ◽

Author(s):

Sophie I Holland ◽

Richard J Edwards ◽

Haluk Ertan ◽

Yie Kuan Wong ◽

Tonia L Russell ◽

...

Keyword(s):

Phylogenetic Trees ◽

De Novo ◽

Enrichment Culture ◽

Rrna Gene ◽

Strictly Anaerobic ◽

Whole Genome ◽

Sequence Alignments ◽

Multiple Sequence ◽

Protein Coding ◽

Fermenting Bacteria

Download Full-text

Morphological redescription and molecular characterization of Trichodina matsu Basson & Van As, 1994 (Ciliophora, Mobilida, Trichodinidae) infecting Tachysurus fulvidraco (Richardson, 1846) from Chongqing, China

Zootaxa ◽

10.11646/zootaxa.4995.2.6 ◽

2021 ◽

Vol 4995 (2) ◽

pp. 334-344

Author(s):

QIAN ZHOU ◽

FAHUI TANG ◽

YUANJUN ZHAO

Keyword(s):

Phylogenetic Analyses ◽

Molecular Data ◽

18S Rrna Gene ◽

Rrna Gene ◽

Similar Species ◽

Sequence Alignments ◽

Multiple Sequence ◽

Multiple Sequence Alignments ◽

Morphological And Molecular Data ◽

Tachysurus Fulvidraco

During a survey of parasitic ciliates in Chongqing, China, Trichodina matsu Basson & Van As, 1994 was isolated from gills of Tachysurus fulvidraco. Furthermore, the 18S rRNA gene and ITS-5.8S rRNA region of T. matsu were sequenced for the first time and applied for the species identification and comparison with similar species in the present study. Based on the morphological and molecular comparisons, the results indicate that T. matsu is an ectoparasite specific for the Siluriformes catfish. Based on the analyses of genetic distance, multiple sequence alignments, and phylogenetic analyses, no obvious differentiation within populations of T. matsu was found. In addition, the ‘Trichodina hyperparasitis’ (KX904933) in GenBank is a misidentification and appears to be conspecific with T. matsu according to the comparison of morphological and molecular data.

Download Full-text

Genomic Analysis of Sarcomyxa edulis Reveals the Basis of Its Medicinal Properties and Evolutionary Relationships

Frontiers in Microbiology ◽

10.3389/fmicb.2021.652324 ◽

2021 ◽

Vol 12 ◽

Author(s):

Fenghua Tian ◽

Changtian Li ◽

Yu Li

Keyword(s):

Single Molecule ◽

De Novo ◽

Genomic Analysis ◽

Single Copy ◽

Whole Genome Sequence ◽

Type I ◽

Whole Genome ◽

Uridine Diphosphate ◽

Protein Coding ◽

Medicinal Value

Yuanmo [Sarcomyxa edulis (Y.C. Dai, Niemelä & G.F. Qin) T. Saito, Tonouchi & T. Harada] is an important edible and medicinal mushroom endemic to Northeastern China. Here we report the de novo sequencing and assembly of the S. edulis genome using single-molecule real-time sequencing technology. The whole genome was approximately 35.65 Mb, with a G + C content of 48.31%. Genome assembly generated 41 contigs with an N50 length of 1,772,559 bp. The genome comprised 9,364 annotated protein-coding genes, many of which encoded enzymes involved in the modification, biosynthesis, and degradation of glycoconjugates and carbohydrates or enzymes predicted to be involved in the biosynthesis of secondary metabolites such as terpene, type I polyketide, siderophore, and fatty acids, which are responsible for the pharmacodynamic activities of S. edulis. We also identified genes encoding 1,3-β-glucan synthase and endo-1,3(4)-β-glucanase, which are involved in polysaccharide and uridine diphosphate glucose biosynthesis. Phylogenetic and comparative analyses of Basidiomycota fungi based on a single-copy orthologous protein indicated that the Sarcomyxa genus is an independent group that evolved from the Pleurotaceae family. The annotated whole-genome sequence of S. edulis can serve as a reference for investigations of bioactive compounds with medicinal value and the development and commercial production of superior S. edulis varieties.

Download Full-text

Draft genome assembly data of Anoxybacillus sp. strain MB8 isolated from Tattapani hot springs, India

10.1101/2021.06.09.447659 ◽

2021 ◽

Author(s):

VISHNU PRASOODANAN P K ◽

Shruti S. Menon ◽

Rituja Saxena ◽

Prashant Waiker ◽

Vineet K Sharma

Keyword(s):

Hot Springs ◽

De Novo ◽

Draft Genome ◽

Gc Content ◽

Central India ◽

Glycoside Hydrolases ◽

Rrna Gene ◽

Aerobic Bacterium ◽

Protein Coding ◽

Protein Coding Genes

Discovery of novel thermophiles has shown promising applications in the field of biotechnology. Due to their thermal stability, they can survive the harsh processes in the industries, which make them important to be characterized and studied. Members of Anoxybacillus are alkaline tolerant thermophiles and have been extensively isolated from manure, dairy-processed plants, and geothermal hot springs. This article reports the assembled data of an aerobic bacterium Anoxybacillus sp. strain MB8, isolated from the Tattapani hot springs in Central India, where the 16S rRNA gene shares an identity of 97% (99% coverage) with Anoxybacillus kamchatkensis strain G10. The de novo assembly and annotation performed on the genome of Anoxybacillus sp. strain MB8 comprises of 2,898,780 bp (in 190 contigs) with a GC content of 41.8% and includes 2,976 protein-coding genes,1 rRNA operon, 73 tRNAs, 1 tm-RNA and 10 CRISPR arrays. The predicted protein-coding genes have been classified into 21 eggNOG categories. The KEGG Automated Annotation Server (KAAS) analysis indicated the presence of assimilatory sulfate reduction pathway, nitrate reducing pathway, and genes for glycoside hydrolases (GHs) and glycoside transferase (GTs). GHs and GTs hold widespread applications, in the baking and food industry for bread manufacturing, and in the paper, detergent and cosmetic industry. Hence, Anoxybacillus sp. strain MB8 holds the potential to be screened and characterized for such commercially relevant enzymes.

Download Full-text

Is cytochrome c oxidase subunit I (COI) the right DNA barcoding marker for the Chaetopteryx villosa group?

ARPHA Conference Abstracts ◽

10.3897/aca.4.e64707 ◽

2021 ◽

Vol 4 ◽

Author(s):

Dalila Destanović ◽

Lejla Ušanović ◽

Lejla Lasić ◽

Jasna Hanjalić ◽

Belma Kalamujić Stroil

Keyword(s):

Phylogenetic Trees ◽

Pairwise Distance ◽

Zoological Museum ◽

Species Determination ◽

Sequence Alignments ◽

Multiple Sequence ◽

Software Analysis ◽

Group Data ◽

Species Specific ◽

The Right

Chaetopteryx villosa (Fabricius, 1798) is a caddisfly species distributed throughout Europe, except in the Balkan and Apennine Peninsula. However, phylogenetically close species belonging to the C. villosa group are widespread throughout entire Europe. Species of this group (C. villosa, C. gessneri, C. fusca, C. sahlbergi, C. atlantica, C. bosniaca, C. vulture, and C. trinacriae) have distinct distributions with some overlaps. Adult forms of these species are morphologically similar, whereas larval morphology is only known for some species. There are also indications of species hybridization (e.g., C. villosa x fusca). Presumably, the molecular approach for the species determination of this group would be highly beneficial. In the BOLD database, there are 154 specimens with COI-5P barcodes of C. villosa species. Out of the remaining species, C. sahlbergi has 27 specimens with a barcode, C. fusca 20, C. gessneri 5, C. bosniaca 5, and C. atlantica 1, whereas sequences from the species C. vulture and C. trinacriae are missing. Therefore, we tested the power of discrimination of the COI-5P marker in the C. villosa group, as the most common barcoding markers for species identification in animals. Only sequences from public records originating from experienced research groups or taxonomists and containing a specimen photograph were taken as input. A total of 75 sequences from the BOLD database were obtained. Out of these sequences, 11 belonged to C. fusca, 5 to C. gessneri, 52 to C. villosa, 5 to C. bosniaca, and 2 to C. sahlbergi. For the generation of overview trees, COI-5P barcodes of Rhyacophila fasciata and Rh. nubila were used as outgroups. All sequences were trimmed at 5’ and 3’ ends, resulting in a final alignment length of 516 base pairs. Multiple sequence alignments and editing were done in the MEGA-X software. Analysis of nucleotide polymorphism was done in DNASP6 software. MEGA-X was used to calculate the pairwise distance and overall mean p-distance, and to construct the overview trees. Analysis of DNA polymorphism revealed 14 haplotypes of C. villosa, 3 haplotypes of C. fusca, 2 haplotypes of C. gessneri, and one for species C. bosniaca and C. sahlbergi. There were no significant interspecific and intraspecific differences among haplotypes based on pairwise distances. The p-distance between one of the haplotypes of C. fusca and C. villosa was 0.000, whereas the p-distance among haplotypes of C. villosa varied from 0.001 to about 0.055. The mean overall p-distance among haplotypes of all species equaled 0.03. No species-specific clusters were observed when phylogenetic trees were constructed except for C. gessneri, regardless of the method used (i.e., NJ, UPGMA, ML, ME, or MP). To minimize the possibility of species misidentification, we used only records submitted by NTNU-Norwegian University of Science and Technology (Norway), SNSB-Zoologische Staatssammlung Muenchen (Germany), Zoologisches Forschungsmuseum Alexander Koenig (Germany), University of Oulu, Zoological Museum (Finland), prof Hans Malicky and prof Mladen Kučinić. No records identified as hybrids were included in the analyses. With the exception of C. gessneri, COI-5P marker failed to separate the species of the C. villosa group. However, it is highly unlikely that poor species determination was the basis for such a result. To enable the comprehensive and unbiased evaluation of the relationships within this group, data coverage in BOLD database for most of the studied species should be enhanced, encompassing different geographical distribution of samples. Further studies are needed to detect the array of molecular markers suitable for the species delineation in a complex group such as C. villosa.

Download Full-text

Diversity of the genus Lactobacillus revealed by comparative genomics of five species

Microbiology ◽

10.1099/mic.0.29140-0 ◽

2006 ◽

Vol 152 (11) ◽

pp. 3185-3196 ◽

Cited By ~ 87

Author(s):

Carlos Canchaya ◽

Marcus J. Claesson ◽

Gerald F. Fitzgerald ◽

Douwe van Sinderen ◽

Paul W. O'Toole

Keyword(s):

Comparative Genomics ◽

Phylogenetic Trees ◽

Rrna Gene ◽

Whole Genome ◽

Genome Sequences ◽

Lactobacillus Salivarius ◽

Core Proteins ◽

Genus Lactobacillus ◽

High Level

The genus Lactobacillus contains over 80 recognized species, and is characterized by a high level of diversity, reflected in its complex phylogeny. The authors' recent determination of the genome sequence of Lactobacillus salivarius means that five complete genomes of Lactobacillus species are available for comparative genomics: L. salivarius, L. plantarum, L. acidophilus, L. johnsonii and L. sakei. This paper now shows that there is no extensive synteny of the genome sequences of these five lactobacilli. Phylogeny based on whole-genome alignments suggested that L. salivarius was closer to L. plantarum than to L. sakei, which was closest to Enterococcus faecalis, in contrast to 16S rRNA gene relatedness. A total of 593 orthologues common to all five species were identified. Species relatedness based on this protein set was largely concordant with genome synteny-based relatedness. A Lactobacillus supertree, combining individual phylogenetic trees from each of 354 core proteins, had four main branches, comprising L. salivarius–L. plantarum; L. sakei; E. faecalis; and L. acidophilus–L. johnsonii. The extreme divergence of the Lactobacillus genomes analysed supports the recognition of new subgeneric divisions.

Download Full-text

Molecular and Symptom Analysis Reveal the Presence of New Phytoplasmas Associated with Sugarcane Grassy Shoot Disease in India

Plant Disease ◽

10.1094/pdis-91-11-1413 ◽

2007 ◽

Vol 91 (11) ◽

pp. 1413-1418 ◽

Cited By ~ 17

Author(s):

Kanchan Nasare ◽

Amit Yadav ◽

Anil K. Singh ◽

K. B. Shivasharanappa ◽

Y. S. Nerkar ◽

...

Keyword(s):

16S Rrna ◽

Sequence Similarity ◽

Pcr Amplification ◽

Saccharum Officinarum ◽

23S Rrna ◽

Rrna Gene ◽

Sequence Alignments ◽

High Sequence Similarity ◽

Multiple Sequence ◽

Very High

A total of 240 sugarcane (Saccharum officinarum) plants showing phenotypic symptoms of sugarcane grassy shoot (SCGS) disease were collected from three states of India, Maharashtra, Karnataka, and Uttar Pradesh. Phytoplasmas were detected in all symptomatic samples by the polymerase chain reaction (PCR) amplification of phytoplasma-specific 16S rRNA gene and 16S-23S rRNA spacer region (SR) sequences. No amplification was observed when DNA from asymptomatic plant samples was used as a template. Sixteen samples were selected on the basis of phenotypic symptoms and geographic location, and cloning and sequencing of the 16S rRNA and spacer regions were performed. Multiple sequence alignments of the 16S rRNA sequences revealed that they share very high sequence similarity with phytoplasmas of rice yellow dwarf, 16SrXI. However, the 16S-23S rRNA SR sequence analysis revealed that while the majority of phytoplasmas shared very high (>99%) sequence similarity with previously reported sugarcane phytoplasmas, two of them, namely BV2 (DQ380342) and VD7 (DQ380343), shared relatively low sequence similarity (79 and 84%, respectively). Therefore, these two phytoplasmas may be previously unreported ones that cause significant yield losses in sugarcane in India.

Download Full-text

Accurate inference of tree topologies from multiple sequence alignments using deep learning

10.1101/559054 ◽

2019 ◽

Cited By ~ 2

Author(s):

Anton Suvorov ◽

Joshua Hochuli ◽

Daniel R. Schrider

Keyword(s):

Deep Learning ◽

Parameter Space ◽

Phylogenetic Trees ◽

Strong Support ◽

Biological Research ◽

Learning Approaches ◽

Sequence Alignments ◽

Traditional Methods ◽

Multiple Sequence ◽

Multiple Sequence Alignments

AbstractReconstructing the phylogenetic relationships between species is one of the most formidable tasks in evolutionary biology. Multiple methods exist to reconstruct phylogenetic trees, each with their own strengths and weaknesses. Both simulation and empirical studies have identified several “zones” of parameter space where accuracy of some methods can plummet, even for four-taxon trees. Further, some methods can have undesirable statistical properties such as statistical inconsistency and/or the tendency to be positively misleading (i.e. assert strong support for the incorrect tree topology). Recently, deep learning techniques have made inroads on a number of both new and longstanding problems in biological research. Here we designed a deep convolutional neural network (CNN) to infer quartet topologies from multiple sequence alignments. This CNN can readily be trained to make inferences using both gapped and ungapped data. We show that our approach is highly accurate, often outperforming traditional methods, and is remarkably robust to bias-inducing regions of parameter space such as the Felsenstein zone and the Farris zone. We also demonstrate that the confidence scores produced by our CNN can more accurately assess support for the chosen topology than bootstrap and posterior probability scores from traditional methods. While numerous practical challenges remain, these findings suggest that deep learning approaches such as ours have the potential to produce more accurate phylogenetic inferences.

Download Full-text

Plastid phylogenomics of the Gynoxoid group (Senecioneae, Asteraceae) highlights the importance of motif-based sequence alignment amid low genetic distances

10.1101/2021.04.23.441144 ◽

2021 ◽

Author(s):

Belen Escobari ◽

Thomas Borsch ◽

Taylor S. Quedensley ◽

Michael Gruenstaeudl

Keyword(s):

Dna Sequence ◽

Phylogenetic Trees ◽

Plastid Genome ◽

Intergenic Spacer ◽

Genetic Distances ◽

Sequence Alignments ◽

Multiple Sequence ◽

Plastid Genomes ◽

Tree Inference ◽

The Impact

ABSTRACTPREMISEThe genus Gynoxys and relatives form a species-rich lineage of Andean shrubs and trees with low genetic distances within the sunflower subtribe Tussilaginineae. Previous molecular phylogenetic investigations of the Tussilaginineae have included few, if any, representatives of this Gynoxoid group or reconstructed ambiguous patterns of relationships for it.METHODSWe sequenced complete plastid genomes of 21 species of the Gynoxoid group and related Tussilaginineae and conducted detailed comparisons of the phylogenetic relationships supported by the gene, intron, and intergenic spacer partitions of these genomes. We also evaluated the impact of manual, motif-based adjustments of automatic DNA sequence alignments on phylogenetic tree inference.RESULTSOur results indicate that the inclusion of all plastid genome partitions is needed to infer fully resolved phylogenetic trees of the Gynoxoid group. Whole plastome-based tree inference suggests that the genera Gynoxys and Nordenstamia are polyphyletic and form the core clade of the Gynoxoid group. This clade is sister to a clade of Aequatorium and Paragynoxys and also includes some but not all representatives of Paracalia.CONCLUSIONSThe concatenation and combined analysis of all plastid genome partitions and the construction of manually curated, motif-based DNA sequence alignments are found to be instrumental in the recovery of strongly supported relationships of the Gynoxoid group. We demonstrate that the correct assessment of homology in genome-level plastid sequence datasets is crucial for subsequent phylogeny reconstruction and that the manual post-processing of multiple sequence alignments improves the reliability of such reconstructions amid low genetic distances between taxa.

Download Full-text