Draft genome assembly data of Anoxybacillus sp. strain MB8 isolated from Tattapani hot springs, India

Mapping Intimacies ◽

10.1101/2021.06.09.447659 ◽

2021 ◽

Author(s):

VISHNU PRASOODANAN P K ◽

Shruti S. Menon ◽

Rituja Saxena ◽

Prashant Waiker ◽

Vineet K Sharma

Keyword(s):

Hot Springs ◽

De Novo ◽

Draft Genome ◽

Gc Content ◽

Central India ◽

Glycoside Hydrolases ◽

Rrna Gene ◽

Aerobic Bacterium ◽

Protein Coding ◽

Protein Coding Genes

Discovery of novel thermophiles has shown promising applications in the field of biotechnology. Due to their thermal stability, they can survive the harsh processes in the industries, which make them important to be characterized and studied. Members of Anoxybacillus are alkaline tolerant thermophiles and have been extensively isolated from manure, dairy-processed plants, and geothermal hot springs. This article reports the assembled data of an aerobic bacterium Anoxybacillus sp. strain MB8, isolated from the Tattapani hot springs in Central India, where the 16S rRNA gene shares an identity of 97% (99% coverage) with Anoxybacillus kamchatkensis strain G10. The de novo assembly and annotation performed on the genome of Anoxybacillus sp. strain MB8 comprises of 2,898,780 bp (in 190 contigs) with a GC content of 41.8% and includes 2,976 protein-coding genes,1 rRNA operon, 73 tRNAs, 1 tm-RNA and 10 CRISPR arrays. The predicted protein-coding genes have been classified into 21 eggNOG categories. The KEGG Automated Annotation Server (KAAS) analysis indicated the presence of assimilatory sulfate reduction pathway, nitrate reducing pathway, and genes for glycoside hydrolases (GHs) and glycoside transferase (GTs). GHs and GTs hold widespread applications, in the baking and food industry for bread manufacturing, and in the paper, detergent and cosmetic industry. Hence, Anoxybacillus sp. strain MB8 holds the potential to be screened and characterized for such commercially relevant enzymes.

Download Full-text

Draft Genome of the Macadamia Husk Spot Pathogen, Pseudocercospora macadamiae

Phytopathology ◽

10.1094/phyto-12-19-0460-a ◽

2020 ◽

Vol 110 (9) ◽

pp. 1503-1506

Author(s):

Olufemi A. Akinsanmi ◽

Lilia C. Carvalhais

Keyword(s):

Plant Disease Resistance ◽

Plant Disease ◽

De Novo ◽

Draft Genome ◽

Gc Content ◽

Disease Development ◽

Closely Related Species ◽

Protein Coding ◽

Protein Coding Genes ◽

The Family

Pseudocercospora macadamiae causes husk spot in macadamia in Australia. Lack of genomic resources for this pathogen has restricted acquiring knowledge on the mechanism of disease development, spread, and its role in fruit abscission. To address this gap, we sequenced the genome of P. macadamiae. The sequence was de novo assembled into a draft genome of 40 Mb, which is comparable to closely related species in the family Mycosphaerellaceae. The draft genome comprises 212 scaffolds, of which 99 scaffolds are over 50 kb. The genome has a 49% GC content and is predicted to contain 15,430 protein-coding genes. This draft genome sequence is the first for P. macadamiae and represents a valuable resource for understanding genome evolution and plant disease resistance.

Download Full-text

Draft Genome Sequence of Bacillus sp. Strain IGA-FME-2, Isolated from the Bulk Soil of Soybean (Glycine max L.) in Northeast China

Microbiology Resource Announcements ◽

10.1128/mra.00004-21 ◽

2021 ◽

Vol 10 (16) ◽

Author(s):

Zhenhua Yu ◽

Sergio de los Santos-Villalobos ◽

Yansheng Li ◽

Jian Jin ◽

Fannie Isela Parra Cota ◽

...

Keyword(s):

Glycine Max ◽

Draft Genome ◽

Gc Content ◽

Single Copy ◽

Bulk Soil ◽

23S Rrna ◽

Protein Coding ◽

Content Type ◽

Protein Coding Genes ◽

Glycine Max L

ABSTRACT Here, we present the draft genome of Bacillus sp. strain IGA-FME-2. This strain was isolated from the bulk soil of soybean (Glycine max L.). Its genome consists of 3,810 protein-coding genes, 44 tRNAs, two 16S rRNAs, and a single copy of 23S rRNA, with a GC content of 46.4%.

Download Full-text

Draft Genome Sequence of Streptomyces cavourensis YBQ59, an Endophytic Producer of Antibiotics Bafilomycin D, Nonactic Acid, Prelactone B, and 5,11-Epoxy-10-Cadinanol

Microbiology Resource Announcements ◽

10.1128/mra.01056-18 ◽

2018 ◽

Vol 7 (11) ◽

Cited By ~ 1

Author(s):

Huy Quang Nguyen ◽

Nguyen Thi-Hanh Vu ◽

Ha Hoang Chu ◽

Son Ky Chu ◽

Ha Hoang ◽

...

Keyword(s):

Genome Sequence ◽

Draft Genome ◽

Gc Content ◽

Gene Clusters ◽

Draft Genome Sequence ◽

Biosynthetic Pathways ◽

Protein Coding ◽

Content Type ◽

Protein Coding Genes

This study reports the draft genome sequence of the endophytic Streptomyces cavourensis strain YBQ59, produces the antibiotics bafilomycin D, nonactic acid, prelactone B, and 5,11-epoxy-10-cadinanol. The draft genome sequence comprises ∼10.2 Mb, with a GC content of 64% and 8,958 predicted protein-coding genes, of which 14 gene clusters were found to associate with antibiotic biosynthetic pathways.

Download Full-text

Draft Genome Sequence of Streptococcus anginosus UMB1296, Isolated from the Female Urinary Tract

Microbiology Resource Announcements ◽

10.1128/mra.00409-20 ◽

2020 ◽

Vol 9 (20) ◽

Author(s):

Sara Temelkova ◽

Taylor Miller-Ensminger ◽

Adelina Voukadinova ◽

Alan J. Wolfe ◽

Catherine Putonti

Keyword(s):

Urinary Tract ◽

Genome Sequence ◽

Draft Genome ◽

Gc Content ◽

Draft Genome Sequence ◽

Protein Coding ◽

Content Type ◽

Streptococcus Anginosus ◽

Protein Coding Genes ◽

Female Urinary Tract

We present the draft genome sequence of a Streptococcus anginosus strain isolated from the female urinary tract. The S. anginosus UMB1296 draft genome has a size of 1,924,009 bp assembled into 35 contigs with a GC content of 38.69%. Genome annotation revealed 1,775 protein-coding genes, including several known virulence factors.

Download Full-text

Draft Genome Sequence of Micromonospora sp . Strain HK10, Isolated from Kaziranga National Park, India

Genome Announcements ◽

10.1128/genomea.00559-15 ◽

2016 ◽

Vol 4 (4) ◽

Cited By ~ 1

Author(s):

Madhumita Talukdar ◽

Dhrubajyoti Das ◽

Chiranjeeta Borah ◽

Hari Prasanna Deka Boruah ◽

Tarun Chandra Bora ◽

...

Keyword(s):

Genome Sequence ◽

National Park ◽

Draft Genome ◽

Gc Content ◽

Soil Samples ◽

Draft Genome Sequence ◽

Full Genome ◽

Protein Coding ◽

Protein Coding Genes ◽

Kaziranga National Park

We report the 6.92-Mbp genome sequence of Micromonospora sp. HK10, isolated from soil samples collected from Kaziranga National Park, Assam, India. The full genome of strain Micromonospora sp . strain HK10 consists of 6,911,179 bp with 73.39% GC content, 6,196 protein-coding genes, and 86 RNAs.

Download Full-text

De Novo Whole-Genome Sequencing of the Wood Rot Fungus Polyporus brumalis, Which Exhibits Potential Terpenoid Metabolism

Genome Announcements ◽

10.1128/genomea.00586-17 ◽

2017 ◽

Vol 5 (28) ◽

Author(s):

Su-Yeon Lee ◽

Ji-eun An ◽

Sun-Hwa Ryu ◽

Myungkil Kim

Keyword(s):

Single Molecule ◽

De Novo ◽

Gene Annotation ◽

Draft Genome ◽

Fungal Growth ◽

Protein Coding ◽

Sequencing Platform ◽

Protein Coding Genes ◽

Polyporus Brumalis ◽

Terpenoid Metabolism

ABSTRACT Polyporus brumalis is able to synthesize several sesquiterpenes during fungal growth. Using a single-molecule real-time sequencing platform, we present the 53-Mb draft genome of P. brumalis, which contains 6,231 protein-coding genes. Gene annotation and isolation support genetic information, which can increase the understanding of sesquiterpene metabolism in P. brumalis.

Download Full-text

Draft Genome Sequence of Geobacillus sp. Strain LEMMJ02, a Thermophile Isolated from Deception Island, an Active Volcano in Antarctica

Microbiology Resource Announcements ◽

10.1128/mra.00920-19 ◽

2019 ◽

Vol 8 (42) ◽

Author(s):

Júnia Schultz ◽

René Kallies ◽

Ulisses Nunes da Rocha ◽

Alexandre Soares Rosado

Keyword(s):

Genome Sequence ◽

Draft Genome ◽

Gc Content ◽

Active Volcano ◽

Draft Genome Sequence ◽

Deception Island ◽

Protein Coding ◽

Content Type ◽

Protein Coding Genes

The thermophilic Geobacillus sp. strain LEMMJ02 was isolated from Fumarole Bay sediment on Deception Island, an active Antarctic volcano. Here, we report the draft genome of LEMMJ02, which consists of 3,160,938 bp with 52.8% GC content and 3,523 protein-coding genes.

Download Full-text

Draft Genome Assembly and Annotation of Red Raspberry Rubus Idaeus

10.1101/546135 ◽

2019 ◽

Cited By ~ 4

Author(s):

Haley Wight ◽

Junhui Zhou ◽

Muzi Li ◽

Sridhar Hannenhalli ◽

Stephen M. Mount ◽

...

Keyword(s):

De Novo ◽

Draft Genome ◽

Rubus Idaeus ◽

Slow Process ◽

Red Raspberry ◽

Protein Coding ◽

Draft Genome Assembly ◽

Protein Coding Genes ◽

A Genome ◽

Exceptional Value

AbstractThe red raspberry, Rubus idaeus, is widely distributed in all temperate regions of Europe, Asia, and North America and is a major commercial fruit valued for its taste, high antioxidant and vitamin content. However, Rubus breeding is a long and slow process hampered by limited genomic and molecular resources. Genomic resources such as a complete genome sequencing and transcriptome will be of exceptional value to improve research and breeding of this high value crop. Using a hybrid sequence assembly approach including data from both long and short sequence reads, we present the first assembly of the Rubus idaeus genome (Joan J. variety). The de novo assembled genome consists of 2,145 scaffolds with a genome completeness of 95.3% and an N50 score of 638 KB. Leveraging a linkage map, we anchored 80.1% of the genome onto seven chromosomes. Using over 1 billion paired-end RNAseq reads, we annotated 35,566 protein coding genes with a transcriptome completeness score of 97.2%. The Rubus idaeus genome provides an important new resource for researchers and breeders.

Download Full-text

First draft genome assembly and identification of SNPs from hilsa shad (Tenualosa ilisha) of the Bay of Bengal

F1000Research ◽

10.12688/f1000research.18325.1 ◽

2019 ◽

Vol 8 ◽

pp. 320 ◽

Cited By ~ 2

Author(s):

Md. Bazlur Rahman Mollah ◽

Mohd Golam Quader Khan ◽

Md Shahidul Islam ◽

Md Samsul Alam

Keyword(s):

Genome Assembly ◽

Bay Of Bengal ◽

Draft Genome ◽

Gc Content ◽

Population Connectivity ◽

Whole Genome ◽

Migratory Fish ◽

Protein Coding ◽

Protein Coding Genes ◽

Tenualosa Ilisha

Background: Hilsa shad (Tenualosa ilisha), a widely distributed migratory fish, contributes substantially to the economy of Bangladesh. The harvest of hilsa from inland waters has been fluctuating due to anthropological and climate change-induced degradation of the riverine habitats. The whole genome sequence of this valuable fish could provide genomic tools for sustainable harvest, conservation and productivity cycle maintenance. Here, we report the first draft genome of T. ilisha from the Bay of Bengal, the largest reservoir of the migratory fish. Methods: A live specimen of T. ilisha was collected from the Bay of Bengal. The whole genome sequencing was performed by the Illumina HiSeqX platform (2 × 150 paired end configuration). We assembled the short reads using SOAPdenovo2 genome assembler and predicted protein coding genes by AUGUSTUS. The completeness of the T. ilisha genome assembly was evaluated by BUSCO (Benchmarking Universal Single Copy Orthologs). We identified single nucleotide polymorphisms (SNPs) by calling them directly from unassembled sequence reads using discoSnp++. Results: We assembled the draft genome of 710.28 Mb having an N50 scaffold length of 64157 bp and GC content of 42.95%. A total of 37,450 protein coding genes were predicted of which 29,339 (78.34%) were annotated with other vertebrate genomes. We also identified 792,939 isolated SNPs with transversion:transition ratio of 1:1.8. The BUSCO evaluation showed 78.1% completeness of this genome. Conclusions: The genomic data generated in this study could be used as a reference to identify genes associated with physiological and ecological adaptations, population connectivity, and migration behaviour of this biologically and economically important anadromous fish species of the Clupeidae family.

Download Full-text

A high-quality chromosomal genome assembly of Diospyros oleifera Cheng

GigaScience ◽

10.1093/gigascience/giz164 ◽

2020 ◽

Vol 9 (1) ◽

Author(s):

Yujing Suo ◽

Peng Sun ◽

Huihui Cheng ◽

Weijuan Han ◽

Songfeng Diao ◽

...

Keyword(s):

Molecular Mechanisms ◽

De Novo ◽

Phylogenetic Analyses ◽

Draft Genome ◽

Diospyros Kaki ◽

High Quality ◽

Phylogenetic Tree Analysis ◽

Protein Coding ◽

Protein Coding Genes ◽

Anthocyanin Pathway

Abstract Background Diospyros oleifera Cheng, of the family Ebenaceae, is an economically important tree. Phylogenetic analyses indicate that D. oleifera is closely related to Diospyros kaki Thunb. and could be used as a model plant for studies of D. kaki. Therefore, development of genomic resources of D. oleifera will facilitate auxiliary assembly of the hexaploid persimmon genome and elucidate the molecular mechanisms of important traits. Findings The D. oleifera genome was assembled with 443.6 Gb of raw reads using the Pacific Bioscience Sequel and Illumina HiSeq X Ten platforms. The final draft genome was ∼812.3 Mb and had a high level of continuity with N50 of 3.36 Mb. Fifteen scaffolds corresponding to the 15 chromosomes were assembled to a final size of 721.5 Mb using 332 scaffolds, accounting for 88.81% of the genome. Repeat sequences accounted for 54.8% of the genome. By de novo sequencing and analysis of homology with other plant species, 30,530 protein-coding genes with an average transcript size of 7,105.40 bp were annotated; of these, 28,580 protein-coding genes (93.61%) had conserved functional motifs or terms. In addition, 171 candidate genes involved in tannin synthesis and deastringency in persimmon were identified; of these chalcone synthase (CHS) genes were expanded in the D. oleifera genome compared with Diospyros lotus, Camellia sinensis, and Vitis vinifera. Moreover, 186 positively selected genes were identified, including chalcone isomerase (CHI) gene, a key enzyme in the flavonoid-anthocyanin pathway. Phylogenetic tree analysis indicated that the split of D. oleifera and D. lotus likely occurred 9.0 million years ago. In addition to the ancient γ event, a second whole-genome duplication event occurred in D. oleifera and D. lotus. Conclusions We generated a high-quality chromosome-level draft genome for D. oleifera, which will facilitate assembly of the hexaploid persimmon genome and further studies of major economic traits in the genus Diospyros.

Download Full-text