scholarly journals De novo genome assemblies of butterflies

GigaScience ◽  
2021 ◽  
Vol 10 (6) ◽  
Author(s):  
Emily A Ellis ◽  
Caroline G Storer ◽  
Akito Y Kawahara

Abstract Background The availability of thousands of genomes has enabled new advancements in biology. However, many genomes have not been investigated for their quality. Here we examine quality trends in a taxonomically diverse and well-known group, butterflies (Papilionoidea), and provide draft, de novo assemblies for all available butterfly genomes. Owing to massive genome sequencing investment and taxonomic curation, this is an excellent group to explore genome quality. Findings We provide de novo assemblies for all 822 available butterfly genomes and interpret their quality in terms of completeness and continuity. We identify the 50 highest quality genomes across butterflies and conclude that the ringlet, Aphantopus hyperantus, has the highest quality genome. Our post-processing of draft genome assemblies identified 118 butterfly genomes that should not be reused owing to contamination or extremely low quality. However, many draft genomes are of high utility, especially because permissibility of low-quality genomes is dependent on the objective of the study. Our assemblies will serve as a key resource for papilionid genomics, especially for researchers without computational resources. Conclusions Quality metrics and assemblies are typically presented with annotated genome accessions but rarely with de novo genomes. We recommend that studies presenting genome sequences provide the assembly and some metrics of quality because quality will significantly affect downstream results. Transparency in quality metrics is needed to improve the field of genome science and encourage data reuse.

2021 ◽  
Author(s):  
Andrea Minio ◽  
Noe Cochetel ◽  
Amanda M Vondras ◽  
Melanie Massonnet ◽  
Dario Cantu

De novo genome assembly is essential for genomic research. High-quality genomes assembled into phased pseudomolecules are challenging to produce and often contain assembly errors caused by repeats, heterozygosity, or the chosen assembly strategy. Although algorithms exist that produce partially phased assemblies, haploid draft assemblies that may lack biological information remain favored because they are easier to generate and use. We developed HaploSync, a suite of tools that produces fully phased, chromosome-scale diploid genome assemblies and performs extensive quality control to limit assembly artifacts. HaploSync uses a genetic map and/or the genome of a closely related species to guide the scaffolding of a diploid assembly into phased pseudomolecules for each chromosome. It compares alternative haplotypes to identify and correct misassemblies independent of a reference, fills assembly gaps with unplaced sequences, and resolves collapsed homozygous regions. In a series of plant, fungal, and animal kingdom case studies, we demonstrate that HaploSync increases the assembly contiguity of phased chromosomes, improves completeness by filling gaps, corrects scaffolding, and correctly phases highly heterozygous, complex regions.


2016 ◽  
Vol 4 (5) ◽  
Author(s):  
Shu Harn Te ◽  
Boon Fei Tan ◽  
Janelle R. Thompson ◽  
Karina Yew-Hoong Gin

Genomes of two filamentous benthic cyanobacteria were obtained from cocultures obtained from two freshwater lakes. The cultures were obtained by first growing cyanobacterial trichome on solid medium, followed by subculturing in freshwater media. Subsequent shotgun sequencing, de novo assembly, and genomic binning yielded almost complete genomes of Oscillatoriales USR 001 and Nostoc sp. MBR 210.


2018 ◽  
Vol 6 (7) ◽  
Author(s):  
Abhishek Somani ◽  
Daniel Smith ◽  
Matthew Hegarty ◽  
Narcis Fernandez-Fuentes ◽  
Sreenivas R. Ravella ◽  
...  

ABSTRACT Non- albicans Candida species are growing in prominence in industrial biotechnology due to their ability to utilize hemicellulose. Here, we present the draft genome sequences of an inhibitor-tolerant Candida tropicalis strain (Y6604) and Candida boidinii NCAIM Y01308 T .


2020 ◽  
Vol 7 (1) ◽  
Author(s):  
Mikhail Rayko ◽  
Aleksey Komissarov ◽  
Jason C. Kwan ◽  
Grace Lim-Fong ◽  
Adelaide C. Rhodes ◽  
...  

Abstract Many animal phyla have no representatives within the catalog of whole metazoan genome sequences. This dataset fills in one gap in the genome knowledge of animal phyla with a draft genome of Bugula neritina (phylum Bryozoa). Interest in this species spans ecology and biomedical sciences because B. neritina is the natural source of bioactive compounds called bryostatins. Here we present a draft assembly of the B. neritina genome obtained from PacBio and Illumina HiSeq data, as well as genes and proteins predicted de novo and verified using transcriptome data, along with the functional annotation. These sequences will permit a better understanding of host-symbiont interactions at the genomic level, and also contribute additional phylogenomic markers to evaluate Lophophorate or Lophotrochozoa phylogenetic relationships. The effort also fits well with plans to ultimately sequence all orders of the Metazoa.


2016 ◽  
Vol 4 (6) ◽  
Author(s):  
Elena Geiser ◽  
Florian Ludwig ◽  
Thiemo Zambanini ◽  
Nick Wierckx ◽  
Lars M. Blank

Some smut fungi of the family Ustilaginaceae produce itaconate from glucose. De novo genome sequencing of nine itaconate-producing Ustilaginaceae revealed genome sizes between 19 and 25 Mbp. Comparison to the itaconate cluster of U. maydis MB215 revealed all essential genes for itaconate production contributing to metabolic engineering for improving itaconate production.


2017 ◽  
Vol 5 (34) ◽  
Author(s):  
Angelina A. Kislichkina ◽  
Aleksandr G. Bogun ◽  
Lidiya A. Kadnikova ◽  
Nadezhda V. Maiskaya ◽  
Viktor I. Solomentsev ◽  
...  

ABSTRACT We here report the draft genome sequences of 8 Yersinia pestis subsp. microtus bv. caucasica strains isolated from the East Caucasian (previous name, Dagestan) mountain focus (no. 39), representing the most ancient branch of the 0.PE2 phylogroup circulating in populations of common voles (Microtus arvalis).


2020 ◽  
Vol 9 (41) ◽  
Author(s):  
Muzi Jin ◽  
B. Byambajav ◽  
Hongyuan Zheng ◽  
Yufei Chen ◽  
B. Natsagdorj ◽  
...  

ABSTRACT Here, we report the draft genome sequences of two Yersinia pestis bv. Antiqua strains, belonging to the 3.ANT phylogroup, that were isolated in Mongolia and were circulating in marmot populations.


F1000Research ◽  
2017 ◽  
Vol 6 ◽  
pp. 1750 ◽  
Author(s):  
Tapan Kumar Mondal ◽  
Hukam Chand Rawal ◽  
Kishor Gaikwad ◽  
Tilak Raj Sharma ◽  
Nagendra Kumar Singh

Oryza coarctata plants, collected from Sundarban delta of West Bengal, India, have been used in the present study to generate draft genome sequences, employing the hybrid genome assembly with Illumina reads and third generation Oxford Nanopore sequencing technology. We report for the first time that more than 85.71 % of the genome coverage and the data have been deposited in NCBI SRA, with BioProject ID PRJNA396417.


2020 ◽  
Vol 9 (45) ◽  
Author(s):  
Hanan R. Shehata ◽  
Richmond A. Chandler ◽  
Steven G. Newmaster

ABSTRACT Here, we report the draft genome sequences of Lactobacillus delbrueckii subsp. bulgaricus strains CBC-LB69 and CBC-LB8. The strains were isolated from naturally processed, homemade dairy foods in Bulgaria. The two genome assemblies each resulted in 39 contigs with total lengths of 1,752,493 and 1,759,908 bp and GC contents of 49.80% and 49.90%, respectively.


2020 ◽  
Vol 9 (25) ◽  
Author(s):  
Djouhar Souak ◽  
Amine M. Boukerb ◽  
Magalie Barreau ◽  
Cecile Duclairoir-Poc ◽  
Marc G. J. Feuilloley

ABSTRACT We report the draft genome sequences of two Micrococcus luteus strains, MFP06 and MFP07, isolated from human skin. The genome assemblies consist of 2,480 and 2,417 kbp with 2,337 and 2,240 coding sequences, respectively. The genomes contain genes potentially involved in osmotic stress tolerance, DNA repair, monoacylglycerol hydrolysis, and beta-lactone synthesis.


Sign in / Sign up

Export Citation Format

Share Document