draft assembly
Recently Published Documents


TOTAL DOCUMENTS

57
(FIVE YEARS 31)

H-INDEX

9
(FIVE YEARS 2)

Author(s):  
Mark A. Weaver ◽  
Nahreen Mirza ◽  
Jennifer R. Mandel ◽  
C. Douglas Boyette ◽  
Shawn P. Brown

We report here the whole-genome sequence and draft assembly for a bioherbicidal strain of Albifimbria verrucaria , CABI-IMI 368023, which was formerly identified as Myrothecium verrucaria . This isolate has been well studied for the biological control of important weeds, including kudzu and giant salvinia.


Life ◽  
2021 ◽  
Vol 11 (12) ◽  
pp. 1377
Author(s):  
Ernesto Parra-Rincón ◽  
Cristian A. Velandia-Huerto ◽  
Adriaan Gittenberger ◽  
Jörg Fallmann ◽  
Thomas Gatter ◽  
...  

Tunicates are the sister group of vertebrates and thus occupy a key position for investigations into vertebrate innovations as well as into the consequences of the vertebrate-specific genome duplications. Nevertheless, tunicate genomes have not been studied extensively in the past, and comparative studies of tunicate genomes have remained scarce. The carpet sea squirt Didemnum vexillum, commonly known as “sea vomit”, is a colonial tunicate considered an invasive species with substantial ecological and economical risk. We report the assembly of the D. vexillum genome using a hybrid approach that combines 28.5 Gb Illumina and 12.35 Gb of PacBio data. The new hybrid scaffolded assembly has a total size of 517.55 Mb that increases contig length about eightfold compared to previous, Illumina-only assembly. As a consequence of an unusually high genetic diversity of the colonies and the moderate length of the PacBio reads, presumably caused by the unusually acidic milieu of the tunic, the assembly is highly fragmented (L50 = 25,284, N50 = 6539). It is sufficient, however, for comprehensive annotations of both protein-coding genes and non-coding RNAs. Despite its shortcomings, the draft assembly of the “sea vomit” genome provides a valuable resource for comparative tunicate genomics and for the study of the specific properties of colonial ascidians.


Author(s):  
Stephen Byrne ◽  
Maximilian Schughart ◽  
James C Carolan ◽  
Michael Gaffney ◽  
Peter J Thorpe ◽  
...  

Abstract The English grain aphid, Sitobion avenae, is a major agricultural pest of wheat, barley and oats, and one of the principal vectors of Barley Yellow Dwarf Virus (BYDV) leading to significant reductions in grain yield, annually. Emerging resistance to and increasing regulation of insecticides has resulted in limited options for their control. Using PacBio HiFi data, we have produced a high quality draft assembly of the S. avenae genome; generating a primary assembly with a total assembly size of 475.7 Mb, and an alternate assembly with a total assembly size of 430.8 Mb. Our primary assembly was highly contiguous with only 326 contigs and a contig N50 of 15.95 Mb. Assembly completeness was estimated at 97.7% using BUSCO analysis and 31,007 and 29,037 protein coding genes were predicted from the primary and alternate assemblies, respectively. This assembly, which is to our knowledge the first for an insecticide resistant clonal lineage of English grain aphid, will provide novel insight into the molecular and mechanistic determinants of resistance and will facilitate future research into mechanisms of viral transmission and aphid behavior.


F1000Research ◽  
2021 ◽  
Vol 10 ◽  
pp. 1230
Author(s):  
Kenichi Okamoto ◽  
Nichole Dopkins ◽  
Elias Kinfu

The common, or spectacled, caiman Caiman crocodilus is an abundant, widely distributed Neotropical crocodilian exhibiting notable morphological and molecular diversification. The species also accounts by far for the largest share of crocodilian hides on the global market, with the C. crocodilus hide trade alone valued at about US$86.5 million per year. We obtained 239,911,946 paired-end reads comprising approximately 72 G bases using IlluminaTM sequencing of tissue sampled from a single Caiman crocodilus individual. These reads were de-novo assembled and progressively aligned against the genomes of increasingly related crocodilians; liftoff was used to annotate the draft C. crocodilus genome assembly based on an Alligator mississipiensis (a confamilial species) annotation. The draft assembly has been submitted to the National Center for Biotechnology Information and both assembly and annotation are currently available for review at doi.org/10.5281/zenodo.4755063


2021 ◽  
Vol 12 ◽  
Author(s):  
Wu Gan ◽  
Chenxi Zhao ◽  
Xinran Liu ◽  
Chao Bian ◽  
Qiong Shi ◽  
...  

Spiny head croaker (Collichthys lucidus), belonging to the family Sciaenidae, is a small economic fish with a main distribution in the coastal waters of Northwestern Pacific. Here, we constructed a nonredundant chromosome-level genome assembly of spiny head croaker and also made genome-wide investigations on genome evolution and gene families related to otolith development. A primary genome assembly of 811.23 Mb, with a contig N50 of 74.92 kb, was generated by a combination of 49.12-Gb Illumina clean reads and 35.24 Gb of PacBio long reads. Contigs of this draft assembly were further anchored into chromosomes by integration with additional 185.33-Gb Hi-C data, resulting in a high-quality chromosome-level genome assembly of 817.24 Mb, with an improved scaffold N50 of 26.58 Mb. Based on our phylogenetic analysis, we observed that C. lucidus is much closer to Larimichthys crocea than Miichthys miiuy. We also predicted that many gene families were significantly expanded (p-value <0.05) in spiny head croaker; among them, some are associated with “calcium signaling pathway” and potential “inner ear functions.” In addition, we identified some otolith-related genes (such as otol1a that encodes Otolin-1a) with critical deletions or mutations, suggesting possible molecular mechanisms for well-developed otoliths in the family Sciaenidae.


2021 ◽  
Author(s):  
Mónica Rojas-Gómez ◽  
Jose Pablo Jiménez-Madrigal ◽  
Maripaz Montero-Vargas ◽  
Randall Loaiza-Montoya ◽  
Max Chavarría ◽  
...  

Abstract Psidium friedrichsthalianum (O. Berg) Nied is a tropical tree species in the Myrtaceae family, natively distributed from southern Mexico, to eastern Venezuela and Ecuador and commonly known as "Cas'', "Costa Rican guava" or “Sour Guava”. The “Cas” produces a fruit with a rather distinctive acidic flavor and has bioactive compounds and biological potential equal or greater than common Guava; is considered an indigenous crop in Costa Rica with characteristics as a functional food untapped. This species has not been completely domesticated, and can be found in home gardens, paddocks, small groups, and, more recently, in small and medium sized plantations. Also, the plantations of this species do not have technical and scientific support or agronomic promotion from industry, nor are there genetic resources or germplasm readily available to farmers. This limits its commercial development and the implementation of selection or genetic improvement programs. In this study, we present the first draft assembly of the Cas genome using PacBio long reads and the Canu assembly pipeline. Our draft assembly has a total length of 417.64 Mb, with 24 440 contigs and a N50 contig size of 21.3 Kb. Structural annotation resulted in 59 036 gene models. Functional annotation was conducted against the non-redundant set of genes from the KEGG database. Of the 52 422 complete genes models, 15.55% (8 153) presented homology with KEGG orthologs. The genes found in our Cas draft assembly were compared to those found in Eucalyptus grandis (rose gum) [erg] in the KEGG repository. According to the KEGG pathway assignments, 33 isoforms were annotated as part of the flavonoid biosynthetic pathway. In addition, 19 isoforms were annotated as part of phenylpropanoid biosynthetic pathway. The results of this study provide an overview of the first draft of the Cas genome assembly using PacBio long reads. This new genomic resource represents the basis for exploring the genetic potential of this crop with characteristics as a functional food.


Author(s):  
Endang Rahmat ◽  
Inkyu Park ◽  
Youngmin Kang

Abstract The new yeast Metschnikowia persimmonesis KCTC 12991BP (KIOM G15050 strain) exhibits strong antimicrobial activity against some pathogens. This activity may be related to the medicinal profile of secondary metabolites that could be found in the genome of this species. Therefore, to explore its future possibility of producing some beneficial activities, including medicinal ability, we report high quality whole-genome assembly of M. persimmonesis produced by PacBio RSII sequencer. The final draft assembly consisted of 16 scaffolds with GC content of 45.90% and comprised a fairly complete set (82.8%) of BUSCO result using Saccharomycetales lineage data set. The total length of the genome was 16.473 Mb, with a scaffold N50 of 1.982 Mb. Annotation of the M. persimmonesis genome revealed presence of 7,029 genes and 6,939 functionally annotated proteins. Based on the analysis of phylogenetic relationship and the average nucleotide identities (ANI), M. persimmonesis was proved to a novel species within the Metschnikowia genus. This finding is expected to significantly contribute to the discovery of high-value natural products from M. persimmonesis as well as for genome biology and evolution comparative analysis within Metschnikowia species.


2021 ◽  
Vol 7 (26) ◽  
pp. eabe8290
Author(s):  
Jennifer M. Polinski ◽  
Aleksey V. Zimin ◽  
K. Fraser Clark ◽  
Andrea B. Kohn ◽  
Norah Sadowski ◽  
...  

The American lobster, Homarus americanus, is integral to marine ecosystems and supports an important commercial fishery. This iconic species also serves as a valuable model for deciphering neural networks controlling rhythmic motor patterns and olfaction. Here, we report a high-quality draft assembly of the H. americanus genome with 25,284 predicted gene models. Analysis of the neural gene complement revealed extraordinary development of the chemosensory machinery, including a profound diversification of ligand-gated ion channels and secretory molecules. The discovery of a novel class of chimeric receptors coupling pattern recognition and neurotransmitter binding suggests a deep integration between the neural and immune systems. A robust repertoire of genes involved in innate immunity, genome stability, cell survival, chemical defense, and cuticle formation represents a diversity of defense mechanisms essential to thrive in the benthic marine environment. Together, these unique evolutionary adaptations contribute to the longevity and ecological success of this long-lived benthic predator.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Lidong Guo ◽  
Mengyang Xu ◽  
Wenchao Wang ◽  
Shengqiang Gu ◽  
Xia Zhao ◽  
...  

Abstract Background Synthetic long reads (SLR) with long-range co-barcoding information are now widely applied in genomics research. Although several tools have been developed for each specific SLR technique, a robust standalone scaffolder with high efficiency is warranted for hybrid genome assembly. Results In this work, we developed a standalone scaffolding tool, SLR-superscaffolder, to link together contigs in draft assemblies using co-barcoding and paired-end read information. Our top-to-bottom scheme first builds a global scaffold graph based on Jaccard Similarity to determine the order and orientation of contigs, and then locally improves the scaffolds with the aid of paired-end information. We also exploited a screening algorithm to reduce the negative effect of misassembled contigs in the input assembly. We applied SLR-superscaffolder to a human single tube long fragment read sequencing dataset and increased the scaffold NG50 of its corresponding draft assembly 1349 fold. Moreover, benchmarking on different input contigs showed that this approach overall outperformed existing SLR scaffolders, providing longer contiguity and fewer misassemblies, especially for short contigs assembled by next-generation sequencing data. The open-source code of SLR-superscaffolder is available at https://github.com/BGI-Qingdao/SLR-superscaffolder. Conclusions SLR-superscaffolder can dramatically improve the contiguity of a draft assembly by integrating a hybrid assembly strategy.


2021 ◽  
Vol 11 (4) ◽  
Author(s):  
Julia Brose ◽  
Kin H Lau ◽  
Thu Thuy Thi Dang ◽  
John P Hamilton ◽  
Lívia do Vale Martins ◽  
...  

Abstract Mitragyna speciosa (kratom) produces numerous compounds with pharmaceutical properties including the production of bioactive monoterpene indole and oxindole alkaloids. Using a linked-read approach, a 1,122,519,462 bp draft assembly of M. speciosa “Rifat” was generated with an N50 scaffold size of 1,020,971 bp and an N50 contig size of 70,448 bp that encodes 55,746 genes. Chromosome counting revealed that “Rifat” is a tetraploid with a base chromosome number of 11, which was further corroborated by orthology and syntenic analysis of the genome. Analysis of genes and clusters involved in specialized metabolism revealed genes putatively involved in alkaloid biosynthesis. Access to the genome of M. speciosa will facilitate an improved understanding of alkaloid biosynthesis and accelerate the production of bioactive alkaloids in heterologous hosts.


Sign in / Sign up

Export Citation Format

Share Document