scholarly journals First draft genome assembly of the Argane tree (Argania spinosa)

F1000Research ◽  
2020 ◽  
Vol 7 ◽  
pp. 1310
Author(s):  
Slimane Khayi ◽  
Nour Elhouda Azza ◽  
Fatima Gaboun ◽  
Stacy Pirro ◽  
Oussama Badad ◽  
...  

Background: The Argane tree ( Argania spinosa L. Skeels) is an endemic tree of mid-western Morocco that plays an important socioeconomic and ecologic role for a dense human population in an arid zone. Several studies confirmed the importance of this species as a food and feed source and as a resource for both pharmaceutical and cosmetic compounds. Unfortunately, the argane tree ecosystem is facing significant threats from environmental changes (global warming, over-population) and over-exploitation. Limited research has been conducted, however, on argane tree genetics and genomics, which hinders its conservation and genetic improvement. Methods: Here, we present a draft genome assembly of A. spinosa. A reliable reference genome of  A. spinosa was created using a hybrid  de novo assembly approach combining short and long sequencing reads. Results: In total, 144 Gb Illumina HiSeq reads and 7.6 Gb PacBio reads were produced and assembled. The final draft genome comprises 75 327 scaffolds totaling 671 Mb with an N50 of 49 916 kb. The draft assembly is close to the genome size estimated by k-mers distribution and covers 89% of complete and 4.3 % of partial Arabidopsis orthologous groups in BUSCO. Conclusion: The A. spinosa genome will be useful for assessing biodiversity leading to efficient conservation of this endangered endemic tree. Furthermore, the genome may enable genome-assisted cultivar breeding, and provide a better understanding of important metabolic pathways and their underlying genes for both cosmetic and pharmacological.

F1000Research ◽  
2018 ◽  
Vol 7 ◽  
pp. 1310 ◽  
Author(s):  
Slimane Khayi ◽  
Nour Elhouda Azza ◽  
Fatima Gaboun ◽  
Stacy Pirro ◽  
Oussama Badad ◽  
...  

Background: The Argane tree (Argania spinosa L. Skeels) is an endemic tree of southwestern Morocco that plays an important socioeconomic and ecologic role for a dense human population in an arid zone. Several studies confirmed the importance of this species as a food and feed source and as a resource for both pharmaceutical and cosmetic compounds. Unfortunately, the argane tree ecosystem is facing significant threats from environmental changes (global warming, over-population) and over-exploitation. Limited research has been conducted, however, on argane tree genetics and genomics, which hinders its conservation and genetic improvement. Methods: Here, we present a draft genome assembly of A. spinosa. A reliable reference genome of A. spinosa was created using a hybrid de novo assembly approach combining short and long sequencing reads. Results: In total, 144 Gb Illumina HiSeq reads and 7.2 Gb PacBio reads were produced and assembled. The final draft genome comprises 75 327 scaffolds totaling 671 Mb with an N50 of 49 916 kb. The draft assembly is close to the genome size estimated by k-mers distribution and covers 89% of complete and 4.3 % of partial Arabidopsis orthologous groups in BUSCO. Conclusion: The A. spinosa genome will be useful for assessing biodiversity leading to efficient conservation of this endangered endemic tree. Furthermore, the genome may enable genome-assisted cultivar breeding, and provide a better understanding of important metabolic pathways and their underlying genes for both cosmetic and pharmacological purposes.


2020 ◽  
Vol 12 (2) ◽  
pp. 3917-3925
Author(s):  
Greer A Dolby ◽  
Matheo Morales ◽  
Timothy H Webster ◽  
Dale F DeNardo ◽  
Melissa A Wilson ◽  
...  

Abstract Toll-like receptors (TLRs) are a complex family of innate immune genes that are well characterized in mammals and birds but less well understood in nonavian sauropsids (reptiles). The advent of highly contiguous draft genomes of nonmodel organisms enables study of such gene families through analysis of synteny and sequence identity. Here, we analyze TLR genes from the genomes of 22 tetrapod species. Findings reveal a TLR8 gene expansion in crocodilians and turtles (TLR8B), and a second duplication (TLR8C) specifically within turtles, followed by pseudogenization of that gene in the nonfreshwater species (desert tortoise and green sea turtle). Additionally, the Mojave desert tortoise (Gopherus agassizii) has a stop codon in TLR8B (TLR8-1) that is polymorphic among conspecifics. Revised orthology further reveals a new TLR homolog, TLR21-like, which is exclusive to lizards, snakes, turtles, and crocodilians. These analyses were made possible by a new draft genome assembly of the desert tortoise (gopAga2.0), which used chromatin-based assembly to yield draft chromosomal scaffolds (L50 = 26 scaffolds, N50 = 28.36 Mb, longest scaffold = 107 Mb) and an enhanced de novo genome annotation with 25,469 genes. Our three-step approach to orthology curation and comparative analysis of TLR genes shows what new insights are possible using genome assemblies with chromosome-scale scaffolds that permit integration of synteny conservation data.


2018 ◽  
Vol 6 (16) ◽  
pp. e00265-18 ◽  
Author(s):  
Stewart T. G. Burgess ◽  
Kathryn Bartley ◽  
Edward J. Marr ◽  
Harry W. Wright ◽  
Robert J. Weaver ◽  
...  

ABSTRACT Sheep scab, caused by infestation with Psoroptes ovis, is highly contagious, results in intense pruritus, and represents a major welfare and economic concern. Here, we report the first draft genome assembly and gene prediction of P. ovis based on PacBio de novo sequencing. The ∼63.2-Mb genome encodes 12,041 protein-coding genes.


2021 ◽  
Author(s):  
Mahiya Farooq ◽  
Mehraj D. Shah ◽  
Bilal A. Padder ◽  
T.A. Sofi ◽  
Khalid k. Masoodi ◽  
...  

Abstract Wilsonomyces carpophilus is a necrotrophic plant pathogenic fungus with a wide host range infecting all stone fruits such as peach, plum, apricot and cherry, and almonds among the nut crops. Necrotrophs are more devastating with a complex pathogenicity mechanism and least known effector repositories. Here, we report a 29.9 megabase draft genome assembly of W. carpophilus. We explored the hybrid technology of Illumina HiSeq and PacBio sequencing technologies to get the unbiased results of sequence reads. We aligned short Illumina reads against the long PacBio reads. A total of 10,901 protein-coding genes were predicted that includes varied set of genes such as HET genes, cytochrome-p450 genes, kinases etc. We mined 2851 simple sequence repeats (SSRs) in the genome assembly. We also predicted the diverse inventory of secretory proteins, transporters, primary and secondary metabolic enzymes. A total of 225 secreted proteins, hydrolases, polysaccharide-degrading enzymes, esterolytic, lipolytic and proteolytic enzymes were the most significant proteins reflecting the necrotrophic lifestyle of the W. carpophilus. We also identified 146 tRNAs and 52 rRNAs in the pathogen genome.


Author(s):  
Luis J Chueca ◽  
Tilman Schell ◽  
Markus Pfenninger

Abstract Among all molluscs, land snails are a scientifically and economically interesting group comprising edible species, alien species and agricultural pests. Yet, despite their high diversity, the number of genome drafts publicly available is still scarce. Here, we present the draft genome assembly of the land snail Candidula unifasciata, a widely distributed species along central Europe, belonging to the Geomitridae family, a highly diversified taxon in the Western-Palearctic region. We performed whole genome sequencing, assembly and annotation of an adult specimen based on PacBio and Oxford Nanopore long read sequences as well as Illumina data. A genome draft of about 1.29 Gb was generated with a N50 length of 246 kb. More than 60% of the assembled genome was identified as repetitive elements. 22,464 protein-coding genes were identified in the genome, of which 62.27% were functionally annotated. This is the first assembled and annotated genome for a geometrid snail and will serve as reference for further evolutionary, genomic and population genetic studies of this important and interesting group.


2020 ◽  
Author(s):  
Jan O. Engler ◽  
Yvonne Lawrie ◽  
Yannick Gansemans ◽  
Filip Van Nieuwerburgh ◽  
Alexander Suh ◽  
...  

AbstractThe Taita White-eye (Zosterops silvanus) is an endangered songbird endemic to the Taita Hills of Southern Kenya, where it is confined to small areas of fragmented forest. With diversification rates exceeding those reported in most other vertebrates, White-eyes are a prime example of a ‘great speciator’. Nevertheless, we still know surprisingly little about the genomic underpinnings leading to this extraordinary fast radiation. Here, we present a draft genome assembly (ZSil_MB_1.0) for the Taita White-eye generated from a blood sample of a wild, female bird captured in the Taita Hills, Kenya. By performing a de novo assembly with linked-reads and annotation of the assembly with the MAKER pipeline, we generated a 1.069 Gb assembly with a scaffold N50 of 1.105 Mb and an L50 of 244. After quality evaluation of the assembly, we identified 92.1% of BUSCOs complete or fragmented, indicating that our de novo assembly is of high quality. This new assembly provides a genomic resource for future studies into the evolutionary and comparative genomics of this rapidly diversifying group of birds.


Genes ◽  
2018 ◽  
Vol 9 (10) ◽  
pp. 485 ◽  
Author(s):  
André Machado ◽  
Ole Tørresen ◽  
Naoki Kabeya ◽  
Alvarina Couto ◽  
Bent Petersen ◽  
...  

Clupeiformes, such as sardines and herrings, represent an important share of worldwide fisheries. Among those, the European sardine (Sardina pilchardus, Walbaum 1792) exhibits significant commercial relevance. While the last decade showed a steady and sharp decline in capture levels, recent advances in culture husbandry represent promising research avenues. Yet, the complete absence of genomic resources from sardine imposes a severe bottleneck to understand its physiological and ecological requirements. We generated 69 Gbp of paired-end reads using Illumina HiSeq X Ten and assembled a draft genome assembly with an N50 scaffold length of 25,579 bp and BUSCO completeness of 82.1% (Actinopterygii). The estimated size of the genome ranges between 655 and 850 Mb. Additionally, we generated a relatively high-level liver transcriptome. To deliver a proof of principle of the value of this dataset, we established the presence and function of enzymes (Elovl2, Elovl5, and Fads2) that have pivotal roles in the biosynthesis of long chain polyunsaturated fatty acids, essential nutrients particularly abundant in oily fish such as sardines. Our study provides the first omics dataset from a valuable economic marine teleost species, the European sardine, representing an essential resource for their effective conservation, management, and sustainable exploitation.


2018 ◽  
Vol 7 (18) ◽  
Author(s):  
Stewart T. G. Burgess ◽  
Kathryn Bartley ◽  
Francesca Nunn ◽  
Harry W. Wright ◽  
Margaret Hughes ◽  
...  

The poultry red mite, Dermanyssus gallinae, is a major worldwide concern in the egg-laying industry. Here, we report the first draft genome assembly and gene prediction of Dermanyssus gallinae, based on combined PacBio and MinION long-read de novo sequencing.


Gigabyte ◽  
2020 ◽  
Vol 2020 ◽  
pp. 1-12
Author(s):  
Weixue Mu ◽  
Jinpu Wei ◽  
Ting Yang ◽  
Yannan Fan ◽  
Le Cheng ◽  
...  

Nyssa yunnanensis is a deciduous tree species in the family Nyssaceae within the order Cornales. As only eight individual trees and two populations have been recorded in China’s Yunnan province, this species has been listed among China’s national Class I protection species since 1999 and also among 120 PSESP (Plant Species with Extremely Small Populations) in the Implementation Plan of Rescuing and Conserving China’s Plant Species with Extremely Small Populations (PSESP) (2011-2-15). Here, we present the draft genome assembly of N. yunnanensis. Using 10X Genomics linked-reads sequencing data, we carried out the de novo assembly and annotation analysis. The N. yunnanensis genome assembly is 1475 Mb in length, containing 288,519 scaffolds with a scaffold N50 length of 985.59 kb. Within the assembled genome, 799.51 Mb was identified as repetitive elements, accounting for 54.24% of the sequenced genome, and a total of 39,803 protein-coding genes were predicted. With the genomic characteristics of N. yunnanensis available, our study might facilitate future conservation biology studies to help protect this extremely threatened tree species.


2018 ◽  
Vol 6 (14) ◽  
Author(s):  
Ellie E. Armstrong ◽  
Stefan Prost ◽  
Damien Ertz ◽  
Martin Westberg ◽  
Andreas Frisch ◽  
...  

ABSTRACT We report here the draft de novo genome assembly, transcriptome assembly, and annotation of the lichen-forming fungus Arthonia radiata (Pers.) Ach., the type species for Arthoniomycetes, a class of lichen-forming, lichenicolous, and saprobic Ascomycota. The genome was assembled using overlapping paired-end and mate pair libraries and sequenced on an Illumina HiSeq 2500 instrument.


Sign in / Sign up

Export Citation Format

Share Document