scholarly journals Chromosomal-level assembly of Juglans sigillata genome using Nanopore, BioNano, and Hi-C analysis

GigaScience ◽  
2020 ◽  
Vol 9 (2) ◽  
Author(s):  
De-Lu Ning ◽  
Tao Wu ◽  
Liang-Jun Xiao ◽  
Ting Ma ◽  
Wen-Liang Fang ◽  
...  

Abstract Background Juglans sigillata, or iron walnut, belonging to the order Juglandales, is an economically important tree species in Asia, especially in the Yunnan province of China. However, little research has been conducted on J. sigillata at the molecular level, which hinders understanding of its evolution, speciation, and synthesis of secondary metabolites, as well as its wide adaptability to its plateau environment. To address these issues, a high-quality reference genome of J. sigillata would be useful. Findings To construct a high-quality reference genome for J. sigillata, we first generated 38.0 Gb short reads and 66.31 Gb long reads using Illumina and Nanopore sequencing platforms, respectively. The sequencing data were assembled into a 536.50-Mb genome assembly with a contig N50 length of 4.31 Mb. Additionally, we applied BioNano technology to identify contacts among contigs, which were then used to assemble contigs into scaffolds, resulting in a genome assembly with scaffold N50 length of 16.43 Mb and contig N50 length of 4.34 Mb. To obtain a chromosome-level genome assembly, we constructed 1 Hi-C library and sequenced 79.97 Gb raw reads using the Illumina HiSeq platform. We anchored ∼93% of the scaffold sequences into 16 chromosomes and evaluated the quality of our assembly using the high contact frequency heat map. Repetitive elements account for 50.06% of the genome, and 30,387 protein-coding genes were predicted from the genome, of which 99.8% have been functionally annotated. The genome-wide phylogenetic tree indicated an estimated divergence time between J. sigillata and Juglans regia of 49 million years ago on the basis of single-copy orthologous genes. Conclusions We provide the first chromosome-level genome for J. sigillata. It will lay a valuable foundation for future research on the genetic improvement of J. sigillata.

2021 ◽  
Author(s):  
Xinxin Yi ◽  
Jing Liu ◽  
Shengcai Chen ◽  
Hao Wu ◽  
Min Liu ◽  
...  

Cultivated soybean (Glycine max) is an important source for protein and oil. Many elite cultivars with different traits have been developed for different conditions. Each soybean strain has its own genetic diversity, and the availability of more high-quality soybean genomes can enhance comparative genomic analysis for identifying genetic underpinnings for its unique traits. In this study, we constructed a high-quality de novo assembly of an elite soybean cultivar Jidou 17 (JD17) with chromsome contiguity and high accuracy. We annotated 52,840 gene models and reconstructed 74,054 high-quality full-length transcripts. We performed a genome-wide comparative analysis based on the reference genome of JD17 with three published soybeans (WM82, ZH13 and W05) , which identified five large inversions and two large translocations specific to JD17, 20,984 - 46,912 PAVs spanning 13.1 - 46.9 Mb in size, and 5 - 53 large PAV clusters larger than 500kb. 1,695,741 - 3,664,629 SNPs and 446,689 - 800,489 Indels were identified and annotated between JD17 and them. Symbiotic nitrogen fixation (SNF) genes were identified and the effects from these variants were further evaluated. It was found that the coding sequences of 9 nitrogen fixation-related genes were greatly affected. The high-quality genome assembly of JD17 can serve as a valuable reference for soybean functional genomics research.


Genes ◽  
2021 ◽  
Vol 12 (12) ◽  
pp. 1873
Author(s):  
Yang Yang ◽  
Lina Wu ◽  
Zhuoying Weng ◽  
Xi Wu ◽  
Xi Wang ◽  
...  

The humpback grouper (Cromileptes altivelis), an Epinephelidae species, is patchily distributed in the reef habitats of Western Pacific water. This grouper possesses a remarkably different body shape and notably low growth rate compared with closely related grouper species. For promoting further research of the grouper, in the present study, a high-quality chromosome-level genome of humpback grouper was assembled using PacBio sequencing and high-throughput chromatin conformation capture (Hi-C) technology. The assembled genome was 1.013 Gb in size with 283 contigs, of which, a total of 143 contigs with 1.011 Gb in size were correctly anchored into 24 chromosomes. Moreover, a total of 26,037 protein-coding genes were predicted, of them, 25,243 (96.95%) genes could be functionally annotated. The high-quality chromosome-level genome assembly will provide pivotal genomic information for future research of the speciation, evolution and molecular-assisted breeding in humpback groupers. In addition, phylogenetic analysis based on shared single-copy orthologues of the grouper species showed that the humpback grouper is included in the Epinephelus genus and clustered with the giant grouper in one clade with a divergence time of 9.86 Myr. In addition, based on the results of collinearity analysis, a gap in chromosome 6 of the humpback grouper was detected; the missed genes were mainly associated with immunity, substance metabolism and the MAPK signal pathway. The loss of the parts of genes involved in these biological processes might affect the disease resistance, stress tolerance and growth traits in humpback groupers. The present research will provide new insight into the evolution and origin of the humpback grouper.


GigaScience ◽  
2019 ◽  
Vol 8 (9) ◽  
Author(s):  
Yongxin Li ◽  
Yandong Ren ◽  
Dongru Zhang ◽  
Hui Jiang ◽  
Zhongkai Wang ◽  
...  

Abstract Background The mustache toad, Vibrissaphora ailaonica, is endemic to China and belongs to the Megophryidae family. Like other mustache toad species, V. ailaonica males temporarily develop keratinized nuptial spines on their upper jaw during each breeding season, which fall off at the end of the breeding season. This feature is likely result of the reversal of sexual dimorphism in body size, with males being larger than females. A high-quality reference genome for the mustache toad would be invaluable to investigate the genetic mechanism underlying these repeatedly developing keratinized spines. Findings To construct the mustache toad genome, we generated 225 Gb of short reads and 277 Gb of long reads using Illumina and Pacific Biosciences (PacBio) sequencing technologies, respectively. Sequencing data were assembled into a 3.53-Gb genome assembly, with a contig N50 length of 821 kb. We also used high-throughput chromosome conformation capture (Hi-C) technology to identify contacts between contigs, then assembled contigs into scaffolds and assembled a genome with 13 chromosomes and a scaffold N50 length of 412.42 Mb. Based on the 26,227 protein-coding genes annotated in the genome, we analyzed phylogenetic relationships between the mustache toad and other chordate species. The mustache toad has a relatively higher evolutionary rate and separated from a common ancestor of the marine toad, bullfrog, and Tibetan frog 206.1 million years ago. Furthermore, we identified 201 expanded gene families in the mustache toad, which were mainly enriched in immune pathway, keratin filament, and metabolic processes. Conclusions Using Illumina, PacBio, and Hi-C technologies, we constructed the first high-quality chromosome-level mustache toad genome. This work not only offers a valuable reference genome for functional studies of mustache toad traits but also provides important chromosomal information for wider genome comparisons.


GigaScience ◽  
2021 ◽  
Vol 10 (1) ◽  
Author(s):  
Monica M Sheffer ◽  
Anica Hoppe ◽  
Henrik Krehenwinkel ◽  
Gabriele Uhl ◽  
Andreas W Kuss ◽  
...  

Abstract Background Argiope bruennichi, the European wasp spider, has been investigated intensively as a focal species for studies on sexual selection, chemical communication, and the dynamics of rapid range expansion at a behavioral and genetic level. However, the lack of a reference genome has limited insights into the genetic basis for these phenomena. Therefore, we assembled a high-quality chromosome-level reference genome of the European wasp spider as a tool for more in-depth future studies. Findings We generated, de novo, a 1.67 Gb genome assembly of A. bruennichi using 21.8× Pacific Biosciences sequencing, polished with 19.8× Illumina paired-end sequencing data, and proximity ligation (Hi-C)-based scaffolding. This resulted in an N50 scaffold size of 124 Mb and an N50 contig size of 288 kb. We found 98.4% of the genome to be contained in 13 scaffolds, fitting the expected number of chromosomes (n = 13). Analyses showed the presence of 91.1% of complete arthropod BUSCOs, indicating a high-quality assembly. Conclusions We present the first chromosome-level genome assembly in the order Araneae. With this genomic resource, we open the door for more precise and informative studies on evolution and adaptation not only in A. bruennichi but also in arachnids overall, shedding light on questions such as the genomic architecture of traits, whole-genome duplication, and the genomic mechanisms behind silk and venom evolution.


2021 ◽  
Author(s):  
Yikun Zhao ◽  
Yuancong Wang ◽  
De Ma ◽  
Guang Feng ◽  
Yongxue Huo ◽  
...  

AbstractThe maize cultivar Dan340 is an excellent backbone inbred line of Luda Red Cob Group with several desirable characters, such as disease resistance, lodging resistance, high combining ability, wide adaptability and so on. In this study, we constructed a high-quality chromosome-level reference genome for Dan340 by combining PacBio long HiFi sequencing, Illumina short reads and chromosomal conformational capture (Hi-C) sequencing reads. The final assembly of Dan340 genome was 2,348.72 Mb, including 2,738 contigs and 2,315 scaffolds with a N50 of 41.49 Mb and 215.35 Mb, respectively. The percent of high quality Illumina reads mapped to the reference genome was up to 97.48%. The assembly of this genome will not only facilitate our understanding about intraspecific genome diversity in maize, but also provides a novel resource for maize breeding.Key findingsThe final assembly of Dan340 genome was 2,348.72 Mb, including 2,738 contigs and 2,315 scaffolds with a N50 of 41.49 Mb and 215.35 Mb, respectively.The percent of reads mapped to the reference genome was up to 97.48%.The results showed that 96.84% of the plant single-copy orthologues were complete. Complete single-copy and multi copy genes accounted for 87.36% and 9.48% of the genes, respectively. Taken together, these results indicated that our Dan340 genome assembly presented high quality and coverage.


Author(s):  
Monica M. Sheffer ◽  
Anica Hoppe ◽  
Henrik Krehenwinkel ◽  
Gabriele Uhl ◽  
Andreas W. Kuss ◽  
...  

AbstractBackgroundArgiope bruennichi, the European wasp spider, has been studied intensively as to sexual selection, chemical communication, and the dynamics of rapid range expansion at a behavioral and genetic level. However, the lack of a reference genome has limited insights into the genetic basis for these phenomena. Therefore, we assembled a high-quality chromosome-level reference genome of the European wasp spider as a tool for more in-depth future studies.FindingsWe generated, de novo, a 1.67Gb genome assembly of A. bruennichi using 21.5X PacBio sequencing, polished with 30X Illumina paired-end sequencing data, and proximity ligation (Hi-C) based scaffolding. This resulted in an N50 scaffold size of 124Mb and an N50 contig size of 288kb. We found 98.4% of the genome to be contained in 13 scaffolds, fitting the expected number of chromosomes (n = 13). Analyses showed the presence of 91.1% of complete arthropod BUSCOs, indicating a high quality of the assembly.ConclusionsWe present the first chromosome-level genome assembly in the class Arachnida. With this genomic resource, we open the door for more precise and informative studies on evolution and adaptation in A. bruennichi, as well as on several interesting topics in Arachnids, such as the genomic architecture of traits, whole genome duplication and the genomic mechanisms behind silk and venom evolution.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Guifang Lin ◽  
Cheng He ◽  
Jun Zheng ◽  
Dal-Hoe Koo ◽  
Ha Le ◽  
...  

Abstract Background The maize inbred line A188 is an attractive model for elucidation of gene function and improvement due to its high embryogenic capacity and many contrasting traits to the first maize reference genome, B73, and other elite lines. The lack of a genome assembly of A188 limits its use as a model for functional studies. Results Here, we present a chromosome-level genome assembly of A188 using long reads and optical maps. Comparison of A188 with B73 using both whole-genome alignments and read depths from sequencing reads identify approximately 1.1 Gb of syntenic sequences as well as extensive structural variation, including a 1.8-Mb duplication containing the Gametophyte factor1 locus for unilateral cross-incompatibility, and six inversions of 0.7 Mb or greater. Increased copy number of carotenoid cleavage dioxygenase 1 (ccd1) in A188 is associated with elevated expression during seed development. High ccd1 expression in seeds together with low expression of yellow endosperm 1 (y1) reduces carotenoid accumulation, accounting for the white seed phenotype of A188. Furthermore, transcriptome and epigenome analyses reveal enhanced expression of defense pathways and altered DNA methylation patterns of the embryonic callus. Conclusions The A188 genome assembly provides a high-resolution sequence for a complex genome species and a foundational resource for analyses of genome variation and gene function in maize. The genome, in comparison to B73, contains extensive intra-species structural variations and other genetic differences. Expression and network analyses identify discrete profiles for embryonic callus and other tissues.


2019 ◽  
Vol 19 (2) ◽  
pp. 485-496 ◽  
Author(s):  
Wenbo Chen ◽  
Xiaowei Yang ◽  
Guillaume Tetreau ◽  
Xiaozhao Song ◽  
Cathy Coutu ◽  
...  

Author(s):  
Yuanchao Liu ◽  
Longhua Huang ◽  
Huiping Hu ◽  
Manjun Cai ◽  
Xiaowei Liang ◽  
...  

Abstract Ganoderma leucocontextum, a newly discovered species of Ganodermataceae in China, has diverse pharmacological activities. G. leucocontextum was widely cultivated in southwest China, but the systematic genetic study has been impeded by the lack of a reference genome. Herein, we present the first whole-genome assembly of G. leucocontextum based on the Illumina and Nanopore platform from high-quality DNA extracted from a monokaryon strain (DH-8). The generated genome was 50.05 Mb in size with a N50 scaffold size of 3.06 Mb, 78,206 coding sequences and 13,390 putative genes. Genome completeness was assessed using the Benchmarking Universal Single-Copy Orthologs (BUSCO) tool, which identified 96.55% of the 280 Fungi BUSCO genes. Furthermore, differences in functional genes of secondary metabolites (terpenoids) were analyzed between G. leucocontextum and G. lucidum. G. leucocontextum has more genes related to terpenoids synthesis compared to G. lucidum, which may be one of the reasons why they exhibit different biological activities. This is the first genome assembly and annotation for G. leucocontextum, which would enrich the toolbox for biological and genetic studies in G. leucocontextum.


Sign in / Sign up

Export Citation Format

Share Document