scholarly journals The chromosome-level Stevia genome provides insights into steviol glycoside biosynthesis

2021 ◽  
Vol 8 (1) ◽  
Author(s):  
Xiaoyang Xu ◽  
Haiyan Yuan ◽  
Xiaqing Yu ◽  
Suzhen Huang ◽  
Yuming Sun ◽  
...  

AbstractStevia (Stevia rebaudiana Bertoni) is well known for its very sweet steviol glycosides (SGs) consisting of a common tetracyclic diterpenoid steviol backbone and a variable glycone. Steviol glycosides are 150–300 times sweeter than sucrose and are used as natural zero-calorie sweeteners. However, the most promising compounds are biosynthesized in small amounts. Based on Illumina, PacBio, and Hi-C sequencing, we constructed a chromosome-level assembly of Stevia covering 1416 Mb with a contig N50 value of 616.85 kb and a scaffold N50 value of 106.55 Mb. More than four-fifths of the Stevia genome consisted of repetitive elements. We annotated 44,143 high-confidence protein-coding genes in the high-quality genome. Genome evolution analysis suggested that Stevia and sunflower diverged ~29.4 million years ago (Mya), shortly after the whole-genome duplication (WGD) event (WGD-2, ~32.1 Mya) that occurred in their common ancestor. Comparative genomic analysis revealed that the expanded genes in Stevia were mainly enriched for biosynthesis of specialized metabolites, especially biosynthesis of terpenoid backbones, and for further oxidation and glycosylation of these compounds. We further identified all candidate genes involved in SG biosynthesis. Collectively, our current findings on the Stevia reference genome will be very helpful for dissecting the evolutionary history of Stevia and for discovering novel genes contributing to SG biosynthesis and other important agronomic traits in future breeding programs.

2020 ◽  
Vol 7 (1) ◽  
Author(s):  
Ying Li ◽  
Gao-Feng Liu ◽  
Li-Ming Ma ◽  
Tong-Kun Liu ◽  
Chang-Wei Zhang ◽  
...  

AbstractNon-heading Chinese cabbage (NHCC) is an important leafy vegetable cultivated worldwide. Here, we report the first high-quality, chromosome-level genome of NHCC001 based on PacBio, Hi-C, and Illumina sequencing data. The assembled NHCC001 genome is 405.33 Mb in size with a contig N50 of 2.83 Mb and a scaffold N50 of 38.13 Mb. Approximately 53% of the assembled genome is composed of repetitive sequences, among which long terminal repeats (LTRs, 20.42% of the genome) are the most abundant. Using Hi-C data, 97.9% (396.83 Mb) of the sequences were assigned to 10 pseudochromosomes. Genome assessment showed that this B. rapa NHCC001 genome assembly is of better quality than other currently available B. rapa assemblies and that it contains 48,158 protein-coding genes, 99.56% of which are annotated in at least one functional database. Comparative genomic analysis confirmed that B. rapa NHCC001 underwent a whole-genome triplication (WGT) event shared with other Brassica species that occurred after the WGD events shared with Arabidopsis. Genes related to ascorbic acid metabolism showed little variation among the three B. rapa subspecies. The numbers of genes involved in glucosinolate biosynthesis and catabolism were higher in NHCC001 than in Chiifu and Z1, due primarily to tandem duplication. The newly assembled genome will provide an important resource for research on B. rapa, especially B. rapa ssp. chinensis.


2020 ◽  
Vol 10 (10) ◽  
pp. 3565-3574
Author(s):  
Jinping Wang ◽  
Shoule Tian ◽  
Xiaoli Sun ◽  
Xinchao Cheng ◽  
Naibin Duan ◽  
...  

The Chinese chestnut (Castanea mollissima Bl.) is a woody nut crop with a high ecological value. Although many cultivars have been selected from natural seedlings, elite lines with comprehensive agronomic traits and characters remain rare. To explore genetic resources with aid of whole genome sequence will play important roles in modern breeding programs for chestnut. In this study, we generated a high-quality C. mollissima genome assembly by combining 90× Pacific Biosciences long read and 170× high-throughput chromosome conformation capture data. The assembly was 688.93 Mb in total, with a contig N50 of 2.83 Mb. Most of the assembled sequences (99.75%) were anchored onto 12 chromosomes, and 97.07% of the assemblies were accurately anchored and oriented. A total of 33,638 protein-coding genes were predicted in the C. mollissima genome. Comparative genomic and transcriptomic analyses provided insights into the genes expressed in specific tissues, as well as those associated with burr development in the Chinese chestnut. This highly contiguous assembly of the C. mollissima genome provides a valuable resource for studies aiming at identifying and characterizing agronomical-important traits, and will aid the design of breeding strategies to develop more focused, faster, and predictable improvement programs.


mSystems ◽  
2020 ◽  
Vol 5 (6) ◽  
Author(s):  
Liangzhi Li ◽  
Zhenghua Liu ◽  
Min Zhang ◽  
Delong Meng ◽  
Xueduan Liu ◽  
...  

ABSTRACT Here, we report three new Acidiphilium genomes, reclassified existing Acidiphilium species, and performed the first comparative genomic analysis on Acidiphilium in an attempt to address the metabolic potential, ecological functions, and evolutionary history of the genus Acidiphilium. In the genomes of Acidiphilium, we found an abundant repertoire of horizontally transferred genes (HTGs) contributing to environmental adaption and metabolic expansion, including genes conferring photosynthesis (puf, puh), CO2 assimilation (rbc), capacity for methane metabolism (mmo, mdh, frm), nitrogen source utilization (nar, cyn, hmp), sulfur compound utilization (sox, psr, sqr), and multiple metal and osmotic stress resistance capacities (czc, cop, ect). Additionally, the predicted donors of horizontal gene transfer were present in a cooccurrence network of Acidiphilium. Genome-scale positive selection analysis revealed that 15 genes contained adaptive mutations, most of which were multifunctional and played critical roles in the survival of extreme conditions. We proposed that Acidiphilium originated in mild conditions and adapted to extreme environments such as acidic mineral sites after the acquisition of many essential functions. IMPORTANCE Extremophiles, organisms that thrive in extreme environments, are key models for research on biological adaption. They can provide hints for the origin and evolution of life, as well as improve the understanding of biogeochemical cycling of elements. Extremely acidophilic bacteria such as Acidiphilium are widespread in acid mine drainage (AMD) systems, but the metabolic potential, ecological functions, and evolutionary history of this genus are still ambiguous. Here, we sequenced the genomes of three new Acidiphilium strains and performed comparative genomic analysis on this extremely acidophilic bacterial genus. We found in the genomes of Acidiphilium an abundant repertoire of horizontally transferred genes (HTGs) contributing to environmental adaption and metabolic ability expansion, as indicated by phylogenetic reconstruction and gene context comparison. This study has advanced our understanding of microbial evolution and biogeochemical cycling in extreme niches.


2020 ◽  
Vol 7 (1) ◽  
Author(s):  
Xian-Gui Yi ◽  
Xia-Qing Yu ◽  
Jie Chen ◽  
Min Zhang ◽  
Shao-Wei Liu ◽  
...  

Abstract Cerasus serrulata is a flowering cherry germplasm resource for ornamental purposes. In this work, we present a de novo chromosome-scale genome assembly of C. serrulata by the use of Nanopore and Hi-C sequencing technologies. The assembled C. serrulata genome is 265.40 Mb across 304 contigs and 67 scaffolds, with a contig N50 of 1.56 Mb and a scaffold N50 of 31.12 Mb. It contains 29,094 coding genes, 27,611 (94.90%) of which are annotated in at least one functional database. Synteny analysis indicated that C. serrulata and C. avium have 333 syntenic blocks composed of 14,072 genes. Blocks on chromosome 01 of C. serrulata are distributed on all chromosomes of C. avium, implying that chromosome 01 is the most ancient or active of the chromosomes. The comparative genomic analysis confirmed that C. serrulata has 740 expanded gene families, 1031 contracted gene families, and 228 rapidly evolving gene families. By the use of 656 single-copy orthologs, a phylogenetic tree composed of 10 species was constructed. The present C. serrulata species diverged from Prunus yedoensis ~17.34 million years ago (Mya), while the divergence of C. serrulata and C. avium was estimated to have occurred ∼21.44 Mya. In addition, a total of 148 MADS-box family gene members were identified in C. serrulata, accompanying the loss of the AGL32 subfamily and the expansion of the SVP subfamily. The MYB and WRKY gene families comprising 372 and 66 genes could be divided into seven and eight subfamilies in C. serrulata, respectively, based on clustering analysis. Nine hundred forty-one plant disease-resistance genes (R-genes) were detected by searching C. serrulata within the PRGdb. This research provides high-quality genomic information about C. serrulata as well as insights into the evolutionary history of Cerasus species.


2014 ◽  
Vol 2014 ◽  
pp. 1-10 ◽  
Author(s):  
Dmitrii E. Polev ◽  
Iuliia K. Karnaukhova ◽  
Larisa L. Krukovskaya ◽  
Andrei P. Kozlov

Human geneLOC100505644 uncharacterized LOC100505644 [Homo sapiens](Entrez Gene ID 100505644) is abundantly expressed in tumors but weakly expressed in few normal tissues. Till now the function of this gene remains unknown. Here we identified the chromosomal borders of the transcribed region and the major splice form of theLOC100505644-specific transcript. We characterised the major regulatory motifs of the gene and its splice sites. Analysis of the secondary structure of the major transcript variant revealed a hairpin-like structure characteristic for precursor microRNAs. Comparative genomic analysis of the locus showed that it originated in primatesde novo. Taken together, our data indicate that human geneLOC100505644encodes some non-protein coding RNA, likely a microRNA. It was assigned a gene symbolELFN1-AS1(ELFN1 antisense RNA 1 (non-protein coding)). This gene combines features of evolutionary novelty and predominant expression in tumors.


2016 ◽  
Vol 371 (1701) ◽  
pp. 20150442 ◽  
Author(s):  
Eugene V. Koonin

The history of life is punctuated by evolutionary transitions which engender emergence of new levels of biological organization that involves selection acting at increasingly complex ensembles of biological entities. Major evolutionary transitions include the origin of prokaryotic and then eukaryotic cells, multicellular organisms and eusocial animals. All or nearly all cellular life forms are hosts to diverse selfish genetic elements with various levels of autonomy including plasmids, transposons and viruses. I present evidence that, at least up to and including the origin of multicellularity, evolutionary transitions are driven by the coevolution of hosts with these genetic parasites along with sharing of ‘public goods’. Selfish elements drive evolutionary transitions at two distinct levels. First, mathematical modelling of evolutionary processes, such as evolution of primitive replicator populations or unicellular organisms, indicates that only increasing organizational complexity, e.g. emergence of multicellular aggregates, can prevent the collapse of the host–parasite system under the pressure of parasites. Second, comparative genomic analysis reveals numerous cases of recruitment of genes with essential functions in cellular life forms, including those that enable evolutionary transitions. This article is part of the themed issue ‘The major synthetic evolutionary transitions’.


Insects ◽  
2021 ◽  
Vol 12 (8) ◽  
pp. 754
Author(s):  
Yupeng Wu ◽  
Hui Fang ◽  
Jiping Wen ◽  
Juping Wang ◽  
Tianwen Cao ◽  
...  

In this study, the complete mitochondrial genomes (mitogenomes) of Hestina persimilis and Hestinalis nama (Nymphalidae: Apaturinae)were acquired. The mitogenomes of H. persimilis and H. nama are 15,252 bp and 15,208 bp in length, respectively. These two mitogenomes have the typical composition, including 37 genes and a control region. The start codons of the protein-coding genes (PCGs) in the two mitogenomes are the typical codon pattern ATN, exceptCGA in the cox1 gene. Twenty-one tRNA genes show a typical clover leaf structure, however, trnS1(AGN) lacks the dihydrouridine (DHU) stem. The secondary structures of rrnL and rrnS of two species were predicted, and there are several new stem loops near the 5’ of rrnL secondary structure. Based on comparative genomic analysis, four similar conservative structures can be found in the control regions of these two mitogenomes. The phylogenetic analyses were performed on mitogenomes of Nymphalidae. The phylogenetic trees show that the relationships among Nymphalidae are generally identical to previous studies, as follows: Libytheinae\Danainae + ((Calinaginae + Satyrinae) + Danainae\Libytheinae + ((Heliconiinae + Limenitidinae) + (Nymphalinae + (Apaturinae + Biblidinae)))). Hestinalisnama isapart fromHestina, andclosely related to Apatura, forming monophyly.


Author(s):  
James Fulton ◽  
Jeremy Brawner ◽  
Jose Huguet-Tapia ◽  
Katherine E Smith ◽  
Randy Fernandez ◽  
...  

Fusarium wilt, caused by Fusarium oxysporum f. sp. niveum (Fon), is a soilborne disease which significantly limits yield in watermelon (Citrullus lanatus) and occasionally causes the loss of an entire year’s harvest. Reference-quality de novo genomic assemblies of pathogenic and non-pathogenic strains were generated using a combination of next-generation and third-generation sequencing technologies. Chromosomal-level genomes were produced with representatives from all Fon races facilitating comparative genomic analysis and the identification of chromosomal structural variation . Syntenic analysis between isolates allowed differentiation of the core and lineage-specific portions of their genomes. This research will support future efforts to refine the scientific understanding of the molecular and genetic factors underpinning the Fon host range, develop diagnostic assays for each of the four races, and decipher the evolutionary history of race 3.


Sign in / Sign up

Export Citation Format

Share Document