scholarly journals Complete Chloroplast Genomes of Fagus sylvatica L. Reveal Sequence Conservation in the Inverted Repeat and the Presence of Allelic Variation in NUPTs

Genes ◽  
2021 ◽  
Vol 12 (9) ◽  
pp. 1357
Author(s):  
Bartosz Ulaszewski ◽  
Joanna Meger ◽  
Bagdevi Mishra ◽  
Marco Thines ◽  
Jarosław Burczyk

Growing amounts of genomic data and more efficient assembly tools advance organelle genomics at an unprecedented scale. Genomic resources are increasingly used for phylogenetic analyses of many plant species, but are less frequently used to investigate within-species variability and phylogeography. In this study, we investigated genetic diversity of Fagus sylvatica, an important broadleaved tree species of European forests, based on complete chloroplast genomes of 18 individuals sampled widely across the species distribution. Our results confirm the hypothesis of a low cpDNA diversity in European beech. The chloroplast genome size was remarkably stable (158,428 ± 37 bp). The polymorphic markers, 12 microsatellites (SSR), four SNPs and one indel, were found only in the single copy regions, while inverted repeat regions were monomorphic both in terms of length and sequence, suggesting highly efficient suppression of mutation. The within-individual analysis of polymorphisms showed >9k of markers which were proportionally present in gene and non-gene areas. However, an investigation of the frequency of alternate alleles revealed that the source of this diversity originated likely from nuclear-encoded plastome remnants (NUPTs). Phylogeographic and Mantel correlation analysis based on the complete chloroplast genomes exhibited clustering of individuals according to geographic distance in the first distance class, suggesting that the novel markers and in particular the cpSSRs could provide a more detailed picture of beech population structure in Central Europe.

Forests ◽  
2021 ◽  
Vol 12 (2) ◽  
pp. 180
Author(s):  
Bagdevi Mishra ◽  
Bartosz Ulaszewski ◽  
Sebastian Ploch ◽  
Jaroslaw Burczyk ◽  
Marco Thines

Chloroplasts are difficult to assemble because of the presence of large inverted repeats. At the same time, correct assemblies are important, as chloroplast loci are frequently used for biogeography and population genetics studies. In an attempt to elucidate the orientation of the single-copy regions and to find suitable loci for chloroplast single nucleotide polymorphism (SNP)-based studies, circular chloroplast sequences for the ultra-centenary reference individual of European Beech (Fagus sylvatica), Bhaga, and an additional Polish individual (named Jamy) was obtained based on hybrid assemblies. The chloroplast genome of Bhaga was 158,458 bp, and that of Jamy was 158,462 bp long. Using long-read mapping on the configuration inferred in this study and the one suggested in a previous study, we found an inverted orientation of the small single-copy region. The chloroplast genome of Bhaga and of the individual from Poland both have only two mismatches as well as three and two indels as compared to the previously published genome, respectively. The low divergence suggests low seed dispersal but high pollen dispersal. However, once chloroplast genomes become available from Pleistocene refugia, where a high degree of variation has been reported, they might prove useful for tracing the migration history of Fagus sylvatica in the Holocene.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Yan-Yan Guo ◽  
Jia-Xing Yang ◽  
Ming-Zhu Bai ◽  
Guo-Qiang Zhang ◽  
Zhong-Jian Liu

Abstract Background Paphiopedilum is the largest genus of slipper orchids. Previous studies showed that the phylogenetic relationships of this genus are not well resolved, and sparse taxon sampling documented inverted repeat (IR) expansion and small single copy (SSC) contraction of the chloroplast genomes of Paphiopedilum. Results Here, we sequenced, assembled, and annotated 77 plastomes of Paphiopedilum species (size range of 152,130 – 164,092 bp). The phylogeny based on the plastome resolved the relationships of the genus except for the phylogenetic position of two unstable species. We used phylogenetic and comparative genomic approaches to elucidate the plastome evolution of Paphiopedilum. The plastomes of Paphiopedilum have a conserved genome structure and gene content except in the SSC region. The large single copy/inverted repeat (LSC/IR) boundaries are relatively stable, while the boundaries of the inverted repeat and small single copy region (IR/SSC) varied among species. Corresponding to the IR/SSC boundary shifts, the chloroplast genomes of the genus experienced IR expansion and SSC contraction. The IR region incorporated one to six genes of the SSC region. Unexpectedly, great variation in the size, gene order, and gene content of the SSC regions was found, especially in the subg. Parvisepalum. Furthermore, Paphiopedilum provides evidence for the ongoing degradation of the ndh genes in the photoautotrophic plants. The estimated substitution rates of the protein coding genes show accelerated rates of evolution in clpP, psbH, and psbZ. Genes transferred to the IR region due to the boundary shift also have higher substitution rates. Conclusions We found IR expansion and SSC contraction in the chloroplast genomes of Paphiopedilum with dense sampling, and the genus shows variation in the size, gene order, and gene content of the SSC region. This genus provides an ideal system to investigate the dynamics of plastome evolution.


Plants ◽  
2020 ◽  
Vol 9 (1) ◽  
pp. 61 ◽  
Author(s):  
Huyen-Trang Vu ◽  
Ngan Tran ◽  
Thanh-Diem Nguyen ◽  
Quoc-Luan Vu ◽  
My-Huyen Bui ◽  
...  

Paphiopedilum delenatii is a native orchid of Vietnam with highly attractive floral traits. Unfortunately, it is now listed as a critically endangered species with a few hundred individuals remaining in nature. In this study, we performed next-generation sequencing of P. delenatii and assembled its complete chloroplast genome. The whole chloroplast genome of P. delenatii was 160,955 bp in size, 35.6% of which was GC content, and exhibited typical quadripartite structure of plastid genomes with four distinct regions, including the large and small single-copy regions and a pair of inverted repeat regions. There were, in total, 130 genes annotated in the genome: 77 coding genes, 39 tRNA genes, 8 rRNA genes, and 6 pseudogenes. The loss of ndh genes and variation in inverted repeat (IR) boundaries as well as data of simple sequence repeats (SSRs) and divergent hotspots provided useful information for identification applications and phylogenetic studies of Paphiopedilum species. Whole chloroplast genomes could be used as an effective super barcode for species identification or for developing other identification markers, which subsequently serves the conservation of Paphiopedilum species.


2012 ◽  
Vol 5 (5) ◽  
pp. 509-520 ◽  
Author(s):  
Tristan Cordier ◽  
Cécile Robin ◽  
Xavier Capdevielle ◽  
Marie-Laure Desprez-Loustau ◽  
Corinne Vacher

2008 ◽  
Vol 8 (1) ◽  
pp. 36 ◽  
Author(s):  
Rui-Jiang Wang ◽  
Chiao-Lei Cheng ◽  
Ching-Chun Chang ◽  
Chun-Lin Wu ◽  
Tian-Mu Su ◽  
...  

PeerJ ◽  
2018 ◽  
Vol 6 ◽  
pp. e6032 ◽  
Author(s):  
Zhenyu Zhao ◽  
Xin Wang ◽  
Yi Yu ◽  
Subo Yuan ◽  
Dan Jiang ◽  
...  

Dioscorea L., the largest genus of the family Dioscoreaceae with over 600 species, is not only an important food but also a medicinal plant. The identification and classification of Dioscorea L. is a rather difficult task. In this study, we sequenced five Dioscorea chloroplast genomes, and analyzed with four other chloroplast genomes of Dioscorea species from GenBank. The Dioscorea chloroplast genomes displayed the typical quadripartite structure of angiosperms, which consisted of a pair of inverted repeats separated by a large single-copy region, and a small single-copy region. The location and distribution of repeat sequences and microsatellites were determined, and the rapidly evolving chloroplast genome regions (trnK-trnQ, trnS-trnG, trnC-petN, trnE-trnT, petG-trnW-trnP, ndhF, trnL-rpl32, and ycf1) were detected. Phylogenetic relationships of Dioscorea inferred from chloroplast genomes obtained high support even in shortest internodes. Thus, chloroplast genome sequences provide potential molecular markers and genomic resources for phylogeny and species identification.


Plants ◽  
2021 ◽  
Vol 10 (2) ◽  
pp. 397
Author(s):  
Kyoung Su Choi ◽  
Young-Ho Ha ◽  
Hee-Young Gil ◽  
Kyung Choi ◽  
Dong-Kap Kim ◽  
...  

Previous studies on the chloroplast genome in Clematis focused on the chloroplast structure within Anemoneae. The chloroplast genomes of Cleamtis were sequenced to provide information for studies on phylogeny and evolution. Two Korean endemic Clematis chloroplast genomes (Clematis brachyura and C. trichotoma) range from 159,170 to 159,532 bp, containing 134 identical genes. Comparing the coding and non-coding regions among 12 Clematis species revealed divergent sites, with carination occurring in the petD-rpoA region. Comparing other Clematis chloroplast genomes suggested that Clematis has two inversions (trnH-rps16 and rps4), reposition (trnL-ndhC), and inverted repeat (IR) region expansion. For phylogenetic analysis, 71 protein-coding genes were aligned from 36 Ranunculaceae chloroplast genomes. Anemoneae (Anemoclema, Pulsatilla, Anemone, and Clematis) clades were monophyletic and well-supported by the bootstrap value (100%). Based on 70 chloroplast protein-coding genes, we compared nonsynonymous (dN) and synonymous (dS) substitution rates among Clematis, Anemoneae (excluding Clematis), and other Ranunculaceae species. The average synonymoussubstitution rates (dS)of large single copy (LSC), small single copy (SSC), and IR genes in Anemoneae and Clematis were significantly higher than those of other Ranunculaceae species, but not the nonsynonymous substitution rates (dN). This study provides fundamental information on plastid genome evolution in the Ranunculaceae.


Plants ◽  
2020 ◽  
Vol 9 (2) ◽  
pp. 199 ◽  
Author(s):  
Arif Khan ◽  
Sajjad Asaf ◽  
Abdul Latif Khan ◽  
Tariq Shehzad ◽  
Ahmed Al-Rawahi ◽  
...  

Euphorbia is one of the largest genera in the Euphorbiaceae family, comprising 2000 species possessing commercial, medicinal, and ornamental importance. However, there are very little data available on their molecular phylogeny and genomics, and uncertainties still exist at a taxonomic level. Herein, we sequence the complete chloroplast (cp) genomes of two species, E. larica and E. smithii, of the genus Euphorbia through next-generation sequencing and perform a comparative analysis with nine related genomes in the family. The results revealed that the cp genomes had similar quadripartite structure, gene content, and genome organization with previously reported genomes from the same family. The size of cp genomes ranged from 162,172 to 162,358 bp with 132 and 133 genes, 8 rRNAs, 39 tRNA in E. smithii and E. larica, respectively. The numbers of protein-coding genes were 85 and 86, with each containing 19 introns. The four-junction regions were studied and results reveal that rps19 was present at JLB (large single copy region and inverted repeat b junction) in E. larica where its complete presence was located in the IRb (inverted repeat b) region in E. smithii. The sequence comparison revealed that highly divergent regions in rpoC1, rpocB, ycf3, clpP, petD, ycf1, and ndhF of the cp genomes might provide better understanding of phylogenetic inferences in the Euphorbiaceae and order Malpighiales. Phylogenetic analyses of this study illustrate sister clades of E. smithii with E. tricullii and these species form a monophyletic clade with E. larica. The current study might help us to understand the genome architecture, genetic diversity among populations, and evolutionary depiction in the genera.


2021 ◽  
Author(s):  
Mahtab Moghaddam ◽  
Atsushi Ohta ◽  
Motoki Shimizu ◽  
Ryohei Terauchi ◽  
Shahrokh Kazempour-Osaloo

Abstract Plastid genome sequences provide valuable markers for surveying the evolutionary relationships and population genetics of plant species. In the present study, the complete plastid genome of Onobrychis gaubae, endemic to Iran, was sequenced using Illumina paired-end sequencing and was compared with previously known genomes of the IRLC species of legumes. The O. gaubae plastid genome was 123,645 bp in length and included a large single-copy (LSC) region of 81,034 bp, a small single-copy (SSC) region of 13,788 bp and one copy of the inverted repeat (IRb) of 28,823 bp. The genome encoded 110 genes, including 76 protein-coding genes, 30 transfer RNA (tRNA) genes and four ribosome RNA (rRNA) genes and possessed 89 simple sequence repeats (SSRs) and 28 repeated structures with the highest proportion in the LSC. Comparative analysis of the chloroplast genomes across IRLC revealed three hotspot genes (ycf1, ycf2, clpP) which could be used as molecular markers for resolving phylogenetic relationships and species identification. IRLC plastid genomes also showed multiple gene losses and inversions. Phylogenetic analyses revealed that O. gaubae is closely related to Hedysarum. The complete O. gaubae genome is a valuable resource for investigating evolution of Onobrychis species and can be used to identify related species.


PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e9132
Author(s):  
Shuilian He ◽  
Yang Yang ◽  
Ziwei Li ◽  
Xuejiao Wang ◽  
Yanbing Guo ◽  
...  

The horticulturally important genus Zantedeschia (Araceae) comprises eight species of herbaceous perennials. We sequenced, assembled and analyzed the chloroplast (cp) genomes of four species of Zantedeschia (Z. aethiopica, Z. odorata, Z. elliottiana, and Z. rehmannii) to investigate the structure of the cp genome in the genus. According to our results, the cp genome of Zantedeschia ranges in size from 169,065 bp (Z. aethiopica) to 175,906 bp (Z. elliottiana). We identified a total of 112 unique genes, including 78 protein-coding genes, 30 transfer RNA (tRNA) genes and four ribosomal RNA (rRNA) genes. Comparison of our results with cp genomes from other species in the Araceae suggests that the relatively large sizes of the Zantedeschia cp genomes may result from inverted repeats (IR) region expansion. The sampled Zantedeschia species formed a monophylogenetic clade in our phylogenetic analysis. Furthermore, the long single copy (LSC) and short single copy (SSC) regions in Zantedeschia are more divergent than the IR regions in the same genus, and non-coding regions showed generally higher divergence than coding regions. We identified a total of 410 cpSSR sites from the four Zantedeschia species studied. Genetic diversity analyses based on four polymorphic SSR markers from 134 cultivars of Zantedeschia suggested that high genetic diversity (I = 0.934; Ne = 2.371) is present in the Zantedeschia cultivars. High genetic polymorphism from the cpSSR region suggests that cpSSR could be an effective tool for genetic diversity assessment and identification of Zantedeschia varieties.


Sign in / Sign up

Export Citation Format

Share Document