scholarly journals Complete chloroplast genome sequences of Dioscorea: Characterization, genomic resources, and phylogenetic analyses

PeerJ ◽  
2018 ◽  
Vol 6 ◽  
pp. e6032 ◽  
Author(s):  
Zhenyu Zhao ◽  
Xin Wang ◽  
Yi Yu ◽  
Subo Yuan ◽  
Dan Jiang ◽  
...  

Dioscorea L., the largest genus of the family Dioscoreaceae with over 600 species, is not only an important food but also a medicinal plant. The identification and classification of Dioscorea L. is a rather difficult task. In this study, we sequenced five Dioscorea chloroplast genomes, and analyzed with four other chloroplast genomes of Dioscorea species from GenBank. The Dioscorea chloroplast genomes displayed the typical quadripartite structure of angiosperms, which consisted of a pair of inverted repeats separated by a large single-copy region, and a small single-copy region. The location and distribution of repeat sequences and microsatellites were determined, and the rapidly evolving chloroplast genome regions (trnK-trnQ, trnS-trnG, trnC-petN, trnE-trnT, petG-trnW-trnP, ndhF, trnL-rpl32, and ycf1) were detected. Phylogenetic relationships of Dioscorea inferred from chloroplast genomes obtained high support even in shortest internodes. Thus, chloroplast genome sequences provide potential molecular markers and genomic resources for phylogeny and species identification.

2021 ◽  
Vol 12 ◽  
Author(s):  
Yike Luo ◽  
Jian He ◽  
Rudan Lyu ◽  
Jiamin Xiao ◽  
Wenhe Li ◽  
...  

The evening primrose family, Onagraceae, is a well defined family of the order Myrtales, comprising 22 genera widely distributed from boreal to tropical areas. In this study, we report and characterize the complete chloroplast genome sequences of 13 species in Circaea, Chamaenerion, and Epilobium using a next-generation sequencing method. We also retrieved chloroplast sequences from two other Onagraceae genera to characterize the chloroplast genome of the family. The complete chloroplast genomes of Onagraceae encoded an identical set of 112 genes (with exclusion of duplication), including 78 protein-coding genes, 30 transfer RNAs, and four ribosomal RNAs. The chloroplast genomes are basically conserved in gene arrangement across the family. However, a large segment of inversion was detected in the large single copy region of all the samples of Oenothera subsect. Oenothera. Two kinds of inverted repeat (IR) region expansion were found in Oenothera, Chamaenerion, and Epilobium samples. We also compared chloroplast genomes across the Onagraceae samples in some features, including nucleotide content, codon usage, RNA editing sites, and simple sequence repeats (SSRs). Phylogeny was inferred by the chloroplast genome data using maximum-likelihood (ML) and Bayesian inference methods. The generic relationship of Onagraceae was well resolved by the complete chloroplast genome sequences, showing potential value in inferring phylogeny within the family. Phylogenetic relationship in Oenothera was better resolved than other densely sampled genera, such as Circaea and Epilobium. Chloroplast genomes of Oenothera subsect. Oenothera, which are biparental inheritated, share a syndrome of characteristics that deviate from primitive pattern of the family, including slightly expanded inverted repeat region, intron loss in clpP, and presence of the inversion.


Molecules ◽  
2019 ◽  
Vol 24 (3) ◽  
pp. 474 ◽  
Author(s):  
Dong-Mei Li ◽  
Chao-Yi Zhao ◽  
Xiao-Fei Liu

Kaempferia galanga and Kaempferia elegans, which belong to the genus Kaempferia family Zingiberaceae, are used as valuable herbal medicine and ornamental plants, respectively. The chloroplast genomes have been used for molecular markers, species identification and phylogenetic studies. In this study, the complete chloroplast genome sequences of K. galanga and K. elegans are reported. Results show that the complete chloroplast genome of K. galanga is 163,811 bp long, having a quadripartite structure with large single copy (LSC) of 88,405 bp and a small single copy (SSC) of 15,812 bp separated by inverted repeats (IRs) of 29,797 bp. Similarly, the complete chloroplast genome of K. elegans is 163,555 bp long, having a quadripartite structure in which IRs of 29,773 bp length separates 88,020 bp of LSC and 15,989 bp of SSC. A total of 111 genes in K. galanga and 113 genes in K. elegans comprised 79 protein-coding genes and 4 ribosomal RNA (rRNA) genes, as well as 28 and 30 transfer RNA (tRNA) genes in K. galanga and K. elegans, respectively. The gene order, GC content and orientation of the two Kaempferia chloroplast genomes exhibited high similarity. The location and distribution of simple sequence repeats (SSRs) and long repeat sequences were determined. Eight highly variable regions between the two Kaempferia species were identified and 643 mutation events, including 536 single-nucleotide polymorphisms (SNPs) and 107 insertion/deletions (indels), were accurately located. Sequence divergences of the whole chloroplast genomes were calculated among related Zingiberaceae species. The phylogenetic analysis based on SNPs among eleven species strongly supported that K. galanga and K. elegans formed a cluster within Zingiberaceae. This study identified the unique characteristics of the entire K. galanga and K. elegans chloroplast genomes that contribute to our understanding of the chloroplast DNA evolution within Zingiberaceae species. It provides valuable information for phylogenetic analysis and species identification within genus Kaempferia.


2018 ◽  
Vol 2018 ◽  
pp. 1-11 ◽  
Author(s):  
Junling Cao ◽  
Dan Jiang ◽  
Zhenyu Zhao ◽  
Subo Yuan ◽  
Yujun Zhang ◽  
...  

Chinese yam has been used both as a food and in traditional herbal medicine. Developing more effective genetic markers in this species is necessary to assess its genetic diversity and perform cultivar identification. In this study, new chloroplast genomic resources were developed using whole chloroplast genomes from six genotypes originating from different geographical locations. The Dioscorea polystachya chloroplast genome is a circular molecule consisting of two single-copy regions separated by a pair of inverted repeats. Comparative analyses of six D. polystachya chloroplast genomes revealed 141 single nucleotide polymorphisms (SNPs). Seventy simple sequence repeats (SSRs) were found in the six genotypes, including 24 polymorphic SSRs. Forty-three common indels and five small inversions were detected. Phylogenetic analysis based on the complete chloroplast genome provided the best resolution among the genotypes. Our evaluation of chloroplast genome resources among these genotypes led us to consider the complete chloroplast genome sequence of D. polystachya as a source of reliable and valuable molecular markers for revealing biogeographical structure and the extent of genetic variation in wild populations and for identifying different cultivars.


2019 ◽  
Vol 20 (23) ◽  
pp. 5940
Author(s):  
Xinbo Pang ◽  
Hongshan Liu ◽  
Suran Wu ◽  
Yangchen Yuan ◽  
Haijun Li ◽  
...  

Species identification of oaks (Quercus) is always a challenge because many species exhibit variable phenotypes that overlap with other species. Oaks are notorious for interspecific hybridization and introgression, and complex speciation patterns involving incomplete lineage sorting. Therefore, accurately identifying Quercus species barcodes has been unsuccessful. In this study, we used chloroplast genome sequence data to identify molecular markers for oak species identification. Using next generation sequencing methods, we sequenced 14 chloroplast genomes of Quercus species in this study and added 10 additional chloroplast genome sequences from GenBank to develop a DNA barcode for oaks. Chloroplast genome sequence divergence was low. We identified four mutation hotspots as candidate Quercus DNA barcodes; two intergenic regions (matK-trnK-rps16 and trnR-atpA) were located in the large single copy region, and two coding regions (ndhF and ycf1b) were located in the small single copy region. The standard plant DNA barcode (rbcL and matK) had lower variability than that of the newly identified markers. Our data provide complete chloroplast genome sequences that improve the phylogenetic resolution and species level discrimination of Quercus. This study demonstrates that the complete chloroplast genome can substantially increase species discriminatory power and resolve phylogenetic relationships in plants.


Author(s):  
Matias Köhler ◽  
Marcelo Reginato ◽  
Tatiana T. Souza-Chies ◽  
Lucas C. Majure

AbstractChloroplast genomes (plastomes) are frequently treated as highly conserved among land plants. However, many lineages of vascular plants have experienced extensive structural rearrangements, including inversions and modifications to the size and content of genes. Cacti are one of these lineages, containing the smallest plastome known for an obligately photosynthetic angiosperm, including the loss of one copy of the inverted repeat (∼25 kb) and the ndh genes suite, but only a few cacti from the subfamily Cactoideae have been sufficiently characterized. Here, we investigated the variation of plastome sequences across the second-major lineage of the Cactaceae, the subfamily Opuntioideae, to address 1) how variable is the content and arrangement of chloroplast genome sequences across the subfamily, and 2) how phylogenetically informative are the plastome sequences for resolving major relationships among the clades of Opuntioideae. Our de novo assembly of the Opuntia quimilo plastome recovered an organelle of 150,347 bp in length with both copies of the inverted repeats and the presence of all the ndh genes suite. An expansion of the large single copy unit and a reduction of the small single copy was observed, including translocations and inversion of genes as well as the putative pseudogenization of numerous loci. Comparative analyses among all clades within Opuntioideae suggested that plastome structure and content vary across taxa of this subfamily, with putative independent losses of the ndh gene suite and pseudogenization of genes across disparate lineages, further demonstrating the dynamic nature of plastomes in Cactaceae. Our plastome dataset was robust in determining relationships among major clades and subclades within Opuntioideae, resolving three tribes with high support: Cylindropuntieae, Tephrocacteae and Opuntieae. A plastome-wide survey for highly informative phylogenetic markers revealed previously unused regions for future use in Sanger-based studies, presenting a valuable dataset with primers designed for continued evolutionary studies across Cactaceae. These results bring new insights into the evolution of plastomes in cacti, suggesting that further analyses should be carried out to address how ecological drivers, physiological constraints and morphological traits of cacti may be related with the common rearrangements in plastomes that have been reported across the family.


Diversity ◽  
2021 ◽  
Vol 13 (9) ◽  
pp. 403
Author(s):  
Umar Rehman ◽  
Nighat Sultana ◽  
Abdullah ◽  
Abbas Jamal ◽  
Maryam Muzaffar ◽  
...  

Family Phyllanthaceae belongs to the eudicot order Malpighiales, and its species are herbs, shrubs, and trees that are mostly distributed in tropical regions. Here, we elucidate the molecular evolution of the chloroplast genome in Phyllanthaceae and identify the polymorphic loci for phylogenetic inference. We de novo assembled the chloroplast genomes of three Phyllanthaceae species, i.e., Phyllanthus emblica, Flueggea virosa, and Leptopus cordifolius, and compared them with six other previously reported genomes. All species comprised two inverted repeat regions (size range 23,921–27,128 bp) that separated large single-copy (83,627–89,932 bp) and small single-copy (17,424–19,441 bp) regions. Chloroplast genomes contained 111–112 unique genes, including 77–78 protein-coding, 30 tRNAs, and 4 rRNAs. The deletion/pseudogenization of rps16 genes was found in only two species. High variability was seen in the number of oligonucleotide repeats, while guanine-cytosine contents, codon usage, amino acid frequency, simple sequence repeats, synonymous and non-synonymous substitutions, and transition and transversion substitutions were similar. The transition substitutions were higher in coding sequences than in non-coding sequences. Phylogenetic analysis revealed the polyphyletic nature of the genus Phyllanthus. The polymorphic protein-coding genes, including rpl22, ycf1, matK, ndhF, and rps15, were also determined, which may be helpful for reconstructing the high-resolution phylogenetic tree of the family Phyllanthaceae. Overall, the study provides insight into the chloroplast genome evolution in Phyllanthaceae.


2019 ◽  
Vol 20 (22) ◽  
pp. 5812
Author(s):  
Liping Nie ◽  
Yingxian Cui ◽  
Liwei Wu ◽  
Jianguo Zhou ◽  
Zhichao Xu ◽  
...  

Macrosolen plants are parasitic shrubs, several of which are important medicinal plants, that are used as folk medicine in some provinces of China. However, reports on Macrosolen are limited. In this study, the complete chloroplast genome sequences of Macrosolen cochinchinensis, Macrosolen tricolor and Macrosolen bibracteolatus are reported. The chloroplast genomes were sequenced by Illumina HiSeq X. The length of the chloroplast genomes ranged from 129,570 bp (M. cochinchinensis) to 126,621 bp (M. tricolor), with a total of 113 genes, including 35 tRNA, eight rRNA, 68 protein-coding genes, and two pseudogenes (ycf1 and rpl2). The simple sequence repeats are mainly comprised of A/T mononucleotide repeats. Comparative genome analyses of the three species detected the most divergent regions in the non-coding spacers. Phylogenetic analyses using maximum parsimony and maximum likelihood strongly supported the idea that Loranthaceae and Viscaceae are monophyletic clades. The data obtained in this study are beneficial for further investigations of Macrosolen in respect to evolution and molecular identification.


Forests ◽  
2021 ◽  
Vol 12 (2) ◽  
pp. 180
Author(s):  
Bagdevi Mishra ◽  
Bartosz Ulaszewski ◽  
Sebastian Ploch ◽  
Jaroslaw Burczyk ◽  
Marco Thines

Chloroplasts are difficult to assemble because of the presence of large inverted repeats. At the same time, correct assemblies are important, as chloroplast loci are frequently used for biogeography and population genetics studies. In an attempt to elucidate the orientation of the single-copy regions and to find suitable loci for chloroplast single nucleotide polymorphism (SNP)-based studies, circular chloroplast sequences for the ultra-centenary reference individual of European Beech (Fagus sylvatica), Bhaga, and an additional Polish individual (named Jamy) was obtained based on hybrid assemblies. The chloroplast genome of Bhaga was 158,458 bp, and that of Jamy was 158,462 bp long. Using long-read mapping on the configuration inferred in this study and the one suggested in a previous study, we found an inverted orientation of the small single-copy region. The chloroplast genome of Bhaga and of the individual from Poland both have only two mismatches as well as three and two indels as compared to the previously published genome, respectively. The low divergence suggests low seed dispersal but high pollen dispersal. However, once chloroplast genomes become available from Pleistocene refugia, where a high degree of variation has been reported, they might prove useful for tracing the migration history of Fagus sylvatica in the Holocene.


Phytotaxa ◽  
2021 ◽  
Vol 500 (3) ◽  
pp. 241-247
Author(s):  
HUI-FENG WANG ◽  
ZHENG-FENG WANG ◽  
QIAO-MEI QIN ◽  
HONG-LIN CAO ◽  
XIAO-MING GUO

Tigridiopalma longmenensis, a new species from Guangdong, China, is described. This species differs from its ally, T. magnifica, by the polychasium consisting of scorpioid cymes, hypanthium with carinas on angles, and longer stamens with a conspicuously white or pink spur at the connective base of anther. A diagnosis and a distribution map of the two species are also provided. The complete chloroplast genome of T. longmenensis was reported here. Phylogenetic analyses based on complete chloroplast genomes from T. longmenensis and other 15 Melastomataceae species indicated that T. longmenensis is sister to T. magnifica. The discovery of T. longmenensis terminates Tigridiopalma as a monotypic genus.


Molecules ◽  
2018 ◽  
Vol 23 (9) ◽  
pp. 2165 ◽  
Author(s):  
Xiao Zhang ◽  
Tao Zhou ◽  
Jia Yang ◽  
Jingjing Sun ◽  
Miaomiao Ju ◽  
...  

Cucurbitaceae is the fourth most important economic plant family with creeping herbaceous species mainly distributed in tropical and subtropical regions. Here, we described and compared the complete chloroplast genome sequences of ten representative species from Cucurbitaceae. The lengths of the ten complete chloroplast genomes ranged from 155,293 bp (C. sativus) to 158,844 bp (M. charantia), and they shared the most common genomic features. 618 repeats of three categories and 813 microsatellites were found. Sequence divergence analysis showed that the coding and IR regions were highly conserved. Three protein-coding genes (accD, clpP, and matK) were under selection and their coding proteins often have functions in chloroplast protein synthesis, gene transcription, energy transformation, and plant development. An unconventional translation initiation codon of psbL gene was found and provided evidence for RNA editing. Applying BI and ML methods, phylogenetic analysis strongly supported the position of Gomphogyne, Hemsleya, and Gynostemma as the relatively original lineage in Cucurbitaceae. This study suggested that the complete chloroplast genome sequences were useful for phylogenetic studies. It would also determine potential molecular markers and candidate DNA barcodes for coming studies and enrich the valuable complete chloroplast genome resources of Cucurbitaceae.


Sign in / Sign up

Export Citation Format

Share Document