scholarly journals Comparative analysis of the complete chloroplast genome sequences of six species of Pulsatilla Miller, Ranunculaceae

2019 ◽  
Vol 14 (1) ◽  
Author(s):  
Tingting Zhang ◽  
Yanping Xing ◽  
Liang Xu ◽  
Guihua Bao ◽  
Zhilai Zhan ◽  
...  

Abstract Background Baitouweng is a traditional Chinese medicine with a long history of different applications. Although referred to as a single medicine, Baitouweng is actually comprised of many closely related species. It is therefore critically important to identify the different species that are utilized in these medicinal applications. Knowledge about their phylogenetic relationships can be derived from their chloroplast genomes and may provide additional insights into development of molecular markers. Methods Genomic DNA was extracted from six species of Pulsatilla and then sequenced on an Illumina HiSeq 4000. Sequences were assembled into contigs by SOAPdenovo 2.04, aligned to the reference genome using BLAST, and then manually corrected. Genome annotation was performed by the online DOGMA tool. General characteristics of the cp genomes of the six species were analyzed and compared with closely related species. Additionally, phylogenetic trees were constructed, based on single nucleotide polymorphisms (SNPs) and 51 shared protein-coding gene sequences in the cp genome among all 31 species via maximum likelihood. Results The size of cp genomes of P. chinensis (Bge.) Regel, P. chinensis (Bge.) Regel var. kissii (Mandl) S. H. Li et Y. H. Huang, P. cernua (Thunb.) Bercht. et Opiz f. plumbea J. X. Ji et Y. T. zhao, P. dahurica (Fisch.) Spreng, P. turczaninovii Kryl. et Serg, and P. cernua (Thunb.) Bercht. et Opiz. were 163,851 bp, 163,756 bp, 162,481 bp, 162,450 bp, 162,795 bp, and 162,924 bp, respectively. Each species included two inverted repeat regions, a small single-copy region, and a large single-copy region. A total of 134 genes were annotated, including 90 protein-coding genes, 36 tRNAs, and eight rRNAs across all species. In simple sequence repeat analysis, only P. dahurica was found to contain hexanucleotide repeats. A total of 26, 39, 32, 37, 32 and 43 large repeat sequences were identified in the genic regions of the six Pulsatilla species. Nucleotide diversity analysis revealed that the rpl36 gene and ccsA-ndhD region have the highest Pi value. In addition, two phylogenetic trees of the cp genomes were constructed, which laced all Pulsatilla species into one branch within Ranunculaceae. Conclusions We identified and analyzed the cp genome features of six species of P. Miller, with implications for species identification and phylogenetic analysis.

2021 ◽  
Vol 51 (3) ◽  
pp. 326-331
Author(s):  
Sung-Dug OH ◽  
Seong-Kon LEE ◽  
Doh-Won YUN ◽  
Hyeon-Jin SUN ◽  
Hong-Gyu KANG ◽  
...  

The complete chloroplast genome of Zoysia macrostachya Franch. & Sav. isolated in Korea is 135,902 bp long (GC ratio is 38.4%) and has four subregions; 81,546 bp of large single-copy (36.3%) and 12,586 bp of small single-copy (32.7%) regions are separated by 20,885 bp of inverted repeat (44.1%) regions, including 130 genes (83 protein-coding genes, eight rRNAs, and 39 tRNAs). Thirty-nine single nucleotide polymorphisms and 11 insertions and deletion (INDEL) regions were identified from two Z. macrostachya chloroplast genomes, the smallest among other Zoysia species. Phylogenetic trees show that two Z. macrostachya chloroplast genomes are clustered into a single clade. However, we found some incongruency with regard to the phylogenetic position of the Z. macrostachya clade. Our chloroplast genome provides insights into intraspecific variations and species delimitation issues pertaining to the Zoysia species.


Molecules ◽  
2018 ◽  
Vol 23 (9) ◽  
pp. 2137 ◽  
Author(s):  
Xiang-Xiao Meng ◽  
Yan-Fang Xian ◽  
Li Xiang ◽  
Dong Zhang ◽  
Yu-Hua Shi ◽  
...  

The genus Sanguisorba, which contains about 30 species around the world and seven species in China, is the source of the medicinal plant Sanguisorba officinalis, which is commonly used as a hemostatic agent as well as to treat burns and scalds. Here we report the complete chloroplast (cp) genome sequences of four Sanguisorba species (S. officinalis, S. filiformis, S. stipulata, and S. tenuifolia var. alba). These four Sanguisorba cp genomes exhibit typical quadripartite and circular structures, and are 154,282 to 155,479 bp in length, consisting of large single-copy regions (LSC; 84,405–85,557 bp), small single-copy regions (SSC; 18,550–18,768 bp), and a pair of inverted repeats (IRs; 25,576–25,615 bp). The average GC content was ~37.24%. The four Sanguisorba cp genomes harbored 112 different genes arranged in the same order; these identical sections include 78 protein-coding genes, 30 tRNA genes, and four rRNA genes, if duplicated genes in IR regions are counted only once. A total of 39–53 long repeats and 79–91 simple sequence repeats (SSRs) were identified in the four Sanguisorba cp genomes, which provides opportunities for future studies of the population genetics of Sanguisorba medicinal plants. A phylogenetic analysis using the maximum parsimony (MP) method strongly supports a close relationship between S. officinalis and S. tenuifolia var. alba, followed by S. stipulata, and finally S. filiformis. The availability of these cp genomes provides valuable genetic information for future studies of Sanguisorba identification and provides insights into the evolution of the genus Sanguisorba.


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Qiu-jie Li ◽  
Na Su ◽  
Ling Zhang ◽  
Ru-chang Tong ◽  
Xiao-hui Zhang ◽  
...  

AbstractPulsatilla (Ranunculaceae) consists of about 40 species, and many of them have horticultural and/or medicinal value. However, it is difficult to recognize and identify wild Pulsatilla species. Universal molecular markers have been used to identify these species, but insufficient phylogenetic signal was available. Here, we compared the complete chloroplast genomes of seven Pulsatilla species. The chloroplast genomes of Pulsatilla were very similar and their length ranges from 161,501 to 162,669 bp. Eight highly variable regions and potential sources of molecular markers such as simple sequence repeats, large repeat sequences, and single nucleotide polymorphisms were identified, which are valuable for studies of infra- and inter-specific genetic diversity. The SNP number differentiating any two Pulsatilla chloroplast genomes ranged from 112 to 1214, and provided sufficient data for species delimitation. Phylogenetic trees based on different data sets were consistent with one another, with the IR, SSC regions and the barcode combination rbcL + matK + trnH-psbA produced slightly different results. Phylogenetic relationships within Pulsatilla were certainly resolved using the complete cp genome sequences. Overall, this study provides plentiful chloroplast genomic resources, which will be helpful to identify members of this taxonomically challenging group in further investigation.


2021 ◽  
Vol 51 (3) ◽  
pp. 337-344
Author(s):  
Yongsung KIM ◽  
Hong XI ◽  
Jongsun PARK

The chloroplast genome of Limonium tetragonum (Thunb.) Bullock, a halophytic species, was sequenced to understand genetic differences based on its geographical distribution. The cp genome of L. tetragonum was 154,689 bp long (GC ratio is 37.0%) and has four subregions: 84,572 bp of large single-copy (35.3%) and 12,813 bp of small singlecopy (31.5%) regions were separated by 28,562 bp of inverted repeat (40.9%) regions. It contained 128 genes (83 proteincoding genes, eight rRNAs, and 37 tRNAs). Thirty-five single-nucleotide polymorphisms and 33 INDEL regions (88 bp in length) were identified. Maximum-likelihood and Bayesian inference phylogenetic trees showed that L. tetragonum formed a sister group with L. aureum, which is incongruent with certain previous studies, including a phylogenetic analysis.


Plants ◽  
2020 ◽  
Vol 9 (8) ◽  
pp. 979
Author(s):  
Millicent Akinyi Oulo ◽  
Jia-Xin Yang ◽  
Xiang Dong ◽  
Vincent Okelo Wanga ◽  
Elijah Mbandi Mkala ◽  
...  

Rhipsalis baccifera is the only cactus that naturally occurs in both the New World and the Old World, and has thus drawn the attention of most researchers. The complete chloroplast (cp) genome of R. baccifera is reported here for the first time. The cp genome of R. baccifera has 122, 333 base pairs (bp), with a large single-copy (LSC) region (81,459 bp), SSC (23,531 bp) and two inverted repeat (IR) regions each 8530 bp. The genome contains 110 genes, with 73 protein-coding genes, 31 tRNAs, 4 rRNAs and 2 pseudogenes. Twelve genes have introns, with loss of introns being observed in, rpoc1clpP and rps12 genes. 49 repeat sequences and 62 simple sequence repeats (SSRs) were found in the genome. Comparative analysis with eight species of the ACPT (Anacampserotaceae, Cactaceae, Portulacaceae, and Talinaceae) clade of the suborder Portulacineae species, showed that R. baccifera genome has higher number of rearrangements, with a 19 gene inversion in its LSC region representing the most significant structural change in terms of its size. Inversion of the SSC region seems common in subfamily Cactoideae, and another 6 kb gene inversion between rbcL- trnM was observed in R. baccifera and Carnegiea gigantea. The IRs of R. baccifera are contracted. The phylogenetic analysis among 36 complete chloroplast genomes of Caryophyllales species and two outgroup species supported monophyly of the families of the ACPT clade. R. baccifera occupied a basal position of the family Cactaceae clade in the tree. A high number of rearrangements in this cp genome suggests a larger number mutation events in the history of evolution of R. baccifera. These results provide important tools for future work on R. baccifera and in the evolutionary studies of the suborder Portulacineae.


Author(s):  
Umar Rehman ◽  
Nighat Sultana ◽  
Abdullah . ◽  
Abbas Jamal ◽  
Maryam Muzaffar ◽  
...  

Family Phyllanthaceae is one of the largest segregates of the eudicot order Malpighiales and its species are herb, shrub, and tree, which are mostly distributed in tropical regions. Certain taxonomic discrepancies exist at genus and family level. Here, we report chloroplast genomes of three Phyllanthaceae species—Phyllanthus emblica, Flueggea virosa, and Leptopus cordifolius— and compare them with six others previously reported Phyllanthaceae chloroplast genomes. The species of Phyllanthaceae displayed quadripartite structure, comprising inverted repeat regions (IRa and IRb) that separate large single copy (LSC) and small single copy (SSC) regions. The length of complete chloroplast genome ranged from 154,707 bp to 161,093 bp; LSC from 83,627 bp to 89,932 bp; IRs from 23,921 bp to 27,128 bp; and SSC from 17,424 bp to 19,441 bp. Chloroplast genomes contained 111 to 112 unique genes, including 77 to 78 protein-coding, 30 transfer RNA (tRNA), and 4 ribosomal RNA (rRNA) that showed similarities in arrangement. The number of protein-coding genes varied due to deletion/pseudogenization of rps16 genes in Baccaurea ramiflora and Leptopus cordifolius. High variability was seen in number of oligonucleotide repeats while analysis of guanine-cytosine (GC) content, codon usage, amino acid frequency, simple sequence repeats analysis, synonymous and non-synonymous substitutions, and transition and transversion substitutions showed similarities in all Phyllanthaceae species. We detected a higher number of transition substitutions in the coding sequences than non-coding sequences. Moreover, the high number of transition substitutions was determined among the distantly related species in comparison to closely related species. Phylogenetic analysis shows the polyphyletic nature of the genus Phyllanthus which requires further verification. We also determined suitable polymorphic coding genes, including rpl22, ycf1, matK, ndhF, and rps15 which may be helpful for the reconstruction of the high-resolution phylogenetic tree of the family Phyllanthaceae using a large number of species in the future. Overall, the current study provides insight into chloroplast genome evolution in Phyllanthaceae.


2020 ◽  
Vol 2020 ◽  
pp. 1-13 ◽  
Author(s):  
Lu Wang ◽  
Na He ◽  
Yao Li ◽  
Yanming Fang ◽  
Feilong Zhang

Chinese lacquer tree (Toxicodendron vernicifluum) is an important commercial arbor species widely cultivated in East Asia for producing highly durable lacquer. Here, we sequenced and analyzed the complete chloroplast (cp) genome of T. vernicifluum and reconstructed the phylogeny of Sapindales based on 52 cp genomes of six families. The plastome of T. vernicifluum is 159,571 bp in length, including a pair of inverted repeats (IRs) of 26,511 bp, separated by a large single-copy (LSC) region of 87,475 bp and a small single-copy (SSC) region of 19,074 bp. A total of 126 genes were identified, of which 81 are protein-coding genes, 37 are transfer RNA genes, and eight are ribosomal RNA genes. Forty-nine mononucleotide microsatellites, one dinucleotide microsatellite, two complex microsatellites, and 49 long repeats were determined. Structural differences such as inversion variation in LSC and gene loss in IR were detected across cp genomes of the six genera in Anacardiaceae. Phylogenetic analyses revealed that the genus Toxicodendron is closely related to Pistacia and Rhus. The phylogenetic relationships of the six families in Sapindales were well resolved. Overall, this study providing complete cp genome resources will be beneficial for determining potential molecular markers and evolutionary patterns of T. vernicifluum and its closely related species.


2020 ◽  
Vol 11 ◽  
Author(s):  
Peninah Cheptoo Rono ◽  
Xiang Dong ◽  
Jia-Xin Yang ◽  
Fredrick Munyao Mutie ◽  
Millicent A. Oulo ◽  
...  

The genus Alchemilla L., known for its medicinal and ornamental value, is widely distributed in the Holarctic regions with a few species found in Asia and Africa. Delimitation of species within Alchemilla is difficult due to hybridization, autonomous apomixes, and polyploidy, necessitating efficient molecular-based characterization. Herein, we report the initial complete chloroplast (cp) genomes of Alchemilla. The cp genomes of two African (Afromilla) species Alchemilla pedata and Alchemilla argyrophylla were sequenced, and phylogenetic and comparative analyses were conducted in the family Rosaceae. The cp genomes mapped a typical circular quadripartite structure of lengths 152,438 and 152,427 base pairs (bp) in A. pedata and A. argyrophylla, respectively. Alchemilla cp genomes were composed of a pair of inverted repeat regions (IRa/IRb) of length 25,923 and 25,915 bp, separating the small single copy (SSC) region of 17,980 and 17,981 bp and a large single copy (LSC) region of 82,612 and 82,616 bp in A. pedata and A. argyrophylla, respectively. The cp genomes encoded 114 unique genes including 88 protein-coding genes, 37 transfer RNA (tRNA) genes, and 4 ribosomal RNA (rRNA) genes. Additionally, 88 and 95 simple sequence repeats (SSRs) and 37 and 40 tandem repeats were identified in A. pedata and A. argyrophylla, respectively. Significantly, the loss of group II intron in atpF gene in Alchemilla species was detected. Phylogenetic analysis based on 26 whole cp genome sequences and 78 protein-coding gene sequences of 27 Rosaceae species revealed a monophyletic clustering of Alchemilla nested within subfamily Rosoideae. Based on a protein-coding region, negative selective pressure (Ka/Ks < 1) was detected with an average Ka/Ks value of 0.1322 in A. argyrophylla and 0.1418 in A. pedata. The availability of complete cp genome in the genus Alchemilla will contribute to species delineation and further phylogenetic and evolutionary studies in the family Rosaceae.


2021 ◽  
Vol 51 (4) ◽  
pp. 353-362
Author(s):  
Mi-Hee KIM ◽  
Suhyeon PARK ◽  
Junho LEE ◽  
Jinwook BAEK ◽  
Jongsun PARK ◽  
...  

The chloroplast genome of Glycyrrhiza uralensis Fisch was sequenced to investigate intraspecific variations on the chloroplast genome. Its length is 127,689 bp long (34.3% GC ratio) with atypical structure of chloroplast genome, which is congruent to those of Glycyrrhiza genus. It includes 110 genes (76 protein-coding genes, four rRNAs, and 30 tRNAs). Intronic region of ndhA presented the highest nucleotide diversity based on the six G. uralenesis chloroplast genomes. A total of 150 single nucleotide polymorphisms and 10 insertion and deletion (INDEL) regions were identified from the six G. uralensis chloroplast genomes. Phylogenetic trees show that the six chloroplast genomes of G. uralensis formed the two clades, requiring additional studies to understand it.


2021 ◽  
Vol 11 ◽  
Author(s):  
Yongtan Li ◽  
Yan Dong ◽  
Yichao Liu ◽  
Xiaoyue Yu ◽  
Minsheng Yang ◽  
...  

In this study, we assembled and annotated the chloroplast (cp) genome of the Euonymus species Euonymus fortunei, Euonymus phellomanus, and Euonymus maackii, and performed a series of analyses to investigate gene structure, GC content, sequence alignment, and nucleic acid diversity, with the objectives of identifying positive selection genes and understanding evolutionary relationships. The results indicated that the Euonymus cp genome was 156,860–157,611bp in length and exhibited a typical circular tetrad structure. Similar to the majority of angiosperm chloroplast genomes, the results yielded a large single-copy region (LSC) (85,826–86,299bp) and a small single-copy region (SSC) (18,319–18,536bp), separated by a pair of sequences (IRA and IRB; 26,341–26,700bp) with the same encoding but in opposite directions. The chloroplast genome was annotated to 130–131 genes, including 85–86 protein coding genes, 37 tRNA genes, and eight rRNA genes, with GC contents of 37.26–37.31%. The GC content was variable among regions and was highest in the inverted repeat (IR) region. The IR boundary of Euonymus happened expanding resulting that the rps19 entered into IR region and doubled completely. Such fluctuations at the border positions might be helpful in determining evolutionary relationships among Euonymus. The simple-sequence repeats (SSRs) of Euonymus species were composed primarily of single nucleotides (A)n and (T)n, and were mostly 10–12bp in length, with an obvious A/T bias. We identified several loci with suitable polymorphism with the potential use as molecular markers for inferring the phylogeny within the genus Euonymus. Signatures of positive selection were seen in rpoB protein encoding genes. Based on data from the whole chloroplast genome, common single copy genes, and the LSC, SSC, and IR regions, we constructed an evolutionary tree of Euonymus and related species, the results of which were consistent with traditional taxonomic classifications. It showed that E. fortunei sister to the Euonymus japonicus, whereby E. maackii appeared as sister to Euonymus hamiltonianus. Our study provides important genetic information to support further investigations into the phylogenetic development and adaptive evolution of Euonymus species.


Sign in / Sign up

Export Citation Format

Share Document