scholarly journals Comparative analysis of four Zantedeschia chloroplast genomes: expansion and contraction of the IR region, phylogenetic analyses and SSR genetic diversity assessment

PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e9132
Author(s):  
Shuilian He ◽  
Yang Yang ◽  
Ziwei Li ◽  
Xuejiao Wang ◽  
Yanbing Guo ◽  
...  

The horticulturally important genus Zantedeschia (Araceae) comprises eight species of herbaceous perennials. We sequenced, assembled and analyzed the chloroplast (cp) genomes of four species of Zantedeschia (Z. aethiopica, Z. odorata, Z. elliottiana, and Z. rehmannii) to investigate the structure of the cp genome in the genus. According to our results, the cp genome of Zantedeschia ranges in size from 169,065 bp (Z. aethiopica) to 175,906 bp (Z. elliottiana). We identified a total of 112 unique genes, including 78 protein-coding genes, 30 transfer RNA (tRNA) genes and four ribosomal RNA (rRNA) genes. Comparison of our results with cp genomes from other species in the Araceae suggests that the relatively large sizes of the Zantedeschia cp genomes may result from inverted repeats (IR) region expansion. The sampled Zantedeschia species formed a monophylogenetic clade in our phylogenetic analysis. Furthermore, the long single copy (LSC) and short single copy (SSC) regions in Zantedeschia are more divergent than the IR regions in the same genus, and non-coding regions showed generally higher divergence than coding regions. We identified a total of 410 cpSSR sites from the four Zantedeschia species studied. Genetic diversity analyses based on four polymorphic SSR markers from 134 cultivars of Zantedeschia suggested that high genetic diversity (I = 0.934; Ne = 2.371) is present in the Zantedeschia cultivars. High genetic polymorphism from the cpSSR region suggests that cpSSR could be an effective tool for genetic diversity assessment and identification of Zantedeschia varieties.

Molecules ◽  
2018 ◽  
Vol 23 (9) ◽  
pp. 2137 ◽  
Author(s):  
Xiang-Xiao Meng ◽  
Yan-Fang Xian ◽  
Li Xiang ◽  
Dong Zhang ◽  
Yu-Hua Shi ◽  
...  

The genus Sanguisorba, which contains about 30 species around the world and seven species in China, is the source of the medicinal plant Sanguisorba officinalis, which is commonly used as a hemostatic agent as well as to treat burns and scalds. Here we report the complete chloroplast (cp) genome sequences of four Sanguisorba species (S. officinalis, S. filiformis, S. stipulata, and S. tenuifolia var. alba). These four Sanguisorba cp genomes exhibit typical quadripartite and circular structures, and are 154,282 to 155,479 bp in length, consisting of large single-copy regions (LSC; 84,405–85,557 bp), small single-copy regions (SSC; 18,550–18,768 bp), and a pair of inverted repeats (IRs; 25,576–25,615 bp). The average GC content was ~37.24%. The four Sanguisorba cp genomes harbored 112 different genes arranged in the same order; these identical sections include 78 protein-coding genes, 30 tRNA genes, and four rRNA genes, if duplicated genes in IR regions are counted only once. A total of 39–53 long repeats and 79–91 simple sequence repeats (SSRs) were identified in the four Sanguisorba cp genomes, which provides opportunities for future studies of the population genetics of Sanguisorba medicinal plants. A phylogenetic analysis using the maximum parsimony (MP) method strongly supports a close relationship between S. officinalis and S. tenuifolia var. alba, followed by S. stipulata, and finally S. filiformis. The availability of these cp genomes provides valuable genetic information for future studies of Sanguisorba identification and provides insights into the evolution of the genus Sanguisorba.


Plants ◽  
2020 ◽  
Vol 9 (3) ◽  
pp. 296 ◽  
Author(s):  
Jacinta N. Munyao ◽  
Xiang Dong ◽  
Jia-Xin Yang ◽  
Elijah M. Mbandi ◽  
Vincent O. Wanga ◽  
...  

The genus Chlorophytum includes many economically important species well-known for medicinal, ornamental, and horticultural values. However, to date, few molecular genomic resources have been reported for this genus. Therefore, there is limited knowledge of phylogenetic studies, and the available chloroplast (cp) genome of Chlorophytum (C. rhizopendulum) does not provide enough information on this genus. In this study, we present genomic resources for C. comosum and C. gallabatense, which had lengths of 154,248 and 154,154 base pairs (bp), respectively. They had a pair of inverted repeats (IRa and IRb) of 26,114 and 26,254 bp each in size, separating the large single-copy (LSC) region of 84,004 and 83,686 bp from the small single-copy (SSC) region of 18,016 and 17,960 bp in C. comosum and C. gallabatense, respectively. There were 112 distinct genes in each cp genome, which were comprised of 78 protein-coding genes, 30 tRNA genes, and four rRNA genes. The comparative analysis with five other selected species displayed a generally high level of sequence resemblance in structural organization, gene content, and arrangement. Additionally, the phylogenetic analysis confirmed the previous phylogeny and produced a phylogenetic tree with similar topology. It showed that the Chlorophytum species (C. comosum, C. gallabatense and C. rhizopendulum) were clustered together in the same clade with a closer relationship than other plants to the Anthericum ramosum. This research, therefore, presents valuable records for further molecular evolutionary and phylogenetic studies which help to fill the gap in genomic resources and resolve the taxonomic complexes of the genus.


Plants ◽  
2019 ◽  
Vol 8 (10) ◽  
pp. 410 ◽  
Author(s):  
Xiaolei Yu ◽  
Wei Tan ◽  
Huanyu Zhang ◽  
Han Gao ◽  
Wenxiu Wang ◽  
...  

Ampelopsis humulifolia (A. humulifolia) and Ampelopsis japonica (A. japonica), which belong to the family Vitaceae, are valuably used as medicinal plants. The chloroplast (cp) genomes have been recognized as a convincing data for marker selection and phylogenetic studies. Therefore, in this study we reported the complete cp genome sequences of two Ampelopsis species. Results showed that the cp genomes of A. humulifolia and A. japonica were 161,724 and 161,430 bp in length, respectively, with 37.3% guanine-cytosine (GC) content. A total of 114 unique genes were identified in each cp genome, comprising 80 protein-coding genes, 30 tRNA genes, and 4 rRNA genes. We determined 95 and 99 small sequence repeats (SSRs) in A. humulifolia and A. japonica, respectively. The location and distribution of long repeats in the two cp genomes were identified. A highly divergent region of psbZ (Photosystem II reaction center protein Z) -trnG (tRNA-Glycine) was found and could be treated as a potential marker for Vitaceae, and then the corresponding primers were designed. Additionally, phylogenetic analysis showed that Vitis was closer to Tetrastigma than Ampelopsis. In general, this study provides valuable genetic resources for DNA barcoding marker identification and phylogenetic analyses of Ampelopsis.


Forests ◽  
2021 ◽  
Vol 12 (5) ◽  
pp. 608
Author(s):  
Sang-Chul Kim ◽  
Jei-Wan Lee ◽  
Byoung-Ki Choi

In the present study, chloroplast genome sequences of four species of Symplocos (S. chinensis for. pilosa, S. prunifolia, S. coreana, and S. tanakana) from South Korea were obtained by Ion Torrent sequencing and compared with the sequences of three previously reported Symplocos chloroplast genomes from different species. The length of the Symplocos chloroplast genome ranged from 156,961 to 157,365 bp. Overall, 132 genes including 87 functional genes, 37 tRNA genes, and eight rRNA genes were identified in all Symplocos chloroplast genomes. The gene order and contents were highly similar across the seven species. The coding regions were more conserved than the non-coding regions, and the large single-copy and small single-copy regions were less conserved than the inverted repeat regions. We identified five new hotspot regions (rbcL, ycf4, psaJ, rpl22, and ycf1) that can be used as barcodes or species-specific Symplocos molecular markers. These four novel chloroplast genomes provide basic information on the plastid genome of Symplocos and enable better taxonomic characterization of this genus.


PeerJ ◽  
2017 ◽  
Vol 5 ◽  
pp. e3919 ◽  
Author(s):  
Hui Cheng ◽  
Jinfeng Li ◽  
Hong Zhang ◽  
Binhua Cai ◽  
Zhihong Gao ◽  
...  

Compared with other members of the family Rosaceae, the chloroplast genomes ofFragariaspecies exhibit low variation, and this situation has limited phylogenetic analyses; thus, complete chloroplast genome sequencing ofFragariaspecies is needed. In this study, we sequenced the complete chloroplast genome ofF. × ananassa‘Benihoppe’ using the Illumina HiSeq 2500-PE150 platform and then performed a combination ofde novoassembly and reference-guided mapping of contigs to generate complete chloroplast genome sequences. The chloroplast genome exhibits a typical quadripartite structure with a pair of inverted repeats (IRs, 25,936 bp) separated by large (LSC, 85,531 bp) and small (SSC, 18,146 bp) single-copy (SC) regions. The length of theF. × ananassa‘Benihoppe’ chloroplast genome is 155,549 bp, representing the smallestFragariachloroplast genome observed to date. The genome encodes 112 unique genes, comprising 78 protein-coding genes, 30 tRNA genes and four rRNA genes. Comparative analysis of the overall nucleotide sequence identity among ten complete chloroplast genomes confirmed that for both coding and non-coding regions in Rosaceae, SC regions exhibit higher sequence variation than IRs. The Ka/Ks ratio of most genes was less than 1, suggesting that most genes are under purifying selection. Moreover, the mVISTA results also showed a high degree of conservation in genome structure, gene order and gene content inFragaria, particularly among three octoploid strawberries which wereF. × ananassa‘Benihoppe’,F.chiloensis(GP33) andF.virginiana(O477). However, when the sequences of the coding and non-coding regions ofF. × ananassa‘Benihoppe’ were compared in detail with those ofF.chiloensis(GP33) andF.virginiana(O477), a number of SNPs and InDels were revealed by MEGA 7. Six non-coding regions (trnK-matK,trnS-trnG,atpF-atpH,trnC-petN,trnT-psbDandtrnP-psaJ) with a percentage of variable sites greater than 1% and no less than five parsimony-informative sites were identified and may be useful for phylogenetic analysis of the genusFragaria.


2020 ◽  
Vol 11 ◽  
Author(s):  
Peninah Cheptoo Rono ◽  
Xiang Dong ◽  
Jia-Xin Yang ◽  
Fredrick Munyao Mutie ◽  
Millicent A. Oulo ◽  
...  

The genus Alchemilla L., known for its medicinal and ornamental value, is widely distributed in the Holarctic regions with a few species found in Asia and Africa. Delimitation of species within Alchemilla is difficult due to hybridization, autonomous apomixes, and polyploidy, necessitating efficient molecular-based characterization. Herein, we report the initial complete chloroplast (cp) genomes of Alchemilla. The cp genomes of two African (Afromilla) species Alchemilla pedata and Alchemilla argyrophylla were sequenced, and phylogenetic and comparative analyses were conducted in the family Rosaceae. The cp genomes mapped a typical circular quadripartite structure of lengths 152,438 and 152,427 base pairs (bp) in A. pedata and A. argyrophylla, respectively. Alchemilla cp genomes were composed of a pair of inverted repeat regions (IRa/IRb) of length 25,923 and 25,915 bp, separating the small single copy (SSC) region of 17,980 and 17,981 bp and a large single copy (LSC) region of 82,612 and 82,616 bp in A. pedata and A. argyrophylla, respectively. The cp genomes encoded 114 unique genes including 88 protein-coding genes, 37 transfer RNA (tRNA) genes, and 4 ribosomal RNA (rRNA) genes. Additionally, 88 and 95 simple sequence repeats (SSRs) and 37 and 40 tandem repeats were identified in A. pedata and A. argyrophylla, respectively. Significantly, the loss of group II intron in atpF gene in Alchemilla species was detected. Phylogenetic analysis based on 26 whole cp genome sequences and 78 protein-coding gene sequences of 27 Rosaceae species revealed a monophyletic clustering of Alchemilla nested within subfamily Rosoideae. Based on a protein-coding region, negative selective pressure (Ka/Ks < 1) was detected with an average Ka/Ks value of 0.1322 in A. argyrophylla and 0.1418 in A. pedata. The availability of complete cp genome in the genus Alchemilla will contribute to species delineation and further phylogenetic and evolutionary studies in the family Rosaceae.


2021 ◽  
Author(s):  
Mahtab Moghaddam ◽  
Atsushi Ohta ◽  
Motoki Shimizu ◽  
Ryohei Terauchi ◽  
Shahrokh Kazempour-Osaloo

Abstract Plastid genome sequences provide valuable markers for surveying the evolutionary relationships and population genetics of plant species. In the present study, the complete plastid genome of Onobrychis gaubae, endemic to Iran, was sequenced using Illumina paired-end sequencing and was compared with previously known genomes of the IRLC species of legumes. The O. gaubae plastid genome was 123,645 bp in length and included a large single-copy (LSC) region of 81,034 bp, a small single-copy (SSC) region of 13,788 bp and one copy of the inverted repeat (IRb) of 28,823 bp. The genome encoded 110 genes, including 76 protein-coding genes, 30 transfer RNA (tRNA) genes and four ribosome RNA (rRNA) genes and possessed 89 simple sequence repeats (SSRs) and 28 repeated structures with the highest proportion in the LSC. Comparative analysis of the chloroplast genomes across IRLC revealed three hotspot genes (ycf1, ycf2, clpP) which could be used as molecular markers for resolving phylogenetic relationships and species identification. IRLC plastid genomes also showed multiple gene losses and inversions. Phylogenetic analyses revealed that O. gaubae is closely related to Hedysarum. The complete O. gaubae genome is a valuable resource for investigating evolution of Onobrychis species and can be used to identify related species.


2021 ◽  
Vol 11 ◽  
Author(s):  
Yongtan Li ◽  
Yan Dong ◽  
Yichao Liu ◽  
Xiaoyue Yu ◽  
Minsheng Yang ◽  
...  

In this study, we assembled and annotated the chloroplast (cp) genome of the Euonymus species Euonymus fortunei, Euonymus phellomanus, and Euonymus maackii, and performed a series of analyses to investigate gene structure, GC content, sequence alignment, and nucleic acid diversity, with the objectives of identifying positive selection genes and understanding evolutionary relationships. The results indicated that the Euonymus cp genome was 156,860–157,611bp in length and exhibited a typical circular tetrad structure. Similar to the majority of angiosperm chloroplast genomes, the results yielded a large single-copy region (LSC) (85,826–86,299bp) and a small single-copy region (SSC) (18,319–18,536bp), separated by a pair of sequences (IRA and IRB; 26,341–26,700bp) with the same encoding but in opposite directions. The chloroplast genome was annotated to 130–131 genes, including 85–86 protein coding genes, 37 tRNA genes, and eight rRNA genes, with GC contents of 37.26–37.31%. The GC content was variable among regions and was highest in the inverted repeat (IR) region. The IR boundary of Euonymus happened expanding resulting that the rps19 entered into IR region and doubled completely. Such fluctuations at the border positions might be helpful in determining evolutionary relationships among Euonymus. The simple-sequence repeats (SSRs) of Euonymus species were composed primarily of single nucleotides (A)n and (T)n, and were mostly 10–12bp in length, with an obvious A/T bias. We identified several loci with suitable polymorphism with the potential use as molecular markers for inferring the phylogeny within the genus Euonymus. Signatures of positive selection were seen in rpoB protein encoding genes. Based on data from the whole chloroplast genome, common single copy genes, and the LSC, SSC, and IR regions, we constructed an evolutionary tree of Euonymus and related species, the results of which were consistent with traditional taxonomic classifications. It showed that E. fortunei sister to the Euonymus japonicus, whereby E. maackii appeared as sister to Euonymus hamiltonianus. Our study provides important genetic information to support further investigations into the phylogenetic development and adaptive evolution of Euonymus species.


2019 ◽  
Vol 20 (16) ◽  
pp. 4040 ◽  
Author(s):  
Yingxian Cui ◽  
Xinlian Chen ◽  
Liping Nie ◽  
Wei Sun ◽  
Haoyu Hu ◽  
...  

Amomum villosum is an important medicinal and edible plant with several pharmacologically active volatile oils. However, identifying A. villosum from A. villosum var. xanthioides and A. longiligulare which exhibit similar morphological characteristics to A. villosum, is difficult. The main goal of this study, therefore, is to mine genetic resources and improve molecular methods that could be used to distinguish these species. A total of eight complete chloroplasts (cp) genomes of these Amomum species which were collected from the main producing areas in China were determined to be 163,608–164,069 bp in size. All genomes displayed a typical quadripartite structure with a pair of inverted repeat (IR) regions (29,820–29,959 bp) that separated a large single copy (LSC) region (88,680–88,857 bp) from a small single copy (SSC) region (15,288–15,369 bp). Each genome encodes 113 different genes with 79 protein-coding genes, 30 tRNA genes, and four rRNA genes. More than 150 SSRs were identified in the entire cp genomes of these three species. The Sanger sequencing results based on 32 Amomum samples indicated that five highly divergent regions screened from cp genomes could not be used to distinguish Amomum species. Phylogenetic analysis showed that the cp genomes could not only accurately identify Amomum species, but also provide a solid foundation for the establishment of phylogenetic relationships of Amomum species. The availability of cp genome resources and the comparative analysis is beneficial for species authentication and phylogenetic analysis in Amomum.


Genes ◽  
2022 ◽  
Vol 13 (1) ◽  
pp. 113
Author(s):  
Carla L. Saldaña ◽  
Pedro Rodriguez-Grados ◽  
Julio C. Chávez-Galarza ◽  
Shefferson Feijoo ◽  
Juan Carlos Guerrero-Abad ◽  
...  

Capirona (Calycophyllum spruceanum Benth.) belongs to subfamily Ixoroideae, one of the major lineages in the Rubiaceae family, and is an important timber tree. It originated in the Amazon Basin and has widespread distribution in Bolivia, Peru, Colombia, and Brazil. In this study, we obtained the first complete chloroplast (cp) genome of capirona from the department of Madre de Dios located in the Peruvian Amazon. High-quality genomic DNA was used to construct libraries. Pair-end clean reads were obtained by PE 150 library and the Illumina HiSeq 2500 platform. The complete cp genome of C. spruceanum has a 154,480 bp in length with typical quadripartite structure, containing a large single copy (LSC) region (84,813 bp) and a small single-copy (SSC) region (18,101 bp), separated by two inverted repeat (IR) regions (25,783 bp). The annotation of C. spruceanum cp genome predicted 87 protein-coding genes (CDS), 8 ribosomal RNA (rRNA) genes, 37 transfer RNA (tRNA) genes, and one pseudogene. A total of 41 simple sequence repeats (SSR) of this cp genome were divided into mononucleotides (29), dinucleotides (5), trinucleotides (3), and tetranucleotides (4). Most of these repeats were distributed in the noncoding regions. Whole chloroplast genome comparison with the other six Ixoroideae species revealed that the small single copy and large single copy regions showed more divergence than inverted regions. Finally, phylogenetic analyses resolved that C. spruceanum is a sister species to Emmenopterys henryi and confirms its position within the subfamily Ixoroideae. This study reports for the first time the genome organization, gene content, and structural features of the chloroplast genome of C. spruceanum, providing valuable information for genetic and evolutionary studies in the genus Calycophyllum and beyond.


Sign in / Sign up

Export Citation Format

Share Document