Chloroplast Genome of Phyllanthus Emblica and Leptopus Cordifolius: Comparative Analysis and Phylogenetic within Family Phyllanthaceae

Family Phyllanthaceae is one of the largest segregates of the eudicot order Malpighiales and its species are herb, shrub, and tree, which are mostly distributed in tropical regions. Certain taxonomic discrepancies exist at genus and family level. Here, we report chloroplast genomes of three Phyllanthaceae species—Phyllanthus emblica, Flueggea virosa, and Leptopus cordifolius— and compare them with six others previously reported Phyllanthaceae chloroplast genomes. The species of Phyllanthaceae displayed quadripartite structure, comprising inverted repeat regions (IRa and IRb) that separate large single copy (LSC) and small single copy (SSC) regions. The length of complete chloroplast genome ranged from 154,707 bp to 161,093 bp; LSC from 83,627 bp to 89,932 bp; IRs from 23,921 bp to 27,128 bp; and SSC from 17,424 bp to 19,441 bp. Chloroplast genomes contained 111 to 112 unique genes, including 77 to 78 protein-coding, 30 transfer RNA (tRNA), and 4 ribosomal RNA (rRNA) that showed similarities in arrangement. The number of protein-coding genes varied due to deletion/pseudogenization of rps16 genes in Baccaurea ramiflora and Leptopus cordifolius. High variability was seen in number of oligonucleotide repeats while analysis of guanine-cytosine (GC) content, codon usage, amino acid frequency, simple sequence repeats analysis, synonymous and non-synonymous substitutions, and transition and transversion substitutions showed similarities in all Phyllanthaceae species. We detected a higher number of transition substitutions in the coding sequences than non-coding sequences. Moreover, the high number of transition substitutions was determined among the distantly related species in comparison to closely related species. Phylogenetic analysis shows the polyphyletic nature of the genus Phyllanthus which requires further verification. We also determined suitable polymorphic coding genes, including rpl22, ycf1, matK, ndhF, and rps15 which may be helpful for the reconstruction of the high-resolution phylogenetic tree of the family Phyllanthaceae using a large number of species in the future. Overall, the current study provides insight into chloroplast genome evolution in Phyllanthaceae.

Download Full-text

Comparative Chloroplast Genomics in Phyllanthaceae Species

Diversity ◽

10.3390/d13090403 ◽

2021 ◽

Vol 13 (9) ◽

pp. 403

Author(s):

Umar Rehman ◽

Nighat Sultana ◽

Abdullah ◽

Abbas Jamal ◽

Maryam Muzaffar ◽

...

Keyword(s):

Chloroplast Genome ◽

De Novo ◽

Single Copy ◽

Protein Coding ◽

Coding Sequences ◽

Tropical Regions ◽

Chloroplast Genomes ◽

The Family ◽

Insight Into ◽

Small Single Copy

Family Phyllanthaceae belongs to the eudicot order Malpighiales, and its species are herbs, shrubs, and trees that are mostly distributed in tropical regions. Here, we elucidate the molecular evolution of the chloroplast genome in Phyllanthaceae and identify the polymorphic loci for phylogenetic inference. We de novo assembled the chloroplast genomes of three Phyllanthaceae species, i.e., Phyllanthus emblica, Flueggea virosa, and Leptopus cordifolius, and compared them with six other previously reported genomes. All species comprised two inverted repeat regions (size range 23,921–27,128 bp) that separated large single-copy (83,627–89,932 bp) and small single-copy (17,424–19,441 bp) regions. Chloroplast genomes contained 111–112 unique genes, including 77–78 protein-coding, 30 tRNAs, and 4 rRNAs. The deletion/pseudogenization of rps16 genes was found in only two species. High variability was seen in the number of oligonucleotide repeats, while guanine-cytosine contents, codon usage, amino acid frequency, simple sequence repeats, synonymous and non-synonymous substitutions, and transition and transversion substitutions were similar. The transition substitutions were higher in coding sequences than in non-coding sequences. Phylogenetic analysis revealed the polyphyletic nature of the genus Phyllanthus. The polymorphic protein-coding genes, including rpl22, ycf1, matK, ndhF, and rps15, were also determined, which may be helpful for reconstructing the high-resolution phylogenetic tree of the family Phyllanthaceae. Overall, the study provides insight into the chloroplast genome evolution in Phyllanthaceae.

Download Full-text

Structural and Comparative Analysis of the Complete Chloroplast Genome of Pyrus hopeiensis—“Wild Plants with a Tiny Population”—and Three Other Pyrus Species

International Journal of Molecular Sciences ◽

10.3390/ijms19103262 ◽

2018 ◽

Vol 19 (10) ◽

pp. 3262 ◽

Cited By ~ 9

Author(s):

Yongtan Li ◽

Jun Zhang ◽

Longfei Li ◽

Lijuan Gao ◽

Jintao Xu ◽

...

Keyword(s):

Chloroplast Genome ◽

Population Decline ◽

Gc Content ◽

Single Copy ◽

Wild Plants ◽

Similar Degree ◽

Protein Coding ◽

Chloroplast Genomes ◽

Close Relationship ◽

History Of

Pyrus hopeiensis is a valuable wild resource of Pyrus in the Rosaceae. Due to its limited distribution and population decline, it has been listed as one of the “wild plants with a tiny population” in China. To date, few studies have been conducted on P. hopeiensis. This paper offers a systematic review of P. hopeiensis, providing a basis for the conservation and restoration of P. hopeiensis resources. In this study, the chloroplast genomes of two different genotypes of P. hopeiensis, P. ussuriensis Maxin. cv. Jingbaili, P. communis L. cv. Early Red Comice, and P. betulifolia were sequenced, compared and analyzed. The two P. hopeiensis genotypes showed a typical tetrad chloroplast genome, including a pair of inverted repeats encoding the same but opposite direction sequences, a large single copy (LSC) region, and a small single copy (SSC) region. The length of the chloroplast genome of P. hopeiensis HB-1 was 159,935 bp, 46 bp longer than that of the chloroplast genome of P. hopeiensis HB-2. The lengths of the SSC and IR regions of the two Pyrus genotypes were identical, with the only difference present in the LSC region. The GC content was only 0.02% higher in P. hopeiensis HB-1. The structure and size of the chloroplast genome, the gene species, gene number, and GC content of P. hopeiensis were similar to those of the other three Pyrus species. The IR boundary of the two genotypes of P. hopeiensis showed a similar degree of expansion. To determine the evolutionary history of P. hopeiensis within the genus Pyrus and the Rosaceae, 57 common protein-coding genes from 36 Rosaceae species were analyzed. The phylogenetic tree showed a close relationship between the genera Pyrus and Malus, and the relationship between P. hopeiensis HB-1 and P. hopeiensis HB-2 was the closest.

Download Full-text

Comparative Analyses of Euonymus Chloroplast Genomes: Genetic Structure, Screening for Loci With Suitable Polymorphism, Positive Selection Genes, and Phylogenetic Relationships Within Celastrineae

Frontiers in Plant Science ◽

10.3389/fpls.2020.593984 ◽

2021 ◽

Vol 11 ◽

Author(s):

Yongtan Li ◽

Yan Dong ◽

Yichao Liu ◽

Xiaoyue Yu ◽

Minsheng Yang ◽

...

Keyword(s):

Positive Selection ◽

Chloroplast Genome ◽

Gc Content ◽

Single Copy ◽

Rrna Genes ◽

Evolutionary Relationships ◽

Trna Genes ◽

Protein Coding ◽

Chloroplast Genomes ◽

Cp Genome

In this study, we assembled and annotated the chloroplast (cp) genome of the Euonymus species Euonymus fortunei, Euonymus phellomanus, and Euonymus maackii, and performed a series of analyses to investigate gene structure, GC content, sequence alignment, and nucleic acid diversity, with the objectives of identifying positive selection genes and understanding evolutionary relationships. The results indicated that the Euonymus cp genome was 156,860–157,611bp in length and exhibited a typical circular tetrad structure. Similar to the majority of angiosperm chloroplast genomes, the results yielded a large single-copy region (LSC) (85,826–86,299bp) and a small single-copy region (SSC) (18,319–18,536bp), separated by a pair of sequences (IRA and IRB; 26,341–26,700bp) with the same encoding but in opposite directions. The chloroplast genome was annotated to 130–131 genes, including 85–86 protein coding genes, 37 tRNA genes, and eight rRNA genes, with GC contents of 37.26–37.31%. The GC content was variable among regions and was highest in the inverted repeat (IR) region. The IR boundary of Euonymus happened expanding resulting that the rps19 entered into IR region and doubled completely. Such fluctuations at the border positions might be helpful in determining evolutionary relationships among Euonymus. The simple-sequence repeats (SSRs) of Euonymus species were composed primarily of single nucleotides (A)n and (T)n, and were mostly 10–12bp in length, with an obvious A/T bias. We identified several loci with suitable polymorphism with the potential use as molecular markers for inferring the phylogeny within the genus Euonymus. Signatures of positive selection were seen in rpoB protein encoding genes. Based on data from the whole chloroplast genome, common single copy genes, and the LSC, SSC, and IR regions, we constructed an evolutionary tree of Euonymus and related species, the results of which were consistent with traditional taxonomic classifications. It showed that E. fortunei sister to the Euonymus japonicus, whereby E. maackii appeared as sister to Euonymus hamiltonianus. Our study provides important genetic information to support further investigations into the phylogenetic development and adaptive evolution of Euonymus species.

Download Full-text

Complete Chloroplast Genomes from Sanguisorba: Identity and Variation Among Four Species

Molecules ◽

10.3390/molecules23092137 ◽

2018 ◽

Vol 23 (9) ◽

pp. 2137 ◽

Cited By ~ 6

Author(s):

Xiang-Xiao Meng ◽

Yan-Fang Xian ◽

Li Xiang ◽

Dong Zhang ◽

Yu-Hua Shi ◽

...

Keyword(s):

Gc Content ◽

Single Copy ◽

Rrna Genes ◽

Trna Genes ◽

Protein Coding ◽

Future Studies ◽

Chloroplast Genomes ◽

Close Relationship ◽

Cp Genome ◽

Sanguisorba Officinalis

The genus Sanguisorba, which contains about 30 species around the world and seven species in China, is the source of the medicinal plant Sanguisorba officinalis, which is commonly used as a hemostatic agent as well as to treat burns and scalds. Here we report the complete chloroplast (cp) genome sequences of four Sanguisorba species (S. officinalis, S. filiformis, S. stipulata, and S. tenuifolia var. alba). These four Sanguisorba cp genomes exhibit typical quadripartite and circular structures, and are 154,282 to 155,479 bp in length, consisting of large single-copy regions (LSC; 84,405–85,557 bp), small single-copy regions (SSC; 18,550–18,768 bp), and a pair of inverted repeats (IRs; 25,576–25,615 bp). The average GC content was ~37.24%. The four Sanguisorba cp genomes harbored 112 different genes arranged in the same order; these identical sections include 78 protein-coding genes, 30 tRNA genes, and four rRNA genes, if duplicated genes in IR regions are counted only once. A total of 39–53 long repeats and 79–91 simple sequence repeats (SSRs) were identified in the four Sanguisorba cp genomes, which provides opportunities for future studies of the population genetics of Sanguisorba medicinal plants. A phylogenetic analysis using the maximum parsimony (MP) method strongly supports a close relationship between S. officinalis and S. tenuifolia var. alba, followed by S. stipulata, and finally S. filiformis. The availability of these cp genomes provides valuable genetic information for future studies of Sanguisorba identification and provides insights into the evolution of the genus Sanguisorba.

Download Full-text

The complete chloroplast genome of Saxifraga sinomontana (Saxifragaceae) and comparative analysis with other Saxifragaceae species

Revista Brasileira de Botânica ◽

10.1007/s40415-019-00561-y ◽

2019 ◽

Vol 42 (4) ◽

pp. 601-611 ◽

Cited By ~ 1

Author(s):

Yan Li ◽

Liukun Jia ◽

Zhihua Wang ◽

Rui Xing ◽

Xiaofeng Chi ◽

...

Keyword(s):

Comparative Analysis ◽

Chloroplast Genome ◽

Phylogenetic Relationships ◽

De Novo ◽

Single Copy ◽

Bootstrap Support ◽

Protein Coding ◽

Complete Chloroplast Genome ◽

Protein Coding Genes ◽

Chloroplast Genomes

Abstract Saxifraga sinomontana J.-T. Pan & Gornall belongs to Saxifraga sect. Ciliatae subsect. Hirculoideae, a lineage containing ca. 110 species whose phylogenetic relationships are largely unresolved due to recent rapid radiations. Analyses of complete chloroplast genomes have the potential to significantly improve the resolution of phylogenetic relationships in this young plant lineage. The complete chloroplast genome of S. sinomontana was de novo sequenced, assembled and then compared with that of other six Saxifragaceae species. The S. sinomontana chloroplast genome is 147,240 bp in length with a typical quadripartite structure, including a large single-copy region of 79,310 bp and a small single-copy region of 16,874 bp separated by a pair of inverted repeats (IRs) of 25,528 bp each. The chloroplast genome contains 113 unique genes, including 79 protein-coding genes, four rRNAs and 30 tRNAs, with 18 duplicates in the IRs. The gene content and organization are similar to other Saxifragaceae chloroplast genomes. Sixty-one simple sequence repeats were identified in the S. sinomontana chloroplast genome, mostly represented by mononucleotide repeats of polyadenine or polythymine. Comparative analysis revealed 12 highly divergent regions in the intergenic spacers, as well as coding genes of matK, ndhK, accD, cemA, rpoA, rps19, ndhF, ccsA, ndhD and ycf1. Phylogenetic reconstruction of seven Saxifragaceae species based on 66 protein-coding genes received high bootstrap support values for nearly all identified nodes, suggesting a promising opportunity to resolve infrasectional relationships of the most species-rich section Ciliatae of Saxifraga.

Download Full-text

Complete Chloroplast Genome of Paphiopedilum delenatii and Phylogenetic Relationships among Orchidaceae

Plants ◽

10.3390/plants9010061 ◽

2020 ◽

Vol 9 (1) ◽

pp. 61 ◽

Cited By ~ 5

Author(s):

Huyen-Trang Vu ◽

Ngan Tran ◽

Thanh-Diem Nguyen ◽

Quoc-Luan Vu ◽

My-Huyen Bui ◽

...

Keyword(s):

Chloroplast Genome ◽

Inverted Repeat ◽

Gc Content ◽

Single Copy ◽

Rrna Genes ◽

Trna Genes ◽

Complete Chloroplast Genome ◽

Critically Endangered Species ◽

Plastid Genomes ◽

Chloroplast Genomes

Paphiopedilum delenatii is a native orchid of Vietnam with highly attractive floral traits. Unfortunately, it is now listed as a critically endangered species with a few hundred individuals remaining in nature. In this study, we performed next-generation sequencing of P. delenatii and assembled its complete chloroplast genome. The whole chloroplast genome of P. delenatii was 160,955 bp in size, 35.6% of which was GC content, and exhibited typical quadripartite structure of plastid genomes with four distinct regions, including the large and small single-copy regions and a pair of inverted repeat regions. There were, in total, 130 genes annotated in the genome: 77 coding genes, 39 tRNA genes, 8 rRNA genes, and 6 pseudogenes. The loss of ndh genes and variation in inverted repeat (IR) boundaries as well as data of simple sequence repeats (SSRs) and divergent hotspots provided useful information for identification applications and phylogenetic studies of Paphiopedilum species. Whole chloroplast genomes could be used as an effective super barcode for species identification or for developing other identification markers, which subsequently serves the conservation of Paphiopedilum species.

Download Full-text

The complete structure of the cucumber (Cucumis sativus L.) chloroplast genome: Its composition and comparative analysis

Cellular & Molecular Biology Letters ◽

10.2478/s11658-007-0029-7 ◽

2007 ◽

Vol 12 (4) ◽

Cited By ~ 24

Author(s):

Wojciech Pląder ◽

Yasushi Yukawa ◽

Masahiro Sugiura ◽

Stefan Malepszy

Keyword(s):

Chloroplast Genome ◽

Single Copy ◽

Trna Genes ◽

Inverted Repeat Region ◽

Cucumis Sativus L ◽

Stem Loop ◽

Protein Coding ◽

Chloroplast Genomes ◽

Trna Species ◽

Rrna Species

AbstractThe complete nucleotide sequence of the cucumber (C. sativus L. var. Borszczagowski) chloroplast genome has been determined. The genome is composed of 155,293 bp containing a pair of inverted repeats of 25,191 bp, which are separated by two single-copy regions, a small 18,222-bp one and a large 86,688-bp one. The chloroplast genome of cucumber contains 130 known genes, including 89 protein-coding genes, 8 ribosomal RNA genes (4 rRNA species), and 37 tRNA genes (30 tRNA species), with 18 of them located in the inverted repeat region. Of these genes, 16 contain one intron, and two genes and one ycf contain 2 introns. Twenty-one small inversions that form stem-loop structures, ranging from 18 to 49 bp, have been identified. Eight of them show similarity to those of other species, while eight seem to be cucumber specific. Detailed comparisons of ycf2 and ycf15, and the overall structure to other chloroplast genomes were performed.

Download Full-text

Complete chloroplast genome sequences of Dioscorea: Characterization, genomic resources, and phylogenetic analyses

PeerJ ◽

10.7717/peerj.6032 ◽

2018 ◽

Vol 6 ◽

pp. e6032 ◽

Cited By ~ 12

Author(s):

Zhenyu Zhao ◽

Xin Wang ◽

Yi Yu ◽

Subo Yuan ◽

Dan Jiang ◽

...

Keyword(s):

Chloroplast Genome ◽

Phylogenetic Analyses ◽

Single Copy ◽

Genome Sequences ◽

Complete Chloroplast Genome ◽

Genomic Resources ◽

Chloroplast Genomes ◽

The Family ◽

Small Single Copy

Dioscorea L., the largest genus of the family Dioscoreaceae with over 600 species, is not only an important food but also a medicinal plant. The identification and classification of Dioscorea L. is a rather difficult task. In this study, we sequenced five Dioscorea chloroplast genomes, and analyzed with four other chloroplast genomes of Dioscorea species from GenBank. The Dioscorea chloroplast genomes displayed the typical quadripartite structure of angiosperms, which consisted of a pair of inverted repeats separated by a large single-copy region, and a small single-copy region. The location and distribution of repeat sequences and microsatellites were determined, and the rapidly evolving chloroplast genome regions (trnK-trnQ, trnS-trnG, trnC-petN, trnE-trnT, petG-trnW-trnP, ndhF, trnL-rpl32, and ycf1) were detected. Phylogenetic relationships of Dioscorea inferred from chloroplast genomes obtained high support even in shortest internodes. Thus, chloroplast genome sequences provide potential molecular markers and genomic resources for phylogeny and species identification.

Download Full-text

Characterization of the complete chloroplast genome sequence and phylogenetic analysis of B. oleracea var. italica

10.21203/rs.2.20976/v1 ◽

2020 ◽

Author(s):

Zhenchao Zhang ◽

Zhongliang Dai ◽

Yuemei Yao ◽

Yongfei Pan ◽

Guosheng Sun ◽

...

Keyword(s):

Chloroplast Genome ◽

Genome Sequence ◽

Genomic Structure ◽

Gc Content ◽

Single Copy ◽

Biological Research ◽

Protein Coding ◽

Protein Coding Genes ◽

Cp Genome ◽

Functional Components

Abstract Backgrounds: Broccoli (Brassica. oleracea var. italica L.) is known as one of the most nutritionally rich vegetables, as well as rich in functional components that benefit to health. The main purposes of this research were sequencing, assembling and annotation of chloroplast genome of broccoli based on Illumina HiSeq2500 sequencing platform. Results: The size of the broccoli cp genome is 153,364 bp, including two inverted repeat (IR) regions of 26,197 bp each, separated by a small single copy (SSC) region of 17,834 bp and a large single copy (LSC) region of 83,136 bp. The GC content of the complete genome is 36.36%, while those of SSC, LSC, and IR are 29.1%, 34.15% and 42.35%, respectively. It harbors 134 functional genes, including 87 protein-coding genes, 39 tRNAs and 8 rRNAs, with 31 duplicates in the IRs. The most abundant amino acid in the protein-coding genes is leucine, while the least is cysteine. Codon usage frequency showed bias for A/T-ending codons in the cp genome. In the repeat structure analysis, a total of 34 repeat sequences and 291 simple sequence repeat (SSRs) were detected in the work. Although cp genomic structure and size are highly conserved, the SC-IR boundary regions are variable between the 7 cp genomes. The phylogenetic relationships based on complete cp genome from 9 species suggest that B. oleracea var. italica is closely related to Brassica juncea. Conclusions: The complete cp genome sequence was obtained and annotated for broccoli for the first time. The information acquired from this research will be useful for further species identification, population genetics and biological research of broccoli.

Download Full-text

Complete Chloroplast Genome Sequence of Justicia flava: Genome Comparative Analysis and Phylogenetic Relationships among Acanthaceae

BioMed Research International ◽

10.1155/2019/4370258 ◽

2019 ◽

Vol 2019 ◽

pp. 1-17 ◽

Cited By ~ 4

Author(s):

Samaila S. Yaradua ◽

Dhafer A. Alzahrani ◽

Enas J. Albokhary ◽

Abidina Abba ◽

Abubakar Bello

Keyword(s):

Comparative Analysis ◽

Chloroplast Genome ◽

Phylogenetic Relationships ◽

Inverted Repeat ◽

Gc Content ◽

Single Copy ◽

Protein Coding ◽

Complete Chloroplast Genome ◽

Protein Coding Genes ◽

Cp Genome

The complete chloroplast genome of J. flava, an endangered medicinal plant in Saudi Arabia, was sequenced and compared with cp genome of three Acanthaceae species to characterize the cp genome, identify SSRs, and also detect variation among the cp genomes of the sampled Acanthaceae. NOVOPlasty was used to assemble the complete chloroplast genome from the whole genome data. The cp genome of J. flava was 150, 888bp in length with GC content of 38.2%, and has a quadripartite structure; the genome harbors one pair of inverted repeat (IRa and IRb 25, 500bp each) separated by large single copy (LSC, 82, 995 bp) and small single copy (SSC, 16, 893 bp). There are 132 genes in the genome, which includes 80 protein coding genes, 30 tRNA, and 4 rRNA; 113 are unique while the remaining 19 are duplicated in IR regions. The repeat analysis indicates that the genome contained all types of repeats with palindromic occurring more frequently; the analysis also identified total number of 98 simple sequence repeats (SSR) of which majority are mononucleotides A/T and are found in the intergenic spacer. The comparative analysis with other cp genomes sampled indicated that the inverted repeat regions are conserved than the single copy regions and the noncoding regions show high rate of variation than the coding region. All the genomes have ndhF and ycf1 genes in the border junction of IRb and SSC. Sequence divergence analysis of the protein coding genes showed that seven genes (petB, atpF, psaI, rpl32, rpl16, ycf1, and clpP) are under positive selection. The phylogenetic analysis revealed that Justiceae is sister to Ruellieae. This study reported the first cp genome of the largest genus in Acanthaceae and provided resources for studying genetic diversity of J. flava as well as resolving phylogenetic relationships within the core Acanthaceae.

Download Full-text