scholarly journals Complete chloroplast genomes of Rubus species (Rosaceae) and comparative analysis within the genus

BMC Genomics ◽  
2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Jiaojun Yu ◽  
Jun Fu ◽  
Yuanping Fang ◽  
Jun Xiang ◽  
Hongjin Dong

Abstract Background Rubus is the largest genus of the family Rosaceae and is valued as medicinal, edible, and ornamental plants. Here, we sequenced and assembled eight chloroplast (cp) genomes of Rubus from the Dabie Mountains in Central China. Fifty-one Rubus species were comparatively analyzed for the cp genomes including the eight newly discovered genomes and forty-three previously reported in GenBank database (NCBI). Results The eight newly obtained cp genomes had the same quadripartite structure as the other cp genomes in Rubus. The length of the eight plastomes ranged from 155,546 bp to 156,321 bp with similar GC content (37.0 to 37.3%). The results indicated 133–134 genes were annotated for the Rubus plastomes, which contained 88 or 89 protein coding genes (PCGs), 37 transfer RNA genes (tRNAs), and eight ribosomal RNA genes (rRNAs). Among them, 16 (or 18) of the genes were duplicated in the IR region. Structural comparative analysis results showed that the gene content and order were relatively preserved. Nucleotide variability analysis identified nine hotspot regions for genomic divergence and multiple simple sequences repeats (SSRs), which may be used as markers for genetic diversity and phylogenetic analysis. Phylogenetic relationships were highly supported within the family Rosaceae, as evidenced by sub-clade taxa cp genome sequences. Conclusion Thus, the whole plastome may be used as a super-marker in phylogenetic studies of this genus.

PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e8678 ◽  
Author(s):  
Qing Su ◽  
Luxian Liu ◽  
Mengyu Zhao ◽  
Cancan Zhang ◽  
Dale Zhang ◽  
...  

The D genome progenitor of bread wheat, Aegilops tauschii Cosson (DD, 2n = 2x = 14), which is naturally distributed in Central Eurasia, ranging from northern Syria and Turkey to western China, is considered a potential genetic resource for improving bread wheat. In this study, the chloroplast (cp) genomes of 17 Ae. tauschii accessions were reconstructed. The cp genome sizes ranged from 135,551 bp to 136,009 bp and contained a typical quadripartite structure of angiosperms. Within these genomes, we identified a total of 124 functional genes, including 82 protein-coding genes, 34 transfer RNA genes and eight ribosomal RNA genes, with 17 duplicated genes in the IRs. Although the comparative analysis revealed that the genomic structure (gene order, gene number and IR/SC boundary regions) is conserved, a few variant loci were detected, predominantly in the non-coding regions (intergenic spacer regions). The phylogenetic relationships determined based on the complete genome sequences were consistent with the hypothesis that Ae. tauschii populations in the Yellow River region of China originated in South Asia not Xinjiang province or Iran, which could contribute to more effective utilization of wild germplasm resources. Furthermore, we confirmed that Ae. tauschii was derived from monophyletic speciation rather than hybrid speciation at the cp genome level. We also identified four variable genomic regions, rpl32-trnL-UAG, ccsA-ndhD, rbcL-psaI and rps18-rpl20, showing high levels of nucleotide polymorphisms, which may accordingly prove useful as cpDNA markers in studying the intraspecific genetic structure and diversity of Ae. tauschii.


2019 ◽  
Vol 20 (5) ◽  
pp. 1045 ◽  
Author(s):  
Xiaoqin Li ◽  
Yunjuan Zuo ◽  
Xinxin Zhu ◽  
Shuai Liao ◽  
Jinshuang Ma

Aristolochiaceae, comprising about 600 species, is a unique plant family containing aristolochic acids (AAs). In this study, we sequenced seven species of Aristolochia, and retrieved eleven chloroplast (cp) genomes published for comparative genomics analysis and phylogenetic constructions. The results show that the cp genomes had a typical quadripartite structure with conserved genome arrangement and moderate divergence. The cp genomes range from 159,308 bp to 160,520 bp in length and have a similar GC content of 38.5%–38.9%. A total number of 113 genes were identified, including 79 protein-coding genes, 30 tRNAs and four rRNAs. Although genomic structure and size were highly conserved, the IR-SC boundary regions were variable between these seven cp genomes. The trnH-GUG genes, are one of major differences between the plastomes of the two subgenera Siphisia and Aristolochia. We analyzed the features of nucleotide substitutions, distribution of repeat sequences and simple sequences repeats (SSRs), positive selections in the cp genomes, and identified 16 hotspot regions for genomes divergence that could be utilized as potential markers for phylogeny reconstruction. Phylogenetic relationships of the family Aristolochiaceae inferred from the 18 cp genome sequences were consistent and robust, using maximum parsimony (MP), maximum likelihood (ML), and Bayesian analysis (BI) methods.


Forests ◽  
2021 ◽  
Vol 12 (7) ◽  
pp. 861
Author(s):  
Huijuan Zhou ◽  
Xiaoxiao Gao ◽  
Keith Woeste ◽  
Peng Zhao ◽  
Shuoxin Zhang

Chloroplast (cp) DNA genomes are traditional workhorses for studying the evolution of species and reconstructing phylogenetic relationships in plants. Species of the genus Castanea (chestnuts and chinquapins) are valued as a source of nuts and timber wherever they grow, and chestnut species hybrids are common. We compared the cp genomes of C. mollissima, C. seguinii, C. henryi, and C. pumila. These cp genomes ranged from 160,805 bp to 161,010 bp in length, comprising a pair of inverted repeat (IR) regions (25,685 to 25,701 bp) separated by a large single-copy (LSC) region (90,440 to 90,560 bp) and a small single-copy (SSC) region (18,970 to 19,049 bp). Each cp genome encoded the same 113 genes; 82–83 protein-coding genes, 30 transfer RNA genes, and four ribosomal RNA genes. There were 18 duplicated genes in the IRs. Comparative analysis of cp genomes revealed that rpl22 was absent in all analyzed species, and the gene ycf1 has been pseudo-genized in all Chinese chestnuts except C. pumlia. We analyzed the repeats and nucleotide substitutions in these plastomes and detected several highly variable regions. The phylogenetic analyses based on plastomes confirmed the monophyly of Castanea species.


Molecules ◽  
2018 ◽  
Vol 23 (9) ◽  
pp. 2137 ◽  
Author(s):  
Xiang-Xiao Meng ◽  
Yan-Fang Xian ◽  
Li Xiang ◽  
Dong Zhang ◽  
Yu-Hua Shi ◽  
...  

The genus Sanguisorba, which contains about 30 species around the world and seven species in China, is the source of the medicinal plant Sanguisorba officinalis, which is commonly used as a hemostatic agent as well as to treat burns and scalds. Here we report the complete chloroplast (cp) genome sequences of four Sanguisorba species (S. officinalis, S. filiformis, S. stipulata, and S. tenuifolia var. alba). These four Sanguisorba cp genomes exhibit typical quadripartite and circular structures, and are 154,282 to 155,479 bp in length, consisting of large single-copy regions (LSC; 84,405–85,557 bp), small single-copy regions (SSC; 18,550–18,768 bp), and a pair of inverted repeats (IRs; 25,576–25,615 bp). The average GC content was ~37.24%. The four Sanguisorba cp genomes harbored 112 different genes arranged in the same order; these identical sections include 78 protein-coding genes, 30 tRNA genes, and four rRNA genes, if duplicated genes in IR regions are counted only once. A total of 39–53 long repeats and 79–91 simple sequence repeats (SSRs) were identified in the four Sanguisorba cp genomes, which provides opportunities for future studies of the population genetics of Sanguisorba medicinal plants. A phylogenetic analysis using the maximum parsimony (MP) method strongly supports a close relationship between S. officinalis and S. tenuifolia var. alba, followed by S. stipulata, and finally S. filiformis. The availability of these cp genomes provides valuable genetic information for future studies of Sanguisorba identification and provides insights into the evolution of the genus Sanguisorba.


Genome ◽  
2020 ◽  
Vol 63 (7) ◽  
pp. 337-348
Author(s):  
Guanglong Hu ◽  
Lili Cheng ◽  
Wugang Huang ◽  
Qingchang Cao ◽  
Lei Zhou ◽  
...  

Coryloideae is a subfamily in the family Betulaceae consisting of four extant genera: Carpinus, Corylus, Ostrya, and Ostryopsis. We sequenced the plastomes of six species of Corylus and one species of Ostryopsis for comparative and phylogenetic analyses. The plastomes are 159–160 kb long and possess typical quadripartite cp architecture. The plastomes show moderate divergence and conserved arrangement. Five mutational hotspots were identified by comparing the plastomes of seven species of Coryloideae: trnG-atpA, trnF-ndhJ, accD-psaI, ndhF-ccsA, and ycf1. We assembled the most complete phylogenomic tree for the family Betulaceae using 68 plastomes. Our cp genomic sequence phylogenetic analyses placed Carpinus, Ostrya, and Ostryopsis in a clade together and left Corylus in a separate clade. Within the genus Corylus, these analyses indicate the existence of five subclades reflecting the phylogeographical relationships among the species. The data offer significant genetic information for the identification of species of the Coryloideae, taxonomic and phylogenetic studies, and molecular breeding.


2020 ◽  
Author(s):  
Hukam C. Rawal ◽  
Abhishek Mazumder ◽  
Sangeeta Borchetia ◽  
Biswajit Bera ◽  
S. Soundararajan ◽  
...  

AbstractTea is an important plantation crop of some Asian and African countries. Based upon the morphological characteristics, tea is classified botanically into 2 main types i.e. Assam and China, which are morphologically very distinct. Further, they are so easily pollinated among themselves, that a third category, Cambod type is also described. Although the general consensus of origin of tea is India, Burma and China joining area, yet specific origin of China and Assam tea are not yet clear. In the present study, we made an attempt to understand the origin of Indian tea through the comparative analysis of different chloroplast (cp) genomes under the Camellia genus. Cp genome based phylogenetic analysis indicated that Indian Assam Tea, TV-1 formed a different group from that of China tea, indicating that TV-1 might have undergone different domestication and hence owe different origin. The simple sequence repeats (SSRs) analysis and codon usage distribution pattern also supported the clustering order in the cp genome based phylogenetic tree.


2019 ◽  
Vol 2019 ◽  
pp. 1-17 ◽  
Author(s):  
Samaila S. Yaradua ◽  
Dhafer A. Alzahrani ◽  
Enas J. Albokhary ◽  
Abidina Abba ◽  
Abubakar Bello

The complete chloroplast genome of J. flava, an endangered medicinal plant in Saudi Arabia, was sequenced and compared with cp genome of three Acanthaceae species to characterize the cp genome, identify SSRs, and also detect variation among the cp genomes of the sampled Acanthaceae. NOVOPlasty was used to assemble the complete chloroplast genome from the whole genome data. The cp genome of J. flava was 150, 888bp in length with GC content of 38.2%, and has a quadripartite structure; the genome harbors one pair of inverted repeat (IRa and IRb 25, 500bp each) separated by large single copy (LSC, 82, 995 bp) and small single copy (SSC, 16, 893 bp). There are 132 genes in the genome, which includes 80 protein coding genes, 30 tRNA, and 4 rRNA; 113 are unique while the remaining 19 are duplicated in IR regions. The repeat analysis indicates that the genome contained all types of repeats with palindromic occurring more frequently; the analysis also identified total number of 98 simple sequence repeats (SSR) of which majority are mononucleotides A/T and are found in the intergenic spacer. The comparative analysis with other cp genomes sampled indicated that the inverted repeat regions are conserved than the single copy regions and the noncoding regions show high rate of variation than the coding region. All the genomes have ndhF and ycf1 genes in the border junction of IRb and SSC. Sequence divergence analysis of the protein coding genes showed that seven genes (petB, atpF, psaI, rpl32, rpl16, ycf1, and clpP) are under positive selection. The phylogenetic analysis revealed that Justiceae is sister to Ruellieae. This study reported the first cp genome of the largest genus in Acanthaceae and provided resources for studying genetic diversity of J. flava as well as resolving phylogenetic relationships within the core Acanthaceae.


Plants ◽  
2019 ◽  
Vol 8 (10) ◽  
pp. 410 ◽  
Author(s):  
Xiaolei Yu ◽  
Wei Tan ◽  
Huanyu Zhang ◽  
Han Gao ◽  
Wenxiu Wang ◽  
...  

Ampelopsis humulifolia (A. humulifolia) and Ampelopsis japonica (A. japonica), which belong to the family Vitaceae, are valuably used as medicinal plants. The chloroplast (cp) genomes have been recognized as a convincing data for marker selection and phylogenetic studies. Therefore, in this study we reported the complete cp genome sequences of two Ampelopsis species. Results showed that the cp genomes of A. humulifolia and A. japonica were 161,724 and 161,430 bp in length, respectively, with 37.3% guanine-cytosine (GC) content. A total of 114 unique genes were identified in each cp genome, comprising 80 protein-coding genes, 30 tRNA genes, and 4 rRNA genes. We determined 95 and 99 small sequence repeats (SSRs) in A. humulifolia and A. japonica, respectively. The location and distribution of long repeats in the two cp genomes were identified. A highly divergent region of psbZ (Photosystem II reaction center protein Z) -trnG (tRNA-Glycine) was found and could be treated as a potential marker for Vitaceae, and then the corresponding primers were designed. Additionally, phylogenetic analysis showed that Vitis was closer to Tetrastigma than Ampelopsis. In general, this study provides valuable genetic resources for DNA barcoding marker identification and phylogenetic analyses of Ampelopsis.


Forests ◽  
2021 ◽  
Vol 12 (6) ◽  
pp. 710
Author(s):  
Heng Liang ◽  
Juan Chen

Zingibereae is a large tribe in the family Zingiberaceae, which contains plants with important medicinal, edible, and ornamental values. Although tribes of Zingiberaceae are well circumscribed, the circumscription of many genera within Zingibereae and the relationships among them remain elusive, especially for the genera of Boesenbergia, Curcuma, Kaempferia and Pyrgophyllum. In this study, we investigated the plastome variation in nine species representing five genera of Zingibereae. All plastomes showed a typical quadripartite structure with lengths ranging from 162,042 bp to 163,539 bp and contained 132–134 genes, consisting of 86–88 coding genes, 38 transfer RNA genes and eight ribosomal RNA genes. Moreover, the characteristics of the long repeats sequences and simple sequence repeats (SSRs) were detected. In addition, we conducted phylogenomic analyses of the Zingibereae and related taxa with plastomes data from additional 32 species from Genbank. Our results confirmed that Stahlianthus is closely related to Curcuma, supporting the idea of merging it into Curcuma. Kaempferia, Boesenbergia and Zingiber were confirmed as close relatives and grouped together as the Kaempferia group. Pyrgophyllum is not allied with the Curcuma clade but instead is embedded within the Hedychium clade. Our results demonstrate the power of plastid phylogenomics in improving the phylogenetic relationships within Zingibereae and provide a new insight into plastome evolution in Zingibereceae.


Author(s):  
Umar Rehman ◽  
Nighat Sultana ◽  
Abdullah . ◽  
Abbas Jamal ◽  
Maryam Muzaffar ◽  
...  

Family Phyllanthaceae is one of the largest segregates of the eudicot order Malpighiales and its species are herb, shrub, and tree, which are mostly distributed in tropical regions. Certain taxonomic discrepancies exist at genus and family level. Here, we report chloroplast genomes of three Phyllanthaceae species—Phyllanthus emblica, Flueggea virosa, and Leptopus cordifolius— and compare them with six others previously reported Phyllanthaceae chloroplast genomes. The species of Phyllanthaceae displayed quadripartite structure, comprising inverted repeat regions (IRa and IRb) that separate large single copy (LSC) and small single copy (SSC) regions. The length of complete chloroplast genome ranged from 154,707 bp to 161,093 bp; LSC from 83,627 bp to 89,932 bp; IRs from 23,921 bp to 27,128 bp; and SSC from 17,424 bp to 19,441 bp. Chloroplast genomes contained 111 to 112 unique genes, including 77 to 78 protein-coding, 30 transfer RNA (tRNA), and 4 ribosomal RNA (rRNA) that showed similarities in arrangement. The number of protein-coding genes varied due to deletion/pseudogenization of rps16 genes in Baccaurea ramiflora and Leptopus cordifolius. High variability was seen in number of oligonucleotide repeats while analysis of guanine-cytosine (GC) content, codon usage, amino acid frequency, simple sequence repeats analysis, synonymous and non-synonymous substitutions, and transition and transversion substitutions showed similarities in all Phyllanthaceae species. We detected a higher number of transition substitutions in the coding sequences than non-coding sequences. Moreover, the high number of transition substitutions was determined among the distantly related species in comparison to closely related species. Phylogenetic analysis shows the polyphyletic nature of the genus Phyllanthus which requires further verification. We also determined suitable polymorphic coding genes, including rpl22, ycf1, matK, ndhF, and rps15 which may be helpful for the reconstruction of the high-resolution phylogenetic tree of the family Phyllanthaceae using a large number of species in the future. Overall, the current study provides insight into chloroplast genome evolution in Phyllanthaceae.


Sign in / Sign up

Export Citation Format

Share Document