scholarly journals Complete Chloroplast Genome of Argania spinosa: Structural Organization and Phylogenetic Relationships in Sapotaceae

Plants ◽  
2020 ◽  
Vol 9 (10) ◽  
pp. 1354
Author(s):  
Slimane Khayi ◽  
Fatima Gaboun ◽  
Stacy Pirro ◽  
Tatiana Tatusova ◽  
Abdelhamid El Mousadik ◽  
...  

Argania spinosa (Sapotaceae), an important endemic Moroccan oil tree, is a primary source of argan oil, which has numerous dietary and medicinal proprieties. The plant species occupies the mid-western part of Morocco and provides great environmental and socioeconomic benefits. The complete chloroplast (cp) genome of A. spinosa was sequenced, assembled, and analyzed in comparison with those of two Sapotaceae members. The A. spinosa cp genome is 158,848 bp long, with an average GC content of 36.8%. The cp genome exhibits a typical quadripartite and circular structure consisting of a pair of inverted regions (IR) of 25,945 bp in length separating small single-copy (SSC) and large single-copy (LSC) regions of 18,591 and 88,367 bp, respectively. The annotation of A. spinosa cp genome predicted 130 genes, including 85 protein-coding genes (CDS), 8 ribosomal RNA (rRNA) genes, and 37 transfer RNA (tRNA) genes. A total of 44 long repeats and 88 simple sequence repeats (SSR) divided into mononucleotides (76), dinucleotides (7), trinucleotides (3), tetranucleotides (1), and hexanucleotides (1) were identified in the A. spinosa cp genome. Phylogenetic analyses using the maximum likelihood (ML) method were performed based on 69 protein-coding genes from 11 species of Ericales. The results confirmed the close position of A. spinosa to the Sideroxylon genus, supporting the revisiting of its taxonomic status. The complete chloroplast genome sequence will be valuable for further studies on the conservation and breeding of this medicinally and culinary important species and also contribute to clarifying the phylogenetic position of the species within Sapotaceae.

2020 ◽  
Author(s):  
Aziz Ebrahimi ◽  
Jennifer D. Antonides ◽  
Cornelia C. Pinchot ◽  
James M. Slavicek ◽  
Charles E. Flower ◽  
...  

ABSTRACTAmerican elm, Ulmus americana L., was cultivated widely in USA and Canada as a landscape tree, but the genome of this important species is poorly characterized. For the first time, we describe the sequencing and assembly of the chloroplast genomes of two American elm genotypes (RV16 and Am57845). The complete chloroplast genome of U. americana ranged from 158,935-158,993 bp. The genome contains 127 genes, including 85 protein-coding genes, 34 tRNA genes and 8 rRNA genes. Between the two American elm chloroplasts we sequenced, we identified 240 sequence variants (SNPs and indels). To evaluate the phylogeny of American elm, we compared the chloroplast genomes of two American elms along with seven Asian elm species and twelve other chloroplast genomes available through the NCBI database. As expected, Ulmus was closely related to Morus and Cannabis, as all three genera are assigned to the Urticales. Comparison of American elm with Asian elms revealed that trnH was absent from the chloroplast of American elm but not most Asian elms; conversely, petB, petD, psbL, trnK, and rps16 are present in the American elm but absent from all Asian elms. The complete chloroplast genome of U. americana will provide useful genetic resources for characterizing the genetic diversity of U. americana and potentially help to conserve natural populations of American elm.


2021 ◽  
Vol 12 ◽  
Author(s):  
Yifan Yu ◽  
Zhen Ouyang ◽  
Juan Guo ◽  
Wen Zeng ◽  
Yujun Zhao ◽  
...  

Erigeron breviscapus is a famous medicinal plant. However, the limited chloroplast genome information of E. breviscapus, especially for the chloroplast DNA sequence resources, has hindered the study of E. breviscapus chloroplast genome transformation. Here, the complete chloroplast (cp) genome of E. breviscapus was reported. This genome was 152,164bp in length, included 37.2% GC content and was structurally arranged into two 24,699bp inverted repeats (IRs) and two single-copy areas. The sizes of the large single-copy region and the small single-copy region were 84,657 and 18,109bp, respectively. The E. breviscapus cp genome consisted of 127 coding genes, including 83 protein coding genes, 36 transfer RNA (tRNA) genes, and eight ribosomal RNA (rRNA) genes. For those genes, 95 genes were single copy genes and 16 genes were duplicated in two inverted regions with seven tRNAs, four rRNAs, and five protein coding genes. Then, genomic DNA of E. breviscapus was used as a template, and the endogenous 5' and 3' flanking sequences of the trnI gene and trnA gene were selected as homologous recombinant fragments in vector construction and cloned through PCR. The endogenous 5' flanking sequences of the psbA gene and rrn16S gene, the endogenous 3' flanking sequences of the psbA gene, rbcL gene, and rps16 gene and one sequence element from the psbN-psbH chloroplast operon were cloned, and certain chloroplast regulatory elements were identified. Two homologous recombination fragments and all of these elements were constructed into the cloning vector pBluescript SK (+) to yield a series of chloroplast expression vectors, which harbored the reporter gene EGFP and the selectable marker aadA gene. After identification, the chloroplast expression vectors were transformed into Escherichia coli and the function of predicted regulatory elements was confirmed by a spectinomycin resistance test and fluorescence intensity measurement. The results indicated that aadA gene and EGFP gene were efficiently expressed under the regulation of predicted regulatory elements and the chloroplast expression vector had been successfully constructed, thereby providing a solid foundation for establishing subsequent E. breviscapus chloroplast transformation system and genetic improvement of E. breviscapus.


2019 ◽  
Vol 2019 ◽  
pp. 1-17 ◽  
Author(s):  
Samaila S. Yaradua ◽  
Dhafer A. Alzahrani ◽  
Enas J. Albokhary ◽  
Abidina Abba ◽  
Abubakar Bello

The complete chloroplast genome of J. flava, an endangered medicinal plant in Saudi Arabia, was sequenced and compared with cp genome of three Acanthaceae species to characterize the cp genome, identify SSRs, and also detect variation among the cp genomes of the sampled Acanthaceae. NOVOPlasty was used to assemble the complete chloroplast genome from the whole genome data. The cp genome of J. flava was 150, 888bp in length with GC content of 38.2%, and has a quadripartite structure; the genome harbors one pair of inverted repeat (IRa and IRb 25, 500bp each) separated by large single copy (LSC, 82, 995 bp) and small single copy (SSC, 16, 893 bp). There are 132 genes in the genome, which includes 80 protein coding genes, 30 tRNA, and 4 rRNA; 113 are unique while the remaining 19 are duplicated in IR regions. The repeat analysis indicates that the genome contained all types of repeats with palindromic occurring more frequently; the analysis also identified total number of 98 simple sequence repeats (SSR) of which majority are mononucleotides A/T and are found in the intergenic spacer. The comparative analysis with other cp genomes sampled indicated that the inverted repeat regions are conserved than the single copy regions and the noncoding regions show high rate of variation than the coding region. All the genomes have ndhF and ycf1 genes in the border junction of IRb and SSC. Sequence divergence analysis of the protein coding genes showed that seven genes (petB, atpF, psaI, rpl32, rpl16, ycf1, and clpP) are under positive selection. The phylogenetic analysis revealed that Justiceae is sister to Ruellieae. This study reported the first cp genome of the largest genus in Acanthaceae and provided resources for studying genetic diversity of J. flava as well as resolving phylogenetic relationships within the core Acanthaceae.


2021 ◽  
Author(s):  
Weicai Song ◽  
Zimeng Chen ◽  
Qi Feng ◽  
Chuxuan Ji ◽  
Chengbo Wei ◽  
...  

Abstract Background: Litsea, Lauraceae, is a group of evergreen trees or shrubs that widely distributed in tropical and subtropical countries, such as Asia and America. Species in Litsea are spontaneously distributed at a maximum altitude of 2,700 m from sea level. Pants and its extractions from Litsea species cover a wide range of medicinal and industrial values. The aromatic oil extracted from Litsea is of great value with citral as its main component. At present, studies related to gene resources of Litsea are limited in the morphological analysis, while studies at the genetic level are insufficient. We therefore firstly assembled and annotated the complete chloroplast genome of nine species in Litsea, carried out a serious of comparative analysis, and completed the construction of phylogenetic tree within genus Litsea. Results: The genome length ranged from 152,051 to 152,717 bp. A total of 128 genes were identified, including 84 protein-coding genes, 36 rRNA genes and 8 tRNA genes. High consistency of codon bias, repeats, divergent analysis, single nucleotide polymorphisms (SNP) and insertions and deletions (InDels) revealed highly conserved chloroplast phenotypes in species within the genus Litsea. Changes in gene length and the present of pseudogene ycf1Ψ that caused by IR contraction and expansion were reported. The non-coding regions, especially atpF - atpH and ndhC - trnV-UAC presented high gene divergence. PsbJ - psbE regions showed remarkably high nucleotide diversity (Pi) values. Furthermore, we constructed two phylogenetic trees, demonstrating two dominant clades within genus Litsea. And the differences between trees constructed by full chloroplast (cp) genome and protein-coding genes were revealed. Conclusion: Overall, the evolutionary pattern of Litsea species regarding structural features, repeats sequences and variations presented high consistency. Valuable genomic resources and theoretical basis were also provided for further research of taxonomic discrepancies, molecular marker-assisted breeding and phylogenetic relationships of Litsea and other angiosperm species.


2021 ◽  
Vol 11 ◽  
Author(s):  
Yongtan Li ◽  
Yan Dong ◽  
Yichao Liu ◽  
Xiaoyue Yu ◽  
Minsheng Yang ◽  
...  

In this study, we assembled and annotated the chloroplast (cp) genome of the Euonymus species Euonymus fortunei, Euonymus phellomanus, and Euonymus maackii, and performed a series of analyses to investigate gene structure, GC content, sequence alignment, and nucleic acid diversity, with the objectives of identifying positive selection genes and understanding evolutionary relationships. The results indicated that the Euonymus cp genome was 156,860–157,611bp in length and exhibited a typical circular tetrad structure. Similar to the majority of angiosperm chloroplast genomes, the results yielded a large single-copy region (LSC) (85,826–86,299bp) and a small single-copy region (SSC) (18,319–18,536bp), separated by a pair of sequences (IRA and IRB; 26,341–26,700bp) with the same encoding but in opposite directions. The chloroplast genome was annotated to 130–131 genes, including 85–86 protein coding genes, 37 tRNA genes, and eight rRNA genes, with GC contents of 37.26–37.31%. The GC content was variable among regions and was highest in the inverted repeat (IR) region. The IR boundary of Euonymus happened expanding resulting that the rps19 entered into IR region and doubled completely. Such fluctuations at the border positions might be helpful in determining evolutionary relationships among Euonymus. The simple-sequence repeats (SSRs) of Euonymus species were composed primarily of single nucleotides (A)n and (T)n, and were mostly 10–12bp in length, with an obvious A/T bias. We identified several loci with suitable polymorphism with the potential use as molecular markers for inferring the phylogeny within the genus Euonymus. Signatures of positive selection were seen in rpoB protein encoding genes. Based on data from the whole chloroplast genome, common single copy genes, and the LSC, SSC, and IR regions, we constructed an evolutionary tree of Euonymus and related species, the results of which were consistent with traditional taxonomic classifications. It showed that E. fortunei sister to the Euonymus japonicus, whereby E. maackii appeared as sister to Euonymus hamiltonianus. Our study provides important genetic information to support further investigations into the phylogenetic development and adaptive evolution of Euonymus species.


Plants ◽  
2019 ◽  
Vol 8 (4) ◽  
pp. 89 ◽  
Author(s):  
Yuying Huang ◽  
Zerui Yang ◽  
Song Huang ◽  
Wenli An ◽  
Jing Li ◽  
...  

In the last decade, several studies have relied on a small number of plastid genomes to deduce deep phylogenetic relationships in the species-rich Myrtaceae. Nevertheless, the plastome of Rhodomyrtus tomentosa, an important representative plant of the Rhodomyrtus (DC.) genera, has not yet been reported yet. Here, we sequenced and analyzed the complete chloroplast (CP) genome of R. tomentosa, which is a 156,129-bp-long circular molecule with 37.1% GC content. This CP genome displays a typical quadripartite structure with two inverted repeats (IRa and IRb), of 25,824 bp each, that are separated by a small single copy region (SSC, 18,183 bp) and one large single copy region (LSC, 86,298 bp). The CP genome encodes 129 genes, including 84 protein-coding genes, 37 tRNA genes, eight rRNA genes and three pseudogenes (ycf1, rps19, ndhF). A considerable number of protein-coding genes have a universal ATG start codon, except for psbL and ndhD. Premature termination codons (PTCs) were found in one protein-coding gene, namely atpE, which is rarely reported in the CP genome of plants. Phylogenetic analysis revealed that R. tomentosa has a sister relationship with Eugenia uniflora and Psidium guajava. In conclusion, this study identified unique characteristics of the R. tomentosa CP genome providing valuable information for further investigations on species identification and the phylogenetic evolution between R. tomentosa and related species.


Genome ◽  
2020 ◽  
Vol 63 (1) ◽  
pp. 53-60 ◽  
Author(s):  
Liping Nie ◽  
Yingxian Cui ◽  
Xinlian Chen ◽  
Zhichao Xu ◽  
Wei Sun ◽  
...  

Arctium lappa, commonly called burdock, has a long medicinal and edible history. It has recently gained increasing attention because of its economic value. In this study, we obtained the complete chloroplast genome of A. lappa by Illumina Hiseq. The complete chloroplast genome of A. lappa is a typical circular structure with 152 708 bp in length. The GC content in the whole chloroplast genome of A. lappa is 37.7%. A total of 37 tRNA genes, 8 rRNA genes, and 87 protein-coding genes were successfully annotated. And the chloroplast genome contains 113 unique genes, 19 of which are duplicated in the inverted repeat. The distribution of 39 simple sequence repeats was analysed, and most of them are in the large single-copy (LSC) sequence. An inversion comprising 16 genes was found in the LSC region, which is 26 283 bp long. We performed multiple sequence alignments using 72 common protein-coding genes of 29 species and constructed a Maximum Parsimony (MP) tree. The MP phylogenetic result shows that A. lappa grouped together with Carthamus tinctorius, Centaurea diffusa, and Saussurea involucrata. The chloroplast genome of A. lappa is a valuable resource for further studies in Asteraceae.


BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Abdul Latif Khan ◽  
Sajjad Asaf ◽  
Lubna ◽  
Ahmed Al-Rawahi ◽  
Ahmed Al-Harrasi

Abstract Background Salvadora persica L. (Toothbrush tree – Miswak; family-Salvadoraceae) grows in the arid-land ecosystem and possesses economic and medicinal importance. The species, genus and the family have no genomic datasets available specifically on chloroplast (cp) genomics and taxonomic evolution. Herein, we have sequenced the complete chloroplast genome of S. persica for the first time and compared it with 11 related specie’s cp genomes from the order Brassicales. Results The S. persica cp genome was 153,379 bp in length containing a sizeable single-copy region (LSC) of 83,818 bp which separated from the small single-copy region (SSC) of 17,683 bp by two inverted repeats (IRs) each 25,939 bp. Among these genomes, the largest cp genome size (160,600 bp) was found in M. oleifera, while in S. persica it was the smallest (153,379 bp). The cp genome of S. persica encoded 131 genes, including 37 tRNA genes, eight rRNA genes and 86 protein-coding genes. Besides, S. persica contains 27 forward, 36 tandem and 19 palindromic repeats. The S. persica cp genome had 154 SSRs with the highest number in the LSC region. Complete cp genome comparisons showed an overall high degree of sequence resemblance between S. persica and related cp genomes. Some divergence was observed in the intergenic spaces of other species. Phylogenomic analyses of 60 shared genes indicated that S. persica formed a single clade with A. tetracantha with high bootstrap values. The family Salvadoraceae is closely related to Capparaceae and Petadiplandraceae rather than to Bataceae and Koberliniacaea. Conclusion The current genomic datasets provide pivotal genetic resources to determine the phylogenetic relationships, genome evolution and future genetic diversity-related studies of S. persica in complex angiosperm families.


PeerJ ◽  
2016 ◽  
Vol 4 ◽  
pp. e2734 ◽  
Author(s):  
Xin Yao ◽  
Ying-Ying Liu ◽  
Yun-Hong Tan ◽  
Yu Song ◽  
Richard T. Corlett

Complete chloroplast genome sequences have been very useful for understanding phylogenetic relationships in angiosperms at the family level and above, but there are currently large gaps in coverage. We report the chloroplast genome forHelwingia himalaica, the first in the distinctive family Helwingiaceae and only the second genus to be sequenced in the order Aquifoliales. We then combine this with 36 published sequences in the large (c. 35,000 species) subclass Campanulidae in order to investigate relationships at the order and family levels. TheHelwingiagenome consists of 158,362 bp containing a pair of inverted repeat (IR) regions of 25,996 bp separated by a large single-copy (LSC) region and a small single-copy (SSC) region which are 87,810 and 18,560 bp, respectively. There are 142 known genes, including 94 protein-coding genes, eight ribosomal RNA genes, and 40 tRNA genes. The topology of the phylogenetic relationships between Apiales, Asterales, and Dipsacales differed between analyses based on complete genome sequences and on 36 shared protein-coding genes, showing that further studies of campanulid phylogeny are needed.


PLoS ONE ◽  
2021 ◽  
Vol 16 (3) ◽  
pp. e0248556
Author(s):  
Bin Zhu ◽  
Fang Qian ◽  
Yunfeng Hou ◽  
Weicheng Yang ◽  
Mengxian Cai ◽  
...  

Eruca sativa Mill. (Brassicaceae) is an important edible vegetable and a potential medicinal plant due to the antibacterial activity of its seed oil. Here, the complete chloroplast (cp) genome of E. sativa was de novo assembled with a combination of long PacBio reads and short Illumina reads. The E. sativa cp genome had a quadripartite structure that was 153,522 bp in size, consisting of one large single-copy region of 83,320 bp and one small single-copy region of 17,786 bp which were separated by two inverted repeat (IRa and IRb) regions of 26,208 bp. This complete cp genome harbored 113 unique genes: 79 protein-coding genes, 30 tRNA genes, and four rRNA genes. Forty-nine long repetitive sequences and 69 simple sequence repeats were identified in the E. sativa cp genome. A codon usage analysis of the E. sativa cp genome showed a bias toward codons ending in A/T. The E. sativa cp genome was similar in size, gene composition, and linearity of the structural region when compared with other Brassicaceae cp genomes. Moreover, the analysis of the synonymous (Ks) and non-synonymous (Ka) substitution rates demonstrated that protein-coding genes generally underwent purifying selection pressure, expect ycf1, ycf2, and rps12. A phylogenetic analysis determined that E. sativa is evolutionarily close to important Brassica species, indicating that it may be possible to transfer favorable E. sativa alleles into other Brassica species. Our results will be helpful to advance genetic improvement and breeding of E. sativa, and will provide valuable information for utilizing E. sativa as an important resource to improve other Brassica species.


Sign in / Sign up

Export Citation Format

Share Document