scholarly journals Mangrove tree (Avicennia marina): insight into chloroplast genome evolutionary divergence and its comparison with related species from family Acanthaceae

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Sajjad Asaf ◽  
Abdul Latif Khan ◽  
Muhammad Numan ◽  
Ahmed Al-Harrasi

AbstractAvicennia marina (family Acanthaceae) is a halotolerant woody shrub that grows wildly and cultivated in the coastal regions. Despite its importance, the species suffers from lack of genomic datasets to improve its taxonomy and phylogenetic placement across the related species. Here, we have aimed to sequence the plastid genome of A. marina and its comparison with related species in family Acanthaceae. Detailed next-generation sequencing and analysis showed a complete chloroplast genome of 150,279 bp, comprising 38.6% GC. Genome architecture is quadripartite revealing large single copy (82,522 bp), small single copy (17,523 bp), and pair of inverted repeats (25,117 bp). Furthermore, the genome contains 132 different genes, including 87 protein-coding genes, 8 rRNA, 37 tRNA genes, and 126 simple sequence repeats (122 mononucleotide, 2 dinucleotides, and 2 trinucleotides). Interestingly, about 25 forward, 15 reversed and 14 palindromic repeats were also found in the A. marina. High degree synteny was observed in the pairwise alignment with related genomes. The chloroplast genome comparative assessment showed a high degree of sequence similarity in coding regions and varying divergence in the intergenic spacers among ten Acanthaceae species. The pairwise distance showed that A. marina exhibited the highest divergence (0.084) with Justicia flava and showed lowest divergence with Aphelandra knappiae (0.059). Current genomic datasets are a valuable resource for investigating the population and evolutionary genetics of family Acanthaceae members’ specifically A. marina and related species.

Plants ◽  
2020 ◽  
Vol 9 (8) ◽  
pp. 928
Author(s):  
Lubna ◽  
Sajjad Asaf ◽  
Rahmatullah Jan ◽  
Abdul Latif Khan ◽  
In-Jung Lee

Oxalis corniculata L. (family Oxalidaceae) is a small creeper wood sorrel plant that grows well in moist climates. Despite being medicinally important, little is known about the genomics of this species. Here, we determined the complete chloroplast genome sequence of O. corniculata for the first time and compared it with other members of family Oxalidaceae. The genome was 152,189 bp in size and comprised of a pair of 25,387 bp inverted repeats (IR) that separated a large 83,427 bp single copy region (LSC) and a small 16,990 bp single copy region (SSC). The chloroplast genome of O. corniculata contains 131 genes with 83 protein coding genes, 40 tRNA genes, and 8 rRNA genes. The analysis revealed 46 microsatellites, of which 6 were present in coding sequences (CDS) regions, 34 in the LSC, 8 in the SSC, and 2 in the single IR region. Twelve palindromic repeats, 30 forward repeats, and 32 tandem repeats were also detected. Chloroplast genome comparisons revealed an overall high degree of sequence similarity between O. corniculata and O. drummondii and some divergence in the intergenic spacers of related species in Oxalidaceae. Furthermore, the seven most divergent genes (ccsA, clpP, rps8, rps15, rpl22, matK, and ycf1) among genomes were observed. Phylogenomic characterization on the basis of 60 shared genes revealed that O. corniculata is closely related to O. drummondii. The complete O. corniculata genome sequenced in the present study is a valuable resource for investigating the population and evolutionary genetics of family Oxalidaceae and can be used to identify related species.


Plants ◽  
2020 ◽  
Vol 9 (10) ◽  
pp. 1354
Author(s):  
Slimane Khayi ◽  
Fatima Gaboun ◽  
Stacy Pirro ◽  
Tatiana Tatusova ◽  
Abdelhamid El Mousadik ◽  
...  

Argania spinosa (Sapotaceae), an important endemic Moroccan oil tree, is a primary source of argan oil, which has numerous dietary and medicinal proprieties. The plant species occupies the mid-western part of Morocco and provides great environmental and socioeconomic benefits. The complete chloroplast (cp) genome of A. spinosa was sequenced, assembled, and analyzed in comparison with those of two Sapotaceae members. The A. spinosa cp genome is 158,848 bp long, with an average GC content of 36.8%. The cp genome exhibits a typical quadripartite and circular structure consisting of a pair of inverted regions (IR) of 25,945 bp in length separating small single-copy (SSC) and large single-copy (LSC) regions of 18,591 and 88,367 bp, respectively. The annotation of A. spinosa cp genome predicted 130 genes, including 85 protein-coding genes (CDS), 8 ribosomal RNA (rRNA) genes, and 37 transfer RNA (tRNA) genes. A total of 44 long repeats and 88 simple sequence repeats (SSR) divided into mononucleotides (76), dinucleotides (7), trinucleotides (3), tetranucleotides (1), and hexanucleotides (1) were identified in the A. spinosa cp genome. Phylogenetic analyses using the maximum likelihood (ML) method were performed based on 69 protein-coding genes from 11 species of Ericales. The results confirmed the close position of A. spinosa to the Sideroxylon genus, supporting the revisiting of its taxonomic status. The complete chloroplast genome sequence will be valuable for further studies on the conservation and breeding of this medicinally and culinary important species and also contribute to clarifying the phylogenetic position of the species within Sapotaceae.


Plants ◽  
2020 ◽  
Vol 9 (1) ◽  
pp. 61 ◽  
Author(s):  
Huyen-Trang Vu ◽  
Ngan Tran ◽  
Thanh-Diem Nguyen ◽  
Quoc-Luan Vu ◽  
My-Huyen Bui ◽  
...  

Paphiopedilum delenatii is a native orchid of Vietnam with highly attractive floral traits. Unfortunately, it is now listed as a critically endangered species with a few hundred individuals remaining in nature. In this study, we performed next-generation sequencing of P. delenatii and assembled its complete chloroplast genome. The whole chloroplast genome of P. delenatii was 160,955 bp in size, 35.6% of which was GC content, and exhibited typical quadripartite structure of plastid genomes with four distinct regions, including the large and small single-copy regions and a pair of inverted repeat regions. There were, in total, 130 genes annotated in the genome: 77 coding genes, 39 tRNA genes, 8 rRNA genes, and 6 pseudogenes. The loss of ndh genes and variation in inverted repeat (IR) boundaries as well as data of simple sequence repeats (SSRs) and divergent hotspots provided useful information for identification applications and phylogenetic studies of Paphiopedilum species. Whole chloroplast genomes could be used as an effective super barcode for species identification or for developing other identification markers, which subsequently serves the conservation of Paphiopedilum species.


Author(s):  
Wojciech Pląder ◽  
Yasushi Yukawa ◽  
Masahiro Sugiura ◽  
Stefan Malepszy

AbstractThe complete nucleotide sequence of the cucumber (C. sativus L. var. Borszczagowski) chloroplast genome has been determined. The genome is composed of 155,293 bp containing a pair of inverted repeats of 25,191 bp, which are separated by two single-copy regions, a small 18,222-bp one and a large 86,688-bp one. The chloroplast genome of cucumber contains 130 known genes, including 89 protein-coding genes, 8 ribosomal RNA genes (4 rRNA species), and 37 tRNA genes (30 tRNA species), with 18 of them located in the inverted repeat region. Of these genes, 16 contain one intron, and two genes and one ycf contain 2 introns. Twenty-one small inversions that form stem-loop structures, ranging from 18 to 49 bp, have been identified. Eight of them show similarity to those of other species, while eight seem to be cucumber specific. Detailed comparisons of ycf2 and ycf15, and the overall structure to other chloroplast genomes were performed.


Author(s):  
Inkyu Park ◽  
Wook-Jin Kim ◽  
Sang-Min Yeo ◽  
Goya Choi ◽  
Young-Min Kang ◽  
...  

The genus Fritillaria belongs to the widely distributed family Liliaceae. The bulbs of Fritillaria ussuriensis and Fritillaria cirrhosa are valuable herbaceous medicinal ingredients. However, they are still used indiscriminately in herbal medicine. Identification and molecular phylogenic analysis of Fritillaria species is therefore required. Here, we report the complete chloroplast (cp) genome sequences of F. ussuriensis and F. cirrhosa. The two Fritillaria cp genomes were 151,524 and 151,083 bp in length, respectively, including a pair of inverted repeat regions (52,678 and 52,156 bp) separated by a large single copy region (81,732 and 81,390 bp) and small single copy region (17,114 and 17,537 bp). A total of 111 genes in F. ussuriensis and 112 in F. cirrhosa comprised 77 protein-coding genes in F. ussuriensis and 78 in F. cirrhosa, 30 tRNA genes, and four rRNA genes. The gene order, content, and orientation of the two Fritillaria cp genomes exhibited the general structure of flowering plants, and were similar to those of other Fritillaria species. Comparison of the six Fritillaria species’ cp genomes indicated seven highly divergent regions in intergenic spacers and in the matK, rpoC1, rpoC2, ycf1, ycf2, ndhD, and ndhF coding regions. We established the position of the six species through phylogenic analysis. The complete chloroplast genome sequences of two Fritillaria species will be useful genomics resources for identification of Fritillaria species and for studying the phylogenetic relationship among Fritillaria species within the Liliaceae family.


PeerJ ◽  
2019 ◽  
Vol 7 ◽  
pp. e7210 ◽  
Author(s):  
Wan Zhang ◽  
Yunlin Zhao ◽  
Guiyan Yang ◽  
Jiao Peng ◽  
Shuwen Chen ◽  
...  

Camellia oleiferais one of the four largest woody edible oil plants in the world with high ecological and medicinal values. Due to frequent interspecific hybridization, it was difficult to study its genetics and evolutionary history. This study usedC. oleiferathat was collected on Hainan Island to conduct our research. The unique island environment makes the quality of tea oil higher than that of other species grown in the mainland. Moreover, a long-term geographic isolation might affect gene structure. In order to better understand the molecular biology of this species, protect excellent germplasm resources, and promote the population genetics and phylogenetic studies ofCamelliaplants, high-throughput sequencing technology was used to obtain the chloroplast genome sequence of HainanC. oleifera. The results showed that the whole chloroplast genome ofC. oleiferain Hainan was 156,995 bp in length, with a typical quadripartite structure of a large single copy (LSC) region of 86,648 bp, a small single copy (SSC) region of 18,297 bp, and a pair of inverted repeats (IRs) of 26,025 bp. The whole genome encoded a total of 141 genes (115 different genes), including 88 protein-coding genes, 45 tRNA genes, and eight rRNA genes. Among these genes, nine genes contained one intron, two genes contained two introns, and four overlapping genes were also detected. The total GC content of HainanC. oleifera’s chloroplast genome was 37.29%. The chloroplast genome structure characteristics of HainanC. oleiferawere compared with mainlandC. oleiferaand those of the other eight closely related Theaceae species; it was found that the contractions and expansions of the IR/LSC and IR/SSC regions affected the length of chloroplast genome. The chloroplast genome sequences of these Theaceae species were highly similar. A comparative analysis indicated that the Theaceae species were conserved in structure and evolution. A total of 51 simple sequence repeat (SSR) loci were detected in the chloroplast genome of HainanC. oleifera, and allCamelliaplants did not have pentanucleotide repeats, which could be used as a good marker in phylogenetic studies. We also detected seven long repeats, the base composition of all repeats was biased toward A/T, which was consistent with the codon bias. It was found that HainanC. oleiferahad a similar evolutionary relationship withC. crapnelliana, through the use of codons and phylogenetic analysis. This study can provide an effective genomic resource for the evolutionary history of Theaceae family.


Forests ◽  
2021 ◽  
Vol 12 (5) ◽  
pp. 608
Author(s):  
Sang-Chul Kim ◽  
Jei-Wan Lee ◽  
Byoung-Ki Choi

In the present study, chloroplast genome sequences of four species of Symplocos (S. chinensis for. pilosa, S. prunifolia, S. coreana, and S. tanakana) from South Korea were obtained by Ion Torrent sequencing and compared with the sequences of three previously reported Symplocos chloroplast genomes from different species. The length of the Symplocos chloroplast genome ranged from 156,961 to 157,365 bp. Overall, 132 genes including 87 functional genes, 37 tRNA genes, and eight rRNA genes were identified in all Symplocos chloroplast genomes. The gene order and contents were highly similar across the seven species. The coding regions were more conserved than the non-coding regions, and the large single-copy and small single-copy regions were less conserved than the inverted repeat regions. We identified five new hotspot regions (rbcL, ycf4, psaJ, rpl22, and ycf1) that can be used as barcodes or species-specific Symplocos molecular markers. These four novel chloroplast genomes provide basic information on the plastid genome of Symplocos and enable better taxonomic characterization of this genus.


Author(s):  
Umar Rehman ◽  
Nighat Sultana ◽  
Abdullah . ◽  
Abbas Jamal ◽  
Maryam Muzaffar ◽  
...  

Family Phyllanthaceae is one of the largest segregates of the eudicot order Malpighiales and its species are herb, shrub, and tree, which are mostly distributed in tropical regions. Certain taxonomic discrepancies exist at genus and family level. Here, we report chloroplast genomes of three Phyllanthaceae species—Phyllanthus emblica, Flueggea virosa, and Leptopus cordifolius— and compare them with six others previously reported Phyllanthaceae chloroplast genomes. The species of Phyllanthaceae displayed quadripartite structure, comprising inverted repeat regions (IRa and IRb) that separate large single copy (LSC) and small single copy (SSC) regions. The length of complete chloroplast genome ranged from 154,707 bp to 161,093 bp; LSC from 83,627 bp to 89,932 bp; IRs from 23,921 bp to 27,128 bp; and SSC from 17,424 bp to 19,441 bp. Chloroplast genomes contained 111 to 112 unique genes, including 77 to 78 protein-coding, 30 transfer RNA (tRNA), and 4 ribosomal RNA (rRNA) that showed similarities in arrangement. The number of protein-coding genes varied due to deletion/pseudogenization of rps16 genes in Baccaurea ramiflora and Leptopus cordifolius. High variability was seen in number of oligonucleotide repeats while analysis of guanine-cytosine (GC) content, codon usage, amino acid frequency, simple sequence repeats analysis, synonymous and non-synonymous substitutions, and transition and transversion substitutions showed similarities in all Phyllanthaceae species. We detected a higher number of transition substitutions in the coding sequences than non-coding sequences. Moreover, the high number of transition substitutions was determined among the distantly related species in comparison to closely related species. Phylogenetic analysis shows the polyphyletic nature of the genus Phyllanthus which requires further verification. We also determined suitable polymorphic coding genes, including rpl22, ycf1, matK, ndhF, and rps15 which may be helpful for the reconstruction of the high-resolution phylogenetic tree of the family Phyllanthaceae using a large number of species in the future. Overall, the current study provides insight into chloroplast genome evolution in Phyllanthaceae.


2021 ◽  
Vol 12 ◽  
Author(s):  
Jin Xu ◽  
Chen Liu ◽  
Yun Song ◽  
Mingfu Li

The genus Pennisetum (Poaceae) is both a forage crop and staple food crop in the tropics. In this study, we obtained chloroplast genome sequences of four species of Pennisetum (P. alopecuroides, P. clandestinum, P. glaucum, and P. polystachion) using Illumina sequencing. These chloroplast genomes have circular structures of 136,346–138,119 bp, including a large single-copy region (LSC, 79,380–81,186 bp), a small single-copy region (SSC, 12,212–12,409 bp), and a pair of inverted repeat regions (IRs, 22,284–22,372 bp). The overall GC content of these chloroplast genomes was 38.6–38.7%. The complete chloroplast genomes contained 110 different genes, including 76 protein-coding genes, 30 transfer RNA (tRNA) genes, and four ribosomal RNA (rRNA) genes. Comparative analysis of nucleotide variability identified nine intergenic spacer regions (psbA-matK, matK-rps16, trnN-trnT, trnY-trnD-psbM, petN-trnC, rbcL-psaI, petA-psbJ, psbE-petL, and rpl32-trnL), which may be used as potential DNA barcodes in future species identification and evolutionary analysis of Pennisetum. The phylogenetic analysis revealed a close relationship between P. polystachion and P. glaucum, followed by P. clandestinum and P. alopecuroides. The completed genomes of this study will help facilitate future research on the phylogenetic relationships and evolution of Pennisetum species.


Plants ◽  
2020 ◽  
Vol 9 (10) ◽  
pp. 1332
Author(s):  
Kai Jiang ◽  
Li-Yuan Miao ◽  
Zheng-Wei Wang ◽  
Zi-Yi Ni ◽  
Chao Hu ◽  
...  

Although the medicinal properties of Coelogyne spp. have been previously studied, there is little genomic information providing a valuable tool for the plant taxonomy, conservation, and utilization of this genus. This study used the next-generation MiSeq sequencing platform to characterize the chloroplast (cp) genomes of Coelogyne fimbriata and Coelogyne ovalis. The Maximum Likelihood (ML) and Bayesian (BI) methods were employed to confirm the phylogenetic position of two Coelogyne species based on the whole chloroplast genome sequences. Additionally, we developed eight new primers based on the two cp genomes’ medium variable regions and evaluated the transferability to another 16 Coelogyne species. We constructed phylogenetic trees including 18 Coelogyne species and four outgroup species using the chloroplast fragments with the ML method. Our results showed that the cp genomes of C. fimbriata and C. ovalis contained a small single-copy region (18,839 and 18,851 bp, respectively) and a large single-copy region (87,606 and 87,759 bp, respectively), separated by two same-length inverted-repeat regions (26,675 bp in C. fimbriata and 26,715 bp C. ovalis, respectively). They all contained 86 protein-coding genes, 38 tRNA genes, and eight rRNA genes, revealing strong structure and gene content similarities. The phylogenetic analysis indicated a close relationship between the genera Coelogyne and Pleione. The newly developed primers revealed good transferability among the Coelogyne taxa and provided enough variable sites to distinguish C. fimbriata and C. ovalis. The two complete cp genomes and the eight new primers of Coelogyne provide new genomic data for further studies on phylogenomics, population genetics, and evolutionary history of Coelogyne taxa.


Sign in / Sign up

Export Citation Format

Share Document