scholarly journals Comparative analysis of chloroplast genomes of kenaf cytoplasmic male sterile line and its maintainer line

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Danfeng Tang ◽  
Fan Wei ◽  
Ruiyang Zhou

AbstractKenaf is a great source of bast fiber and possesses significantly industrial interests. Cytoplasmic male sterility (CMS) is the basis of heterosis utilization in kenaf. Chloroplast, an important organelle for photosynthesis, could be associated with CMS. To understand the phylogenetic position and molecular basis of kenaf CMS from the perspective of chloroplast, the chloroplast (cp) genomes of the CMS line P3A and its maintainer line P3B were characterized and their comparative analysis was also performed. In this study, the chloroplast genomes of P3B and P3A were sequenced with 163,597 bp and 163,360 bp in length, respectively. A total of 131 genes including 85 protein coding genes (PCGs), 38 transfer RNA (tRNA) genes, and 8 ribosome RNA (rRNA) genes were annotated in P3B, while 132 genes containing 83 PCGs, 41 tRNA genes, and 8 rRNA genes were found in P3A. The phylogenetic tree revealed that kenaf was closely related to Hibiscus syriacus and Abelmoschus esculentus. Further analysis of single nucleotide polymorphism (SNP) and insertion and deletion (InDel) showed that compared with P3B, a total of 22 SNPs and 53 InDels were detected in gene coding region, gene intron, and intergenic regions of P3A. Remarkably, a total of 9 SNPs including 6 synonymous SNPs and 3 nonsynonymous SNPs were found in psbK, atpA, rpoC2, atpB, rpl20, clpP, rpoA, and ycf1. The present study provided basic information for further study of kenaf CMS mechsnism.

2020 ◽  
Author(s):  
Yucui Han ◽  
Yujie Gao ◽  
Xiaoguang Zhai ◽  
Hao Zhou ◽  
Qin Ding ◽  
...  

Abstract The sterile line is vital for the heterosis utilization of wheat (Triticum aestivum L.), and the cytoplasmic male sterility (CMS) line has more practical significance in the utilization of heterosis. To reveal the sterile mechanism of wheat K-CMS line K519A from the perspective of the chloroplast genome, the chloroplast genomes of the K519A and its same nuclear genotype maintainer line 519B were sequenced using second-generation high-throughput technology and assembled. Then the chloroplast genomic contents, SSR sequences, long repeat fragments, and qPCR were analyzed. The results showed that the total length of K519A and 519B were 136,996 bp and 136,235 bp, respectively. Both chloroplast genomes encode 126 genes, including 89 protein-coding genes, 8 rRNA genes, and 39 tRNA genes. There were 186 and 188 SSRs in K519A and 519B, respectively. And a developed SSR maker, which from atpF, can be used to distinguish the cytoplasmic type of Aegilops kotschyi and Triticum aestivum, respectively. There were 50 and 52 long repeat fragments, which come from the gene sequences, in K519A and 519B, respectively. A total of 107 mutations between K519A and 519B, which from 45 genes, were identified, of which, 16 genes (matK, rps16, rpoB, rpoC2, atpI, ndhK, atpB, rbcL, psbB, psbH, rpl14, ndhH, ndhF, rpl32, ccsA, ndhA) contained non-synonymous mutations. Further, the qPCR analysis was performed, and the gene expression levels of selected six genes for K519A compared to 519B were mostly downregulated at the binucleate and trinucleate stages of pollen development, thus, these non-synonymous mutant genes might affect the fertility of K519A.


2021 ◽  
Vol 12 ◽  
Author(s):  
Jin Xu ◽  
Chen Liu ◽  
Yun Song ◽  
Mingfu Li

The genus Pennisetum (Poaceae) is both a forage crop and staple food crop in the tropics. In this study, we obtained chloroplast genome sequences of four species of Pennisetum (P. alopecuroides, P. clandestinum, P. glaucum, and P. polystachion) using Illumina sequencing. These chloroplast genomes have circular structures of 136,346–138,119 bp, including a large single-copy region (LSC, 79,380–81,186 bp), a small single-copy region (SSC, 12,212–12,409 bp), and a pair of inverted repeat regions (IRs, 22,284–22,372 bp). The overall GC content of these chloroplast genomes was 38.6–38.7%. The complete chloroplast genomes contained 110 different genes, including 76 protein-coding genes, 30 transfer RNA (tRNA) genes, and four ribosomal RNA (rRNA) genes. Comparative analysis of nucleotide variability identified nine intergenic spacer regions (psbA-matK, matK-rps16, trnN-trnT, trnY-trnD-psbM, petN-trnC, rbcL-psaI, petA-psbJ, psbE-petL, and rpl32-trnL), which may be used as potential DNA barcodes in future species identification and evolutionary analysis of Pennisetum. The phylogenetic analysis revealed a close relationship between P. polystachion and P. glaucum, followed by P. clandestinum and P. alopecuroides. The completed genomes of this study will help facilitate future research on the phylogenetic relationships and evolution of Pennisetum species.


2020 ◽  
Vol 11 ◽  
Author(s):  
Peninah Cheptoo Rono ◽  
Xiang Dong ◽  
Jia-Xin Yang ◽  
Fredrick Munyao Mutie ◽  
Millicent A. Oulo ◽  
...  

The genus Alchemilla L., known for its medicinal and ornamental value, is widely distributed in the Holarctic regions with a few species found in Asia and Africa. Delimitation of species within Alchemilla is difficult due to hybridization, autonomous apomixes, and polyploidy, necessitating efficient molecular-based characterization. Herein, we report the initial complete chloroplast (cp) genomes of Alchemilla. The cp genomes of two African (Afromilla) species Alchemilla pedata and Alchemilla argyrophylla were sequenced, and phylogenetic and comparative analyses were conducted in the family Rosaceae. The cp genomes mapped a typical circular quadripartite structure of lengths 152,438 and 152,427 base pairs (bp) in A. pedata and A. argyrophylla, respectively. Alchemilla cp genomes were composed of a pair of inverted repeat regions (IRa/IRb) of length 25,923 and 25,915 bp, separating the small single copy (SSC) region of 17,980 and 17,981 bp and a large single copy (LSC) region of 82,612 and 82,616 bp in A. pedata and A. argyrophylla, respectively. The cp genomes encoded 114 unique genes including 88 protein-coding genes, 37 transfer RNA (tRNA) genes, and 4 ribosomal RNA (rRNA) genes. Additionally, 88 and 95 simple sequence repeats (SSRs) and 37 and 40 tandem repeats were identified in A. pedata and A. argyrophylla, respectively. Significantly, the loss of group II intron in atpF gene in Alchemilla species was detected. Phylogenetic analysis based on 26 whole cp genome sequences and 78 protein-coding gene sequences of 27 Rosaceae species revealed a monophyletic clustering of Alchemilla nested within subfamily Rosoideae. Based on a protein-coding region, negative selective pressure (Ka/Ks < 1) was detected with an average Ka/Ks value of 0.1322 in A. argyrophylla and 0.1418 in A. pedata. The availability of complete cp genome in the genus Alchemilla will contribute to species delineation and further phylogenetic and evolutionary studies in the family Rosaceae.


Forests ◽  
2021 ◽  
Vol 12 (12) ◽  
pp. 1681
Author(s):  
Xue-Jiao Zhang ◽  
Kang-Jia Liu ◽  
Ya-Chao Wang ◽  
Jian He ◽  
Yuan-Mi Wu ◽  
...  

High genetic diversity and low differentiation present challenges in taxonomy and systematics of Salix. Chloroplast (cp) genome sequencing is efficient for providing new genomic information and elucidating phylogenetic relationships. Salix spathulifolia Seemen, S. cupularis Rehder, and S. annulifera C.Marquand & Airy Shaw are three shrubby willows spread in high-altitude regions in western China. In this study, the integrated circular cp genomes were sequenced and analyzed, and a phylogeny of Salix was constructed on the basis of the cp genomes. The results of chloroplast assembly and annotation information were used to characterize genome feature and interspecific variation. The phylogenetic position of the three willows was evaluated using phylogenetic analysis. Full-length cp genomes were 155,566–155,680 bp with a typical double-stranded circular quadripartite structure, containing one large single-copy region (LSC, 84,431–4552 bp), one small single-copy region (SSC: 16,206–16,221 bp), and two inverted repeats (IR: 27,453–27,461 bp). The cp genomes encoded 130 genes, including 8 rRNA genes, 37 tRNA genes, and 85 protein-coding genes. The guanine-cytosine (GC) content of the overall genome was 36.7%. Comparison among the three willows’ cp genomes revealed high similarity. Phylogenetic analysis indicated that S. spathulifolia was a basal taxon of clade I, while S. annulifera formed a monophyletic group with S. rorida Laksch.; S. cupularis was sister to S. suchowensis W.C. Cheng and S. psammophila Z. Wang & Chang Y. Yang. The complete chloroplast genomes of the three willows provides an additional sequence-based resource for studying the phylogeny and evolutionary history of Salicaceae.


2021 ◽  
Author(s):  
Mahtab Moghaddam ◽  
Atsushi Ohta ◽  
Motoki Shimizu ◽  
Ryohei Terauchi ◽  
Shahrokh Kazempour-Osaloo

Abstract Plastid genome sequences provide valuable markers for surveying the evolutionary relationships and population genetics of plant species. In the present study, the complete plastid genome of Onobrychis gaubae, endemic to Iran, was sequenced using Illumina paired-end sequencing and was compared with previously known genomes of the IRLC species of legumes. The O. gaubae plastid genome was 123,645 bp in length and included a large single-copy (LSC) region of 81,034 bp, a small single-copy (SSC) region of 13,788 bp and one copy of the inverted repeat (IRb) of 28,823 bp. The genome encoded 110 genes, including 76 protein-coding genes, 30 transfer RNA (tRNA) genes and four ribosome RNA (rRNA) genes and possessed 89 simple sequence repeats (SSRs) and 28 repeated structures with the highest proportion in the LSC. Comparative analysis of the chloroplast genomes across IRLC revealed three hotspot genes (ycf1, ycf2, clpP) which could be used as molecular markers for resolving phylogenetic relationships and species identification. IRLC plastid genomes also showed multiple gene losses and inversions. Phylogenetic analyses revealed that O. gaubae is closely related to Hedysarum. The complete O. gaubae genome is a valuable resource for investigating evolution of Onobrychis species and can be used to identify related species.


Plants ◽  
2021 ◽  
Vol 10 (6) ◽  
pp. 1229
Author(s):  
Dhafer A. Alzahrani ◽  
Enas J. Albokhari ◽  
Samaila S. Yaradua ◽  
Abidina Abba

This study presents for the first time the complete chloroplast genomes of four medicinal species in the Capparaceae family belonging to two different genera, Cadaba and Maerua (i.e., C. farinosa, C. glandulosa, M. crassifolia and M. oblongifolia), to investigate their evolutionary process and to infer their phylogenetic positions. The four species are considered important medicinal plants, and are used in the treatment of many diseases. In the genus Cadaba, the chloroplast genome ranges from 156,481 bp to 156,560 bp, while that of Maerua ranges from 155,685 bp to 155,436 bp. The chloroplast genome of C. farinosa, M. crassifolia and M. oblongifolia contains 138 genes, while that of C. glandulosa contains 137 genes, comprising 81 protein-coding genes, 31 tRNA genes and 4 rRNA genes. Out of the total genes, 116–117 are unique, while the remaining 19 are replicated in inverted repeat regions. The psbG gene, which encodes for subunit K of NADH dehydrogenase, is absent in C. glandulosa. A total of 249 microsatellites were found in the chloroplast genome of C. farinosa, 251 in C. glandulosa, 227 in M. crassifolia and 233 in M. oblongifolia, the majority of which are mononucleotides A/T found in the intergenic spacer. Comparative analysis revealed variable hotspot regions (atpF, rpoC2, rps19 and ycf1), which can be used as molecular markers for species authentication and as regions for inferring phylogenetic relationships among them, as well as for evolutionary studies. The monophyly of Capparaceae and other families under Brassicales, as well as the phylogenetic positions of the studied species, are highly supported by all the relationships in the phylogenetic tree. The cp genomes reported in this study will provide resources for studying the genetic diversity of Capparaceae, as well as resolving phylogenetic relationships within the family.


2020 ◽  
Author(s):  
Jingling Li ◽  
Jianmin Tang ◽  
Siyuan Zeng ◽  
Fang Han ◽  
Jing Yuan ◽  
...  

Abstract Background Pilea is a genus of perennial herbs from the family Urticaceae, which are used for courtyard ornamental. For some species, they are used as medicinal plants in traditional Chinese medicine as well. The morphological characteristics of medicinal species from Pilea are similar, and it is difficult to accurately distinguish them based only on morphological characteristics. Besides, the species classification of Pilea are still controversial. The classification of many species are still in an unresolved state. At present, there is no information about the chloroplast genomes of Pilea, which limits our further understanding of this genus. Here, we first reported 4 chloroplast genomes of Pilea taxa (P. mollis, P. glauca, P. peperomioides and P. serpyllacea), and performed comprehensive comparative analysis. Results The four chloroplast genomes have similar structural characteristics and gene order with other angiosperms. These genomes all have a typical quartile structure, which contains 113 unique genes, including 79 protein-coding genes, 4 rRNA genes, and 30 tRNA genes. Besides, we detected SSRs and repeat sequences, and analyzed the expansion/contraction of IR regions. In particular, the comparative analysis showed a rather level of sequence divergence in the non-coding regions, even in the protein-coding regions of the four genome sequences, suggesting a high level of genetic diversity in Pilea. Moreover, we identified eight hypervariable regions, including petN-psbM; psbZ-trnG-GCC; trnT-UGU-trnL-UAA; accD-psbI; ndhF-rpl32; rpl32-trnL-UAG; ndhA-intron and ycf1, are proposed for use as DNA barcode regions. Phylogenetic analysis showed that four Pilea species form a monophyletic cluster with a 100% bootstrap value. Conclusion The results obtained here could provide abundant information for the phylogenetic position of Pilea and further species identification. High levels of sequences divergence promote our understanding of the interspecific diversity of this genus, also provide reference for the rational classification of unsolved species in the future.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Bobby Lim-Ho Kong ◽  
Hyun-Seung Park ◽  
Tai-Wai David Lau ◽  
Zhixiu Lin ◽  
Tae-Jin Yang ◽  
...  

AbstractIlex is a monogeneric plant group (containing approximately 600 species) in the Aquifoliaceae family and one of the most commonly used medicinal herbs. However, its taxonomy and phylogenetic relationships at the species level are debatable. Herein, we obtained the complete chloroplast genomes of all 19 Ilex types that are native to Hong Kong. The genomes are conserved in structure, gene content and arrangement. The chloroplast genomes range in size from 157,119 bp in Ilex graciliflora to 158,020 bp in Ilex kwangtungensis. All these genomes contain 125 genes, of which 88 are protein-coding and 37 are tRNA genes. Four highly varied sequences (rps16-trnQ, rpl32-trnL, ndhD-psaC and ycf1) were found. The number of repeats in the Ilex genomes is mostly conserved, but the number of repeating motifs varies. The phylogenetic relationship among the 19 Ilex genomes, together with eight other available genomes in other studies, was investigated. Most of the species could be correctly assigned to the section or even series level, consistent with previous taxonomy, except Ilex rotunda var. microcarpa, Ilex asprella var. tapuensis and Ilex chapaensis. These species were reclassified; I. rotunda was placed in the section Micrococca, while the other two were grouped with the section Pseudoaquifolium. These studies provide a better understanding of Ilex phylogeny and refine its classification.


PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e8450 ◽  
Author(s):  
Sunan Huang ◽  
Xuejun Ge ◽  
Asunción Cano ◽  
Betty Gaby Millán Salazar ◽  
Yunfei Deng

The genus Dicliptera (Justicieae, Acanthaceae) consists of approximately 150 species distributed throughout the tropical and subtropical regions of the world. Newly obtained chloroplast genomes (cp genomes) are reported for five species of Dilciptera (D. acuminata, D. peruviana, D. montana, D. ruiziana and D. mucronata) in this study. These cp genomes have circular structures of 150,689–150,811 bp and exhibit quadripartite organizations made up of a large single copy region (LSC, 82,796–82,919 bp), a small single copy region (SSC, 17,084–17,092 bp), and a pair of inverted repeat regions (IRs, 25,401–25,408 bp). Guanine-Cytosine (GC) content makes up 37.9%–38.0% of the total content. The complete cp genomes contain 114 unique genes, including 80 protein-coding genes, 30 transfer RNA (tRNA) genes, and four ribosomal RNA (rRNA) genes. Comparative analyses of nucleotide variability (Pi) reveal the five most variable regions (trnY-GUA-trnE-UUC, trnG-GCC, psbZ-trnG-GCC, petN-psbM, and rps4-trnL-UUA), which may be used as molecular markers in future taxonomic identification and phylogenetic analyses of Dicliptera. A total of 55-58 simple sequence repeats (SSRs) and 229 long repeats were identified in the cp genomes of the five Dicliptera species. Phylogenetic analysis identified a close relationship between D. ruiziana and D. montana, followed by D. acuminata, D. peruviana, and D. mucronata. Evolutionary analysis of orthologous protein-coding genes within the family Acanthaceae revealed only one gene, ycf15, to be under positive selection, which may contribute to future studies of its adaptive evolution. The completed genomes are useful for future research on species identification, phylogenetic relationships, and the adaptive evolution of the Dicliptera species.


Molecules ◽  
2018 ◽  
Vol 23 (9) ◽  
pp. 2137 ◽  
Author(s):  
Xiang-Xiao Meng ◽  
Yan-Fang Xian ◽  
Li Xiang ◽  
Dong Zhang ◽  
Yu-Hua Shi ◽  
...  

The genus Sanguisorba, which contains about 30 species around the world and seven species in China, is the source of the medicinal plant Sanguisorba officinalis, which is commonly used as a hemostatic agent as well as to treat burns and scalds. Here we report the complete chloroplast (cp) genome sequences of four Sanguisorba species (S. officinalis, S. filiformis, S. stipulata, and S. tenuifolia var. alba). These four Sanguisorba cp genomes exhibit typical quadripartite and circular structures, and are 154,282 to 155,479 bp in length, consisting of large single-copy regions (LSC; 84,405–85,557 bp), small single-copy regions (SSC; 18,550–18,768 bp), and a pair of inverted repeats (IRs; 25,576–25,615 bp). The average GC content was ~37.24%. The four Sanguisorba cp genomes harbored 112 different genes arranged in the same order; these identical sections include 78 protein-coding genes, 30 tRNA genes, and four rRNA genes, if duplicated genes in IR regions are counted only once. A total of 39–53 long repeats and 79–91 simple sequence repeats (SSRs) were identified in the four Sanguisorba cp genomes, which provides opportunities for future studies of the population genetics of Sanguisorba medicinal plants. A phylogenetic analysis using the maximum parsimony (MP) method strongly supports a close relationship between S. officinalis and S. tenuifolia var. alba, followed by S. stipulata, and finally S. filiformis. The availability of these cp genomes provides valuable genetic information for future studies of Sanguisorba identification and provides insights into the evolution of the genus Sanguisorba.


Sign in / Sign up

Export Citation Format

Share Document