scholarly journals Abundant RNA editing sites of chloroplast protein-coding genes in Ginkgo biloba and an evolutionary pattern analysis

2016 ◽  
Vol 16 (1) ◽  
Author(s):  
Peng He ◽  
Sheng Huang ◽  
Guanghui Xiao ◽  
Yuzhou Zhang ◽  
Jianing Yu
BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Joonhyung Jung ◽  
Changkyun Kim ◽  
Joo-Hwan Kim

Abstract Background Commelinaceae (Commelinales) comprise 41 genera and are widely distributed in both the Old and New Worlds, except in Europe. The relationships among genera in this family have been suggested in several morphological and molecular studies. However, it is difficult to explain their relationships due to high morphological variations and low support values. Currently, many researchers have been using complete chloroplast genome data for inferring the evolution of land plants. In this study, we completed 15 new plastid genome sequences of subfamily Commelinoideae using the Mi-seq platform. We utilized genome data to reveal the structural variations and reconstruct the problematic positions of genera for the first time. Results All examined species of Commelinoideae have three pseudogenes (accD, rpoA, and ycf15), and the former two might be a synapomorphy within Commelinales. Only four species in tribe Commelineae presented IR expansion, which affected duplication of the rpl22 gene. We identified inversions that range from approximately 3 to 15 kb in four taxa (Amischotolype, Belosynapsis, Murdannia, and Streptolirion). The phylogenetic analysis using 77 chloroplast protein-coding genes with maximum parsimony, maximum likelihood, and Bayesian inference suggests that Palisota is most closely related to tribe Commelineae, supported by high support values. This result differs significantly from the current classification of Commelinaceae. Also, we resolved the unclear position of Streptoliriinae and the monophyly of Dichorisandrinae. Among the ten CDS (ndhH, rpoC2, ndhA, rps3, ndhG, ndhD, ccsA, ndhF, matK, and ycf1), which have high nucleotide diversity values (Pi > 0.045) and over 500 bp length, four CDS (ndhH, rpoC2, matK, and ycf1) show that they are congruent with the topology derived from 77 chloroplast protein-coding genes. Conclusions In this study, we provide detailed information on the 15 complete plastid genomes of Commelinoideae taxa. We identified characteristic pseudogenes and nucleotide diversity, which can be used to infer the family evolutionary history. Also, further research is needed to revise the position of Palisota in the current classification of Commelinaceae.


mBio ◽  
2015 ◽  
Vol 6 (6) ◽  
Author(s):  
Vojtěch David ◽  
Pavel Flegontov ◽  
Evgeny Gerasimov ◽  
Goro Tanifuji ◽  
Hassan Hashimi ◽  
...  

ABSTRACT Perkinsela is an enigmatic early-branching kinetoplastid protist that lives as an obligate endosymbiont inside Paramoeba (Amoebozoa). We have sequenced the highly reduced mitochondrial genome of Perkinsela, which possesses only six protein-coding genes (cox1, cox2, cox3, cob, atp6, and rps12), despite the fact that the organelle itself contains more DNA than is present in either the host or endosymbiont nuclear genomes. An in silico analysis of two Perkinsela strains showed that mitochondrial RNA editing and processing machineries typical of kinetoplastid flagellates are generally conserved, and all mitochondrial transcripts undergo U-insertion/deletion editing. Canonical kinetoplastid mitochondrial ribosomes are also present. We have developed software tools for accurate and exhaustive mapping of transcriptome sequencing (RNA-seq) reads with extensive U-insertions/deletions, which allows detailed investigation of RNA editing via deep sequencing. With these methods, we show that up to 50% of reads for a given edited region contain errors of the editing system or, less likely, correspond to alternatively edited transcripts. IMPORTANCE Uridine insertion/deletion-type RNA editing, which occurs in the mitochondrion of kinetoplastid protists, has been well-studied in the model parasite genera Trypanosoma, Leishmania, and Crithidia. Perkinsela provides a unique opportunity to broaden our knowledge of RNA editing machinery from an evolutionary perspective, as it represents the earliest kinetoplastid branch and is an obligatory endosymbiont with extensive reductive trends. Interestingly, up to 50% of mitochondrial transcripts in Perkinsela contain errors. Our study was complemented by use of newly developed software designed for accurate mapping of extensively edited RNA-seq reads obtained by deep sequencing.


BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Shujie Dong ◽  
Zhiqi Ying ◽  
Shuisheng Yu ◽  
Qirui Wang ◽  
Guanghui Liao ◽  
...  

Abstract Background The Stephania tetrandra S. Moore (S. tetrandra) is a medicinal plant belonging to the family Menispermaceae that has high medicinal value and is well worth doing further exploration. The wild resources of S. tetrandra were widely distributed in tropical and subtropical regions of China, generating potential genetic diversity and unique population structures. The geographical origin of S. tetrandra is an important factor influencing its quality and price in the market. In addition, the species relationship within Stephania genus still remains uncertain due to high morphological similarity and low support values of molecular analysis approach. The complete chloroplast (cp) genome data has become a promising strategy to determine geographical origin and understand species evolution for closely related plant species. Herein, we sequenced the complete cp genome of S. tetrandra from Zhejiang Province and conducted a comparative analysis within Stephania plants to reveal the structural variations, informative markers and phylogenetic relationship of Stephania species. Results The cp genome of S. tetrandra voucher ZJ was 157,725 bp, consisting of a large single copy region (89,468 bp), a small single copy region (19,685 bp) and a pair of inverted repeat regions (24,286 bp each). A total of 134 genes were identified in the cp genome of S. tetrandra, including 87 protein-coding genes, 8 rRNA genes, 37 tRNA genes and 2 pseudogene copies (ycf1 and rps19). The gene order and GC content were highly consistent in the Stephania species according to the comparative analysis results, with the highest RSCU value in arginine (1.79) and lowest RSCU value in serine of S. tetrandra, respectively. A total of 90 SSRs have been identified in the cp genome of S. tetrandra, where repeats that consisting of A or T bases were much higher than that of G or C bases. In addition, 92 potential RNA editing sites were identified in 25 protein-coding genes, with the most predicted RNA editing sites in ndhB gene. The variations on length and expansion extent to the junction of ycf1 gene were observed between S. tetrandra vouchers from different regions, indicating potential markers for further geographical origin discrimination. Moreover, the values of transition to transversion ratio (Ts/Tv) in the Stephania species were significantly higher than 1 using Pericampylus glaucus as reference. Comparative analysis of the Stephania cp genomes revealed 5 highly variable regions, including 3 intergenic regions (trnH-psbA, trnD-trnY, trnP) and two protein coding genes (rps16 and ndhA). The identified mutational hotspots of Stephania plants exhibited multiple SNP sites and Gaps, as well as different Ka/Ks ratio values. In addition, five pairs of specific primers targeting the divergence regions were accordingly designed, which could be utilized as potential molecular markers for species identification, population genetic and phylogenetic analysis in Stephania species. Phylogenetic tree analysis based on the conserved chloroplast protein coding genes indicated a sister relationship between S. tetrandra and the monophyletic group of S. japonica and S. kwangsiensis with high support values, suggesting a close genetic relationship within Stephania plants. However, two S. tetrandra vouches from different regions failed to cluster into one clade, confirming the occurrences of genetic diversities and requiring further investigation for geographical tracing strategy. Conclusions Overall, we provided comprehensive and detailed information on the complete chloroplast genome and identified nucleotide diversity hotspots of Stephania species. The obtained genetic resource of S. tetrandra from Zhejiang Province would facilitate future studies in DNA barcode, species discrimination, the intraspecific and interspecific variability and the phylogenetic relationships of Stephania plants.


Author(s):  
Luoyun Wang ◽  
Jing Wang ◽  
Caiyun He ◽  
Jianguo Zhang ◽  
Yanfei Zeng

Hippophae is a tree species with ecological, economic and social benefits. In this study, we assembled and annotated chloroplast genomes of sympatric Hippophae gyantsensis and H. rhamnoides subsp. yunnanensis. Their full-length are 155260 and 156415 bp, respectively. Each of them has 131 genes, comprising 85 protein-coding genes, 8 ribosomal RNA genes and 38 transfer RNA genes. After comparing the chloroplast genomes, we found 1302 base difference loci, and 63.29% are located in the intergenic region or intron sequences and 36.71% are located in the coding sequences. The SSC region has the highest mutation rate, followed by the LSC region; the IR regions have the lowest mutation rate. Among the protein-coding genes, three had a ratio of nonsynonymous to synonymous substitutions (Ka/Ks) >1 (but P values were non-significant) and 66 had Ka/Ks <1 (46 were significant). In general, the chloroplast protein-coding genes may be subject to purification selection. Among H. gyantsensis and H. rhamnoides subsp. yunnanensis chloroplast protein-coding genes, there are 20 and 16 optimal codons, respectively. Most of the optimal codons were ending with A or U, which indicates significant AT preference. It is an important reference for studies on the general characteristics and evolution of the Hippophae chloroplast genome.


2011 ◽  
Vol 53 (12) ◽  
pp. 961-970 ◽  
Author(s):  
Haiyan Chen ◽  
Likun Deng ◽  
Yuan Jiang ◽  
Ping Lu ◽  
Jianing Yu

2021 ◽  
Vol 43 (1) ◽  
Author(s):  
SHANSHAN DONG ◽  
YANG LIU

In contrast to the highly variable mitogenomes of vascular plants, the composition and architecture of mitogenomes within the three bryophyte lineages appear stable and invariant. Currently, complete mitogenomes are available from 113 bryophyte accessions of 71 genera and 28 orders. Liverworts and mosses hold a rich mitochondrial (mt) gene repertoire among land plants with 40–42 protein-coding genes, whereas hornworts maintain the smallest functional gene set among land plants, of only around two dozen protein-coding genes, with the majority of ribosomal genes pseudogenized and all cytochrome c maturase genes lost. The rRNA and tRNA genes are also conserved and rich in mosses and liverworts, whereas subject to patchy losses in hornworts. In contrast to the conserved gene set, intron content varies significantly with only one intron shared among the three bryophyte lineages. Bryophytes hold relatively compact mitogenomes with narrow size fluctuations. Among the three bryophyte lineages, intergenic spacers and repeat content are smallest in mosses, largest in hornworts, and intermediate in liverworts, mirroring their size differences and levels of structural dynamics among the three lineages. Mosses, with the least repeated sequences, show the most static genome structure; whereas hornworts, with a relatively large set of repeated sequences, experience 1–4 rearrangements; liverworts, with intermediate repeat levels, see only one structural variant that requires two inversions to gain collinearity with the mitogenome of other liverworts. Repeat sequences were evoked to explain the mt gene order rearrangements in hornwort and liverwort mitogenomes; with the latter also supported by sequencing read evidence, which suggests that the conserved mitogenome structure observed in bryophyte lineages might be shaped by low repeat recombination level, and/or along with the intensified nucleus’ surveillance. Mitochondrial RNA editing is abundant in hornworts, with medium frequency and high variation in liverwort species, and generally limited in mosses, reflecting the diversity of nuclear encoded PPR proteins that are functionally related to RNA editing processes.


Sign in / Sign up

Export Citation Format

Share Document