Complete plastid genome sequence ofPrimula sinensis(Primulaceae): structure comparison, sequence variation and evidence foraccDtransfer to nucleus

Species-rich genusPrimulaL. is a typical plant group with which to understand genetic variance between species in different levels of relationships. Chloroplast genome sequences are used to be the information resource for quantifying this difference and reconstructing evolutionary history. In this study, we reported the complete chloroplast genome sequence ofPrimula sinensisand compared it with other related species. This genome of chloroplast showed a typical circular quadripartite structure with 150,859 bp in sequence length consisting of 37.2% GC base. Two inverted repeated regions (25,535 bp) were separated by a large single-copy region (82,064 bp) and a small single-copy region (17,725 bp). The genome consists of 112 genes, including 78 protein-coding genes, 30 tRNA genes and four rRNA genes. Among them, seven coding genes, seven tRNA genes and four rRNA genes have two copies due to their locations in the IR regions. TheaccDandinfAgenes lacking intact open reading frames (ORF) were identified as pseudogenes. SSR and sequence variation analyses were also performed on the plastome ofPrimula sinensis, comparing with another available plastome ofP. poissonii. The four most variable regions,rpl36–rps8,rps16–trnQ,trnH–psbAandndhC–trnV, were identified. Phylogenetic relationship estimates using three sub-datasets extracted from a matrix of 57 protein-coding gene sequences showed the identical result that was consistent with previous studies. A transcript found fromP. sinensistranscriptome showed a high similarity to plastidaccDfunctional region and was identified as a putative plastid transit peptide at the N-terminal region. The result strongly suggested that plastidaccDhas been functionally transferred to the nucleus inP. sinensis.

Download Full-text

Chloroplast Genome Analysis of Two Medicinal Coelogyne spp. (Orchidaceae) Shed Light on the Genetic Information, Comparative Genomics, and Species Identification

Plants ◽

10.3390/plants9101332 ◽

2020 ◽

Vol 9 (10) ◽

pp. 1332

Author(s):

Kai Jiang ◽

Li-Yuan Miao ◽

Zheng-Wei Wang ◽

Zi-Yi Ni ◽

Chao Hu ◽

...

Keyword(s):

Chloroplast Genome ◽

Phylogenetic Trees ◽

Single Copy ◽

Rrna Genes ◽

Phylogenetic Position ◽

Trna Genes ◽

Protein Coding ◽

Variable Regions ◽

Sequencing Platform ◽

Close Relationship

Although the medicinal properties of Coelogyne spp. have been previously studied, there is little genomic information providing a valuable tool for the plant taxonomy, conservation, and utilization of this genus. This study used the next-generation MiSeq sequencing platform to characterize the chloroplast (cp) genomes of Coelogyne fimbriata and Coelogyne ovalis. The Maximum Likelihood (ML) and Bayesian (BI) methods were employed to confirm the phylogenetic position of two Coelogyne species based on the whole chloroplast genome sequences. Additionally, we developed eight new primers based on the two cp genomes’ medium variable regions and evaluated the transferability to another 16 Coelogyne species. We constructed phylogenetic trees including 18 Coelogyne species and four outgroup species using the chloroplast fragments with the ML method. Our results showed that the cp genomes of C. fimbriata and C. ovalis contained a small single-copy region (18,839 and 18,851 bp, respectively) and a large single-copy region (87,606 and 87,759 bp, respectively), separated by two same-length inverted-repeat regions (26,675 bp in C. fimbriata and 26,715 bp C. ovalis, respectively). They all contained 86 protein-coding genes, 38 tRNA genes, and eight rRNA genes, revealing strong structure and gene content similarities. The phylogenetic analysis indicated a close relationship between the genera Coelogyne and Pleione. The newly developed primers revealed good transferability among the Coelogyne taxa and provided enough variable sites to distinguish C. fimbriata and C. ovalis. The two complete cp genomes and the eight new primers of Coelogyne provide new genomic data for further studies on phylogenomics, population genetics, and evolutionary history of Coelogyne taxa.

Download Full-text

Comparative analysis of chloroplast genomes for five Dicliptera species (Acanthaceae): molecular structure, phylogenetic relationships, and adaptive evolution

PeerJ ◽

10.7717/peerj.8450 ◽

2020 ◽

Vol 8 ◽

pp. e8450 ◽

Cited By ~ 2

Author(s):

Sunan Huang ◽

Xuejun Ge ◽

Asunción Cano ◽

Betty Gaby Millán Salazar ◽

Yunfei Deng

Keyword(s):

Adaptive Evolution ◽

Phylogenetic Relationships ◽

Single Copy ◽

Rrna Genes ◽

Trna Genes ◽

Evolutionary Analysis ◽

Protein Coding ◽

Variable Regions ◽

Protein Coding Genes ◽

Chloroplast Genomes

The genus Dicliptera (Justicieae, Acanthaceae) consists of approximately 150 species distributed throughout the tropical and subtropical regions of the world. Newly obtained chloroplast genomes (cp genomes) are reported for five species of Dilciptera (D. acuminata, D. peruviana, D. montana, D. ruiziana and D. mucronata) in this study. These cp genomes have circular structures of 150,689–150,811 bp and exhibit quadripartite organizations made up of a large single copy region (LSC, 82,796–82,919 bp), a small single copy region (SSC, 17,084–17,092 bp), and a pair of inverted repeat regions (IRs, 25,401–25,408 bp). Guanine-Cytosine (GC) content makes up 37.9%–38.0% of the total content. The complete cp genomes contain 114 unique genes, including 80 protein-coding genes, 30 transfer RNA (tRNA) genes, and four ribosomal RNA (rRNA) genes. Comparative analyses of nucleotide variability (Pi) reveal the five most variable regions (trnY-GUA-trnE-UUC, trnG-GCC, psbZ-trnG-GCC, petN-psbM, and rps4-trnL-UUA), which may be used as molecular markers in future taxonomic identification and phylogenetic analyses of Dicliptera. A total of 55-58 simple sequence repeats (SSRs) and 229 long repeats were identified in the cp genomes of the five Dicliptera species. Phylogenetic analysis identified a close relationship between D. ruiziana and D. montana, followed by D. acuminata, D. peruviana, and D. mucronata. Evolutionary analysis of orthologous protein-coding genes within the family Acanthaceae revealed only one gene, ycf15, to be under positive selection, which may contribute to future studies of its adaptive evolution. The completed genomes are useful for future research on species identification, phylogenetic relationships, and the adaptive evolution of the Dicliptera species.

Download Full-text

Complete Chloroplast Genome of Argania spinosa: Structural Organization and Phylogenetic Relationships in Sapotaceae

Plants ◽

10.3390/plants9101354 ◽

2020 ◽

Vol 9 (10) ◽

pp. 1354

Author(s):

Slimane Khayi ◽

Fatima Gaboun ◽

Stacy Pirro ◽

Tatiana Tatusova ◽

Abdelhamid El Mousadik ◽

...

Keyword(s):

Chloroplast Genome ◽

Single Copy ◽

Rrna Genes ◽

Trna Genes ◽

Protein Coding ◽

Important Species ◽

Complete Chloroplast Genome ◽

Argania Spinosa ◽

Protein Coding Genes ◽

Cp Genome

Argania spinosa (Sapotaceae), an important endemic Moroccan oil tree, is a primary source of argan oil, which has numerous dietary and medicinal proprieties. The plant species occupies the mid-western part of Morocco and provides great environmental and socioeconomic benefits. The complete chloroplast (cp) genome of A. spinosa was sequenced, assembled, and analyzed in comparison with those of two Sapotaceae members. The A. spinosa cp genome is 158,848 bp long, with an average GC content of 36.8%. The cp genome exhibits a typical quadripartite and circular structure consisting of a pair of inverted regions (IR) of 25,945 bp in length separating small single-copy (SSC) and large single-copy (LSC) regions of 18,591 and 88,367 bp, respectively. The annotation of A. spinosa cp genome predicted 130 genes, including 85 protein-coding genes (CDS), 8 ribosomal RNA (rRNA) genes, and 37 transfer RNA (tRNA) genes. A total of 44 long repeats and 88 simple sequence repeats (SSR) divided into mononucleotides (76), dinucleotides (7), trinucleotides (3), tetranucleotides (1), and hexanucleotides (1) were identified in the A. spinosa cp genome. Phylogenetic analyses using the maximum likelihood (ML) method were performed based on 69 protein-coding genes from 11 species of Ericales. The results confirmed the close position of A. spinosa to the Sideroxylon genus, supporting the revisiting of its taxonomic status. The complete chloroplast genome sequence will be valuable for further studies on the conservation and breeding of this medicinally and culinary important species and also contribute to clarifying the phylogenetic position of the species within Sapotaceae.

Download Full-text

Determination of the evolutionary pressure onCamellia oleiferaon Hainan Island using the complete chloroplast genome sequence

PeerJ ◽

10.7717/peerj.7210 ◽

2019 ◽

Vol 7 ◽

pp. e7210 ◽

Cited By ~ 6

Author(s):

Wan Zhang ◽

Yunlin Zhao ◽

Guiyan Yang ◽

Jiao Peng ◽

Shuwen Chen ◽

...

Keyword(s):

Chloroplast Genome ◽

Genome Sequence ◽

Evolutionary History ◽

Evolutionary Relationship ◽

Hainan Island ◽

Single Copy ◽

Rrna Genes ◽

Trna Genes ◽

Phylogenetic Studies ◽

Chloroplast Genome Sequence

Camellia oleiferais one of the four largest woody edible oil plants in the world with high ecological and medicinal values. Due to frequent interspecific hybridization, it was difficult to study its genetics and evolutionary history. This study usedC. oleiferathat was collected on Hainan Island to conduct our research. The unique island environment makes the quality of tea oil higher than that of other species grown in the mainland. Moreover, a long-term geographic isolation might affect gene structure. In order to better understand the molecular biology of this species, protect excellent germplasm resources, and promote the population genetics and phylogenetic studies ofCamelliaplants, high-throughput sequencing technology was used to obtain the chloroplast genome sequence of HainanC. oleifera. The results showed that the whole chloroplast genome ofC. oleiferain Hainan was 156,995 bp in length, with a typical quadripartite structure of a large single copy (LSC) region of 86,648 bp, a small single copy (SSC) region of 18,297 bp, and a pair of inverted repeats (IRs) of 26,025 bp. The whole genome encoded a total of 141 genes (115 different genes), including 88 protein-coding genes, 45 tRNA genes, and eight rRNA genes. Among these genes, nine genes contained one intron, two genes contained two introns, and four overlapping genes were also detected. The total GC content of HainanC. oleifera’s chloroplast genome was 37.29%. The chloroplast genome structure characteristics of HainanC. oleiferawere compared with mainlandC. oleiferaand those of the other eight closely related Theaceae species; it was found that the contractions and expansions of the IR/LSC and IR/SSC regions affected the length of chloroplast genome. The chloroplast genome sequences of these Theaceae species were highly similar. A comparative analysis indicated that the Theaceae species were conserved in structure and evolution. A total of 51 simple sequence repeat (SSR) loci were detected in the chloroplast genome of HainanC. oleifera, and allCamelliaplants did not have pentanucleotide repeats, which could be used as a good marker in phylogenetic studies. We also detected seven long repeats, the base composition of all repeats was biased toward A/T, which was consistent with the codon bias. It was found that HainanC. oleiferahad a similar evolutionary relationship withC. crapnelliana, through the use of codons and phylogenetic analysis. This study can provide an effective genomic resource for the evolutionary history of Theaceae family.

Download Full-text

Analyzing and Characterizing the Chloroplast Genome of Salix wilsonii

BioMed Research International ◽

10.1155/2019/5190425 ◽

2019 ◽

Vol 2019 ◽

pp. 1-14 ◽

Cited By ~ 3

Author(s):

Yingnan Chen ◽

Nan Hu ◽

Huaitong Wu

Keyword(s):

Chloroplast Genome ◽

Tandem Repeats ◽

Single Copy ◽

Rrna Genes ◽

Whole Genome Sequencing Data ◽

Trna Genes ◽

Sequencing Data ◽

Protein Coding ◽

Organelle Genomes ◽

Additional Sequence

Salix wilsonii is an important ornamental willow tree widely distributed in China. In this study, an integrated circular chloroplast genome was reconstructed for S. wilsonii based on the chloroplast reads screened from the whole-genome sequencing data generated with the PacBio RSII platform. The obtained pseudomolecule was 155,750 bp long and had a typical quadripartite structure, comprising a large single copy region (LSC, 84,638 bp) and a small single copy region (SSC, 16,282 bp) separated by two inverted repeat regions (IR, 27,415 bp). The S. wilsonii chloroplast genome encoded 115 unique genes, including four rRNA genes, 30 tRNA genes, 78 protein-coding genes, and three pseudogenes. Repetitive sequence analysis identified 32 tandem repeats, 22 forward repeats, two reverse repeats, and five palindromic repeats. Additionally, a total of 118 perfect microsatellites were detected, with mononucleotide repeats being the most common (89.83%). By comparing the S. wilsonii chloroplast genome with those of other rosid plant species, significant contractions or expansions were identified at the IR-LSC/SSC borders. Phylogenetic analysis of 17 willow species confirmed that S. wilsonii was most closely related to S. chaenomeloides and revealed the monophyly of the genus Salix. The complete S. wilsonii chloroplast genome provides an additional sequence-based resource for studying the evolution of organelle genomes in woody plants.

Download Full-text

Comparative Analyses of Euonymus Chloroplast Genomes: Genetic Structure, Screening for Loci With Suitable Polymorphism, Positive Selection Genes, and Phylogenetic Relationships Within Celastrineae

Frontiers in Plant Science ◽

10.3389/fpls.2020.593984 ◽

2021 ◽

Vol 11 ◽

Author(s):

Yongtan Li ◽

Yan Dong ◽

Yichao Liu ◽

Xiaoyue Yu ◽

Minsheng Yang ◽

...

Keyword(s):

Positive Selection ◽

Chloroplast Genome ◽

Gc Content ◽

Single Copy ◽

Rrna Genes ◽

Evolutionary Relationships ◽

Trna Genes ◽

Protein Coding ◽

Chloroplast Genomes ◽

Cp Genome

In this study, we assembled and annotated the chloroplast (cp) genome of the Euonymus species Euonymus fortunei, Euonymus phellomanus, and Euonymus maackii, and performed a series of analyses to investigate gene structure, GC content, sequence alignment, and nucleic acid diversity, with the objectives of identifying positive selection genes and understanding evolutionary relationships. The results indicated that the Euonymus cp genome was 156,860–157,611bp in length and exhibited a typical circular tetrad structure. Similar to the majority of angiosperm chloroplast genomes, the results yielded a large single-copy region (LSC) (85,826–86,299bp) and a small single-copy region (SSC) (18,319–18,536bp), separated by a pair of sequences (IRA and IRB; 26,341–26,700bp) with the same encoding but in opposite directions. The chloroplast genome was annotated to 130–131 genes, including 85–86 protein coding genes, 37 tRNA genes, and eight rRNA genes, with GC contents of 37.26–37.31%. The GC content was variable among regions and was highest in the inverted repeat (IR) region. The IR boundary of Euonymus happened expanding resulting that the rps19 entered into IR region and doubled completely. Such fluctuations at the border positions might be helpful in determining evolutionary relationships among Euonymus. The simple-sequence repeats (SSRs) of Euonymus species were composed primarily of single nucleotides (A)n and (T)n, and were mostly 10–12bp in length, with an obvious A/T bias. We identified several loci with suitable polymorphism with the potential use as molecular markers for inferring the phylogeny within the genus Euonymus. Signatures of positive selection were seen in rpoB protein encoding genes. Based on data from the whole chloroplast genome, common single copy genes, and the LSC, SSC, and IR regions, we constructed an evolutionary tree of Euonymus and related species, the results of which were consistent with traditional taxonomic classifications. It showed that E. fortunei sister to the Euonymus japonicus, whereby E. maackii appeared as sister to Euonymus hamiltonianus. Our study provides important genetic information to support further investigations into the phylogenetic development and adaptive evolution of Euonymus species.

Download Full-text

The Chloroplast Phylogenomics and Systematics of Zoysia (Poaceae)

Plants ◽

10.3390/plants10081517 ◽

2021 ◽

Vol 10 (8) ◽

pp. 1517

Author(s):

Se-Hwan Cheon ◽

Min-Ah Woo ◽

Sangjin Jo ◽

Young-Kee Kim ◽

Ki-Joong Kim

Keyword(s):

Northeast Asia ◽

Single Copy ◽

Rrna Genes ◽

Bootstrap Support ◽

Trna Genes ◽

Protein Coding ◽

Tropical Regions ◽

Relationship Of ◽

The Relationship ◽

Simple Sequence

The genus Zoysia Willd. (Chloridoideae) is widely distributed from the temperate regions of Northeast Asia—including China, Japan, and Korea—to the tropical regions of Southeast Asia. Among these, four species—Zoysia japonica Steud., Zoysia sinica Hance, Zoysia tenuifolia Thiele, and Zoysia macrostachya Franch. & Sav.—are naturally distributed in the Korean Peninsula. In this study, we report the complete plastome sequences of these Korean Zoysia species (NCBI acc. nos. MF953592, MF967579~MF967581). The length of Zoysia plastomes ranges from 135,854 to 135,904 bp, and the plastomes have a typical quadripartite structure, which consists of a pair of inverted repeat regions (20,962~20,966 bp) separated by a large (81,348~81,392 bp) and a small (12,582~12,586 bp) single-copy region. In terms of gene order and structure, Zoysia plastomes are similar to the typical plastomes of Poaceae. The plastomes encode 110 genes, of which 76 are protein-coding genes, 30 are tRNA genes, and four are rRNA genes. Fourteen genes contain single introns and one gene has two introns. Three evolutionary hotspot spacer regions—atpB~rbcL, rps16~rps3, and rpl32~trnL-UAG—were recognized among six analyzed Zoysia species. The high divergences in the atpB~rbcL spacer and rpl16~rpl3 region are primarily due to the differences in base substitutions and indels. In contrast, the high divergence between rpl32~trnL-UAG spacers is due to a small inversion with a pair of 22 bp stem and an 11 bp loop. Simple sequence repeats (SSRs) were identified in 59 different locations in Z. japonica, 63 in Z. sinica, 62 in Z. macrostachya, and 63 in Z. tenuifolia plastomes. Phylogenetic analysis showed that the Zoysia (Zoysiinae) forms a monophyletic group, which is sister to Sporobolus (Sporobolinae), with 100% bootstrap support. Within the Zoysia clade, the relationship of (Z. sinica, Z japonica), (Z. tenuifolia, Z. matrella), (Z. macrostachya, Z. macrantha) was suggested.

Download Full-text

Complete Genome Sequence of Cupriavidus necator H16 (DSM 428)

Microbiology Resource Announcements ◽

10.1128/mra.00814-19 ◽

2019 ◽

Vol 8 (37) ◽

Cited By ~ 2

Author(s):

Gareth T. Little ◽

Muhammad Ehsaan ◽

Christian Arenas-López ◽

Kamran Jawed ◽

Klaus Winzer ◽

...

Keyword(s):

Genome Sequence ◽

Complete Genome Sequence ◽

Complete Genome ◽

Cupriavidus Necator ◽

Rrna Genes ◽

Trna Genes ◽

Protein Coding ◽

Content Type ◽

Protein Coding Genes ◽

Cupriavidus Necator H16

The hydrogen-utilizing strain Cupriavidus necator H16 (DSM 428) was sequenced using a combination of PacBio and Illumina sequencing. Annotation of this strain reveals 6,543 protein-coding genes, 263 pseudogenes, 64 tRNA genes, and 15 rRNA genes.

Download Full-text

Comparative genome analysis revealed gene inversions, boundary expansion and contraction, and gene loss in Stemona sessilifolia (Miq.) Miq. chloroplast genome

10.1101/2021.02.15.431246 ◽

2021 ◽

Author(s):

Jingting Liu ◽

Mei Jiang ◽

Haimei Chen ◽

Yu Liu ◽

Chang Liu ◽

...

Keyword(s):

Chloroplast Genome ◽

Gene Loss ◽

Herbal Medicines ◽

Single Copy ◽

Asparagus Officinalis ◽

Accurate Identification ◽

Protein Coding ◽

Variable Regions ◽

Complete Chloroplast Genome ◽

Next Generation Sequencing Technology

AbstractStemona sessilifolia (Miq.) Miq., commonly known as Baibu, is one of the most popular herbal medicines in Asia. In Chinese Pharmacopoeia, Baibu has multiple authentic sources, and there are many homonym herbs sold as Baibu in the herbal medicine market. The existence of the counterfeits of Baibu brings challenges to its identification. To assist the accurate identification of Baibu, we sequenced and analyzed the complete chloroplast genome of Stemona sessilifolia using next-generation sequencing technology. The genome was 154,039 bp in length, possessing a typical quadripartite structure consisting of a pair of inverted repeats (IRs: 27,094 bp) separating by a large single copy (LSC: 81,950 bp) and a small single copy (SSC: 17,901 bp). A total of 112 unique genes were identified, including 80 protein-coding, 28 transfer RNA, and four ribosomal RNA genes. Besides, 45 tandem, 27 forward, 23 palindromic, and 72 simple sequence repeats were detected in the genome by repeat analysis. Compared with its counterfeits (Asparagus officinalis and Carludovica palmate), we found that IR expansion and SSC contraction events of Stemona sessilifolia resulted in two copies of the rpl22 gene in the IR regions and partial duplication of the ndhF gene in the SSC region. Secondly, an approximately 3-kb-long inversion was identified in the LSC region, leading to the petA and cemA gene presented in the complementary strand of the chloroplast DNA molecule. Comparative analysis revealed some highly variable regions, including trnF-GAA_ndhJ, atpB_rbcL, rps15_ycf1, trnG-UCC_trnR-UCU, ndhF_rpl32. Finally, gene loss events were investigated in the context of phylogenetic relationships. In summary, the complete plastome of Stemona sessilifolia will provide valuable information for the molecular identification of Baibu and assist in elucidating the evolution of Stemona sessilifolia.

Download Full-text

Complete Genome Sequence of Thiohalobacter sp. Strain COW1, Isolated from Activated Sludge Treating Coke Oven Wastewater

Microbiology Resource Announcements ◽

10.1128/mra.00013-21 ◽

2021 ◽

Vol 10 (7) ◽

Author(s):

Kentaro Miyazaki ◽

Hikaru Suenaga ◽

Mamoru Oshiki ◽

Shuichi Kawano ◽

Toshikazu Fukushima

Keyword(s):

Activated Sludge ◽

Genome Sequence ◽

Complete Genome Sequence ◽

Complete Genome ◽

Coke Oven ◽

Rrna Genes ◽

Trna Genes ◽

Circular Chromosome ◽

Protein Coding ◽

Degrading Bacterium

ABSTRACT A thiocyanate-degrading bacterium, Thiohalobacter sp. strain COW1, was isolated from activated sludge treating coke oven wastewater, and the complete genome sequence was determined. COW1 contained a single circular chromosome (3.23 Mb; G+C content, 63.4%) in which 2,788 protein-coding genes, 39 tRNA genes, and 3 rRNA genes were identified.

Download Full-text