scholarly journals Two-Step Contractions of Inverted Repeat Region and Psai Gene Duplication from the Plastome of Croton Tiglium (Euphorbiaceae)

Author(s):  
Sangjin Jo ◽  
Ki-Joong Kim

Croton L. (Euphorbiaceae) is a very specious genus and consists of about 1,250 species, mainly distributed in tropical Asia and China. The first complete plastome sequence from the genus, Croton tiglium, is reported in this study (NCBI acc. No. MH394334). The plastome is 150,021 bp in length. The lengths of LSC and SSC are 111,654 bp and 18,167 bp, respectively. However, the length of the IR region is only 10,100 bp and includes only four rrn and four trn genes, and a small part of the ycf1 gene. We propose two-step IR contractions to explain this unique IR region of the C. tiglium plastome. First, the IR contracted from rps19-rpl2 to ycf2-trnL-CAA on the LSC/IRb boundary. Second, the IR contracted from ycf2-trnL-CAA to rrn16-trnV-GAC on the LSC/IRa boundary. In addition, duplicated copies of psaI genes were discovered in the C. tiglium plastome. Both copies were located side by side between accD and ycf4 genes, but one copy was pseudogenized because of a five-basepair (TAGCT) insertion in the middle of the gene following frameshift mutation. The plastome contains 112 genes, of which 78 are protein-coding genes, 30 are tRNA genes, and four are rRNA genes. Sixteen genes contain one intron and two genes have two introns. The infA gene is lost. Twelve large repeats were detected in the plastome. All large repeats are located in the LSC region. Also, 272 simple sequence repeats (SSRs) were identified. The penta-SSRs accounted for 45% of total SSRs, followed by mono- (32%), di- (12%), tetra (6%) and tri-SSRs (5%). Most of them were distributed in the large single copy (LSC) region (85%). In addition, 76% of the SSRs were located in the intergenic spacer (IGS). Phylogenetic analysis suggested that C. tiglium is a sister group of Jatropha curcas with 100% bootstrap support. Seven Euphorbiaceae species formed one clade with 100% bootstrap support.

Author(s):  
Sangjin Jo ◽  
Ki-Joong Kim

Croton L. (Euphorbiaceae) is a very specious genus and consists of about 1,250 species, mainly distributed in tropical Asia and China. The first complete plastome sequence from the genus, Croton tiglium, is reported in this study (NCBI acc. No. MH394334). The plastome is 150,021 bp in length. The lengths of LSC and SSC are 111,654 bp and 18,167 bp, respectively. However, the length of the IR region is only 10,100 bp and includes only four rrn and four trn genes, and a small part of the ycf1 gene. We propose two-step IR contractions to explain this unique IR region of the C. tiglium plastome. First, the IR contracted from rps19-rpl2 to ycf2-trnL-CAA on the LSC/IRb boundary. Second, the IR contracted from ycf2-trnL-CAA to rrn16-trnV-GAC on the LSC/IRa boundary. In addition, duplicated copies of psaI genes were discovered in the C. tiglium plastome. Both copies were located side by side between accD and ycf4 genes, but one copy was pseudogenized because of a five-basepair (TAGCT) insertion in the middle of the gene following frameshift mutation. The plastome contains 112 genes, of which 78 are protein-coding genes, 30 are tRNA genes, and four are rRNA genes. Sixteen genes contain one intron and two genes have two introns. The infA gene is lost. Twelve large repeats were detected in the plastome. All large repeats are located in the LSC region. Also, 272 simple sequence repeats (SSRs) were identified. The penta-SSRs accounted for 45% of total SSRs, followed by mono- (32%), di- (12%), tetra (6%) and tri-SSRs (5%). Most of them were distributed in the large single copy (LSC) region (85%). In addition, 76% of the SSRs were located in the intergenic spacer (IGS). Phylogenetic analysis suggested that C. tiglium is a sister group of Jatropha curcas with 100% bootstrap support. Seven Euphorbiaceae species formed one clade with 100% bootstrap support.


Plants ◽  
2021 ◽  
Vol 10 (8) ◽  
pp. 1517
Author(s):  
Se-Hwan Cheon ◽  
Min-Ah Woo ◽  
Sangjin Jo ◽  
Young-Kee Kim ◽  
Ki-Joong Kim

The genus Zoysia Willd. (Chloridoideae) is widely distributed from the temperate regions of Northeast Asia—including China, Japan, and Korea—to the tropical regions of Southeast Asia. Among these, four species—Zoysia japonica Steud., Zoysia sinica Hance, Zoysia tenuifolia Thiele, and Zoysia macrostachya Franch. & Sav.—are naturally distributed in the Korean Peninsula. In this study, we report the complete plastome sequences of these Korean Zoysia species (NCBI acc. nos. MF953592, MF967579~MF967581). The length of Zoysia plastomes ranges from 135,854 to 135,904 bp, and the plastomes have a typical quadripartite structure, which consists of a pair of inverted repeat regions (20,962~20,966 bp) separated by a large (81,348~81,392 bp) and a small (12,582~12,586 bp) single-copy region. In terms of gene order and structure, Zoysia plastomes are similar to the typical plastomes of Poaceae. The plastomes encode 110 genes, of which 76 are protein-coding genes, 30 are tRNA genes, and four are rRNA genes. Fourteen genes contain single introns and one gene has two introns. Three evolutionary hotspot spacer regions—atpB~rbcL, rps16~rps3, and rpl32~trnL-UAG—were recognized among six analyzed Zoysia species. The high divergences in the atpB~rbcL spacer and rpl16~rpl3 region are primarily due to the differences in base substitutions and indels. In contrast, the high divergence between rpl32~trnL-UAG spacers is due to a small inversion with a pair of 22 bp stem and an 11 bp loop. Simple sequence repeats (SSRs) were identified in 59 different locations in Z. japonica, 63 in Z. sinica, 62 in Z. macrostachya, and 63 in Z. tenuifolia plastomes. Phylogenetic analysis showed that the Zoysia (Zoysiinae) forms a monophyletic group, which is sister to Sporobolus (Sporobolinae), with 100% bootstrap support. Within the Zoysia clade, the relationship of (Z. sinica, Z japonica), (Z. tenuifolia, Z. matrella), (Z. macrostachya, Z. macrantha) was suggested.


PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e8450 ◽  
Author(s):  
Sunan Huang ◽  
Xuejun Ge ◽  
Asunción Cano ◽  
Betty Gaby Millán Salazar ◽  
Yunfei Deng

The genus Dicliptera (Justicieae, Acanthaceae) consists of approximately 150 species distributed throughout the tropical and subtropical regions of the world. Newly obtained chloroplast genomes (cp genomes) are reported for five species of Dilciptera (D. acuminata, D. peruviana, D. montana, D. ruiziana and D. mucronata) in this study. These cp genomes have circular structures of 150,689–150,811 bp and exhibit quadripartite organizations made up of a large single copy region (LSC, 82,796–82,919 bp), a small single copy region (SSC, 17,084–17,092 bp), and a pair of inverted repeat regions (IRs, 25,401–25,408 bp). Guanine-Cytosine (GC) content makes up 37.9%–38.0% of the total content. The complete cp genomes contain 114 unique genes, including 80 protein-coding genes, 30 transfer RNA (tRNA) genes, and four ribosomal RNA (rRNA) genes. Comparative analyses of nucleotide variability (Pi) reveal the five most variable regions (trnY-GUA-trnE-UUC, trnG-GCC, psbZ-trnG-GCC, petN-psbM, and rps4-trnL-UUA), which may be used as molecular markers in future taxonomic identification and phylogenetic analyses of Dicliptera. A total of 55-58 simple sequence repeats (SSRs) and 229 long repeats were identified in the cp genomes of the five Dicliptera species. Phylogenetic analysis identified a close relationship between D. ruiziana and D. montana, followed by D. acuminata, D. peruviana, and D. mucronata. Evolutionary analysis of orthologous protein-coding genes within the family Acanthaceae revealed only one gene, ycf15, to be under positive selection, which may contribute to future studies of its adaptive evolution. The completed genomes are useful for future research on species identification, phylogenetic relationships, and the adaptive evolution of the Dicliptera species.


Molecules ◽  
2018 ◽  
Vol 23 (9) ◽  
pp. 2137 ◽  
Author(s):  
Xiang-Xiao Meng ◽  
Yan-Fang Xian ◽  
Li Xiang ◽  
Dong Zhang ◽  
Yu-Hua Shi ◽  
...  

The genus Sanguisorba, which contains about 30 species around the world and seven species in China, is the source of the medicinal plant Sanguisorba officinalis, which is commonly used as a hemostatic agent as well as to treat burns and scalds. Here we report the complete chloroplast (cp) genome sequences of four Sanguisorba species (S. officinalis, S. filiformis, S. stipulata, and S. tenuifolia var. alba). These four Sanguisorba cp genomes exhibit typical quadripartite and circular structures, and are 154,282 to 155,479 bp in length, consisting of large single-copy regions (LSC; 84,405–85,557 bp), small single-copy regions (SSC; 18,550–18,768 bp), and a pair of inverted repeats (IRs; 25,576–25,615 bp). The average GC content was ~37.24%. The four Sanguisorba cp genomes harbored 112 different genes arranged in the same order; these identical sections include 78 protein-coding genes, 30 tRNA genes, and four rRNA genes, if duplicated genes in IR regions are counted only once. A total of 39–53 long repeats and 79–91 simple sequence repeats (SSRs) were identified in the four Sanguisorba cp genomes, which provides opportunities for future studies of the population genetics of Sanguisorba medicinal plants. A phylogenetic analysis using the maximum parsimony (MP) method strongly supports a close relationship between S. officinalis and S. tenuifolia var. alba, followed by S. stipulata, and finally S. filiformis. The availability of these cp genomes provides valuable genetic information for future studies of Sanguisorba identification and provides insights into the evolution of the genus Sanguisorba.


2021 ◽  
Vol 46 (1) ◽  
pp. 162-174
Author(s):  
Ming-Hui Yan ◽  
Chun-Yang Li ◽  
Peter W. Fritsch ◽  
Jie Cai ◽  
Heng-Chang Wang

Abstract—The phylogenetic relationships among 11 out of the 12 genera of the angiosperm family Styracaceae have been largely resolved with DNA sequence data based on all protein-coding genes of the plastome. The only genus that has not been phylogenomically investigated in the family with molecular data is the monotypic genus Parastyrax, which is extremely rare in the wild and difficult to collect. To complete the sampling of the genera comprising the Styracaceae, examine the plastome composition of Parastyrax, and further explore the phylogenetic relationships of the entire family, we sequenced the whole plastome of P. lacei and incorporated it into the Styracaceae dataset for phylogenetic analysis. Similar to most others in the family, the plastome is 158189 bp in length and contains a large single-copy region of 88085 bp and a small single-copy region of 18540 bp separated by two inverted-repeat regions of 25781 bp each. A total of 113 genes was predicted, including 79 protein-coding genes, 30 tRNA genes, and four rRNA genes. Phylogenetic relationships among all 12 genera of the family were constructed with 79 protein-coding genes. Consistent with a previous study, Styrax, Huodendron, and a clade of Alniphyllum + Bruinsmia were successively sister to the remainder of the family. Parastyrax was strongly supported as sister to an internal clade comprising seven other genera of the family, whereas Halesia and Pterostyrax were both recovered as polyphyletic, as in prior studies. However, when we employed either the whole plastome or the large- or small-single copy regions as datasets, Pterostyrax was resolved as monophyletic with 100% support, consistent with expectations based on morphology and indicating that non-coding regions of the Styracaceae plastome contain informative phylogenetic signal. Conversely Halesia was still resolved as polyphyletic but with novel strong support.


2020 ◽  
Vol 11 ◽  
Author(s):  
Inkyu Park ◽  
Sungyu Yang ◽  
Jun-Ho Song ◽  
Byeong Cheol Moon

The genera Arnebia and Lithospermum (Lithospermeae-Boraginaceae) comprise 25–30 and 50–60 species, respectively. Some of them are economically valuable, as their roots frequently contain a purple-red dye used in the cosmetic industry. Furthermore, dried roots of Arnebia euchroma, A. guttata, and Lithospermum erythrorhizon, which have been designated Lithospermi Radix, are used as traditional Korean herbal medicine. This study is the first report on the floral micromorphology and complete chloroplast (cp) genome sequences of A. guttata (including A. tibetana), A. euchroma, and L. erythrorhizon. We reveal great diversity in floral epidermal cell patterns, gynoecium, and structure of trichomes. The cp genomes were 149,361–150,465 bp in length, with conserved quadripartite structures. In total, 112 genes were identified, including 78 protein-coding regions, 30 tRNA genes, and four rRNA genes. Gene order, content, and orientation were highly conserved and were consistent with the general structure of angiosperm cp genomes. Comparison of the four cp genomes revealed locally divergent regions, mainly within intergenic spacer regions (atpH-atpI, petN-psbM, rbcL-psaI, ycf4-cemA, ndhF-rpl32, and ndhC-trnV-UAC). To facilitate species identification, we developed molecular markers psaA- ycf3 (PSY), trnI-CAU- ycf2 (TCY), and ndhC-trnV-UAC (NCTV) based on divergence hotspots. High-resolution phylogenetic analysis revealed clear clustering and a close relationship of Arnebia to its Lithospermum sister group, which was supported by strong bootstrap values and posterior probabilities. Overall, gynoecium characteristics and genetic distance of cp genomes suggest that A. tibetana, might be recognized as an independent species rather than a synonym of A. guttata. The present morphological and cp genomic results provide useful information for future studies, such as taxonomic, phylogenetic, and evolutionary analysis of Boraginaceae.


Plants ◽  
2020 ◽  
Vol 9 (10) ◽  
pp. 1354
Author(s):  
Slimane Khayi ◽  
Fatima Gaboun ◽  
Stacy Pirro ◽  
Tatiana Tatusova ◽  
Abdelhamid El Mousadik ◽  
...  

Argania spinosa (Sapotaceae), an important endemic Moroccan oil tree, is a primary source of argan oil, which has numerous dietary and medicinal proprieties. The plant species occupies the mid-western part of Morocco and provides great environmental and socioeconomic benefits. The complete chloroplast (cp) genome of A. spinosa was sequenced, assembled, and analyzed in comparison with those of two Sapotaceae members. The A. spinosa cp genome is 158,848 bp long, with an average GC content of 36.8%. The cp genome exhibits a typical quadripartite and circular structure consisting of a pair of inverted regions (IR) of 25,945 bp in length separating small single-copy (SSC) and large single-copy (LSC) regions of 18,591 and 88,367 bp, respectively. The annotation of A. spinosa cp genome predicted 130 genes, including 85 protein-coding genes (CDS), 8 ribosomal RNA (rRNA) genes, and 37 transfer RNA (tRNA) genes. A total of 44 long repeats and 88 simple sequence repeats (SSR) divided into mononucleotides (76), dinucleotides (7), trinucleotides (3), tetranucleotides (1), and hexanucleotides (1) were identified in the A. spinosa cp genome. Phylogenetic analyses using the maximum likelihood (ML) method were performed based on 69 protein-coding genes from 11 species of Ericales. The results confirmed the close position of A. spinosa to the Sideroxylon genus, supporting the revisiting of its taxonomic status. The complete chloroplast genome sequence will be valuable for further studies on the conservation and breeding of this medicinally and culinary important species and also contribute to clarifying the phylogenetic position of the species within Sapotaceae.


Author(s):  
Wojciech Pląder ◽  
Yasushi Yukawa ◽  
Masahiro Sugiura ◽  
Stefan Malepszy

AbstractThe complete nucleotide sequence of the cucumber (C. sativus L. var. Borszczagowski) chloroplast genome has been determined. The genome is composed of 155,293 bp containing a pair of inverted repeats of 25,191 bp, which are separated by two single-copy regions, a small 18,222-bp one and a large 86,688-bp one. The chloroplast genome of cucumber contains 130 known genes, including 89 protein-coding genes, 8 ribosomal RNA genes (4 rRNA species), and 37 tRNA genes (30 tRNA species), with 18 of them located in the inverted repeat region. Of these genes, 16 contain one intron, and two genes and one ycf contain 2 introns. Twenty-one small inversions that form stem-loop structures, ranging from 18 to 49 bp, have been identified. Eight of them show similarity to those of other species, while eight seem to be cucumber specific. Detailed comparisons of ycf2 and ycf15, and the overall structure to other chloroplast genomes were performed.


ZooKeys ◽  
2020 ◽  
Vol 925 ◽  
pp. 73-88
Author(s):  
Chaoyi Hu ◽  
Shuaibin Wang ◽  
Bisheng Huang ◽  
Hegang Liu ◽  
Lei Xu ◽  
...  

Scolopendra mutilans L. Koch, 1878 is an important Chinese animal with thousands of years of medicinal history. However, the genomic information of this species is limited, which hinders its further application. Here, the complete mitochondrial genome (mitogenome) of S. mutilans was sequenced and assembled by next-generation sequencing. The genome is 15,011 bp in length, consisting of 13 protein-coding genes (PCGs), 14 tRNA genes, and two rRNA genes. Most PCGs start with the ATN initiation codon, and all PCGs have the conventional stop codons TAA and TAG. The S. mutilans mitogenome revealed nine simple sequence repeats (SSRs), and an obviously lower GC content compared with other seven centipede mitogenomes previously sequenced. After analysis of homologous regions between the eight centipede mitogenomes, the S. mutilans mitogenome further showed clear genomic rearrangements. The phylogenetic analysis of eight centipedes using 13 conserved PCG genes was finally performed. The phylogenetic reconstructions showed Scutigeromorpha as a separate group, and Scolopendromorpha in a sister-group relationship with Lithobiomorpha and Geophilomorpha. Collectively, the S. mutilans mitogenome provided new genomic resources, which will improve its medicinal research and applications in the future.


ZooKeys ◽  
2021 ◽  
Vol 1070 ◽  
pp. 13-30
Author(s):  
Wanqing Zhao ◽  
Dajun Liu ◽  
Qian Jia ◽  
Xin Wu ◽  
Hufang Zhang

Mitochondrial genomes (mitogenomes) are widely used in research studies on phylogenetic relationships and evolutionary history. Here, we sequenced and analyzed the mitogenome of the scentless plant bug Myrmus lateralis Hsiao, 1964 (Heteroptera, Rhopalidae). The complete 17,309 bp genome encoded 37 genes, including 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes, two ribosomal RNA (rRNA) genes, and a control region. The mitogenome revealed a high A+T content (75.8%), a positive AT-skew (0.092), and a negative GC-skew (–0.165). All 13 PCGs were found to start with ATN codons, except for cox1, in which TTG was the start codon. The Ka/Ks ratios of 13 PCGs were all lower than 1, indicating that purifying selection evolved in these genes. All tRNAs could be folded into the typical cloverleaf secondary structure, except for trnS1 and trnV, which lack dihydrouridine arms. Phylogenetic trees were constructed and analyzed based on the PCG+rRNA from 38 mitogenomes, using maximum likelihood and Bayesian inference methods, showed that M. lateralis and Chorosoma macilentum Stål, 1858 grouped together in the tribe Chorosomatini. In addition, Coreoidea and Pyrrhocoroidea were sister groups among the superfamilies of Trichophora, and Rhopalidae was a sister group to Alydidae + Coreidae.


Sign in / Sign up

Export Citation Format

Share Document