scholarly journals The chloroplast genome of Amygdalus L. (Rosaceae) reveals the phylogenetic relationship and divergence time

BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Zhongyu Du ◽  
Ke Lu ◽  
Kai Zhang ◽  
Yiming He ◽  
Haitao Wang ◽  
...  

Abstract Background Limited access to genetic information has greatly hindered our understanding of the molecular evolution, phylogeny, and differentiation time of subg. Amygdalus. This study reported complete chloroplast (cp) genome sequences of subg. Amygdalus, which further enriched the available valuable resources of complete cp genomes of higher plants and deepened our understanding of the divergence time and phylogenetic relationships of subg. Amygdalus. Results The results showed that subg. Amygdalus species exhibited a tetrad structure with sizes ranging from 157,736 bp (P. kansuensis) to 158,971 bp (P. davidiana), a pair of inverted repeat regions (IRa/IRb) that ranged from 26,137–26,467 bp, a large single-copy region that ranged from 85,757–86,608 bp, and a small single-copy region that ranged from 19,020–19,133 bp. The average GC content of the complete cp genomes in the 12 species was 36.80%. We found that the structure of the subg. Amygdalus complete cp genomes was highly conserved, and the 12 subg. Amygdalus species had an rps19 pseudogene. There was not rearrangement of the complete cp genome in the 12 subg. Amygdalus species. All 12 subg. Amygdalus species clustered into one clade based on both Bayesian inference and maximum likelihood. The divergence time analyses based on the complete cp genome sequences showed that subg. Amygdalus species diverged approximately 15.65 Mya. Conclusion Our results provide data on the genomic structure of subg. Amygdalus and elucidates their phylogenetic relationships and divergence time.

2020 ◽  
Author(s):  
Zhenchao Zhang ◽  
Zhongliang Dai ◽  
Yuemei Yao ◽  
Yongfei Pan ◽  
Guosheng Sun ◽  
...  

Abstract Backgrounds: Broccoli (Brassica. oleracea var. italica L.) is known as one of the most nutritionally rich vegetables, as well as rich in functional components that benefit to health. The main purposes of this research were sequencing, assembling and annotation of chloroplast genome of broccoli based on Illumina HiSeq2500 sequencing platform. Results: The size of the broccoli cp genome is 153,364 bp, including two inverted repeat (IR) regions of 26,197 bp each, separated by a small single copy (SSC) region of 17,834 bp and a large single copy (LSC) region of 83,136 bp. The GC content of the complete genome is 36.36%, while those of SSC, LSC, and IR are 29.1%, 34.15% and 42.35%, respectively. It harbors 134 functional genes, including 87 protein-coding genes, 39 tRNAs and 8 rRNAs, with 31 duplicates in the IRs. The most abundant amino acid in the protein-coding genes is leucine, while the least is cysteine. Codon usage frequency showed bias for A/T-ending codons in the cp genome. In the repeat structure analysis, a total of 34 repeat sequences and 291 simple sequence repeat (SSRs) were detected in the work. Although cp genomic structure and size are highly conserved, the SC-IR boundary regions are variable between the 7 cp genomes. The phylogenetic relationships based on complete cp genome from 9 species suggest that B. oleracea var. italica is closely related to Brassica juncea. Conclusions: The complete cp genome sequence was obtained and annotated for broccoli for the first time. The information acquired from this research will be useful for further species identification, population genetics and biological research of broccoli.


2019 ◽  
Vol 2019 ◽  
pp. 1-17 ◽  
Author(s):  
Samaila S. Yaradua ◽  
Dhafer A. Alzahrani ◽  
Enas J. Albokhary ◽  
Abidina Abba ◽  
Abubakar Bello

The complete chloroplast genome of J. flava, an endangered medicinal plant in Saudi Arabia, was sequenced and compared with cp genome of three Acanthaceae species to characterize the cp genome, identify SSRs, and also detect variation among the cp genomes of the sampled Acanthaceae. NOVOPlasty was used to assemble the complete chloroplast genome from the whole genome data. The cp genome of J. flava was 150, 888bp in length with GC content of 38.2%, and has a quadripartite structure; the genome harbors one pair of inverted repeat (IRa and IRb 25, 500bp each) separated by large single copy (LSC, 82, 995 bp) and small single copy (SSC, 16, 893 bp). There are 132 genes in the genome, which includes 80 protein coding genes, 30 tRNA, and 4 rRNA; 113 are unique while the remaining 19 are duplicated in IR regions. The repeat analysis indicates that the genome contained all types of repeats with palindromic occurring more frequently; the analysis also identified total number of 98 simple sequence repeats (SSR) of which majority are mononucleotides A/T and are found in the intergenic spacer. The comparative analysis with other cp genomes sampled indicated that the inverted repeat regions are conserved than the single copy regions and the noncoding regions show high rate of variation than the coding region. All the genomes have ndhF and ycf1 genes in the border junction of IRb and SSC. Sequence divergence analysis of the protein coding genes showed that seven genes (petB, atpF, psaI, rpl32, rpl16, ycf1, and clpP) are under positive selection. The phylogenetic analysis revealed that Justiceae is sister to Ruellieae. This study reported the first cp genome of the largest genus in Acanthaceae and provided resources for studying genetic diversity of J. flava as well as resolving phylogenetic relationships within the core Acanthaceae.


PeerJ ◽  
2016 ◽  
Vol 4 ◽  
pp. e2699 ◽  
Author(s):  
Wenpan Dong ◽  
Chao Xu ◽  
Delu Li ◽  
Xiaobai Jin ◽  
Ruili Li ◽  
...  

TheHaloxylongenus belongs to the Amaranthaceae (formerly Chenopodiaceae) family. The small trees or shrubs in this genus are referred to as the King of psammophytic plants, and perform important functions in environmental protection, including wind control and sand fixation in deserts. To better understand these beneficial plants, we sequenced the chloroplast (cp) genomes ofHaloxylon ammodendron(HA) andHaloxylon persicum(HP) and conducted comparative genomic analyses on these and two other representative Amaranthaceae species. Similar to other higher plants, we found that theHaloxyloncp genome is a quadripartite, double-stranded, circular DNA molecule of 151,570 bp in HA and 151,586 bp in HP. It contains a pair of inverted repeats (24,171 bp in HA and 24,177 bp in HP) that separate the genome into a large single copy region of 84,214 bp in HA and 84,217 bp in HP, and a small single copy region of 19,014 bp in HA and 19,015 bp in HP. EachHaloxyloncp genome contains 112 genes, including 78 coding, 30 tRNA, and four ribosomal RNA genes. We detected 59 different simple sequence repeat loci, including 44 mono-nucleotide, three di-nucleotide, one tri-nucleotide, and 11 tetra-nucleotide repeats. Comparative analysis revealed only 67 mutations between the two species, including 44 substitutions, 23 insertions/deletions, and two micro-inversions. The two inversions, with lengths of 14 and 3 bp, occur in thepetA-psbJ intergenic region andrpl16 intron, respectively, and are predicted to form hairpin structures with repeat sequences of 27 and 19 bp, respectively, at the two ends. The ratio of transitions to transversions was 0.76. These results are valuable for future studies onHaloxylongenetic diversity and will enhance our understanding of the phylogenetic evolution of Amaranthaceae.


2021 ◽  
Author(s):  
Hui Jiang ◽  
Jing Tian ◽  
Jiaxin Yang ◽  
Xiang Dong ◽  
Zhixiang Zhong ◽  
...  

Abstract Background: Polystachya Hook. is a large pantropical orchid genus (c. 240 species) distributed in Africa, southern Asia and the Americas, with the centre of diversity in Africa. Chloroplast (cp) genomes of plants are highly conserved and can provide much more informative DNA sites and generate much better resolution for plant phylogenies. However, for Polystachya, the whole cp genome including its structure features are yet unknown and its phylogenetic placement of the genus within the Orchidaceae is still unclear.Results: In this study, the complete cp genomes of six Polystachya species were assembled based on genome skimming. We subjected them to comparative genomic analyses and reconstructed their phylogenetic relationships. The results exhibited that the cp genomes had a typical quadripartite structure with conserved genome arrangement and moderate divergence. The cp genomes of the six Polystachya species ranged from 145,484 bp to 149,274 bp in length and had almost similar GC content of 36.9%-37.0%. Gene annotation revealed 113 unique genes. In additions, 19 genes are duplicated in the inverted regions, and 17 gene possessed intron. Comparative analysis of the overall sequence identity among six complete cp genomes confirmed that for both coding and non-coding regions in Polystachya, SC regions exhibit higher sequence variation than IRs. Furthermore, there were various amplifications in the IR region among the six Polystachya species. Most of the protein-coding genes of these species had a high degree of codon preference. We screened out specific SSR and found seven relatively highly variable loci. Moreover, 13 genes were discovered with significant positive selection. Phylogenetic analysis suggested that the six Polystachya species formed a monophyletic clade and had more closely related to tribe Vandeae. Phylogenetic relationships of the family Orchidaceae inferred from the 85 cp genome sequences were generally consistent with previous studies and robust. Conclusions: Our study reported the complete cp genomes of the six Polystachya species, and provided detailed structural analysis and comparative analysis results, which can contribute to the development of DNA markers for use in the study of genetic variability and evolutionary studies in Polystachya. In addition, the present results further demonstrate the phylogenetic position of Polystachya.


Molecules ◽  
2018 ◽  
Vol 23 (9) ◽  
pp. 2137 ◽  
Author(s):  
Xiang-Xiao Meng ◽  
Yan-Fang Xian ◽  
Li Xiang ◽  
Dong Zhang ◽  
Yu-Hua Shi ◽  
...  

The genus Sanguisorba, which contains about 30 species around the world and seven species in China, is the source of the medicinal plant Sanguisorba officinalis, which is commonly used as a hemostatic agent as well as to treat burns and scalds. Here we report the complete chloroplast (cp) genome sequences of four Sanguisorba species (S. officinalis, S. filiformis, S. stipulata, and S. tenuifolia var. alba). These four Sanguisorba cp genomes exhibit typical quadripartite and circular structures, and are 154,282 to 155,479 bp in length, consisting of large single-copy regions (LSC; 84,405–85,557 bp), small single-copy regions (SSC; 18,550–18,768 bp), and a pair of inverted repeats (IRs; 25,576–25,615 bp). The average GC content was ~37.24%. The four Sanguisorba cp genomes harbored 112 different genes arranged in the same order; these identical sections include 78 protein-coding genes, 30 tRNA genes, and four rRNA genes, if duplicated genes in IR regions are counted only once. A total of 39–53 long repeats and 79–91 simple sequence repeats (SSRs) were identified in the four Sanguisorba cp genomes, which provides opportunities for future studies of the population genetics of Sanguisorba medicinal plants. A phylogenetic analysis using the maximum parsimony (MP) method strongly supports a close relationship between S. officinalis and S. tenuifolia var. alba, followed by S. stipulata, and finally S. filiformis. The availability of these cp genomes provides valuable genetic information for future studies of Sanguisorba identification and provides insights into the evolution of the genus Sanguisorba.


2021 ◽  
Vol 46 (1) ◽  
pp. 162-174
Author(s):  
Ming-Hui Yan ◽  
Chun-Yang Li ◽  
Peter W. Fritsch ◽  
Jie Cai ◽  
Heng-Chang Wang

Abstract—The phylogenetic relationships among 11 out of the 12 genera of the angiosperm family Styracaceae have been largely resolved with DNA sequence data based on all protein-coding genes of the plastome. The only genus that has not been phylogenomically investigated in the family with molecular data is the monotypic genus Parastyrax, which is extremely rare in the wild and difficult to collect. To complete the sampling of the genera comprising the Styracaceae, examine the plastome composition of Parastyrax, and further explore the phylogenetic relationships of the entire family, we sequenced the whole plastome of P. lacei and incorporated it into the Styracaceae dataset for phylogenetic analysis. Similar to most others in the family, the plastome is 158189 bp in length and contains a large single-copy region of 88085 bp and a small single-copy region of 18540 bp separated by two inverted-repeat regions of 25781 bp each. A total of 113 genes was predicted, including 79 protein-coding genes, 30 tRNA genes, and four rRNA genes. Phylogenetic relationships among all 12 genera of the family were constructed with 79 protein-coding genes. Consistent with a previous study, Styrax, Huodendron, and a clade of Alniphyllum + Bruinsmia were successively sister to the remainder of the family. Parastyrax was strongly supported as sister to an internal clade comprising seven other genera of the family, whereas Halesia and Pterostyrax were both recovered as polyphyletic, as in prior studies. However, when we employed either the whole plastome or the large- or small-single copy regions as datasets, Pterostyrax was resolved as monophyletic with 100% support, consistent with expectations based on morphology and indicating that non-coding regions of the Styracaceae plastome contain informative phylogenetic signal. Conversely Halesia was still resolved as polyphyletic but with novel strong support.


Genome ◽  
2011 ◽  
Vol 54 (8) ◽  
pp. 663-673 ◽  
Author(s):  
Dai-Yong Kuang ◽  
Hong Wu ◽  
Ya-Ling Wang ◽  
Lian-Ming Gao ◽  
Shou-Zhou Zhang ◽  
...  

Here, we report a completely sequenced plastome using Illumina/Solexa sequencing-by-synthesis (SBS) technology. The plastome of Magnolia kwangsiensis Figlar & Noot. is 159 667 bp in length with a typical quadripartite structure: 88 030 bp large single-copy (LSC) and 18 669 bp small single-copy (SSC) regions, separated by two 26 484 bp inverted repeat (IR) regions. The overall predicted gene number is 129, among which 17 genes are duplicated in IR regions. The plastome of M. kwangsiensis is identical in its gene order to previously published plastomes of magnoliids. Furthermore, the C-to-U type RNA editing frequency of 114 seed plants is positively correlated with plastome GC content and plastome length, whereas plastome length is not correlated with GC content. A total of 16 potential putative barcoding or low taxonomic level phylogenetic study markers in Magnoliaceae were detected by comparing the coding and noncoding regions of the plastome of M. kwangsiensis with that of Liriodendron tulipifera L. At least eight markers might be applied not only to Magnoliaceae but also to other taxa. The 86 mononucleotide cpSSRs that distributed in single-copy noncoding regions are highly valuable to study population genetics and conservation genetics of this endangered rare species.


2021 ◽  
Author(s):  
Jiaxin Yang ◽  
Guoxiong Hu ◽  
Guangwan Hu

Abstract Background Handeliodendron Rehder and Eurycorymbus Hand.-Mazz. are the monotypic genera in the Sapindaceae family. The phylogenetic relationship of these endangered species Handeliodendron bodinieri (Lévl.) Rehd. and Eurycorymbus cavaleriei (Lévl.) Rehd. et Hand.-Mazz. with other members of Sapindaceae s.l. is not well resolved. A previous study concluded that the genus Aesculus might be paraphyletic because Handeliodendron was nested within it based on small DNA fragments. Thus, their chloroplast genomic information and comparative genomic analysis with other Sapindaceae species are necessary and crucial to understand the circumscription and plastome evolution of this family. Results The chloroplast genome sizes of Handeliodendron bodinieri and Eurycorymbus cavaleriei are 151,271 and 158,690 bp, respectively. Results showed that a total of 114 unique genes were annotated in H. bodinieri and E. cavaleriei, and the ycf1 gene contained abundant SSRs in both genomes. Comparative analysis revealed that gene content, PCGs, and total GC content were remarkably similar or identical within 13 genera from Sapindaceae, and the chloroplast genome size of four genera was generally smaller within the family, including Acer, Dipteronia, Aesculus, and Handeliodendron. IR boundaries of the H. bodinieri showed a significant contraction, whereas it presented a notable expansion in E. cavaleriei cp genome. Ycf1, ndhC-trnV-UAC, and rpl32-trnL-UAG-ccsA were remarkably divergent regions in the Sapindaceae species. Phylogenetic analysis based on different datasets, including whole chloroplast genome sequences, coding sequences, large single-copy, small single-copy, and inverted repeat regions, consistently demonstrated that H. bodinieri was sister to the clade consisted of Aesculus chinensis and A. wangii, strongly support Eurycorymbus cavaleriei as sister to Dodonaea viscosa. Conclusion This study revealed that the cp genome size of the Hippocastanoideae was generally smaller across Sapindaceae, and three highly divergent regions could be used as the specific DNA barcodes within Sapindaceae. Phylogenetic results strongly support that the subdivision of four subfamilies within Sapindaceae, and Handeliodendron is not nested within the genus Aesculus.


PeerJ ◽  
2018 ◽  
Vol 6 ◽  
pp. e6032 ◽  
Author(s):  
Zhenyu Zhao ◽  
Xin Wang ◽  
Yi Yu ◽  
Subo Yuan ◽  
Dan Jiang ◽  
...  

Dioscorea L., the largest genus of the family Dioscoreaceae with over 600 species, is not only an important food but also a medicinal plant. The identification and classification of Dioscorea L. is a rather difficult task. In this study, we sequenced five Dioscorea chloroplast genomes, and analyzed with four other chloroplast genomes of Dioscorea species from GenBank. The Dioscorea chloroplast genomes displayed the typical quadripartite structure of angiosperms, which consisted of a pair of inverted repeats separated by a large single-copy region, and a small single-copy region. The location and distribution of repeat sequences and microsatellites were determined, and the rapidly evolving chloroplast genome regions (trnK-trnQ, trnS-trnG, trnC-petN, trnE-trnT, petG-trnW-trnP, ndhF, trnL-rpl32, and ycf1) were detected. Phylogenetic relationships of Dioscorea inferred from chloroplast genomes obtained high support even in shortest internodes. Thus, chloroplast genome sequences provide potential molecular markers and genomic resources for phylogeny and species identification.


2018 ◽  
Vol 123 (5) ◽  
pp. 857-865 ◽  
Author(s):  
Jacqueline Heckenhauer ◽  
Ovidiu Paun ◽  
Mark W Chase ◽  
Peter S Ashton ◽  
A S Kamariah ◽  
...  

Abstract Background and Aims Phylogenetic relationships within tribe Shoreeae, containing the main elements of tropical forests in Southeast Asia, present a long-standing problem in the systematics of Dipterocarpaceae. Sequencing whole plastomes using next-generation sequencing- (NGS) based genome skimming is increasingly employed for investigating phylogenetic relationships of plants. Here, the usefulness of complete plastid genome sequences in resolving phylogenetic relationships within Shoreeae is evaluated. Methods A pipeline to obtain alignments of whole plastid genome sequences across individuals with different amounts of available data is presented. In total, 48 individuals, representing 37 species and four genera of the ecologically and economically important tribe Shoreeae sensu Ashton, were investigated. Phylogenetic trees were reconstructed using maximum parsimony, maximum likelihood and Bayesian inference. Key Results Here, the first fully sequenced plastid genomes for the tribe Shoreeae are presented. Their size, GC content and gene order are comparable with those of other members of Malvales. Phylogenomic analyses demonstrate that whole plastid genomes are useful for inferring phylogenetic relationships among genera and groups of Shorea (Shoreeae) but fail to provide well-supported phylogenetic relationships among some of the most closely related species. Discordance in placement of Parashorea was observed between phylogenetic trees obtained from plastome analyses and those obtained from nuclear single nucleotide polymorphism (SNP) data sets identified in restriction-site associated sequencing (RADseq). Conclusions Phylogenomic analyses of the entire plastid genomes are useful for inferring phylogenetic relationships at lower taxonomic levels, but are not sufficient for detailed phylogenetic reconstructions of closely related species groups in Shoreeae. Discordance in placement of Parashorea was further investigated for evidence of ancient hybridization.


Sign in / Sign up

Export Citation Format

Share Document