scholarly journals Comparative genomics and phylogenetic relationships of two endemic and endangered species (Handeliodendron bodinieri and Eurycorymbus cavaleriei) of two monotypic genera within Sapindales

BMC Genomics ◽  
2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Jiaxin Yang ◽  
Guoxiong Hu ◽  
Guangwan Hu

Abstract Background Handeliodendron Rehder and Eurycorymbus Hand.-Mazz. are the monotypic genera in the Sapindaceae family. The phylogenetic relationship of these endangered species Handeliodendron bodinieri (Lévl.) Rehd. and Eurycorymbus cavaleriei (Lévl.) Rehd. et Hand.-Mazz. with other members of Sapindaceae s.l. is not well resolved. A previous study concluded that the genus Aesculus might be paraphyletic because Handeliodendron was nested within it based on small DNA fragments. Thus, their chloroplast genomic information and comparative genomic analysis with other Sapindaceae species are necessary and crucial to understand the circumscription and plastome evolution of this family. Results The chloroplast genome sizes of Handeliodendron bodinieri and Eurycorymbus cavaleriei are 151,271 and 158,690 bp, respectively. Results showed that a total of 114 unique genes were annotated in H. bodinieri and E. cavaleriei, and the ycf1 gene contained abundant SSRs in both genomes. Comparative analysis revealed that gene content, PCGs, and total GC content were remarkably similar or identical within 13 genera from Sapindaceae, and the chloroplast genome size of four genera was generally smaller within the family, including Acer, Dipteronia, Aesculus, and Handeliodendron. IR boundaries of the H. bodinieri showed a significant contraction, whereas it presented a notable expansion in E. cavaleriei cp genome. Ycf1, ndhC-trnV-UAC, and rpl32-trnL-UAG-ccsA were remarkably divergent regions in the Sapindaceae species. Analysis of selection pressure showed that there are a few positively selected genes. Phylogenetic analysis based on different datasets, including whole chloroplast genome sequences, coding sequences, large single-copy, small single-copy, and inverted repeat regions, consistently demonstrated that H. bodinieri was sister to the clade consisting of Aesculus chinensis and A. wangii and strongly support Eurycorymbus cavaleriei as sister to Dodonaea viscosa. Conclusion This study revealed that the cp genome size of the Hippocastanoideae was generally smaller compared to the other subfamilies within Sapindaceae, and three highly divergent regions could be used as the specific DNA barcodes within Sapindaceae. Phylogenetic results strongly support that the subdivision of four subfamilies within Sapindaceae, and Handeliodendron is not nested within the genus Aesculus.

2021 ◽  
Author(s):  
Jiaxin Yang ◽  
Guoxiong Hu ◽  
Guangwan Hu

Abstract Background Handeliodendron Rehder and Eurycorymbus Hand.-Mazz. are the monotypic genera in the Sapindaceae family. The phylogenetic relationship of these endangered species Handeliodendron bodinieri (Lévl.) Rehd. and Eurycorymbus cavaleriei (Lévl.) Rehd. et Hand.-Mazz. with other members of Sapindaceae s.l. is not well resolved. A previous study concluded that the genus Aesculus might be paraphyletic because Handeliodendron was nested within it based on small DNA fragments. Thus, their chloroplast genomic information and comparative genomic analysis with other Sapindaceae species are necessary and crucial to understand the circumscription and plastome evolution of this family. Results The chloroplast genome sizes of Handeliodendron bodinieri and Eurycorymbus cavaleriei are 151,271 and 158,690 bp, respectively. Results showed that a total of 114 unique genes were annotated in H. bodinieri and E. cavaleriei, and the ycf1 gene contained abundant SSRs in both genomes. Comparative analysis revealed that gene content, PCGs, and total GC content were remarkably similar or identical within 13 genera from Sapindaceae, and the chloroplast genome size of four genera was generally smaller within the family, including Acer, Dipteronia, Aesculus, and Handeliodendron. IR boundaries of the H. bodinieri showed a significant contraction, whereas it presented a notable expansion in E. cavaleriei cp genome. Ycf1, ndhC-trnV-UAC, and rpl32-trnL-UAG-ccsA were remarkably divergent regions in the Sapindaceae species. Phylogenetic analysis based on different datasets, including whole chloroplast genome sequences, coding sequences, large single-copy, small single-copy, and inverted repeat regions, consistently demonstrated that H. bodinieri was sister to the clade consisted of Aesculus chinensis and A. wangii, strongly support Eurycorymbus cavaleriei as sister to Dodonaea viscosa. Conclusion This study revealed that the cp genome size of the Hippocastanoideae was generally smaller across Sapindaceae, and three highly divergent regions could be used as the specific DNA barcodes within Sapindaceae. Phylogenetic results strongly support that the subdivision of four subfamilies within Sapindaceae, and Handeliodendron is not nested within the genus Aesculus.


2020 ◽  
Author(s):  
Zhenchao Zhang ◽  
Zhongliang Dai ◽  
Yuemei Yao ◽  
Yongfei Pan ◽  
Guosheng Sun ◽  
...  

Abstract Backgrounds: Broccoli (Brassica. oleracea var. italica L.) is known as one of the most nutritionally rich vegetables, as well as rich in functional components that benefit to health. The main purposes of this research were sequencing, assembling and annotation of chloroplast genome of broccoli based on Illumina HiSeq2500 sequencing platform. Results: The size of the broccoli cp genome is 153,364 bp, including two inverted repeat (IR) regions of 26,197 bp each, separated by a small single copy (SSC) region of 17,834 bp and a large single copy (LSC) region of 83,136 bp. The GC content of the complete genome is 36.36%, while those of SSC, LSC, and IR are 29.1%, 34.15% and 42.35%, respectively. It harbors 134 functional genes, including 87 protein-coding genes, 39 tRNAs and 8 rRNAs, with 31 duplicates in the IRs. The most abundant amino acid in the protein-coding genes is leucine, while the least is cysteine. Codon usage frequency showed bias for A/T-ending codons in the cp genome. In the repeat structure analysis, a total of 34 repeat sequences and 291 simple sequence repeat (SSRs) were detected in the work. Although cp genomic structure and size are highly conserved, the SC-IR boundary regions are variable between the 7 cp genomes. The phylogenetic relationships based on complete cp genome from 9 species suggest that B. oleracea var. italica is closely related to Brassica juncea. Conclusions: The complete cp genome sequence was obtained and annotated for broccoli for the first time. The information acquired from this research will be useful for further species identification, population genetics and biological research of broccoli.


Viruses ◽  
2020 ◽  
Vol 12 (12) ◽  
pp. 1373
Author(s):  
Sang Guen Kim ◽  
Sung Bin Lee ◽  
Sib Sankar Giri ◽  
Hyoun Joong Kim ◽  
Sang Wha Kim ◽  
...  

Jumbo phages, which have a genome size of more than 200 kb, have recently been reported for the first time. However, limited information is available regarding their characteristics because few jumbo phages have been isolated. Therefore, in this study, we aimed to isolate and characterize other jumbo phages. We performed comparative genomic analysis of three Erwinia phages (pEa_SNUABM_12, pEa_SNUABM_47, and pEa_SNUABM_50), each of which had a genome size of approximately 360 kb (32.5% GC content). These phages were predicted to harbor 546, 540, and 540 open reading frames with 32, 34, and 35 tRNAs, respectively. Almost all of the genes in these phages could not be functionally annotated but showed high sequence similarity with genes encoded in Serratia phage BF, a member of Eneladusvirus. The detailed comparative and phylogenetic analyses presented in this study contribute to our understanding of the diversity and evolution of Erwinia phage and the genus Eneladusvirus.


2019 ◽  
Vol 2019 ◽  
pp. 1-17 ◽  
Author(s):  
Samaila S. Yaradua ◽  
Dhafer A. Alzahrani ◽  
Enas J. Albokhary ◽  
Abidina Abba ◽  
Abubakar Bello

The complete chloroplast genome of J. flava, an endangered medicinal plant in Saudi Arabia, was sequenced and compared with cp genome of three Acanthaceae species to characterize the cp genome, identify SSRs, and also detect variation among the cp genomes of the sampled Acanthaceae. NOVOPlasty was used to assemble the complete chloroplast genome from the whole genome data. The cp genome of J. flava was 150, 888bp in length with GC content of 38.2%, and has a quadripartite structure; the genome harbors one pair of inverted repeat (IRa and IRb 25, 500bp each) separated by large single copy (LSC, 82, 995 bp) and small single copy (SSC, 16, 893 bp). There are 132 genes in the genome, which includes 80 protein coding genes, 30 tRNA, and 4 rRNA; 113 are unique while the remaining 19 are duplicated in IR regions. The repeat analysis indicates that the genome contained all types of repeats with palindromic occurring more frequently; the analysis also identified total number of 98 simple sequence repeats (SSR) of which majority are mononucleotides A/T and are found in the intergenic spacer. The comparative analysis with other cp genomes sampled indicated that the inverted repeat regions are conserved than the single copy regions and the noncoding regions show high rate of variation than the coding region. All the genomes have ndhF and ycf1 genes in the border junction of IRb and SSC. Sequence divergence analysis of the protein coding genes showed that seven genes (petB, atpF, psaI, rpl32, rpl16, ycf1, and clpP) are under positive selection. The phylogenetic analysis revealed that Justiceae is sister to Ruellieae. This study reported the first cp genome of the largest genus in Acanthaceae and provided resources for studying genetic diversity of J. flava as well as resolving phylogenetic relationships within the core Acanthaceae.


2010 ◽  
Vol 78 (12) ◽  
pp. 5214-5222 ◽  
Author(s):  
Alina Nakhamchik ◽  
Caroline Wilde ◽  
Henry Chong ◽  
Dean A. Rowe-Magnus

ABSTRACT The most intensely studied of the Vibrio vulnificus virulence factors is the capsular polysaccharide (CPS). All virulent strains produce copious amounts of CPS. Acapsular strains are avirulent. The structure of the CPS from the clinical isolate ATCC 27562 is unusual. It is serine modified and contains, surprisingly, N-acetylmuramic acid. We identified the complete 25-kb CPS biosynthesis locus from ATCC 27562. It contained 21 open reading frames and was allelic to O-antigen biosynthesis loci. Two of the genes, murA CPS and murB CPS, were paralogs of the murA PG and murB PG genes of the peptidoglycan biosynthesis pathway; only a single copy of these genes is present in the strain CMCP6 and YJ016 genomes. Although MurACPS and MurBCPS were functional when expressed in Escherichia coli, lesions in either gene had no effect on CPS production, virulence, or growth in V. vulnificus; disruption of 8 other genes within the locus resulted in an acapsular phenotype and attenuated virulence. Thus, murA CPS and murB CPS were functional but redundant. Comparative genomic analysis revealed that while completely different CPS biosynthesis loci were found in the same chromosomal region in other V. vulnificus strains, most of the CPS locus of ATCC 27562 was conserved in another marine bacterium, Shewanella putrefaciens strain 200. However, the average GC content of the CPS locus was significantly lower than the average GC content of either genome. Furthermore, several of the encoded proteins appeared to be of Gram-positive and archaebacterial origin. These data indicate that the horizontal transfer of intact and partial CPS loci drives CPS diversity in marine bacteria.


2021 ◽  
Vol 11 ◽  
Author(s):  
Yongtan Li ◽  
Yan Dong ◽  
Yichao Liu ◽  
Xiaoyue Yu ◽  
Minsheng Yang ◽  
...  

In this study, we assembled and annotated the chloroplast (cp) genome of the Euonymus species Euonymus fortunei, Euonymus phellomanus, and Euonymus maackii, and performed a series of analyses to investigate gene structure, GC content, sequence alignment, and nucleic acid diversity, with the objectives of identifying positive selection genes and understanding evolutionary relationships. The results indicated that the Euonymus cp genome was 156,860–157,611bp in length and exhibited a typical circular tetrad structure. Similar to the majority of angiosperm chloroplast genomes, the results yielded a large single-copy region (LSC) (85,826–86,299bp) and a small single-copy region (SSC) (18,319–18,536bp), separated by a pair of sequences (IRA and IRB; 26,341–26,700bp) with the same encoding but in opposite directions. The chloroplast genome was annotated to 130–131 genes, including 85–86 protein coding genes, 37 tRNA genes, and eight rRNA genes, with GC contents of 37.26–37.31%. The GC content was variable among regions and was highest in the inverted repeat (IR) region. The IR boundary of Euonymus happened expanding resulting that the rps19 entered into IR region and doubled completely. Such fluctuations at the border positions might be helpful in determining evolutionary relationships among Euonymus. The simple-sequence repeats (SSRs) of Euonymus species were composed primarily of single nucleotides (A)n and (T)n, and were mostly 10–12bp in length, with an obvious A/T bias. We identified several loci with suitable polymorphism with the potential use as molecular markers for inferring the phylogeny within the genus Euonymus. Signatures of positive selection were seen in rpoB protein encoding genes. Based on data from the whole chloroplast genome, common single copy genes, and the LSC, SSC, and IR regions, we constructed an evolutionary tree of Euonymus and related species, the results of which were consistent with traditional taxonomic classifications. It showed that E. fortunei sister to the Euonymus japonicus, whereby E. maackii appeared as sister to Euonymus hamiltonianus. Our study provides important genetic information to support further investigations into the phylogenetic development and adaptive evolution of Euonymus species.


2018 ◽  
Author(s):  
Zerui Yang ◽  
Yuying Huang ◽  
Xiasheng Zheng ◽  
Song Huang ◽  
Lingling Liang

Lycium chinense Mill, an important Chinese herbal medicine, is emphasized as a healthy food and is widely used as a dietary supplement. Here we sequenced and analyzed the complete chloroplast (CP) genome of the L. chinense, which is 155,756 bp in length and with 37.8% GC content. This CP genome consists of a pair of inverted repeat regions (IRa and IRb) of 25,476 bp, separated by a large single-copy region (LSC) and a small single-copy region (SSC), with length of 86,595 and 18,209 bp, respectively. Annotation results revealed that the L. chinense CP genome contains 114 genes, 16 of which are duplicated genes. Most of the 85 protein-coding genes have a usual ATG start codon, except for 3 genes including rps12, psbL and ndhD. Furthermore, most of the simple sequence repeats (SSRs) are short polyadenine or polythymine repeats that contribute to the high AT content of the chloroplast genome. Revealing of the complete sequences and annotation of the L. chinense chloroplast genome will facilitate phylogenic, population and genetic engineering research investigations involving this particular species.


2018 ◽  
Author(s):  
Zerui Yang ◽  
Yuying Huang ◽  
Xiasheng Zheng ◽  
Song Huang ◽  
Lingling Liang

Lycium chinense Mill, an important Chinese herbal medicine, is emphasized as a healthy food and is widely used as a dietary supplement. Here we sequenced and analyzed the complete chloroplast (CP) genome of the L. chinense, which is 155,756 bp in length and with 37.8% GC content. This CP genome consists of a pair of inverted repeat regions (IRa and IRb) of 25,476 bp, separated by a large single-copy region (LSC) and a small single-copy region (SSC), with length of 86,595 and 18,209 bp, respectively. Annotation results revealed that the L. chinense CP genome contains 114 genes, 16 of which are duplicated genes. Most of the 85 protein-coding genes have a usual ATG start codon, except for 3 genes including rps12, psbL and ndhD. Furthermore, most of the simple sequence repeats (SSRs) are short polyadenine or polythymine repeats that contribute to the high AT content of the chloroplast genome. Revealing of the complete sequences and annotation of the L. chinense chloroplast genome will facilitate phylogenic, population and genetic engineering research investigations involving this particular species.


2018 ◽  
Vol 19 (8) ◽  
pp. 2443 ◽  
Author(s):  
Xuan Li ◽  
Yongfu Li ◽  
Mingyue Zang ◽  
Mingzhi Li ◽  
Yanming Fang

Quercus acutissima, an important endemic and ecological plant of the Quercus genus, is widely distributed throughout China. However, there have been few studies on its chloroplast genome. In this study, the complete chloroplast (cp) genome of Q. acutissima was sequenced, analyzed, and compared to four species in the Fagaceae family. The size of the Q. acutissima chloroplast genome is 161,124 bp, including one large single copy (LSC) region of 90,423 bp and one small single copy (SSC) region of 19,068 bp, separated by two inverted repeat (IR) regions of 51,632 bp. The GC content of the whole genome is 36.08%, while those of LSC, SSC, and IR are 34.62%, 30.84%, and 42.78%, respectively. The Q. acutissima chloroplast genome encodes 136 genes, including 88 protein-coding genes, four ribosomal RNA genes, and 40 transfer RNA genes. In the repeat structure analysis, 31 forward and 22 inverted long repeats and 65 simple-sequence repeat loci were detected in the Q. acutissima cp genome. The existence of abundant simple-sequence repeat loci in the genome suggests the potential for future population genetic work. The genome comparison revealed that the LSC region is more divergent than the SSC and IR regions, and there is higher divergence in noncoding regions than in coding regions. The phylogenetic relationships of 25 species inferred that members of the Quercus genus do not form a clade and that Q. acutissima is closely related to Q. variabilis. This study identified the unique characteristics of the Q. acutissima cp genome, which will provide a theoretical basis for species identification and biological research.


2021 ◽  
Vol 53 (4) ◽  
Author(s):  
Jean N. Hakizimana ◽  
Jean B. Ntirandekura ◽  
Clara Yona ◽  
Lionel Nyabongo ◽  
Gladson Kamwendo ◽  
...  

AbstractSeveral African swine fever (ASF) outbreaks in domestic pigs have been reported in Burundi and Malawi and whole-genome sequences of circulating outbreak viruses in these countries are limited. In the present study, complete genome sequences of ASF viruses (ASFV) that caused the 2018 outbreak in Burundi (BUR/18/Rutana) and the 2019 outbreak in Malawi (MAL/19/Karonga) were produced using Illumina next-generation sequencing (NGS) platform and compared with other previously described ASFV complete genomes. The complete nucleotide sequences of BUR/18/Rutana and MAL/19/Karonga were 176,564 and 183,325 base pairs long with GC content of 38.62 and 38.48%, respectively. The MAL/19/Karonga virus had a total of 186 open reading frames (ORFs) while the BUR/18/Rutana strain had 151 ORFs. After comparative genomic analysis, the MAL/19/Karonga virus showed greater than 99% nucleotide identity with other complete nucleotides sequences of p72 genotype II viruses previously described in Tanzania, Europe and Asia including the Georgia 2007/1 isolate. The Burundian ASFV BUR/18/Rutana exhibited 98.95 to 99.34% nucleotide identity with genotype X ASFV previously described in Kenya and in Democratic Republic of the Congo (DRC). The serotyping results classified the BUR/18/Rutana and MAL/19/Karonga ASFV strains in serogroups 7 and 8, respectively. The results of this study provide insight into the genetic structure and antigenic diversity of ASFV strains circulating in Burundi and Malawi. This is important in order to understand the transmission dynamics and genetic evolution of ASFV in eastern Africa, with an ultimate goal of designing an efficient risk management strategy against ASF transboundary spread.


Sign in / Sign up

Export Citation Format

Share Document