scholarly journals Comparative Genomics of 86 Whole-Genome Sequences in the Six Species of the Elizabethkingia Genus Reveals Intraspecific and Interspecific Divergence

2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Chih-Yu Liang ◽  
Chih-Hui Yang ◽  
Chung-Hsu Lai ◽  
Yi-Han Huang ◽  
Jiun-Nong Lin

AbstractBacteria of the genus Elizabethkingia are emerging infectious agents that can cause infection in humans. The number of published whole-genome sequences of Elizabethkingia is rapidly increasing. In this study, we used comparative genomics to investigate the genomes of the six species in the Elizabethkingia genus, namely E. meningoseptica, E. anophelis, E. miricola, E. bruuniana, E. ursingii, and E. occulta. In silico DNA–DNA hybridization, whole-genome sequence-based phylogeny, pan genome analysis, and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were performed, and clusters of orthologous groups were evaluated. Of the 86 whole-genome sequences available in GenBank, 21 were complete genome sequences and 65 were shotgun sequences. In silico DNA–DNA hybridization clearly delineated the six Elizabethkingia species. Phylogenetic analysis confirmed that E. bruuniana, E. ursingii, and E. occulta were closer to E. miricola than to E. meningoseptica and E. anophelis. A total of 2,609 clusters of orthologous groups were identified among the six type strains of the Elizabethkingia genus. Metabolism-related clusters of orthologous groups accounted for the majority of gene families in KEGG analysis. New genes were identified that substantially increased the total repertoire of the pan genome after the addition of 86 Elizabethkingia genomes, which suggests that Elizabethkingia has shown adaptive evolution to environmental change. This study presents a comparative genomic analysis of Elizabethkingia, and the results of this study provide knowledge that facilitates a better understanding of this microorganism.

2020 ◽  
Author(s):  
Antonio Roberto Gomes de Farias ◽  
Wilson José da Silva Junior ◽  
José Bandeira do Nascimento Junior ◽  
Valdir de Queiroz Balbino ◽  
Ana Maria Benko-Iseppon ◽  
...  

Abstract Background Xanthomonas citri pv. viticola is one of the most critical grapevine diseases in the Northeast of Brazil, presenting a high risk to Brazilian and worldwide areas of grape production. The X.citri pv. viticola epithet was recently proposed to be changed from X. campestris pv. v iticola based on multilocus sequence analysis and whole-genome sequences. Besides, genomics has revolutionized the field of bacteriology, by associating genome sequencing with comparative analysis such as in silico analysis such as DNA-DNA hybridization, average nucleotide identity, distance between genomes, pan-genomic approach, and phylogenomic, providing valuable insights and knowledge about virulence factors and contributing to increase the understanding and clarifying the taxonomic relationship of Xanthomonas and others prokaryotic species.Results We used the whole-genome sequence of three Brazilian strains and the pathotype to characterize X.citri pv. viticola accessions plus 124 whole-genome sequences of Xanthomonas species available in NCBI, comprising 13 species and 15 pathovars. The whole-genome sequence structure of X. citri pv. viticola was shown presents a high level of conservation concerning other X. citri species. Pan-genomic approaches, average nucleotide identity analysis, and in silico DNA-DNA hybridization were carried out, allowing X.citri pv. viticola characterization and inferences on the phylogenetic relationships within Xanthomonas . The analysis of the sequence of the 128 genomes clustered the Xanthomonas strains in eight main groups according to the recently proposed classification in all approaches used. Also, the analysis revealed that X. hortorum and X. gardneri should be classified as a single species, and the strain 17 of X. campestris and XC01 of X. citri pv. mangiferaeindicae widely described in the literature are misclassified.Conclusions We performed the genomic characterization of three representative Brazilian strains of Xcv . The genomic approaches based in the pan-genome, average nucleotide identity, and in silico DNA-DNA hybridization support the proposed taxonomic position of X.citri pv. viticola and of the recently proposed Xanthomonas species and pathovars. In addition, we detected species delimitation of the misclassified Xanthomonas strains with extensive studies reported in the literature.


Author(s):  
Viola Kurm ◽  
Ilse Houwers ◽  
Claudia E. Coipan ◽  
Peter Bonants ◽  
Cees Waalwijk ◽  
...  

AbstractIdentification and classification of members of the Ralstonia solanacearum species complex (RSSC) is challenging due to the heterogeneity of this complex. Whole genome sequence data of 225 strains were used to classify strains based on average nucleotide identity (ANI) and multilocus sequence analysis (MLSA). Based on the ANI score (>95%), 191 out of 192(99.5%) RSSC strains could be grouped into the three species R. solanacearum, R. pseudosolanacearum, and R. syzygii, and into the four phylotypes within the RSSC (I,II, III, and IV). R. solanacearum phylotype II could be split in two groups (IIA and IIB), from which IIB clustered in three subgroups (IIBa, IIBb and IIBc). This division by ANI was in accordance with MLSA. The IIB subgroups found by ANI and MLSA also differed in the number of SNPs in the primer and probe sites of various assays. An in-silico analysis of eight TaqMan and 11 conventional PCR assays was performed using the whole genome sequences. Based on this analysis several cases of potential false positives or false negatives can be expected upon the use of these assays for their intended target organisms. Two TaqMan assays and two PCR assays targeting the 16S rDNA sequence should be able to detect all phylotypes of the RSSC. We conclude that the increasing availability of whole genome sequences is not only useful for classification of strains, but also shows potential for selection and evaluation of clade specific nucleic acid-based amplification methods within the RSSC.


2021 ◽  
Vol 368 (5) ◽  
Author(s):  
Chen Yang ◽  
Zhe Liu ◽  
Shuai Yu ◽  
Kun Ye ◽  
Xin Li ◽  
...  

Abstract Elizabethkingia are found to cause severe neonatal meningitis, nosocomial pneumonia, endocarditis and bacteremia. However, there are few studies on Elizabethkingia genus by comparative genomic analysis. In this study, three species of Elizabethkingia were found: E. meningoseptica, E. anophelis and E. miricola. Resistance genes and associated proteins of seven classes of antibiotics including beta-lactams, aminoglycosides, macrolides, tetracyclines, quinolones, sulfonamides and glycopeptides, as well as multidrug resistance efflux pumps were identified from 20 clinical isolates of Elizabethkingia by whole-genome sequence. Genotype and phenotype displayed a good consistency in beta-lactams, aminoglycosides and glycopeptides, while contradictions exhibited in tetracyclines, quinolones and sulfonamides. Virulence factors and associated genes such as hsp60 (htpB), exopolysaccharide (EPS) (galE/pgi), Mg2+ transport (mgtB/mgtE) and catalase (katA/katG) existed in all clinical and reference strains. The functional analysis of the clusters of orthologous groups indicated that ‘metabolism’ occupied the largest part in core genome, ‘information storage and processing’ was the largest group in both accessory genome and unique genome. Abundant mobile elements were identified in E. meningoseptica and E. anophelis. The most significant finding in our study was that a single clone of E. anophelis had been circulating within diversities of departments in a clinical setting for nearly 18 months.


2019 ◽  
Vol 6 (Supplement_2) ◽  
pp. S295-S295
Author(s):  
Hoan J Lee ◽  
Joon Kee Lee ◽  
Yun Young Choi ◽  
Ji Young Park ◽  
Moon-Woo Seong ◽  
...  

Abstract Background This study applied high-throughput whole-genome sequencing (WGS) technologies to investigate the comparative genomics of 30 M. pneumoniae strains isolated from children with pneumonia in South Korea during two epidemics from 2010 to 2016 in comparison with a global collection of 48 Mycoplasma pneumoniae strains which includes seven countries ranging from 1944 to 2017. Methods A total number of 30 M. pneumoniae strains were selected for whole-genome sequence analysis from two epidemics, 2010–2012 and 2014–2016. Next-generation sequencing (NGS) of all M. pneumoniae strains was performed using the Illumina MiSeq desktop sequencer. Comparative genomic analysis was performed using BLAST Ring Image Generator (BRIG), MAUVE, MAFFT, CLC Phylogeny Module, SnpEff, and Pathosystems Resource Integration Center (PATRIC). Results The 30 Korean strains had approximately 40% GC content and ranged from 815,686 to 818,669 base pairs, coding for a total of 809 to 828 genes. Overall, BRIG revealed 99% to>99% similarity among strains. The genomic similarity dropped to approximately 95% in the P1 type 2 strains when aligned to the reference M129 genome, which corresponded to the region of the p1 gene. MAUVE detected four subtype-specific of which were all hypothetical proteins except for one tRNA insertion in all P1 type 1 strains. eBURST analysis demonstrated two clonal complexes which are accordant with the known P1 typing, with higher diversity among P1 type 2 strains. The phylogenetic tree constructed with 78 genomes including 48 genomes outside Korea, formed three clusters, in which the sequence type 3 strains from Korea were divided into two P1 type 1 clusters. Conclusion The comparative genomics of the 78 M. pneumoniae strains including 30 strains from Korea by WGS reveals structural diversity and phylogenetic associations, even though the similarity across the strains was very high. Disclosures All authors: No reported disclosures.


Diversity ◽  
2018 ◽  
Vol 10 (4) ◽  
pp. 113 ◽  
Author(s):  
Sunitha Joseph ◽  
Rebecca O’Connor ◽  
Abdullah Al Mutery ◽  
Mick Watson ◽  
Denis Larkin ◽  
...  

Whole genome assemblies are crucial for understanding a wide range of aspects of falcon biology, including morphology, ecology, and physiology, and are thus essential for their care and conservation. A key aspect of the genome of any species is its karyotype, which can then be linked to the whole genome sequence to generate a so-called chromosome-level assembly. Chromosome-level assemblies are essential for marker assisted selection and genotype-phenotype correlations in breeding regimes, as well as determining patterns of gross genomic evolution. To date, only two falcon species have been sequenced and neither initially were assembled to the chromosome level. Falcons have atypical avian karyotypes with fewer chromosomes than other birds, presumably brought about by wholesale fusion. To date, however, published chromosome preparations are of poor quality, few chromosomes have been distinguished and standard ideograms have not been made. The purposes of this study were to generate analyzable karyotypes and ideograms of peregrine, saker, and gyr falcons, report on our recent generation of chromosome level sequence assemblies of peregrine and saker falcons, and for the first time, sequence the gyr falcon genome. Finally, we aimed to generate comparative genomic data between all three species and the reference chicken genome. Results revealed a diploid number of 2n = 50 for peregrine falcon and 2n = 52 for saker and gyr through high quality banded chromosomes. Standard ideograms that are generated here helped to map predicted chromosomal fragments (PCFs) from the genome sequences directly to chromosomes and thus generate chromosome level sequence assemblies for peregrine and saker falcons. Whole genome sequencing was successful in gyr falcon, but read depth and coverage was not sufficient to generate a chromosome level assembly. Nonetheless, comparative genomics revealed no differences in genome organization between gyr and saker falcons. When compared to peregrine falcon, saker/gyr differed by one interchromosomal and seven intrachromosomal rearrangements (a fusion plus seven inversions), whereas peregrine and saker/gyr differ from the reference chicken genome by 14/13 fusions (11 microchromosomal) and six fissions. The chromosomal differences between the species could potentially provide the basis of a screening test for hybrid animals.


2020 ◽  
Author(s):  
Chen Yang ◽  
Zhe Liu ◽  
Shuai Yu ◽  
Kun Ye ◽  
Xin Li ◽  
...  

Abstract Background: There are increasing researches on whole-genome sequences for clinical strains of Elizabethkingia genus which can cause severe infection in humans, while few studies on the comparative genomics of species in the Elizabethkingia genus in China have been conducted. Methods: The Elizabethkingia genus, isolated in a tertiary hospital of Beijing, China, were re-identified and analyzed through in silico DNA-DNA hybridization (DDH), whole-genome sequence-based phylogeny. Antibiotic resistance genes, antimicrobial resistance-associated proteins, virulence factors were identified, and clusters of orthologous groups were evaluated by Kyoto Encyclopedia of Genes and Genomes (KEGG). The clinical information of patients infected by these organisms was collected and the characteristics were analyzed. Results: There were three species among 20 clinical isolates of Elizabethkingia genus: E. meningoseptica, E. anophelis and E. miricola. E. anophelis accounted for the majority. E. meningoseptica exhibited higher GC content and possessed carbapenemase-encoding genes of blaGOB-16 and blaB-12 while E. anophelis carried genes of blaCME-1. Multiple kinds of antimicrobial resistance-associated proteins were predicted and the virulence factors about adherence, biofilm formation, iron and magnesium uptake, stress adaptation, and immune evasion were discovered. Among 2622 clusters of core genomes identified from the three species of the Elizabethkingia genus, the majority of genes were metabolism-related. Pan genome displayed an upregulation, while the core genome displayed a downregulation with the addition of new genes for the 20 Elizabethkingia strains. Conclusions: The composition was different in antimicrobial resistance-related, virulence-related and metabolism-related genes depending upon species of Elizabethkingia. An adaptive evolution of Elizabethkingia to environmental change including hospital settings has been developed.


F1000Research ◽  
2016 ◽  
Vol 5 ◽  
pp. 2003 ◽  
Author(s):  
Michael P. Heaton ◽  
Timothy P.L. Smith ◽  
Jacky K. Carnahan ◽  
Veronica Basnayake ◽  
Jiansheng Qiu ◽  
...  

The availability of whole genome sequence (WGS) data has made it possible to discover protein variantsin silico. However, existing bovine WGS databases do not show data in a form conducive to protein variant analysis, and tend to under represent the breadth of genetic diversity in global beef cattle. Thus, our first aim was to use 96 beef sires, sharing minimal pedigree relationships, to create a searchable and publicly viewable set of mapped genomes relevant for 19 popular breeds of U.S. cattle. Our second aim was to identify protein variants encoded by the bovine endothelial PAS domain-containing protein 1 gene (EPAS1), a gene associated with pulmonary hypertension in Angus cattle. The identity and quality of genomic sequences were verified by comparing WGS genotypes to those derived from other methods. The average read depth, genotype scoring rate, and genotype accuracy exceeded 14, 99%, and 99%, respectively. The 96 genomes were used to discover four amino acid variants encoded byEPAS1(E270Q, P362L, A671G, and L701F) and confirm two variants previously associated with disease (A606T and G610S). The sixEPAS1missense mutations were verified with matrix-assisted laser desorption/ionization time-of-flight mass spectrometry assays, and their frequencies were estimated in a separate collection of 1154 U.S. cattle representing 46 breeds. A rooted phylogenetic tree of eight polypeptide sequences provided a framework for evaluating the likely order of mutations and potential impact ofEPAS1alleles on the adaptive response to chronic hypoxia in U.S. cattle. This public, whole genome resource facilitatesin silicoidentification of protein variants in diverse types of U.S. beef cattle, and provides a means of translating WGS data into a practical biological and evolutionary context for generating and testing hypotheses.


2017 ◽  
Vol 5 (28) ◽  
Author(s):  
Sara Jones ◽  
Raji Prasad ◽  
Anjana S. Nair ◽  
Sanjai Dharmaseelan ◽  
Remya Usha ◽  
...  

ABSTRACT We report here the whole-genome sequence of six clinical isolates of influenza A(H1N1)pdm09, isolated from Kerala, India. Amino acid analysis of all gene segments from the A(H1N1)pdm09 isolates obtained in 2014 and 2015 identified several new mutations compared to the 2009 A(H1N1) pandemic strain.


2020 ◽  
Author(s):  
Zhong Peng ◽  
Junyang Liu ◽  
Wan Liang ◽  
Fei Wang ◽  
Li Wang ◽  
...  

Abstract Background: Different typing systems including capsular genotyping, lipopolysaccharide (LPS) genotyping, multilocus sequence typing (MLST), and virulence genotyping based on the detection of different virulence factor-encoding gene (VFG) profiles have been applied to characterize Pasteurella multocida strains from different host species. However, these methods require much time and effort in laboratories. Particularly, relying on one of these methods is difficult to address the biology of P. multocida from host species. Recently, we found that assigning P. multocida strains according to the combination of their capsular, LPS, and MLST genotypes (marked as capsular genotype: LPS genotype: MLST genotype) could help address the biological characteristics of P. multocida circulation in multiple hosts. However, it is still lack of a rapid, efficient, intelligent and cost-saving tool to diagnose P. multocida according to this system. Results: We have developed an intelligent genotyping and host tropism prediction tool PmGT for P. multocida strains according to their whole genome sequences by using machine learning and web 2.0 technologies. By using this tool, the capsular genotypes, LPS genotypes, and MLST genotypes as well as the main VFGs of P. multocida isolates in different host species were determined based on whole genome sequences. The results revealed a closer association between the genotypes and pasteurellosis rather than between genotypes and host species. Finally, we also used PmGT to predict the host species of P. multocida strains with the same capsular: lipopolysaccharide: MLST genotypes. Conclusions: With the advent of high-quality, inexpensive DNA sequencing, this platform represents a more efficient and cost-saving tool for P. multocida diagnosis in both epidemiological studies and clinical settings.


Sign in / Sign up

Export Citation Format

Share Document