scholarly journals Candida auris Whole-Genome Sequence Benchmark Dataset for Phylogenomic Pipelines

2021 ◽  
Vol 7 (3) ◽  
pp. 214
Author(s):  
Rory M. Welsh ◽  
Elizabeth Misas ◽  
Kaitlin Forsberg ◽  
Meghan Lyman ◽  
Nancy A. Chow

Candida auris is a multidrug-resistant pathogen that represents a serious public health threat due to its rapid global emergence, increasing incidence of healthcare-associated outbreaks, and high rates of antifungal resistance. Whole-genome sequencing and genomic surveillance have the potential to bolster C. auris surveillance networks moving forward. Laboratories conducting genomic surveillance need to be able to compare analyses from various national and international surveillance partners to ensure that results are mutually trusted and understood. Therefore, we established an empirical outbreak benchmark dataset consisting of 23 C. auris genomes to help validate comparisons of genomic analyses and facilitate communication among surveillance networks. Our outbreak benchmark dataset represents a polyclonal phylogeny with three subclades. The genomes in this dataset are from well-vetted studies that are supported by multiple lines of evidence, which demonstrate that the whole-genome sequencing data, phylogenetic tree, and epidemiological data are all in agreement. This C. auris benchmark set allows for standardized comparisons of phylogenomic pipelines, ultimately promoting effective C. auris collaborations.

2017 ◽  
Author(s):  
Emily J. Goldstein ◽  
William T. Harvey ◽  
Gavin S. Wilkie ◽  
Samantha J. Shepherd ◽  
Alasdair R. MacLean ◽  
...  

AbstractGenetic surveillance of seasonal influenza is largely focused upon sequencing of the haemagglutinin gene. Consequently, our understanding of the contribution of the remaining seven gene segments to the evolution and epidemiological dynamics of seasonal influenza is relatively limited. The increased availability of next generation sequencing technologies allows rapid and economic whole genome sequencing (WGS). Here, 150 influenza A(H3N2) positive clinical specimens with linked epidemiological data, from the 2014/15 season in Scotland, were sequenced directly using both Sanger sequencing of the HA1 region and WGS using the Illumina MiSeq platform. Sequences generated by both methods were highly consistent and WGS provided on average >90% whole genome coverage. As reported in other European countries during 2014/15, all strains belonged to genetic group 3C, with subgroup 3C.2a predominating. Inter-subgroup reassortants were identified (9%), including three 3C.3 viruses descended from a single reassortment event, which had persisted in the population. Significant phylogenetic associations with cases of severe acute respiratory illness observed herein warrant further investigation. Severe cases were also more likely to be associated with reassortant viruses (odds ratio: 4.4 (1.3-15.5)) and occur later in the season. These results suggest that increased levels of WGS, linked to clinical and epidemiological data, could improve influenza surveillance.


Heredity ◽  
2021 ◽  
Author(s):  
Axel Jensen ◽  
Mette Lillie ◽  
Kristofer Bergström ◽  
Per Larsson ◽  
Jacob Höglund

AbstractThe use of genetic markers in the context of conservation is largely being outcompeted by whole-genome data. Comparative studies between the two are sparse, and the knowledge about potential effects of this methodology shift is limited. Here, we used whole-genome sequencing data to assess the genetic status of peripheral populations of the wels catfish (Silurus glanis), and discuss the results in light of a recent microsatellite study of the same populations. The Swedish populations of the wels catfish have suffered from severe declines during the last centuries and persists in only a few isolated water systems. Fragmented populations generally are at greater risk of extinction, for example due to loss of genetic diversity, and may thus require conservation actions. We sequenced individuals from the three remaining native populations (Båven, Emån, and Möckeln) and one reintroduced population of admixed origin (Helge å), and found that genetic diversity was highest in Emån but low overall, with strong differentiation among the populations. No signature of recent inbreeding was found, but a considerable number of short runs of homozygosity were present in all populations, likely linked to historically small population sizes and bottleneck events. Genetic substructure within any of the native populations was at best weak. Individuals from the admixed population Helge å shared most genetic ancestry with the Båven population (72%). Our results are largely in agreement with the microsatellite study, and stresses the need to protect these isolated populations at the northern edge of the distribution of the species.


2017 ◽  
Vol 114 (38) ◽  
pp. 10166-10171 ◽  
Author(s):  
Christoph Lippert ◽  
Riccardo Sabatini ◽  
M. Cyrus Maher ◽  
Eun Yong Kang ◽  
Seunghak Lee ◽  
...  

Prediction of human physical traits and demographic information from genomic data challenges privacy and data deidentification in personalized medicine. To explore the current capabilities of phenotype-based genomic identification, we applied whole-genome sequencing, detailed phenotyping, and statistical modeling to predict biometric traits in a cohort of 1,061 participants of diverse ancestry. Individually, for a large fraction of the traits, their predictive accuracy beyond ancestry and demographic information is limited. However, we have developed a maximum entropy algorithm that integrates multiple predictions to determine which genomic samples and phenotype measurements originate from the same person. Using this algorithm, we have reidentified an average of >8 of 10 held-out individuals in an ethnically mixed cohort and an average of 5 of either 10 African Americans or 10 Europeans. This work challenges current conceptions of personal privacy and may have far-reaching ethical and legal implications.


Sign in / Sign up

Export Citation Format

Share Document