scholarly journals Contaminating DNA in human saliva alters the detection of variants from whole genome sequencing

2020 ◽  
Vol 10 (1) ◽  
Author(s):  
C. A. Samson ◽  
W. Whitford ◽  
R. G. Snell ◽  
J. C. Jacobsen ◽  
K. Lehnert

Abstract Cells obtained from human saliva are commonly used as an alternative DNA source when blood is difficult or less convenient to collect. Although DNA extracted from saliva is considered to be of comparable quality to that derived from blood, recent studies have shown that non-human contaminating DNA derived from saliva can confound whole genome sequencing results. The most concerning complication is that non-human reads align to the human reference genome using standard methodology, which can critically affect the resulting variant genotypes identified in a genome. We identified clusters of anomalous variants in saliva DNA derived reads which aligned in an atypical manner. These reads had only short regions of identity to the human reference sequence, flanked by soft clipped sequence. Sequence comparisons of atypically aligning reads from eight human saliva-derived samples to RefSeq genomes revealed the majority to be of bacterial origin (63.46%). To partition the non-human reads during the alignment step, a decoy of the most prevalent bacterial genome sequences was designed and utilised. This reduced the number of atypically aligning reads when trialled on the eight saliva-derived samples by 44% and most importantly prevented the associated anomalous genotype calls. Saliva derived DNA is often contaminated by DNA from other species. This can lead to non-human reads aligning to the human reference genome using current alignment best-practices, impacting variant identification. This problem can be diminished by using a bacterial decoy in the alignment process.

2019 ◽  
Vol 9 (10) ◽  
pp. 3213-3223 ◽  
Author(s):  
Giovanna Cáceres ◽  
María E. López ◽  
María I. Cádiz ◽  
Grazyella M. Yoshida ◽  
Ana Jedlicki ◽  
...  

Nile tilapia (Oreochromis niloticus) is one of the most cultivated and economically important species in world aquaculture. Intensive production promotes the use of monosex animals, due to an important dimorphism that favors male growth. Currently, the main mechanism to obtain all-male populations is the use of hormones in feeding during larval and fry phases. Identifying genomic regions associated with sex determination in Nile tilapia is a research topic of great interest. The objective of this study was to identify genomic variants associated with sex determination in three commercial populations of Nile tilapia. Whole-genome sequencing of 326 individuals was performed, and a total of 2.4 million high-quality bi-allelic single nucleotide polymorphisms (SNPs) were identified after quality control. A genome-wide association study (GWAS) was conducted to identify markers associated with the binary sex trait (males = 1; females = 0). A mixed logistic regression GWAS model was fitted and a genome-wide significant signal comprising 36 SNPs, spanning a genomic region of 536 kb in chromosome 23 was identified. Ten out of these 36 genetic variants intercept the anti-Müllerian (Amh) hormone gene. Other significant SNPs were located in the neighboring Amh gene region. This gene has been strongly associated with sex determination in several vertebrate species, playing an essential role in the differentiation of male and female reproductive tissue in early stages of development. This finding provides useful information to better understand the genetic mechanisms underlying sex determination in Nile tilapia.


2020 ◽  
Author(s):  
Alexander Smetanin ◽  
Nikita Moshkov ◽  
Tatiana V. Tatarinova

AbstractSummaryWe developed PyLAE - a new tool for determining local ancestry along a genome using whole-genome sequencing data or high-density genotyping experiments. PyLAE can process an arbitrarily large number of ancestral populations (with or without an informative prior). Since PyLAE does not involve estimation of many parameters, it can process thousands of genomes within a day. Computational efficiency, straightforward presentation of results, and an ease of installation makes PyLAE a useful tool to study admixed populations.Availability and implementationThe source code and installation manual are available at https://github.com/smetam/pylae.


2021 ◽  
Author(s):  
Severin Einspanier ◽  
Tamara Susanto ◽  
Nicole Metz ◽  
Pieter J. Wolters ◽  
Vivianne G.A.A. Vleeshouwers ◽  
...  

Early blight of potato is caused by the fungal pathogen Alternaria solani and is an increasing problem worldwide. The primary strategy to control the disease is applying fungicides such as succinate dehydrogenase inhibitors (SDHI). SDHI-resistant strains, showing reduced sensitivity to treatments, appeared in Germany in 2013, five years after introduction of SDHIs. Two primary mutations in the Sdh complex (SdhB-H278Y and SdhC-H134R) have been frequently found throughout Europe. How these resistances arose and spread, and whether they are linked to other genomic features, remains unknown. We performed whole-genome sequencing for A. solani isolates from potato fields across Europe (Germany, Sweden, Belgium, and Serbia) to better understand the pathogen's genetic diversity in general and understand the development and spread of the genetic mutations that lead to SDHI resistance. We used ancestry analysis and phylogenetics to determine the genetic background of 48 isolates. The isolates can be grouped into 7 genotypes. These genotypes do not show a geographical pattern but appear spread throughout Europe. The Sdh mutations appear in different genetic backgrounds, suggesting they arose independently, and the observed admixtures might indicate a higher adaptive potential in the fungus than previously thought. Our research gives insights into the genetic diversity of A. solani on a genome level. The mixed occurrence of different genotypes and apparent admixture in the populations indicate higher genomic complexity than anticipated. The conclusion that SDHI tolerance arose multiple times independently has important implications for future fungicide resistance management strategies. These should not solely focus on preventing the spread of isolates between locations but also on limiting population size and the selective pressure posed by fungicides in a given field to avoid the rise of new mutations in other genetic backgrounds.


2020 ◽  
Author(s):  
Sihao Xiao ◽  
Zhentian Kai ◽  
David Brown ◽  
Claire L Shovlin ◽  

SUMMARYWhole genome sequencing (WGS) is championed by the UK National Health Service (NHS) to identify genetic variants that cause particular diseases. The full potential of WGS has yet to be realised as early data analytic steps prioritise protein-coding genes, and effectively ignore the less well annotated non-coding genome which is rich in transcribed and critical regulatory regions. To address, we developed a filter, which we call GROFFFY, and validated in WGS data from hereditary haemorrhagic telangiectasia patients within the 100,000 Genomes Project. Before filter application, the mean number of DNA variants compared to human reference sequence GRCh38 was 4,867,167 (range 4,786,039-5,070,340), and one-third lay within intergenic areas. GROFFFY removed a mean of 2,812,015 variants per DNA. In combination with allele frequency and other filters, GROFFFY enabled a 99.56% reduction in variant number. The proportion of intergenic variants was maintained, and no pathogenic variants in disease genes were lost. We conclude that the filter applied to NHS diagnostic samples in the 100,000 Genomes pipeline offers an efficient method to prioritise intergenic, intronic and coding gDNA variants. Reducing the overwhelming number of variants while retaining functional genome variation of importance to patients, enhances the near-term value of WGS in clinical diagnostics.


2021 ◽  
Vol 12 ◽  
Author(s):  
Annika Brinkmann ◽  
Sophie-Luisa Ulm ◽  
Steven Uddin ◽  
Sophie Förster ◽  
Dominique Seifert ◽  
...  

Since the emergence of the Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2) in December 2019, the scientific community has been sharing data on epidemiology, diagnostic methods, and whole-genomic sequences almost in real time. The latter have already facilitated phylogenetic analyses, transmission chain tracking, protein modeling, the identification of possible therapeutic targets, timely risk assessment, and identification of novel variants. We have established and evaluated an amplification-based approach for whole-genome sequencing of SARS-CoV-2. It can be used on the miniature-sized and field-deployable sequencing device Oxford Nanopore MinION, with sequencing library preparation time of 10 min. We show that the generation of 50,000 total reads per sample is sufficient for a near complete coverage (>90%) of the SARS-CoV-2 genome directly from patient samples even if virus concentration is low (Ct 35, corresponding to approximately 5 genome copies per reaction). For patient samples with high viral load (Ct 18–24), generation of 50,000 reads in 1–2 h was shown to be sufficient for a genome coverage of >90%. Comparison to Illumina data reveals an accuracy that suffices to identify virus mutants. AmpliCoV can be applied whenever sequence information on SARS-CoV-2 is required rapidly, for instance for the identification of circulating virus mutants.


2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Rueben G. Das ◽  
Doreen Becker ◽  
Vidhya Jagannathan ◽  
Orly Goldstein ◽  
Evelyn Santana ◽  
...  

Abstract Congenital stationary night blindness (CSNB), in the complete form, is caused by dysfunctions in ON-bipolar cells (ON-BCs) which are secondary neurons of the retina. We describe the first disease causative variant associated with CSNB in the dog. A genome-wide association study using 12 cases and 11 controls from a research colony determined a 4.6 Mb locus on canine chromosome 32. Subsequent whole-genome sequencing identified a 1 bp deletion in LRIT3 segregating with CSNB. The canine mutant LRIT3 gives rise to a truncated protein with unaltered subcellular expression in vitro. Genetic variants in LRIT3 have been associated with CSNB in patients although there is limited evidence regarding its apparently critical function in the mGluR6 pathway in ON-BCs. We determine that in the canine CSNB retina, the mutant LRIT3 is correctly localized to the region correlating with the ON-BC dendritic tips, albeit with reduced immunolabelling. The LRIT3-CSNB canine model has direct translational potential enabling studies to help understand the CSNB pathogenesis as well as to develop new therapies targeting the secondary neurons of the retina.


BMC Genomics ◽  
2020 ◽  
Vol 21 (1) ◽  
Author(s):  
E. A. Hisey ◽  
H. Hermans ◽  
Z. T. Lounsberry ◽  
F. Avila ◽  
R. A. Grahn ◽  
...  

Abstract Background Distichiasis, an ocular disorder in which aberrant cilia (eyelashes) grow from the opening of the Meibomian glands of the eyelid, has been reported in Friesian horses. These misplaced cilia can cause discomfort, chronic keratitis, and corneal ulceration, potentially impacting vision due to corneal fibrosis, or, if secondary infection occurs, may lead to loss of the eye. Friesian horses represent the vast majority of reported cases of equine distichiasis, and as the breed is known to be affected with inherited monogenic disorders, this condition was hypothesized to be a simply inherited Mendelian trait. Results A genome wide association study (GWAS) was performed using the Axiom 670 k Equine Genotyping array (MNEc670k) utilizing 14 cases and 38 controls phenotyped for distichiasis. An additive single locus mixed linear model (EMMAX) approach identified a 1.83 Mb locus on ECA5 and a 1.34 Mb locus on ECA13 that reached genome-wide significance (pcorrected = 0.016 and 0.032, respectively). Only the locus on ECA13 withstood replication testing (p = 1.6 × 10− 5, cases: n = 5 and controls: n = 37). A 371 kb run of homozygosity (ROH) on ECA13 was found in 13 of the 14 cases, providing evidence for a recessive mode of inheritance. Haplotype analysis (hapQTL) narrowed the region of association on ECA13 to 163 kb. Whole-genome sequencing data from 3 cases and 2 controls identified a 16 kb deletion within the ECA13 associated haplotype (ECA13:g.178714_195130del). Functional annotation data supports a tissue-specific regulatory role of this locus. This deletion was associated with distichiasis, as 18 of the 19 cases were homozygous (p = 4.8 × 10− 13). Genotyping the deletion in 955 horses from 54 different breeds identified the deletion in only 11 non-Friesians, all of which were carriers, suggesting that this could be causal for this Friesian disorder. Conclusions This study identified a 16 kb deletion on ECA13 in an intergenic region that was associated with distichiasis in Friesian horses. Further functional analysis in relevant tissues from cases and controls will help to clarify the precise role of this deletion in normal and abnormal eyelash development and investigate the hypothesis of incomplete penetrance.


2018 ◽  
Vol 59 ◽  
pp. 1-6 ◽  
Author(s):  
Mohamed M.H. Abdelbary ◽  
Laurence Senn ◽  
Estelle Moulin ◽  
Guy Prod'hom ◽  
Antony Croxatto ◽  
...  

2017 ◽  
Vol 94 (2) ◽  
pp. 151-157 ◽  
Author(s):  
Jason C Kwong ◽  
Eric P F Chow ◽  
Kerrie Stevens ◽  
Timothy P Stinear ◽  
Torsten Seemann ◽  
...  

ObjectivesDrug-resistant Neisseria gonorrhoeae are now a global public health threat. Direct transmission of antibiotic-resistant gonococci between individuals has been proposed as a driver for the increased transmission of resistance, but direct evidence of such transmission is limited. Whole-genome sequencing (WGS) has superior resolution to investigate outbreaks and disease transmission compared with traditional molecular typing methods such as multilocus sequence typing (MLST) and N. gonorrhoeae multiantigen sequence (NG-MAST). We therefore aimed to systematically investigate the transmission of N. gonorrhoeae between men in sexual partnerships using WGS to compare isolates and their resistance to antibiotics at a genome level.Methods458 couples from a large prospective cohort of men who have sex with men (MSM) tested for gonorrhoea together between 2005 and 2014 were included, and WGS was conducted on all isolates from couples where both men were culture-positive for N. gonorrhoeae. Resistance-determining sequences were identified from genome assemblies, and comparison of isolates between and within individuals was performed by pairwise single nucleotide polymorphism and pangenome comparisons, and in silico predictions of NG-MAST and MLST.ResultsFor 33 of 34 (97%; 95% CI 85% to 100%) couples where both partners were positive for gonorrhoea, the resistance-determining genes and mutations were identical in isolates from each partner (94 isolates in total). Resistance determinants in isolates from 23 of 23 (100%; 95% CI 86% to 100%) men with multisite infections were also identical within an individual. These partner and within-host isolates were indistinguishable by NG-MAST, MLST and whole genomic comparisons.ConclusionsThese data support the transmission of antibiotic-resistant strains between sexual partners as a key driver of resistance rates in gonorrhoea among MSM. This improved understanding of the transmission dynamics of N. gonorrhoeae between sexual partners will inform treatment and prevention guidelines.


Sign in / Sign up

Export Citation Format

Share Document