Whole-genome sequencing of eukaryotes: From sequencing of DNA fragments to a genome assembly

2017 ◽  
Vol 53 (6) ◽  
pp. 631-639 ◽  
Author(s):  
K. S. Zadesenets ◽  
N. I. Ershov ◽  
N. B. Rubtsov
2021 ◽  
Vol 12 ◽  
Author(s):  
Ruihan Li ◽  
Xiaoai Wang ◽  
Chao Bian ◽  
Zijian Gao ◽  
Yuanwei Zhang ◽  
...  

An adult Sinocyclocheilus maitianheensis, a surface-dwelling golden-line barbel fish, was collected from Maitian river (Kunming City, Yunnan Province, China) for whole-genome sequencing, assembly, and annotation. We obtained a genome assembly of 1.7 Gb with a scaffold N50 of 1.4 Mb and a contig N50 of 24.7 kb. A total of 39,977 protein-coding genes were annotated. Based on a comparative phylogenetic analysis of five Sinocyclocheilus species and other five representative vertebrates with published genome sequences, we found that S. maitianheensis is close to Sinocyclocheilus anophthalmus (a cave-restricted species with similar locality). Moreover, the assembled genomes of S. maitianheensis and other four Sinocyclocheilus counterparts were used for a fourfold degenerative third-codon transversion (4dTv) analysis. The recent whole-genome duplication (WGD) event was therefore estimated to occur about 18.1 million years ago. Our results also revealed a decreased tendency of copy number in many important genes related to immunity and apoptosis in cave-restricted Sinocyclocheilus species. In summary, we report the first genome assembly of S. maitianheensis, which provides a valuable genetic resource for comparative studies on cavefish biology, species protection, and practical aquaculture of this potentially economical fish.


PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e12446
Author(s):  
Darlene D. Wagner ◽  
Heather A. Carleton ◽  
Eija Trees ◽  
Lee S. Katz

Background Whole genome sequencing (WGS) has gained increasing importance in responses to enteric bacterial outbreaks. Common analysis procedures for WGS, single nucleotide polymorphisms (SNPs) and genome assembly, are highly dependent upon WGS data quality. Methods Raw, unprocessed WGS reads from Escherichia coli, Salmonella enterica, and Shigella sonnei outbreak clusters were characterized for four quality metrics: PHRED score, read length, library insert size, and ambiguous nucleotide composition. PHRED scores were strongly correlated with improved SNPs analysis results in E. coli and S. enterica clusters. Results Assembly quality showed only moderate correlations with PHRED scores and library insert size, and then only for Salmonella. To improve SNP analyses and assemblies, we compared seven read-healing pipelines to improve these four quality metrics and to see how well they improved SNP analysis and genome assembly. The most effective read healing pipelines for SNPs analysis incorporated quality-based trimming, fixed-width trimming, or both. The Lyve-SET SNPs pipeline showed a more marked improvement than the CFSAN SNP Pipeline, but the latter performed better on raw, unhealed reads. For genome assembly, SPAdes enabled significant improvements in healed E. coli reads only, while Skesa yielded no significant improvements on healed reads. Conclusions PHRED scores will continue to be a crucial quality metric albeit not of equal impact across all types of analyses for all enteric bacteria. While trimming-based read healing performed well for SNPs analyses, different read healing approaches are likely needed for genome assembly or other, emerging WGS analysis methodologies.


2019 ◽  
Vol 9 (10) ◽  
pp. 3213-3223 ◽  
Author(s):  
Giovanna Cáceres ◽  
María E. López ◽  
María I. Cádiz ◽  
Grazyella M. Yoshida ◽  
Ana Jedlicki ◽  
...  

Nile tilapia (Oreochromis niloticus) is one of the most cultivated and economically important species in world aquaculture. Intensive production promotes the use of monosex animals, due to an important dimorphism that favors male growth. Currently, the main mechanism to obtain all-male populations is the use of hormones in feeding during larval and fry phases. Identifying genomic regions associated with sex determination in Nile tilapia is a research topic of great interest. The objective of this study was to identify genomic variants associated with sex determination in three commercial populations of Nile tilapia. Whole-genome sequencing of 326 individuals was performed, and a total of 2.4 million high-quality bi-allelic single nucleotide polymorphisms (SNPs) were identified after quality control. A genome-wide association study (GWAS) was conducted to identify markers associated with the binary sex trait (males = 1; females = 0). A mixed logistic regression GWAS model was fitted and a genome-wide significant signal comprising 36 SNPs, spanning a genomic region of 536 kb in chromosome 23 was identified. Ten out of these 36 genetic variants intercept the anti-Müllerian (Amh) hormone gene. Other significant SNPs were located in the neighboring Amh gene region. This gene has been strongly associated with sex determination in several vertebrate species, playing an essential role in the differentiation of male and female reproductive tissue in early stages of development. This finding provides useful information to better understand the genetic mechanisms underlying sex determination in Nile tilapia.


2020 ◽  
Author(s):  
Alexander Smetanin ◽  
Nikita Moshkov ◽  
Tatiana V. Tatarinova

AbstractSummaryWe developed PyLAE - a new tool for determining local ancestry along a genome using whole-genome sequencing data or high-density genotyping experiments. PyLAE can process an arbitrarily large number of ancestral populations (with or without an informative prior). Since PyLAE does not involve estimation of many parameters, it can process thousands of genomes within a day. Computational efficiency, straightforward presentation of results, and an ease of installation makes PyLAE a useful tool to study admixed populations.Availability and implementationThe source code and installation manual are available at https://github.com/smetam/pylae.


2021 ◽  
Author(s):  
Severin Einspanier ◽  
Tamara Susanto ◽  
Nicole Metz ◽  
Pieter J. Wolters ◽  
Vivianne G.A.A. Vleeshouwers ◽  
...  

Early blight of potato is caused by the fungal pathogen Alternaria solani and is an increasing problem worldwide. The primary strategy to control the disease is applying fungicides such as succinate dehydrogenase inhibitors (SDHI). SDHI-resistant strains, showing reduced sensitivity to treatments, appeared in Germany in 2013, five years after introduction of SDHIs. Two primary mutations in the Sdh complex (SdhB-H278Y and SdhC-H134R) have been frequently found throughout Europe. How these resistances arose and spread, and whether they are linked to other genomic features, remains unknown. We performed whole-genome sequencing for A. solani isolates from potato fields across Europe (Germany, Sweden, Belgium, and Serbia) to better understand the pathogen's genetic diversity in general and understand the development and spread of the genetic mutations that lead to SDHI resistance. We used ancestry analysis and phylogenetics to determine the genetic background of 48 isolates. The isolates can be grouped into 7 genotypes. These genotypes do not show a geographical pattern but appear spread throughout Europe. The Sdh mutations appear in different genetic backgrounds, suggesting they arose independently, and the observed admixtures might indicate a higher adaptive potential in the fungus than previously thought. Our research gives insights into the genetic diversity of A. solani on a genome level. The mixed occurrence of different genotypes and apparent admixture in the populations indicate higher genomic complexity than anticipated. The conclusion that SDHI tolerance arose multiple times independently has important implications for future fungicide resistance management strategies. These should not solely focus on preventing the spread of isolates between locations but also on limiting population size and the selective pressure posed by fungicides in a given field to avoid the rise of new mutations in other genetic backgrounds.


2021 ◽  
Vol 12 ◽  
Author(s):  
Annika Brinkmann ◽  
Sophie-Luisa Ulm ◽  
Steven Uddin ◽  
Sophie Förster ◽  
Dominique Seifert ◽  
...  

Since the emergence of the Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2) in December 2019, the scientific community has been sharing data on epidemiology, diagnostic methods, and whole-genomic sequences almost in real time. The latter have already facilitated phylogenetic analyses, transmission chain tracking, protein modeling, the identification of possible therapeutic targets, timely risk assessment, and identification of novel variants. We have established and evaluated an amplification-based approach for whole-genome sequencing of SARS-CoV-2. It can be used on the miniature-sized and field-deployable sequencing device Oxford Nanopore MinION, with sequencing library preparation time of 10 min. We show that the generation of 50,000 total reads per sample is sufficient for a near complete coverage (>90%) of the SARS-CoV-2 genome directly from patient samples even if virus concentration is low (Ct 35, corresponding to approximately 5 genome copies per reaction). For patient samples with high viral load (Ct 18–24), generation of 50,000 reads in 1–2 h was shown to be sufficient for a genome coverage of >90%. Comparison to Illumina data reveals an accuracy that suffices to identify virus mutants. AmpliCoV can be applied whenever sequence information on SARS-CoV-2 is required rapidly, for instance for the identification of circulating virus mutants.


2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Rueben G. Das ◽  
Doreen Becker ◽  
Vidhya Jagannathan ◽  
Orly Goldstein ◽  
Evelyn Santana ◽  
...  

Abstract Congenital stationary night blindness (CSNB), in the complete form, is caused by dysfunctions in ON-bipolar cells (ON-BCs) which are secondary neurons of the retina. We describe the first disease causative variant associated with CSNB in the dog. A genome-wide association study using 12 cases and 11 controls from a research colony determined a 4.6 Mb locus on canine chromosome 32. Subsequent whole-genome sequencing identified a 1 bp deletion in LRIT3 segregating with CSNB. The canine mutant LRIT3 gives rise to a truncated protein with unaltered subcellular expression in vitro. Genetic variants in LRIT3 have been associated with CSNB in patients although there is limited evidence regarding its apparently critical function in the mGluR6 pathway in ON-BCs. We determine that in the canine CSNB retina, the mutant LRIT3 is correctly localized to the region correlating with the ON-BC dendritic tips, albeit with reduced immunolabelling. The LRIT3-CSNB canine model has direct translational potential enabling studies to help understand the CSNB pathogenesis as well as to develop new therapies targeting the secondary neurons of the retina.


BMC Genomics ◽  
2020 ◽  
Vol 21 (1) ◽  
Author(s):  
E. A. Hisey ◽  
H. Hermans ◽  
Z. T. Lounsberry ◽  
F. Avila ◽  
R. A. Grahn ◽  
...  

Abstract Background Distichiasis, an ocular disorder in which aberrant cilia (eyelashes) grow from the opening of the Meibomian glands of the eyelid, has been reported in Friesian horses. These misplaced cilia can cause discomfort, chronic keratitis, and corneal ulceration, potentially impacting vision due to corneal fibrosis, or, if secondary infection occurs, may lead to loss of the eye. Friesian horses represent the vast majority of reported cases of equine distichiasis, and as the breed is known to be affected with inherited monogenic disorders, this condition was hypothesized to be a simply inherited Mendelian trait. Results A genome wide association study (GWAS) was performed using the Axiom 670 k Equine Genotyping array (MNEc670k) utilizing 14 cases and 38 controls phenotyped for distichiasis. An additive single locus mixed linear model (EMMAX) approach identified a 1.83 Mb locus on ECA5 and a 1.34 Mb locus on ECA13 that reached genome-wide significance (pcorrected = 0.016 and 0.032, respectively). Only the locus on ECA13 withstood replication testing (p = 1.6 × 10− 5, cases: n = 5 and controls: n = 37). A 371 kb run of homozygosity (ROH) on ECA13 was found in 13 of the 14 cases, providing evidence for a recessive mode of inheritance. Haplotype analysis (hapQTL) narrowed the region of association on ECA13 to 163 kb. Whole-genome sequencing data from 3 cases and 2 controls identified a 16 kb deletion within the ECA13 associated haplotype (ECA13:g.178714_195130del). Functional annotation data supports a tissue-specific regulatory role of this locus. This deletion was associated with distichiasis, as 18 of the 19 cases were homozygous (p = 4.8 × 10− 13). Genotyping the deletion in 955 horses from 54 different breeds identified the deletion in only 11 non-Friesians, all of which were carriers, suggesting that this could be causal for this Friesian disorder. Conclusions This study identified a 16 kb deletion on ECA13 in an intergenic region that was associated with distichiasis in Friesian horses. Further functional analysis in relevant tissues from cases and controls will help to clarify the precise role of this deletion in normal and abnormal eyelash development and investigate the hypothesis of incomplete penetrance.


2017 ◽  
Vol 94 (2) ◽  
pp. 151-157 ◽  
Author(s):  
Jason C Kwong ◽  
Eric P F Chow ◽  
Kerrie Stevens ◽  
Timothy P Stinear ◽  
Torsten Seemann ◽  
...  

ObjectivesDrug-resistant Neisseria gonorrhoeae are now a global public health threat. Direct transmission of antibiotic-resistant gonococci between individuals has been proposed as a driver for the increased transmission of resistance, but direct evidence of such transmission is limited. Whole-genome sequencing (WGS) has superior resolution to investigate outbreaks and disease transmission compared with traditional molecular typing methods such as multilocus sequence typing (MLST) and N. gonorrhoeae multiantigen sequence (NG-MAST). We therefore aimed to systematically investigate the transmission of N. gonorrhoeae between men in sexual partnerships using WGS to compare isolates and their resistance to antibiotics at a genome level.Methods458 couples from a large prospective cohort of men who have sex with men (MSM) tested for gonorrhoea together between 2005 and 2014 were included, and WGS was conducted on all isolates from couples where both men were culture-positive for N. gonorrhoeae. Resistance-determining sequences were identified from genome assemblies, and comparison of isolates between and within individuals was performed by pairwise single nucleotide polymorphism and pangenome comparisons, and in silico predictions of NG-MAST and MLST.ResultsFor 33 of 34 (97%; 95% CI 85% to 100%) couples where both partners were positive for gonorrhoea, the resistance-determining genes and mutations were identical in isolates from each partner (94 isolates in total). Resistance determinants in isolates from 23 of 23 (100%; 95% CI 86% to 100%) men with multisite infections were also identical within an individual. These partner and within-host isolates were indistinguishable by NG-MAST, MLST and whole genomic comparisons.ConclusionsThese data support the transmission of antibiotic-resistant strains between sexual partners as a key driver of resistance rates in gonorrhoea among MSM. This improved understanding of the transmission dynamics of N. gonorrhoeae between sexual partners will inform treatment and prevention guidelines.


Sign in / Sign up

Export Citation Format

Share Document