Gene and Genome Sequencing: Interpreting Genetic Variation at the Nucleotide Level

Advancing RNA Virus Discovery and Biology with Whole Genome Sequencing

10.21007/etd.cghs.2021.0551 ◽

2021 ◽

Author(s):

◽

Mariah Taylor ◽

Keyword(s):

Genetic Variation ◽

Amino Acid ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Rna Virus ◽

Animal Health ◽

Functional Domains ◽

Whole Genome ◽

Coding Region

Two RNA virus families that pose a threat to human and animal health are Hantaviridae and Coronaviridae. These RNA viruses which originate in wildlife continue and will continue to cause disease, and hence, it is critical that scientific research define the mechanisms as to how these viruses spillover and adapt to new hosts to become endemic. One gap in our ability to define these mechanisms is the lack of whole genome sequences for many of these viruses. To address this specific gap, I developed a versatile amplicon-based whole-genome sequencing (WGS) approach to identify viral genomes of hantaviruses and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) within reservoir and spillover hosts. In my research studies, I used the amplicon-based WGS approach to define the genetic plasticity of viral RNA within pathogenic and nonpathogenic hantavirus species. The standing genetic variation of Andes orthohantavirus and Prospect Hill orthohantavirus was mapped out and amino acid changes occurring outside of functional domains were identified within the nucleocapsid and glycoprotein. I observed several amino acid changes in functional domains of the RNA-dependent RNA polymerase, as well as single nucleotide polymorphisms (SNPs) within the 3’ non-coding region (NCR) of the S-segment. To identify whether virus adaptation would occur within the S- and L-segments we attempted to adapt hantaviruses in vitro in a spillover host model through passaging experiments. In early passages we identified few mutations in the M-segment with the majority being identified in the S-segment 3’ NCR and the L-segment. This work suggests that hantavirus adaptation occurs in the S- and L-segments although the effect of these mutants on pathology is yet to be determined. While sequencing laboratory isolates is easily accomplished, sequencing low concentrations of virus within the reservoir is a formidable task. I further translated our amplicon-based WGS approach into a pan-oligonucleotide amplicon-based WGS approach to sequence hantavirus vRNA and mRNA from reservoir and spillover hosts in Ukraine. This approach successfully identified a novel Puumala orthohantavirus (PUUV) strain in Ukraine and using Bayesian phylogenetics we found this strain to be associated with the PUUV Latvian lineage. Early during the SARS-CoV-2 pandemic, I applied the knowledge gained in the hantavirus WGS efforts to sequencing of SARS-CoV-2 from nasopharyngeal swabs collected in April 2020. The genetic diversity of 45 SARS-CoV-2 isolates was evaluated with the methods I developed. We identified D614G, a notable mutation known for increasing transmission, in over 90% of our isolates. Two major lineages distinguish SARS-CoV-2 variants worldwide, lineages A and B. While most of our isolates were found within B lineage, we also identified one isolate within lineage A. We performed in vitro work which confirmed A lineage isolates as having poor replication in the trachea as compared to the nasal cavity. Five of these isolates presented a unique array of mutations which were assessed in the keratin 18 human angiotensin-converting enzyme 2 (K18-hACE2) mouse model for its response immunologically and pathogenically. We identified a distinction of pathogenesis between the A and B lineages with emphysema being common amongst A lineage isolates. Additionally, we discovered a small cohort of likely SNPs that defined the late induction of eosinophils during infection. In summary, this work will further define the dynamics of genetic variation and plasticity within virus populations that cause disease outbreaks and will allow a deeper understanding of the virus-host relationship.

Download Full-text

Whole genome sequencing of emerging multidrug resistant Candida auris isolates in India demonstrates low genetic variation

New Microbes and New Infections ◽

10.1016/j.nmni.2016.07.003 ◽

2016 ◽

Vol 13 ◽

pp. 77-82 ◽

Cited By ~ 90

Author(s):

C. Sharma ◽

N. Kumar ◽

R. Pandey ◽

J.F. Meis ◽

A. Chowdhary

Keyword(s):

Genetic Variation ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Multidrug Resistant ◽

Whole Genome ◽

Candida Auris

Download Full-text

Peromyscus as a Mammalian Epigenetic Model

Genetics Research International ◽

10.1155/2012/179159 ◽

2012 ◽

Vol 2012 ◽

pp. 1-11 ◽

Cited By ~ 11

Author(s):

Kimberly R. Shorter ◽

Janet P. Crossland ◽

Denessia Webb ◽

Gabor Szalai ◽

Michael R. Felder ◽

...

Keyword(s):

Genetic Variation ◽

Genome Sequencing ◽

Environmental Effects ◽

Genetic Variants ◽

Life Histories ◽

Coat Color ◽

Deer Mice ◽

Epigenetic Variation ◽

Natural Genetic Variation ◽

Epigenetic Effects

Deer mice (Peromyscus) offer an opportunity for studying the effects of natural genetic/epigenetic variation with several advantages over other mammalian models. These advantages include the ability to study natural genetic variation and behaviors not present in other models. Moreover, their life histories in diverse habitats are well studied. Peromyscus resources include genome sequencing in progress, a nascent genetic map, and >90,000 ESTs. Here we review epigenetic studies and relevant areas of research involving Peromyscus models. These include differences in epigenetic control between species and substance effects on behavior. We also present new data on the epigenetic effects of diet on coat-color using a Peromyscus model of agouti overexpression. We suggest that in terms of tying natural genetic variants with environmental effects in producing specific epigenetic effects, Peromyscus models have a great potential.

Download Full-text

Whole Genome Sequencing of A Candidate Strain for FMDV Vaccine: Genomic Structure and Genetic Variation

Molecular Pathogens ◽

10.5376/mp.2011.02.0001 ◽

2011 ◽

Cited By ~ 1

Author(s):

Li Huachun

Keyword(s):

Genetic Variation ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Genomic Structure ◽

Whole Genome ◽

Candidate Strain

Download Full-text

Analyses of X-linked and autosomal genetic variation in population-scale whole genome sequencing

Nature Genetics ◽

10.1038/ng.877 ◽

2011 ◽

Vol 43 (8) ◽

pp. 741-743 ◽

Cited By ~ 59

Author(s):

Srikanth Gottipati ◽

Leonardo Arbiza ◽

Adam Siepel ◽

Andrew G Clark ◽

Alon Keinan

Keyword(s):

Genetic Variation ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Whole Genome ◽

Population Scale

Download Full-text

Whole-genome sequencing for an enhanced understanding of genetic variation among South Africans

Nature Communications ◽

10.1038/s41467-017-00663-9 ◽

2017 ◽

Vol 8 (1) ◽

Cited By ~ 34

Author(s):

Ananyo Choudhury ◽

Michèle Ramsay ◽

Scott Hazelhurst ◽

Shaun Aron ◽

Soraya Bardien ◽

...

Keyword(s):

Genetic Variation ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Whole Genome ◽

South Africans

Download Full-text

Characterizing and Interpreting Genetic Variation from Personal Genome Sequencing

Methods in Molecular Biology - Genomic Structural Variants ◽

10.1007/978-1-61779-507-7_17 ◽

2011 ◽

pp. 343-367 ◽

Cited By ~ 4

Author(s):

Anna C. V. Johansson ◽

Lars Feuk

Keyword(s):

Genetic Variation ◽

Genome Sequencing ◽

Personal Genome ◽

Personal Genome Sequencing

Download Full-text

Corrigendum and follow-up: Whole genome sequencing of multiple CRISPR-edited mouse lines suggests no excess mutations

10.1101/154450 ◽

2017 ◽

Cited By ~ 4

Author(s):

Kellie A. Schaefer ◽

Benjamin W Darbro ◽

Diana F. Colgan ◽

Stephen H. Tsang ◽

Alexander G. Bassuk ◽

...

Keyword(s):

Genetic Variation ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Whole Genome ◽

Mouse Line ◽

Genome Data ◽

Parental Lines ◽

Mouse Lines

Our previous publication suggested CRISPR-Cas9 editing at the zygotic stage might unexpectedly introduce a multitude of subtle but unintended mutations, an interpretation that not surprisingly raised numerous questions. The key issue is that since parental lines were not available, might the reported variants have been inherited? To expand upon the limited available whole genome data on whether CRISPR-edited mice show more genetic variation, whole-genome sequencing was performed on two other mouse lines that had undergone a CRISPR-editing procedure. Again, parents were not available for either the Capn5 nor Fblim1 CRISPR-edited mouse lines, so strain controls were examined. Additionally, we also include verification of variants detected in the initial mouse line. Taken together, these whole-genome-sequencing-level results support the idea that in specific cases, CRISPR-Cas9 editing can precisely edit the genome at the organismal level and may not introduce numerous, unintended, off-target mutations.

Download Full-text

Mid-pass whole genome sequencing enables biomedical genetic studies of diverse populations

BMC Genomics ◽

10.1186/s12864-021-07949-9 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Anne-Katrin Emde ◽

Amanda Phipps-Green ◽

Murray Cadzow ◽

C. Scott Gallagher ◽

Tanya J. Major ◽

...

Keyword(s):

Genetic Variation ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Whole Genome ◽

Diverse Populations ◽

Human Genetic Variation ◽

Underrepresented Populations ◽

Financial Barriers ◽

Genetic Studies ◽

Disease Relevance

Abstract Background Historically, geneticists have relied on genotyping arrays and imputation to study human genetic variation. However, an underrepresentation of diverse populations has resulted in arrays that poorly capture global genetic variation, and a lack of reference panels. This has contributed to deepening global health disparities. Whole genome sequencing (WGS) better captures genetic variation but remains prohibitively expensive. Thus, we explored WGS at “mid-pass” 1-7x coverage. Results Here, we developed and benchmarked methods for mid-pass sequencing. When applied to a population without an existing genomic reference panel, 4x mid-pass performed consistently well across ethnicities, with high recall (98%) and precision (97.5%). Conclusion Compared to array data imputed into 1000 Genomes, mid-pass performed better across all metrics and identified novel population-specific variants with potential disease relevance. We hope our work will reduce financial barriers for geneticists from underrepresented populations to characterize their genomes prior to biomedical genetic applications.

Download Full-text

Faculty Opinions recommendation of Whole genome sequencing of emerging multidrug resistant Candida auris isolates in India demonstrates low genetic variation.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.726739133.793538212 ◽

2017 ◽

Author(s):

Anna Skiada

Keyword(s):

Genetic Variation ◽

Whole Genome Sequencing ◽

Genome Sequencing ◽

Multidrug Resistant ◽

Whole Genome ◽

Candida Auris

Download Full-text