scholarly journals Identifying Selection in the Within-Host Evolution of Influenza Using Viral Sequence Data

2014 ◽  
Vol 10 (7) ◽  
pp. e1003755 ◽  
Author(s):  
Christopher J. R. Illingworth ◽  
Andrej Fischer ◽  
Ville Mustonen
2020 ◽  
Vol 12 (s1) ◽  
Author(s):  
Rami Kantor ◽  
John P. Fulton ◽  
Jon Steingrimsson ◽  
Vladimir Novitsky ◽  
Mark Howison ◽  
...  

AbstractGreat efforts are devoted to end the HIV epidemic as it continues to have profound public health consequences in the United States and throughout the world, and new interventions and strategies are continuously needed. The use of HIV sequence data to infer transmission networks holds much promise to direct public heath interventions where they are most needed. As these new methods are being implemented, evaluating their benefits is essential. In this paper, we recognize challenges associated with such evaluation, and make the case that overcoming these challenges is key to the use of HIV sequence data in routine public health actions to disrupt HIV transmission networks.


Author(s):  
Lina Wang ◽  
Fengzhen Chen ◽  
Xueqin Guo ◽  
Lijin You ◽  
Xiaoxia Yang ◽  
...  

AbstractMotivationThe Coronavirus Disease 2019 (COVID-19) pandemic poses a huge threat to human public health. Viral sequence data plays an important role in the scientific prevention and control of epidemics. A comprehensive virus database will be vital useful for virus data retrieval and deep analysis. To promote sharing of virus data, several virus databases and related analyzing tools have been created.ResultsTo facilitate virus research and promote the global sharing of virus data, we present here VirusDIP, a one-stop service platform for archive, integration, access, analysis of virus data. It accepts the submission of viral sequence data from all over the world and currently integrates data resources from the National GeneBank Database (CNGBdb), Global initiative on sharing all influenza data (GISAID), and National Center for Biotechnology Information (NCBI). Moreover, based on the comprehensive data resources, BLAST sequence alignment tool and multi-party security computing tools are deployed for multi-sequence alignment, phylogenetic tree building and global trusted sharing. VirusDIP is gradually establishing cooperation with more databases, and paving the way for the analysis of virus origin and evolution. All public data in VirusDIP are freely available for all researchers worldwide.Availabilityhttps://db.cngb.org/virus/[email protected]


2017 ◽  
Author(s):  
John T. McCrone ◽  
Robert J. Woods ◽  
Emily T. Martin ◽  
Ryan E. Malosh ◽  
Arnold S. Monto ◽  
...  

AbstractThe global evolutionary dynamics of influenza virus ultimately derive from processes that take place within and between infected individuals. Here we define the dynamics of influenza A virus populations in human hosts through next generation sequencing of 249 specimens from 200 individuals collected over 6290 person-seasons of observation. Because these viruses were collected over 5 seasons from individuals in a prospective community-based cohort, they are broadly representative of natural human infections with seasonal viruses. We used viral sequence data from 35 serially sampled individuals to estimate a within host effective population size of 30-70 and an in vivo mutation rate of 4x10−5 per nucleotide per cellular infectious cycle. These estimates are consistent across several models and robust to the models' underlying assumptions. We also identified 43 epidemiologically linked and genetically validated transmission pairs. Maximum likelihood optimization of multiple transmission models estimates an effective transmission bottleneck of 1-2 distinct genomes. Our data suggest that positive selection of novel viral variants is inefficient at the level of the individual host and that genetic drift and other stochastic processes dominate the within and between host evolution of influenza A viruses.


Author(s):  
Gianluigi Rossi ◽  
Joseph Crispell ◽  
Daniel Balaz ◽  
Samantha J. Lycett ◽  
Richard J. Delahay ◽  
...  

AbstractEstablished methods for whole-genome-sequencing (WGS) technology allow for the detection of single-nucleotide polymorphisms (SNPs) in the pathogen genomes sourced from host samples. The information obtained can be used to track the pathogen’s evolution in time and potentially identify ‘who-infected-whom’ with unprecedented accuracy. Successful methods include ‘phylodynamic approaches’ that integrate evolutionary and epidemiological data. However, they are typically computationally intensive, require extensive data, and are best applied when there is a strong molecular clock signal and substantial pathogen diversity.To determine how much transmission information can be inferred when pathogen genetic diversity is low and metadata limited, we propose an analytical approach that combines pathogen WGS data and sampling times from infected hosts. It accounts for ‘between-scale’ processes, in particular within-host pathogen evolution and between-host transmission. We applied this to a well-characterised population with an endemic Mycobacterium bovis (the causative agent of bovine/zoonotic tuberculosis, bTB) infection.Our results show that, even with such limited data and low diversity, the computation of the transmission probability between host pairs can help discriminate between likely and unlikely infection pathways and therefore help to identify potential transmission networks, but can be sensitive to assumptions about within-host evolution.


2021 ◽  
Vol 118 (23) ◽  
pp. e2023202118
Author(s):  
Michael J. Tisza ◽  
Christopher B. Buck

Despite remarkable strides in microbiome research, the viral component of the microbiome has generally presented a more challenging target than the bacteriome. This gap persists, even though many thousands of shotgun sequencing runs from human metagenomic samples exist in public databases, and all of them encompass large amounts of viral sequence data. The lack of a comprehensive database for human-associated viruses has historically stymied efforts to interrogate the impact of the virome on human health. This study probes thousands of datasets to uncover sequences from over 45,000 unique virus taxa, with historically high per-genome completeness. Large publicly available case-control studies are reanalyzed, and over 2,200 strong virus–disease associations are found.


2010 ◽  
Vol 7 (48) ◽  
pp. 1119-1127 ◽  
Author(s):  
J. Conrad Stack ◽  
J. David Welch ◽  
Matt J. Ferrari ◽  
Beth U. Shapiro ◽  
Bryan T. Grenfell

With more emphasis being put on global infectious disease monitoring, viral genetic data are being collected at an astounding rate, both within and without the context of a long-term disease surveillance plan. Concurrent with this increase have come improvements to the sophisticated and generalized statistical techniques used for extracting population-level information from genetic sequence data. However, little research has been done on how the collection of these viral sequence data can or does affect the efficacy of the phylogenetic algorithms used to analyse and interpret them. In this study, we use epidemic simulations to consider how the collection of viral sequence data clarifies or distorts the picture, provided by the phylogenetic algorithms, of the underlying population dynamics of the simulated viral infection over many epidemic cycles. We find that sampling protocols purposefully designed to capture sequences at specific points in the epidemic cycle, such as is done for seasonal influenza surveillance, lead to a significantly better view of the underlying population dynamics than do less-focused collection protocols. Our results suggest that the temporal distribution of samples can have a significant effect on what can be inferred from genetic data, and thus highlight the importance of considering this distribution when designing or evaluating protocols and analysing the data collected thereunder.


eLife ◽  
2018 ◽  
Vol 7 ◽  
Author(s):  
John T McCrone ◽  
Robert J Woods ◽  
Emily T Martin ◽  
Ryan E Malosh ◽  
Arnold S Monto ◽  
...  

The evolutionary dynamics of influenza virus ultimately derive from processes that take place within and between infected individuals. Here we define influenza virus dynamics in human hosts through sequencing of 249 specimens from 200 individuals collected over 6290 person-seasons of observation. Because these viruses were collected from individuals in a prospective community-based cohort, they are broadly representative of natural infections with seasonal viruses. Consistent with a neutral model of evolution, sequence data from 49 serially sampled individuals illustrated the dynamic turnover of synonymous and nonsynonymous single nucleotide variants and provided little evidence for positive selection of antigenic variants. We also identified 43 genetically-validated transmission pairs in this cohort. Maximum likelihood optimization of multiple transmission models estimated an effective transmission bottleneck of 1–2 genomes. Our data suggest that positive selection is inefficient at the level of the individual host and that stochastic processes dominate the host-level evolution of influenza viruses.


2020 ◽  
Author(s):  
Colin Young ◽  
Sarah Meng ◽  
Niema Moshiri

AbstractThe use of computational techniques to analyze viral sequence data and ultimately inform public health intervention has become increasingly common in the realm of epidemiology. These methods typically attempt to make epidemiological inferences based on multiple sequence alignments and phylogenies estimated from the raw sequence data. Like all estimation techniques, multiple sequence alignment and phylogenetic inference tools are error-prone, and the impacts of such imperfections on downstream epidemiological inferences are poorly understood. To address this, we executed multiple commonly-used workflows for conducting viral phylogenetic analyses on simulated viral sequence data modeling HIV, HCV, and Ebola, and we computed multiple methods of accuracy motivated by transmission clustering techniques. For multiple sequence alignment, MAFFT consistently outperformed MUSCLE and Clustal Omega in both accuracy and runtime. For phylogenetic inference, FastTree 2, IQ-TREE, RAxML-NG, and PhyML had similar topological accuracies, but branch lengths and pairwise distances were consistently most accurate in phylogenies inferred by RAxML-NG. However, FastTree 2 was orders of magnitude faster than the other tools, and when the other tools were used to optimize branch lengths along a fixed topology provided by FastTree 2 (i.e., no tree search), the resulting phylogenies had accuracies that were indistinguishable from their original counterparts, but with a fraction of the runtime. Our results indicate that an ideal workflow for viral phylogenetic inference is to (1) use MAFFT to perform MSA, (2) use FastTree 2 under the GTR model with discrete gamma-distributed site-rate heterogeneity to quickly obtain a reasonable tree topology, and (3) use RAxML-NG to optimize branch lengths along the fixed FastTree 2 topology.


2017 ◽  
Author(s):  
Gytis Dudas ◽  
Luiz Max Carvalho ◽  
Andrew Rambaut ◽  
Trevor Bedford ◽  
Ali M. Somily ◽  
...  

AbstractMiddle East respiratory syndrome coronavirus (MERS-CoV) is a zoonotic virus from camels causing significant mortality and morbidity in humans in the Arabian Peninsula. The epidemiology of the virus remains poorly understood, and while case-based and seroepidemiological studies have been employed extensively throughout the epidemic, viral sequence data have not been utilised to their full potential. Here we use existing MERS-CoV sequence data to explore its phylodynamics in two of its known major hosts, humans and camels. We employ structured coalescent models to show that long-term MERS-CoV evolution occurs exclusively in camels, whereas humans act as a transient, and ultimately terminal host. By analysing the distribution of human outbreak cluster sizes and zoonotic introduction times we show that human outbreaks in the Arabian peninsula are driven by seasonally varying zoonotic transfer of viruses from camels. Without heretofore unseen evolution of host tropism, MERS-CoV is unlikely to become endemic in humans.


Sign in / Sign up

Export Citation Format

Share Document