scholarly journals Dating the Common Ancestor from an NCBI Tree of 83688 High-Quality and Full-Length SARS-CoV-2 Genomes

Viruses ◽  
2021 ◽  
Vol 13 (9) ◽  
pp. 1790
Author(s):  
Xuhua Xia

All dating studies involving SARS-CoV-2 are problematic. Previous studies have dated the most recent common ancestor (MRCA) between SARS-CoV-2 and its close relatives from bats and pangolins. However, the evolutionary rate thus derived is expected to differ from the rate estimated from sequence divergence of SARS-CoV-2 lineages. Here, I present dating results for the first time from a large phylogenetic tree with 86,582 high-quality full-length SARS-CoV-2 genomes. The tree contains 83,688 genomes with full specification of collection time. Such a large tree spanning a period of about 1.5 years offers an excellent opportunity for dating the MRCA of the sampled SARS-CoV-2 genomes. The MRCA is dated 16 August 2019, with the evolutionary rate estimated to be 0.05526 mutations/genome/day. The Pearson correlation coefficient (r) between the root-to-tip distance (D) and the collection time (T) is 0.86295. The NCBI tree also includes 10 SARS-CoV-2 genomes isolated from cats, collected over roughly the same time span as human COVID-19 infection. The MRCA from these cat-derived SARS-CoV-2 is dated 30 July 2019, with r = 0.98464. While the dating method is well known, I have included detailed illustrations so that anyone can repeat the analysis and obtain the same dating results. With 16 August 2019 as the date of the MRCA of sampled SARS-CoV-2 genomes, archived samples from respiratory or digestive tracts collected around or before 16 August 2019, or those that are not descendants of the existing SARS-CoV-2 lineages, should be particularly valuable for tracing the origin of SARS-CoV-2.

2012 ◽  
Vol 93 (5) ◽  
pp. 1035-1045 ◽  
Author(s):  
A. R. Patrício ◽  
L. H. Herbst ◽  
A. Duarte ◽  
X. Vélez-Zuazo ◽  
N. Santos Loureiro ◽  
...  

A global phylogeny for chelonid fibropapilloma-associated herpesvirus (CFPHV), the most likely aetiological agent of fibropapillomatosis (FP) in sea turtles, was inferred, using dated sequences, through Bayesian Markov chain Monte Carlo analysis and used to estimate the virus evolutionary rate independent of the evolution of the host, and to resolve the phylogenetic positions of new haplotypes from Puerto Rico and the Gulf of Guinea. Four phylogeographical groups were identified: eastern Pacific, western Atlantic/eastern Caribbean, mid-west Pacific and Atlantic. The latter comprises the Gulf of Guinea and Puerto Rico, suggesting recent virus gene flow between these two regions. One virus haplotype from Florida remained elusive, representing either an independent lineage sharing a common ancestor with all other identified virus variants or an Atlantic representative of the lineage giving rise to the eastern Pacific group. The virus evolutionary rate ranged from 1.62×10−4 to 2.22×10−4 substitutions per site per year, which is much faster than what is expected for a herpesvirus. The mean time for the most recent common ancestor of the modern virus variants was estimated at 192.90–429.71 years ago, which, although more recent than previous estimates, still supports an interpretation that the global FP pandemic is not the result of a recent acquisition of a virulence mutation(s). The phylogeographical pattern obtained seems partially to reflect sea turtle movements, whereas altered environments appear to be implicated in current FP outbreaks and in the modern evolutionary history of CFPHV.


Author(s):  
Kenneth Siu-Sing Leung ◽  
Timothy Ting-Leung Ng ◽  
Alan Ka-Lun Wu ◽  
Miranda Chong-Yee Yau ◽  
Hiu-Yin Lao ◽  
...  

AbstractInitial cases of COVID-19 reported in Hong Kong were mostly imported from China. However, most cases reported in February 2020 were locally-acquired infections, indicating local community transmissions. We extracted the demographic, clinical and epidemiological data from 50 COVID-19 patients, who accounted for 53.8% of the cases in Hong Kong by February 2020. Whole-genome sequencing of the SARS-CoV-2 were conducted to determine the phylogenetic relatedness and transmission dynamics. Only three (6.0%) patients required ICU admission. Phylogenetic analysis identified six transmission clusters. All locally-acquired cases harboured a common mutation Orf3a G251V and were clustered in two subclades in global phylogeny of SARS-CoV-2. The estimated time to the most recent common ancestor of local COVID-2019 outbreak was December 24, 2019 with an evolutionary rate of 3.04×10−3 substitutions per site per year. The reproduction number value was 1.84. Social distancing and vigilant epidemiological control are crucial to the containment of COVID-19 transmission.Article summary linesA combined epidemiological and phylogenetic analysis of early COVID-19 outbreak in Hong Kong revealed that a SARS-CoV-2 variant with ORF3a G251V mutation accounted for all locally acquired cases, and that asymptomatic carriers could be a huge public health risk for COVID-19 control.


Author(s):  
Asher D. Cutter

Chapter 5, “Genealogy in evolution,” introduces branching tree diagrams as an intuitive way to visualize the evolutionary relationships between alleles, haplotypes, individuals, and species. It describes the nomenclature of gene tree topologies, the stochasticity in tree shape across genes, and the notion of a most recent common ancestor. This chapter also covers reverse-time genealogical thinking with coalescent theory and how it integrates with predictions about nucleotide polymorphism and the site frequency spectrum. An overview of how phylogenies show between-species genealogical relationships is used to highlight the concepts of orthology and homoplasy, how to calculate and interpret different metrics of DNA sequence divergence, the role of ancestral polymorphism in creating distinct gene trees, the multiple mutational hits problem, and factors that influence calculations of the time to the most recent common ancestor for species trees versus gene trees. This chapter surveys how to think of evolution in terms of genealogies that relate gene copies within a species or among species, and how to connect ideas about gene trees to other ideas in molecular population genetics.


Genetics ◽  
1998 ◽  
Vol 150 (3) ◽  
pp. 1187-1198 ◽  
Author(s):  
Mikkel H Schierup ◽  
Xavier Vekemans ◽  
Freddy B Christiansen

Abstract Expectations for the time scale and structure of allelic genealogies in finite populations are formed under three models of sporophytic self-incompatibility. The models differ in the dominance interactions among the alleles that determine the self-incompatibility phenotype: In the SSIcod model, alleles act codominantly in both pollen and style, in the SSIdom model, alleles form a dominance hierarchy, and in SSIdomcod, alleles are codominant in the style and show a dominance hierarchy in the pollen. Coalescence times of alleles rarely differ more than threefold from those under gametophytic self-incompatibility, and transspecific polymorphism is therefore expected to be equally common. The previously reported directional turnover process of alleles in the SSIdomcod model results in coalescence times lower and substitution rates higher than those in the other models. The SSIdom model assumes strong asymmetries in allelic action, and the most recessive extant allele is likely to be the most recent common ancestor. Despite these asymmetries, the expected shape of the allele genealogies does not deviate markedly from the shape of a neutral gene genealogy. The application of the results to sequence surveys of alleles, including interspecific comparisons, is discussed.


Author(s):  
Wenjun Cheng ◽  
Tianjiao Ji ◽  
Shuaifeng Zhou ◽  
Yong Shi ◽  
Lili Jiang ◽  
...  

AbstractEchovirus 6 (E6) is associated with various clinical diseases and is frequently detected in environmental sewage. Despite its high prevalence in humans and the environment, little is known about its molecular phylogeography in mainland China. In this study, 114 of 21,539 (0.53%) clinical specimens from hand, foot, and mouth disease (HFMD) cases collected between 2007 and 2018 were positive for E6. The complete VP1 sequences of 87 representative E6 strains, including 24 strains from this study, were used to investigate the evolutionary genetic characteristics and geographical spread of E6 strains. Phylogenetic analysis based on VP1 nucleotide sequence divergence showed that, globally, E6 strains can be grouped into six genotypes, designated A to F. Chinese E6 strains collected between 1988 and 2018 were found to belong to genotypes C, E, and F, with genotype F being predominant from 2007 to 2018. There was no significant difference in the geographical distribution of each genotype. The evolutionary rate of E6 was estimated to be 3.631 × 10-3 substitutions site-1 year-1 (95% highest posterior density [HPD]: 3.2406 × 10-3-4.031 × 10-3 substitutions site-1 year-1) by Bayesian MCMC analysis. The most recent common ancestor of the E6 genotypes was traced back to 1863, whereas their common ancestor in China was traced back to around 1962. A small genetic shift was detected in the Chinese E6 population size in 2009 according to Bayesian skyline analysis, which indicated that there might have been an epidemic around that year.


Author(s):  
Ya-Fang Hu ◽  
Li-Ping Jia ◽  
Fang-Yuan Yu ◽  
Li-Ying Liu ◽  
Qin-Wei Song ◽  
...  

Abstract Background Coxsackievirus A16 (CVA16) is one of the major etiological agents of hand, foot and mouth disease (HFMD). This study aimed to investigate the molecular epidemiology and evolutionary characteristics of CVA16. Methods Throat swabs were collected from children with HFMD and suspected HFMD during 2010–2019. Enteroviruses (EVs) were detected and typed by real-time reverse transcription-polymerase chain reaction (RT-PCR) and RT-PCR. The genotype, evolutionary rate, the most recent common ancestor, population dynamics and selection pressure of CVA16 were analyzed based on viral protein gene (VP1) by bioinformatics software. Results A total of 4709 throat swabs were screened. EVs were detected in 3180 samples and 814 were CVA16 positive. More than 81% of CVA16-positive children were under 5 years old. The prevalence of CVA16 showed obvious periodic fluctuations with a high level during 2010–2012 followed by an apparent decline during 2013–2017. However, the activities of CVA16 increased gradually during 2018–2019. All the Beijing CVA16 strains belonged to sub-genotype B1, and B1b was the dominant strain. One B1c strain was detected in Beijing for the first time in 2016. The estimated mean evolutionary rate of VP1 gene was 4.49 × 10–3 substitution/site/year. Methionine gradually fixed at site-23 of VP1 since 2012. Two sites were detected under episodic positive selection, one of which (site-223) located in neutralizing linear epitope PEP71. Conclusions The dominant strains of CVA16 belonged to clade B1b and evolved in a fast evolutionary rate during 2010–2019 in Beijing. To provide more favorable data for HFMD prevention and control, it is necessary to keep attention on molecular epidemiological and evolutionary characteristics of CVA16.


Genetics ◽  
1999 ◽  
Vol 151 (3) ◽  
pp. 1217-1228 ◽  
Author(s):  
Carsten Wiuf ◽  
Jotun Hein

Abstract In this article we discuss the ancestry of sequences sampled from the coalescent with recombination with constant population size 2N. We have studied a number of variables based on simulations of sample histories, and some analytical results are derived. Consider the leftmost nucleotide in the sequences. We show that the number of nucleotides sharing a most recent common ancestor (MRCA) with the leftmost nucleotide is ≈log(1 + 4N Lr)/4Nr when two sequences are compared, where L denotes sequence length in nucleotides, and r the recombination rate between any two neighboring nucleotides per generation. For larger samples, the number of nucleotides sharing MRCA with the leftmost nucleotide decreases and becomes almost independent of 4N Lr. Further, we show that a segment of the sequences sharing a MRCA consists in mean of 3/8Nr nucleotides, when two sequences are compared, and that this decreases toward 1/4Nr nucleotides when the whole population is sampled. A measure of the correlation between the genealogies of two nucleotides on two sequences is introduced. We show analytically that even when the nucleotides are separated by a large genetic distance, but share MRCA, the genealogies will show only little correlation. This is surprising, because the time until the two nucleotides shared MRCA is reciprocal to the genetic distance. Using simulations, the mean time until all positions in the sample have found a MRCA increases logarithmically with increasing sequence length and is considerably lower than a theoretically predicted upper bound. On the basis of simulations, it turns out that important properties of the coalescent with recombinations of the whole population are reflected in the properties of a sample of low size.


Botany ◽  
2013 ◽  
Vol 91 (9) ◽  
pp. 605-613 ◽  
Author(s):  
Claudia Ciotir ◽  
Chris Yesson ◽  
Joanna Freeland

Understanding the spatial distribution of genetic diversity and its evolutionary history is an essential part of developing effective biodiversity management plans. This may be particularly true when considering the value of peripheral or disjunct populations. Although conservation decisions are often made with reference to geopolitical boundaries, many policy-makers also consider global distributions, and therefore a species’ global status may temper its regional status. Many disjunct populations can be found in the Great Lakes region of North America, including those of Bartonia paniculata subsp. paniculata, a species that has been designated as threatened in Canada but globally secure. We compared chloroplast sequences between disjunct (Canada) and core (USA) populations of B. paniculata subsp. paniculata separated by 600 km, which is the minimum distance between disjunct and core populations in this subspecies. We found that although lineages within the disjunct populations shared a relatively recent common ancestor, the genetic divergence between plants from Ontario and New Jersey was substantially greater than expected for a consubspecific comparison. A coalescence-based analysis dated the most recent common ancestor of the Canadian and US populations at approximately 534 000 years ago with the lower confidence estimate at 226 000 years ago. This substantially predates the Last Glacial Maximum and suggests that disjunct and core populations have followed independent evolutionary trajectories throughout multiple glacial–interglacial cycles. Our findings provide important insight into the diverse processes that have resulted in numerous disjunct species in the Great Lakes region and highlight a need for additional work on Canadian B. paniculata subsp. paniculata taxonomy prior to a reevaluation of its conservation value.


Author(s):  
Satoshi Nakano ◽  
Takao Fujisawa ◽  
Bin Chang ◽  
Yutaka Ito ◽  
Hideki Akeda ◽  
...  

After the introduction of the seven-valent pneumococcal conjugate vaccine, the global spread of multidrug resistant serotype 19A-ST320 strains became a public health concern. In Japan, the main genotype of serotype 19A was ST3111, and the identification rate of ST320 was low. Although the isolates were sporadically detected in both adults and children, their origin remains unknown. Thus, by combining pneumococcal isolates collected in three nationwide pneumococcal surveillance studies conducted in Japan between 2008 and 2020, we analyzed 56 serotype 19A-ST320 isolates along with 931 global isolates, using whole-genome sequencing to uncover the transmission route of the globally distributed clone in Japan. The clone was frequently detected in Okinawa Prefecture, where the U.S. returned to Japan in 1972. Phylogenetic analysis demonstrated that the isolates from Japan were genetically related to those from the U.S.; therefore, the common ancestor may have originated in the U.S. In addition, Bayesian analysis suggested that the time to the most recent common ancestor of the isolates form Japan and the U.S. was approximately the 1990s to 2000, suggesting the possibility that the common ancestor could have already spread in the U.S. before the Taiwan 19F-14 isolate was first identified in a Taiwanese hospital in 1997. The phylogeographical analysis supported the transmission of the clone from the U.S. to Japan, but the analysis could be influenced by sampling bias. These results suggested the possibility that the serotype 19A-ST320 clone had already spread in the U.S. before being imported into Japan.


Sign in / Sign up

Export Citation Format

Share Document