scholarly journals Phylogenetic analysis of SARS-CoV-2 data is difficult

Author(s):  
Benoit Morel ◽  
Pierre Barbera ◽  
Lucas Czech ◽  
Ben Bettisworth ◽  
Lukas Hübner ◽  
...  

Abstract Numerous studies covering some aspects of SARS-CoV-2 data analyses are being published on a daily basis, including a regularly updated phylogeny on nextstrain.org. Here, we review the difficulties of inferring reliable phylogenies by example of a data snapshot comprising a quality-filtered subset of 8, 736 out of all 16, 453 virus sequences available on May 5, 2020 from gisaid.org. We find that it is difficult to infer a reliable phylogeny on these data due to the large number of sequences in conjunction with the low number of mutations. We further find that rooting the inferred phylogeny with some degree of confidence either via the bat and pangolin outgroups or by applying novel computational methods on the ingroup phylogeny does not appear to be credible. Finally, an automatic classification of the current sequences into sub-classes using the mPTP tool for molecular species delimitation is also, as might be expected, not possible, as the sequences are too closely related. We conclude that, although the application of phylogenetic methods to disentangle the evolution and spread of COVID-19 provides some insight, results of phylogenetic analyses, in particular those conducted under the default settings of current phylogenetic inference tools, as well as downstream analyses on the inferred phylogenies, should be considered and interpreted with extreme caution.

Phytotaxa ◽  
2014 ◽  
Vol 159 (4) ◽  
pp. 241 ◽  
Author(s):  
Yu-lan Peng ◽  
Yu Zhang ◽  
Xin-fen Gao ◽  
Lin-jing Tong ◽  
Liang Li ◽  
...  

The systematic position of Paraixeris humifusa (Asteraceae) is hard to define, because the circumscription of Paraixeris, Youngia and Crepidiastrum, three closely related genera in subtribe Crepidinae (Cichorieae), is not clear. This paper reports on the relationships between 30 species in subtribe Crepidinae, based on an analysis of nucleotides from one nuclear (ITS) and three chloroplast DNA regions ( trnL-F, rps16 and atpB-rbcL). The phylogenetic analyses used maximum parsimony with maximum likelihood inference. The monophyly of Crepidiastrum in the most recent generic classification of Shih & Kilian (2011) is explored. The results show that 12 species in Crepidiastrum constitute a monophyletic group, and that Paraixeris humifusa should be treated as Youngia humifusa.


Paleobiology ◽  
1990 ◽  
Vol 16 (1) ◽  
pp. 25-48 ◽  
Author(s):  
Rich Mooi

Convincing hypotheses of the origin of major invertebrate groups are difficult to make in the absence of phylogenetic analyses. In spite of this, several scenarios exist for the origin of the unusual echinoid order Clypeasteroida. I expand upon the most probable of these models by performing a phylogenetic analysis on three clypeasteroid suborders, the enigmatic fossil genusTogocyamus, and the extinct Oligopygoida. This analysis shows that the oligopygoids are the sister group of the Clypeasteroida plusTogocyamus. The latter is here considered a plesion (extinct sister group) to the crown group Clypeasteroida. Within that order, the suborder Clypeasterina is the sister group to the Laganina plus Scutellina. A new classification of all these taxa is presented. The phylogeny is based on 47 characters and incorporates data on external appendages, Aristotle's lantern anatomy, and test structure of irregular echinoids, as well as new information on the morphology ofTogocyamus. The earliest clypeasteroids had a lantern similar to that of adult oligopygoids, which in turn inherited their lantern from a cassiduloid-like ancestor that retained the lantern into adulthood. This lantern is absent in adult cassiduloids. Subsequent changes, including modification of the lantern into a crushing mill, extreme flattening of the test, and proliferation of food-gathering tube feet have allowed clypeasteroids to become epifaunal inhabitants of environments characterized by fine, shifting substrates, a habitat previously inaccessible to most other irregular echinoids.


2008 ◽  
Vol 82 (23) ◽  
pp. 11545-11554 ◽  
Author(s):  
Zhiguo Liang ◽  
A. S. Manoj Kumar ◽  
Morris S. Jones ◽  
Nick J. Knowles ◽  
Howard L. Lipton

ABSTRACT The Cardiovirus genus of the family Picornaviridae includes two distinct species, Encephalomyocarditis virus and Theilovirus. We now report the complete nucleotide sequences of three Theiler's murine encephalomyelitis virus (TMEV) strains (TO Yale, TOB15, and Vie 415HTR) and of Vilyuisk human encephalomyelitis virus (VHEV). This information, together with the recently reported sequences of divergent theiloviruses (Theiler's-like rat virus [TRV] and Saffold viruses 1 and 2 [SAFV-1 and SAFV-2]), enables an updated phylogenetic analysis as well as a reexamination of several gene products important in the pathogenesis of this emerging group of viruses. In the light of the known neurotropism of TMEV and the new human SAFV-1 and SAFV-2, the resulting data suggest the existence of theiloviruses that cause human central nervous system infections. Our phylogenetic analyses point to the classification of presently known theiloviruses into five types: TMEV, VHEV, TRV, SAFV-1, and SAFV-2.


Zootaxa ◽  
2007 ◽  
Vol 1668 (1) ◽  
pp. 327-341 ◽  
Author(s):  
GREGORY D. EDGECOMBE

Breakthroughs in centipede systematics over the past 25 years have included: a stable morphology-based cladogram for ordinal interrelationships that is largely congruent with well-sampled nuclear ribosomal genes; the discovery of mid Palaeozoic crown-group fossils, including Silurian-Devonian stem-group Scutigeromorpha and an extinct order in the Middle Devonian; and, a web-based catalogue of all centipede species globally. Challenges include species delimitation in several groups, conflict between different kinds of molecular data (nuclear coding genes versus ribosomal genes), the inter-familial relationships and classification of the Geophilomorpha in particular, and effecting a synthesis between microanatomical studies of selected ‘model’ species and dense taxonomic sampling in numerical phylogenetic analyses.


PeerJ ◽  
2017 ◽  
Vol 5 ◽  
pp. e3055 ◽  
Author(s):  
Andrea Cau

Bayesian phylogenetic methods integrating simultaneously morphological and stratigraphic information have been applied increasingly among paleontologists. Most of these studies have used Bayesian methods as an alternative to the widely-used parsimony analysis, to infer macroevolutionary patterns and relationships among species-level or higher taxa. Among recently introduced Bayesian methodologies, the Fossilized Birth-Death (FBD) model allows incorporation of hypotheses on ancestor-descendant relationships in phylogenetic analyses including fossil taxa. Here, the FBD model is used to infer the relationships among an ingroup formed exclusively by fossil individuals, i.e., dipnoan tooth plates from four localities in the Ain el Guettar Formation of Tunisia. Previous analyses of this sample compared the results of phylogenetic analysis using parsimony with stratigraphic methods, inferred a high diversity (five or more genera) in the Ain el Guettar Formation, and interpreted it as an artifact inflated by depositional factors. In the analysis performed here, the uncertainty on the chronostratigraphic relationships among the specimens was included among the prior settings. The results of the analysis confirm the referral of most of the specimens to the taxaAsiatoceratodus,Equinoxiodus, LavocatodusandNeoceratodus, but reject those toCeratodusandFerganoceratodus. The resulting phylogeny constrained the evolution of the Tunisian sample exclusively in the Early Cretaceous, contrasting with the previous scenario inferred by the stratigraphically-calibrated topology resulting from parsimony analysis. The phylogenetic framework also suggests that (1) the sampled localities are laterally equivalent, (2) but three localities are restricted to the youngest part of the section; both results are in agreement with previous stratigraphic analyses of these localities. The FBD model of specimen-level units provides a novel tool for phylogenetic inference among fossils but also for independent tests of stratigraphic scenarios.


2020 ◽  
pp. 1-37
Author(s):  
Markéta Kirstová ◽  
Robin Kundrata ◽  
Petr Kočárek

Abstract We present herein the first phylogenetic analysis of the genus Chelidura and the taxonomic revision of the genus Chelidurella, stat. restit., based on DNA sequences. The results confirm the generic status of Chelidurella Verhoeff, 1902 and Mesochelidura Verhoeff, 1902, and they are removed from the synonymy with Chelidura and reinstated as valid genera. Many individual Chelidurella species are defined based on the combination of a few variable characters on the pygidium and forceps, and the systematics and phylogeny of this genus are unclear. The validity of most of the species is revisited here by molecular phylogenetic analyses, and individual morphological characters are evaluated for their relevance in the identification of all described species. We describe two new species to science, Chelidurella galvagnii Kirstová & Kočárek, sp. nov. from Austria, and C. pseudovignai Kočárek & Kirstová, sp. nov. from Italy and Austria; two species, C. guentheri Galvagni, 1994 and C. tatrica Chládek, 2017 are newly synonymized. Critical diagnostic characters are illustrated, and an identification key for males of Chelidurella is provided.


Pathogens ◽  
2021 ◽  
Vol 10 (2) ◽  
pp. 241
Author(s):  
Joon Moh Park ◽  
Jachoon Koo ◽  
Se Won Kang ◽  
Sung Hee Jo ◽  
Jeong Mee Park

Rhodococcus fascians is an important pathogen that infects various herbaceous perennials and reduces their economic value. In this study, we examined R. fascians isolates carrying a virulence gene from symptomatic lily plants grown in South Korea. Phylogenetic analysis using the nucleotide sequences of 16S rRNA, vicA, and fasD led to the classification of the isolates into four different strains of R. fascians. Inoculation of Nicotiana benthamiana with these isolates slowed root growth and resulted in symptoms of leafy gall. These findings elucidate the diversification of domestic pathogenic R. fascians and may lead to an accurate causal diagnosis to help reduce economic losses in the bulb market.


Pathogens ◽  
2021 ◽  
Vol 10 (1) ◽  
pp. 41
Author(s):  
Marcos Godoy ◽  
Daniel A. Medina ◽  
Rudy Suarez ◽  
Sandro Valenzuela ◽  
Jaime Romero ◽  
...  

Piscine orthoreovirus (PRV) belongs to the family Reoviridae and has been described mainly in association with salmonid infections. The genome of PRV consists of about 23,600 bp, with 10 segments of double-stranded RNA, classified as small (S1 to S4), medium (M1, M2 and M3) and large (L1, L2 and L3); these range approximately from 1000 bp (segment S4) to 4000 bp (segment L1). How the genetic variation among PRV strains affects the virulence for salmonids is still poorly understood. The aim of this study was to describe the molecular phylogeny of PRV based on an extensive sequence analysis of the S1 and M2 segments of PRV available in the GenBank database to date (May 2020). The analysis was extended to include new PRV sequences for S1 and M2 segments. In addition, subgenotype classifications were assigned to previously published unclassified sequences. It was concluded that the phylogenetic trees are consistent with the original classification using the PRV genomic segment S1, which differentiates PRV into two major genotypes, I and II, and each of these into two subgenotypes, designated as Ia and Ib, and IIa and IIb, respectively. Moreover, some clusters of country- and host-specific PRV subgenotypes were observed in the subset of sequences used. This work strengthens the subgenotype classification of PRV based on the S1 segment and can be used to enhance research on the virulence of PRV.


2021 ◽  
Vol 16 (1) ◽  
pp. 711-718
Author(s):  
Thuan Duc Lao ◽  
Hanh Van Trinh ◽  
Loi Vuong ◽  
Luyen Tien Vu ◽  
Thuy Ai Huyen Le ◽  
...  

Abstract The entomopathogenic fungus T011, parasitizing on nymph of Cicada, collected in the coffee garden in Dak Lak Province, Vietnam, was preliminarily morphologically identified as Isaria cicadae, belonged to order Hypocreales and family Clavicipitaceae. To ensure the authenticity of T011, phylogenetic analysis of the concatenated set of multiple genes including ITS, nrLSU, nrSSU, Rpb1, and Tef1 was applied to support the identification. Genomic DNA was isolated from dried sample T011. The PCR assay sequencing was applied to amplify ITS, nrLSU, nrSSU, Rpb1, and Tef1 gene. For phylogenetic analysis, the concatenated data of both target gens were constructed with MEGAX with a 1,000 replicate bootstrap based on the neighbor-joining, maximum likelihood, maximum parsimony method. As the result, the concatenated data containing 62 sequences belonged to order Hypocreales, families Clavicipitaceae, and 2 outgroup sequences belonged to order Hypocreales, genus Verticillium. The phylogenetic analysis results indicated that T011 was accepted at subclade Cordyceps and significantly formed the monophyletic group with referent Cordyceps cicadae (Telemorph of Isaria cicadae) with high bootstrap value. The phylogenetically analyzed result was strongly supported by our morphological analysis described as the Isaria cicadae. In summary, phylogenetic analyses based on the concatenated dataset were successfully applied to strengthen the identification of T011 as Isaria cicadae.


2018 ◽  
Vol 44 (1) ◽  
pp. 20
Author(s):  
Eloiza Teles Caldart ◽  
Helena Mata ◽  
Cláudio Wageck Canal ◽  
Ana Paula Ravazzolo

Background: Phylogenetic analyses are an essential part in the exploratory assessment of nucleic acid and amino acid sequences. Particularly in virology, they are able to delineate the evolution and epidemiology of disease etiologic agents and/or the evolutionary path of their hosts. The objective of this review is to help researchers who want to use phylogenetic analyses as a tool in virology and molecular epidemiology studies, presenting the most commonly used methodologies, describing the importance of the different techniques, their peculiar vocabulary and some examples of their use in virology.Review: This article starts presenting basic concepts of molecular epidemiology and molecular evolution, emphasizing their relevance in the context of viral infectious diseases. It presents a session on the vocabulary relevant to the subject, bringing readers to a minimum level of knowledge needed throughout this literature review. Within its main subject, the text explains what a molecular phylogenetic analysis is, starting from a multiple alignment of nucleotide or amino acid sequences. The different software used to perform multiple alignments may apply different algorithms. To build a phylogeny based on amino acid or nucleotide sequences it is necessary to produce a data matrix based on a model for nucleotide or amino acid replacement, also called evolutionary model. There are a number of evolutionary models available, varying in complexity according to the number of parameters (transition, transversion, GC content, nucleotide position in the codon, among others). Some papers presented herein provide techniques that can be used to choose evolutionary models. After the model is chosen, the next step is to opt for a phylogenetic reconstruction method that best fits the available data and the selected model. Here we present the most common reconstruction methods currently used, describing their principles, advantages and disadvantages. Distance methods, for example, are simpler and faster, however, they do not provide reliable estimations when the sequences are highly divergent. The accuracy of the analysis with probabilistic models (neighbour joining, maximum likelihood and bayesian inference) strongly depends on the adherence of the actual data to the chosen development model. Finally, we also explore topology confidence tests, especially the most used one, the bootstrap. To assist the reader, this review presents figures to explain specific situations discussed in the text and numerous examples of previously published scientific articles in virology that demonstrate the importance of the techniques discussed herein, as well as their judicious use.Conclusion: The DNA sequence is not only a record of phylogeny and divergence times, but also keeps signs of how the evolutionary process has shaped its history and also the elapsed time in the evolutionary process of the population. Analyses of genomic sequences by molecular phylogeny have demonstrated a broad spectrum of applications. It is important to note that for the different available data and different purposes of phylogenies, reconstruction methods and evolutionary models should be wisely chosen. This review provides theoretical basis for the choice of evolutionary models and phylogenetic reconstruction methods best suited to each situation. In addition, it presents examples of diverse applications of molecular phylogeny in virology.


Sign in / Sign up

Export Citation Format

Share Document