scholarly journals Phylommand - a command line software package for phylogenetics

F1000Research ◽  
2016 ◽  
Vol 5 ◽  
pp. 2903
Author(s):  
Martin Ryberg

Phylogenetics is an intrinsic part of many analyses in evolutionary biology and ecology, and as the amount of data available for these analyses is increasing rapidly the need for automated pipelines to deal with the data also increases. Phylommand is a package of four programs to create, manipulate, and/or analyze phylogenetic trees or pairwise alignments. It is built to be easily implemented in software workflows, both directly on the command prompt, and executed using scripts. Inputs can be taken from standard input or a file, and the behavior of the programs can be changed through switches. By using standard file formats for phylogenetic analyses, such as newick, nexus, phylip, and fasta, phylommand is widely compatible with other software.

2021 ◽  
Vol 9 ◽  
Author(s):  
Caio Ribeiro ◽  
Lucas Oliveira ◽  
Romina Batista ◽  
Marcos De Sousa

The use of Ultraconserved Elements (UCEs) as genetic markers in phylogenomics has become popular and has provided promising results. Although UCE data can be easily obtained from targeted enriched sequencing, the protocol for in silico analysis of UCEs consist of the execution of heterogeneous and complex tools, a challenge for scientists without training in bioinformatics. Developing tools with the adoption of best practices in research software can lessen this problem by improving the execution of computational experiments, thus promoting better reproducibility. We present UCEasy, an easy-to-install and easy-to-use software package with a simple command line interface that facilitates the computational analysis of UCEs from sequencing samples, following the best practices of research software. UCEasy is a wrapper that standardises, automates and simplifies the quality control of raw reads, assembly and extraction and alignment of UCEs, generating at the end a data matrix with different levels of completeness that can be used to infer phylogenetic trees. We demonstrate the functionalities of UCEasy by reproducing the published results of phylogenomic studies of the bird genus Turdus (Aves) and of Adephaga families (Coleoptera) containing genomic datasets to efficiently extract UCEs.


Author(s):  
Jakob McBroome ◽  
Bryan Thornlow ◽  
Angie S Hinrichs ◽  
Alexander Kramer ◽  
Nicola De Maio ◽  
...  

Abstract The vast scale of SARS-CoV-2 sequencing data has made it increasingly challenging to comprehensively analyze all available data using existing tools and file formats. To address this, we present a database of SARS-CoV-2 phylogenetic trees inferred with unrestricted public sequences, which we update daily to incorporate new sequences. Our database uses the recently-proposed mutation-annotated tree (MAT) format to efficiently encode the tree with branches labeled with parsimony-inferred mutations, as well as Nextstrain clade and Pango lineage labels at clade roots. As of June 9, 2021, our SARS-CoV-2 MAT consists of 834,521 sequences and provides a comprehensive view of the virus' evolutionary history using public data. We also present matUtils—a command-line utility for rapidly querying, interpreting and manipulating the MATs. Our daily-updated SARS-CoV-2 MAT database and matUtils software are available at http://hgdownload.soe.ucsc.edu/goldenPath/wuhCor1/UShER_SARS-CoV-2/ and https://github.com/yatisht/usher, respectively.


2015 ◽  
Author(s):  
Lucas Czech ◽  
Jaime Huerta-Cepas ◽  
Alexandros Stamatakis

AbstractPhylogenetic trees are routinely visualized to present and interpret the evolutionary relationships of species. Virtually all empirical evolutionary data studies contain a visualization of the inferred tree with branch support values. Ambiguous semantics in tree file formats can lead to erroneous tree visualizations and therefore to incorrect interpretations of phylogenetic analyses.Here, we discuss problems that can and do arise when displaying branch values on trees after re-rooting. Branch values are typically stored as node labels in the widely-used Newick tree format. However, such values are attributes of branches. Storing them as node labels can therefore yield errors when re-rooting trees. This depends on the mostly implicit semantics that tools deploy to interpret node labels.We reviewed 10 tree viewers and 10 bioinformatics toolkits that can display and re-root trees. We found that 14 out of 20 of these tools do not permit users to select the semantics of node labels. Thus, unaware users might obtain incorrect results when rooting trees inferred by common phylogenetic inference programs. We illustrate such incorrect mappings for several test cases and real examples taken from the literature. This review has already led to improvements and workarounds in 8 of the tested tools. We suggest tools should provide an option that explicitly forces users to define the semantics of node labels.


ALGAE ◽  
2021 ◽  
Vol 36 (4) ◽  
pp. 333-340
Author(s):  
Seongmin Cheon ◽  
Sung-Gwon Lee ◽  
Hyun-Hee Hong ◽  
Hyun-Gwan Lee ◽  
Kwang Young Kim ◽  
...  

Phylotranscriptomics is the study of phylogenetic relationships among taxa based on their DNA sequences derived from transcriptomes. Because of the relatively low cost of transcriptome sequencing compared with genome sequencing and the fact that phylotranscriptomics is almost as reliable as phylogenomics, the phylotranscriptomic analysis has recently emerged as the preferred method for studying evolutionary biology. However, it is challenging to perform transcriptomic and phylogenetic analyses together without programming expertise. This study presents a protocol for phylotranscriptomic analysis to aid marine biologists unfamiliar with UNIX command-line interface and bioinformatics tools. Here, we used transcriptomes to reconstruct a molecular phylogeny of dinoflagellate protists, a diverse and globally abundant group of marine plankton organisms whose large and complex genomic sequences have impeded conventional phylogenic analysis based on genomic data. We hope that our proposed protocol may serve as practical and helpful information for the training and education of novice phycologists.


2016 ◽  
Author(s):  
Stephen R. Bond ◽  
Karl E. Keat ◽  
Sofia N. Barreira ◽  
Andreas D. Baxevanis

AbstractThe ability to manipulate sequence, alignment, and phylogenetic tree files has become an increasingly important skill in the life sciences, whether to generate summary information or to prepare data for further downstream analysis. The command line can be an extremely powerful environment for interacting with these resources, but only if the user has the appropriate general-purpose tools on hand. BuddySuite is a collection of four independent yet interrelated command-line toolkits that facilitate each step in the workflow of sequence discovery, curation, alignment, and phylogenetic reconstruction. Most common sequence, alignment, and tree file formats are automatically detected and parsed, and over 100 tools have been implemented for manipulating these data. The project has been engineered to easily accommodate the addition of new tools, it is written in the popular programming language Python, and is hosted on the Python Package Index and GitHub to maximize accessibility. Documentation for each BuddySuite tool, including usage examples, is available at http://tiny.cc/buddysuite_wiki. All software is open source and freely available through http://research.nhgri.nih.gov/software/BuddySuite.


2012 ◽  
Vol 39 (2) ◽  
pp. 217-233 ◽  
Author(s):  
J. David Archibald

Studies of the origin and diversification of major groups of plants and animals are contentious topics in current evolutionary biology. This includes the study of the timing and relationships of the two major clades of extant mammals – marsupials and placentals. Molecular studies concerned with marsupial and placental origin and diversification can be at odds with the fossil record. Such studies are, however, not a recent phenomenon. Over 150 years ago Charles Darwin weighed two alternative views on the origin of marsupials and placentals. Less than a year after the publication of On the origin of species, Darwin outlined these in a letter to Charles Lyell dated 23 September 1860. The letter concluded with two competing phylogenetic diagrams. One showed marsupials as ancestral to both living marsupials and placentals, whereas the other showed a non-marsupial, non-placental as being ancestral to both living marsupials and placentals. These two diagrams are published here for the first time. These are the only such competing phylogenetic diagrams that Darwin is known to have produced. In addition to examining the question of mammalian origins in this letter and in other manuscript notes discussed here, Darwin confronted the broader issue as to whether major groups of animals had a single origin (monophyly) or were the result of “continuous creation” as advocated for some groups by Richard Owen. Charles Lyell had held similar views to those of Owen, but it is clear from correspondence with Darwin that he was beginning to accept the idea of monophyly of major groups.


2016 ◽  
Vol 371 (1691) ◽  
pp. 20150225 ◽  
Author(s):  
Daniele Silvestro ◽  
Alexander Zizka ◽  
Christine D. Bacon ◽  
Borja Cascales-Miñana ◽  
Nicolas Salamin ◽  
...  

Methods in historical biogeography have revolutionized our ability to infer the evolution of ancestral geographical ranges from phylogenies of extant taxa, the rates of dispersals, and biotic connectivity among areas. However, extant taxa are likely to provide limited and potentially biased information about past biogeographic processes, due to extinction, asymmetrical dispersals and variable connectivity among areas. Fossil data hold considerable information about past distribution of lineages, but suffer from largely incomplete sampling. Here we present a new dispersal–extinction–sampling (DES) model, which estimates biogeographic parameters using fossil occurrences instead of phylogenetic trees. The model estimates dispersal and extinction rates while explicitly accounting for the incompleteness of the fossil record. Rates can vary between areas and through time, thus providing the opportunity to assess complex scenarios of biogeographic evolution. We implement the DES model in a Bayesian framework and demonstrate through simulations that it can accurately infer all the relevant parameters. We demonstrate the use of our model by analysing the Cenozoic fossil record of land plants and inferring dispersal and extinction rates across Eurasia and North America. Our results show that biogeographic range evolution is not a time-homogeneous process, as assumed in most phylogenetic analyses, but varies through time and between areas. In our empirical assessment, this is shown by the striking predominance of plant dispersals from Eurasia into North America during the Eocene climatic cooling, followed by a shift in the opposite direction, and finally, a balance in biotic interchange since the middle Miocene. We conclude by discussing the potential of fossil-based analyses to test biogeographic hypotheses and improve phylogenetic methods in historical biogeography.


PLoS ONE ◽  
2020 ◽  
Vol 15 (12) ◽  
pp. e0240953
Author(s):  
Christian Schulz ◽  
Eivind Almaas

Approaches for systematizing information of relatedness between organisms is important in biology. Phylogenetic analyses based on sets of highly conserved genes are currently the basis for the Tree of Life. Genome-scale metabolic reconstructions contain high-quality information regarding the metabolic capability of an organism and are typically restricted to metabolically active enzyme-encoding genes. While there are many tools available to generate draft reconstructions, expert-level knowledge is still required to generate and manually curate high-quality genome-scale metabolic models and to fill gaps in their reaction networks. Here, we use the tool AutoKEGGRec to construct 975 genome-scale metabolic draft reconstructions encoded in the KEGG database without further curation. The organisms are selected across all three domains, and their metabolic networks serve as basis for generating phylogenetic trees. We find that using all reactions encoded, these metabolism-based comparisons give rise to a phylogenetic tree with close similarity to the Tree of Life. While this tree is quite robust to reasonable levels of noise in the metabolic reaction content of an organism, we find a significant heterogeneity in how much noise an organism may tolerate before it is incorrectly placed in the tree. Furthermore, by using the protein sequences for particular metabolic functions and pathway sets, such as central carbon-, nitrogen-, and sulfur-metabolism, as basis for the organism comparisons, we generate highly specific phylogenetic trees. We believe the generation of phylogenetic trees based on metabolic reaction content, in particular when focused on specific functions and pathways, could aid the identification of functionally important metabolic enzymes and be of value for genome-scale metabolic modellers and enzyme-engineers.


2020 ◽  
Vol 76 (01) ◽  
pp. 6334-2020
Author(s):  
ZEYNEP AKKUTAY-YOLDAR ◽  
TAYLAN KOÇ B.

Canine parvovirus (CPV) type 2 is the causative agent of acute hemorrhagic enteritis and high mortality in the affected dogs. Numerous studies have been done to understand the origin of the virus and to exhibit new variants and circulating strains. This report describes the detection and genomic characterization of CPV strains from indoor and outdoor dogs in Ankara, Turkey. Samples were sent to our laboratory due to clinical symptoms in puppies. We tested blood and swab samples to determine the presence of canine parvovirus (CPV) in three puppies and two adult dogs by reverse transcription-polymerase chain reaction (RT-PCR) using VP2 (capsid protein) region primers of canine parvoviruses. Following that, to provide molecular characterization data Maximum Likelihood (ML) method was used for phylogenetic analyses. Constructed phylogenetic trees from the aligned nucleotide sequences revealed that our CPV strains demonstrated high genetic similarities, with 100% identity match on nucleotide alignments with each other and classified in CPV-2b genotypes.They have placed on a monophyletic clade as a sister branch with CPV VAC S quantum with 98.9% nucleotide homology. Our findings suggest that CPV-2b is actual and frequently seen variant in Turkey and shows high similarities with other CPV variants and a bit less with FPVs in Turkey and around the world. CPV causes high mortality and morbidity in dogs and to develop effective vaccines for protection of dogs in Turkey where there are few numbers of studies that have been done, field strains should be isolated and characterised.


Author(s):  
Emmanuelle Jousselin ◽  
Marianne Elias

During the last two decades, ecological speciation has been a major research theme in evolutionary biology. Ecological speciation occurs when reproductive isolation between populations evolves as a result of niche differentiation. Phytophagous insects represent model systems for the study of this evolutionary process. The host-plants on which these insects feed and often spend parts of their life cycle constitute ideal agents of divergent selection for these organisms. Adaptation to feeding on different host-plant species can potentially lead to ecological specialization of populations and subsequent speciation. This process is thought to have given birth to the astonishing diversity of phytophagous insects and is often put forward in macroevolutionary scenarios of insect diversification. Consequently, numerous phylogenetic studies on phytophagous insects have aimed at testing whether speciation driven by host-plant adaptation is the main pathway for the diversification of the groups under investigation. The increasing availability of comprehensive and well-resolved phylogenies and the recent developments in phylogenetic comparative methods are offering an unprecedented opportunity to test hypotheses on insect diversification at a macroevolutionary scale, in a robust phylogenetic framework. Our purpose here is to review the contribution of phylogenetic analyses to investigate the importance of plant-mediated speciation in the diversification of phytophagous insects and to present suggestions for future developments in this field.


Sign in / Sign up

Export Citation Format

Share Document