A multi-modal algorithm based on an NSGA-II scheme for phylogenetic tree inference

Biosystems ◽  
2022 ◽  
pp. 104606
Author(s):  
Manuel Villalobos-Cid ◽  
César Rivera ◽  
Eduardo I. Kessi-Pérez ◽  
Mario Inostroza-Ponta
2019 ◽  
Vol 23 (5) ◽  
pp. 776-787 ◽  
Author(s):  
Manuel Villalobos-Cid ◽  
Marcio Dorn ◽  
Rodrigo Ligabue-Braun ◽  
Mario Inostroza-Ponta

2021 ◽  
Vol 82 (1-2) ◽  
Author(s):  
Lena Collienne ◽  
Alex Gavryushkin

AbstractMany popular algorithms for searching the space of leaf-labelled (phylogenetic) trees are based on tree rearrangement operations. Under any such operation, the problem is reduced to searching a graph where vertices are trees and (undirected) edges are given by pairs of trees connected by one rearrangement operation (sometimes called a move). Most popular are the classical nearest neighbour interchange, subtree prune and regraft, and tree bisection and reconnection moves. The problem of computing distances, however, is $${\mathbf {N}}{\mathbf {P}}$$ N P -hard in each of these graphs, making tree inference and comparison algorithms challenging to design in practice. Although anked phylogenetic trees are one of the central objects of interest in applications such as cancer research, immunology, and epidemiology, the computational complexity of the shortest path problem for these trees remained unsolved for decades. In this paper, we settle this problem for the ranked nearest neighbour interchange operation by establishing that the complexity depends on the weight difference between the two types of tree rearrangements (rank moves and edge moves), and varies from quadratic, which is the lowest possible complexity for this problem, to $${\mathbf {N}}{\mathbf {P}}$$ N P -hard, which is the highest. In particular, our result provides the first example of a phylogenetic tree rearrangement operation for which shortest paths, and hence the distance, can be computed efficiently. Specifically, our algorithm scales to trees with tens of thousands of leaves (and likely hundreds of thousands if implemented efficiently).


2020 ◽  
Author(s):  
Jeremy M. Brown ◽  
Genevieve G. Mount ◽  
Kyle A. Gallivan ◽  
James Wilgenbusch

All phylogenetic studies are built around sets of trees. Tree sets carry different kinds of information depending on the data and approaches used to generate them, but ultimately the variation they contain and their structure is what drives new phylogenetic insights. In order to better understand the variation in and structure of phylogenetic tree sets, we need tools that are generic, flexible, and exploratory. These tools can serve as natural complements to more formal, statistical investigations and allow us to flag surprising or unexpected observations, better understand the results of model-based studies, as well as build intuition. Here, we describe such a set of tools and provide examples of how they can be applied to relevant questions in phylogenetics, phylogenomics, and species-tree inference. These tools include both visualization techniques and quantitative summaries and are currently implemented in the TreeScaper software package (Huang et al. 2016).


2020 ◽  
Vol 10 ◽  
Author(s):  
Pin Wu ◽  
Linjun Hou ◽  
Yingdong Zhang ◽  
Liye Zhang

2020 ◽  
Author(s):  
Dana Azouri ◽  
Shiran Abadi ◽  
Yishay Mansour ◽  
Itay Mayrose ◽  
Tal Pupko

Abstract Inferring a phylogenetic tree, which describes the evolutionary relationships among a set of organisms, genes, or genomes, is a fundamental step in numerous evolutionary studies. With the aim of making tree inference feasible for problems involving more than a handful of sequences, current algorithms for phylogenetic tree reconstruction utilize various heuristic approaches. Such approaches rely on performing costly likelihood optimizations, and thus evaluate only a subset of all potential trees. Consequently, all existing methods suffer from the known tradeoff between accuracy and running time. Here, we train a machine-learning algorithm over an extensive cohort of empirical data to predict the neighboring trees that increase the likelihood, without actually computing their likelihood. This provides means to safely discard a large set of the search space, thus avoiding numerous expensive likelihood computations. Our analyses suggest that machine-learning approaches can make heuristic tree searches substantially faster without losing accuracy and thus could be incorporated for narrowing down the examined neighboring trees of each intermediate tree in any tree search methodology.


2017 ◽  
Vol 66 (5) ◽  
pp. 698-714 ◽  
Author(s):  
Yongliang Zhai ◽  
Bouchard-Côté Alexandre

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
David Kerk ◽  
Jordan F. Mattice ◽  
Mario E. Valdés-Tresanco ◽  
Sergei Yu Noskov ◽  
Kenneth K.-S. Ng ◽  
...  

AbstractPhosphoprotein phosphatase (PPP) enzymes are ubiquitous proteins involved in cellular signaling pathways and other functions. Here we have traced the origin of the PPP sequences of Eukaryotes and their radiation. Using a bacterial PPP Hidden Markov Model (HMM) we uncovered “BacterialPPP-Like” sequences in Archaea. A HMM derived from eukaryotic PPP enzymes revealed additional, unique sequences in Archaea and Bacteria that were more like the eukaryotic PPP enzymes then the bacterial PPPs. These sequences formed the basis of phylogenetic tree inference and sequence structural analysis allowing the history of these sequence types to be elucidated. Our phylogenetic tree data strongly suggest that eukaryotic PPPs ultimately arose from ancestors in the Asgard archaea. We have clarified the radiation of PPPs within Eukaryotes, substantially expanding the range of known organisms with PPP subtypes (Bsu1, PP7, PPEF/RdgC) previously thought to have a more restricted distribution. Surprisingly, sequences from the Methanosarcinaceae (Euryarchaeota) form a strongly supported sister group to eukaryotic PPPs in our phylogenetic analysis. This strongly suggests an intimate association between an Asgard ancestor and that of the Methanosarcinaceae. This is highly reminiscent of the syntrophic association recently demonstrated between the cultured Lokiarchaeal species Prometheoarchaeum and a methanogenic bacterial species.


Author(s):  
Yo Yamamoto ◽  
Hidemoto Nakada ◽  
Hidetoshi Shimodaira ◽  
Satoshi Matsuoka

Sign in / Sign up

Export Citation Format

Share Document