scholarly journals Integration of Anatomy Ontologies and Evo-Devo Using Structured Markov Models Suggests a New Framework for Modeling Discrete Phenotypic Traits

2019 ◽  
Vol 68 (5) ◽  
pp. 698-716 ◽  
Author(s):  
Sergei Tarasov

Abstract Modeling discrete phenotypic traits for either ancestral character state reconstruction or morphology-based phylogenetic inference suffers from ambiguities of character coding, homology assessment, dependencies, and selection of adequate models. These drawbacks occur because trait evolution is driven by two key processes—hierarchical and hidden—which are not accommodated simultaneously by the available phylogenetic methods. The hierarchical process refers to the dependencies between anatomical body parts, while the hidden process refers to the evolution of gene regulatory networks (GRNs) underlying trait development. Herein, I demonstrate that these processes can be efficiently modeled using structured Markov models (SMM) equipped with hidden states, which resolves the majority of the problems associated with discrete traits. Integration of SMM with anatomy ontologies can adequately incorporate the hierarchical dependencies, while the use of the hidden states accommodates hidden evolution of GRNs and substitution rate heterogeneity. I assess the new models using simulations and theoretical synthesis. The new approach solves the long-standing “tail color problem,” in which the trait is scored for species with tails of different colors or no tails. It also presents a previously unknown issue called the “two-scientist paradox,” in which the nature of coding the trait and the hidden processes driving the trait’s evolution are confounded; failing to account for the hidden process may result in a bias, which can be avoided by using hidden state models. All this provides a clear guideline for coding traits into characters. This article gives practical examples of using the new framework for phylogenetic inference and comparative analysis.

2017 ◽  
Author(s):  
Sergei Tarasov

AbstractModeling discrete phenotypic traits for either ancestral character state reconstruction or morphology-based phylogenetic inference suffers from ambiguities of character coding, homology assessment, dependencies, and selection of adequate models. These drawbacks occur because trait evolution is driven by two key processes – hierarchical and hidden – which are not accommodated simultaneously by the available phylogenetic methods. The hierarchical process refers to the dependencies between anatomical body parts, while the hidden process refers to the evolution of gene regulatory networks underlying trait development. Herein, I demonstrate that these processes can be efficiently modeled using structured Markov models equipped with hidden states, which resolves the majority of the problems associated with discrete traits. Integration of structured Markov models with anatomy ontologies can adequately incorporate the hierarchical dependencies, while the use of the hidden states accommodates hidden evolution of gene regulatory networks and substitution rate heterogeneity. I assess the new models using simulations and theoretical synthesis. The new approach solves the long-standing tail color problem (that aims at coding tail when it is absent) and presents a previously unknown issue called the “two-scientist paradox”. The latter issue refers to the confounding nature of the coding of a trait and the hidden processes driving the trait’s evolution; failing to account for the hidden process may result in a bias, which can be avoided by using hidden state models. All this provides a clear guideline for coding traits into characters. This paper gives practical examples of using the new framework for phylogenetic inference and comparative analysis.


Botany ◽  
2013 ◽  
Vol 91 (9) ◽  
pp. 573-591 ◽  
Author(s):  
Virginia Ramírez-Cruz ◽  
Gastón Guzmán ◽  
Alma Rosa Villalobos-Arámbula ◽  
Aarón Rodríguez ◽  
P. Brandon Matheny ◽  
...  

The genus Psilocybe contains iconic species of fungi renowned for their hallucinogenic properties. Recently, Psilocybe also included non-hallucinogenic species that have since been shifted to the genus Deconica. Here, we reconstruct a multigene phylogeny for Psilocybe, Deconica, and other exemplars of the families Hymenogastraceae and Strophariaceae sensu stricto (s. str.), using three nuclear markers (nLSU-rRNA, 5.8S rRNA, and rpb1). Our results confirm the monophyly of Deconica within Strophariaceae s. str., as well as numerous robust infrageneric relationships. Psilocybe is also recovered as a monophyletic group in the Hymenogastraceae, in which two principal lineages are recognized, including several nested subgroups. Most sections of Psilocybe following classifications based on morphological features are not supported in these analyses. Ancestral character state reconstruction analyses suggest that basidiospore shape in frontal view and spore wall thickness, commonly used to characterize sections in Deconica and Psilocybe, are homoplastic. Chrysocystidia, sterile cells located in the hymenium, evolved on at least two occasions in the Strophariaceae s. str., including in a novel lineage of Deconica.


2019 ◽  
Author(s):  
Sergei Tarasov

Abstract What constitutes a discrete morphological character versus character state has been long discussed in the systematics literature but the consensus on this issue is still missing. Different methods of classifying organismal features into characters and character states (CCSs) can dramatically affect the results of phylogenetic analyses. Here, I show that, in the framework of Markov models, the modular structure of the gene regulatory network (GRN) underlying trait development, and the hierarchical nature of GRN evolution, essentially remove the distinction between morphological CCS, thus endowing the CCS with an invariant property with respect to each other. This property allows the states of one character to be represented as several individual characters and vice versa. In practice, this means that a phenotype can be encoded using a set of characters or just one complex character with numerous states. The representation of a phenotype using one complex character can be implemented in Markov models of trait evolution by properly structuring transition rate matrix.


2015 ◽  
Vol 84 (2) ◽  
pp. 129-148 ◽  
Author(s):  
Valentin Rineau ◽  
Anaïs Grand ◽  
René Zaragüeta ◽  
Michel Laurin

Phenotypic characters are essential to study the evolution of extant and extinct life forms and to reconstruct the tree of life. Inside the cladistics theory, parsimony is used by a large majority of systematists working on phenotypic characters, whereas 3ta is much less widespread but has triggered important debates. Many important differences in the interpretation of the cladistic theory exist between these methods, e.g. meaning and treatment of reversals, character representation as ‘data-matrices’ in parsimony (ordered and unordered), and as rooted trees (hierarchies) in 3ta. Although 3ta has received severe criticism, mostly focused in the use of software intended to be used in parsimony, only a few empirical studies have compared these methods so far. We present the results of simulations of the evolution of phenotypic traits under a Brownian motion model to characterize differences in sensitivity between parsimony and 3ta to (1) outgroup branch length, which affects the reliability of ancestral character state estimates, (2) character state ordering scheme, and (3) ingroup branch lengths that reflect the geological age of studied taxa. Our results show that the ‘nihilistic’ attitude of leaving multistate characters unordered when criteria to order are available (e.g. , similarity, ontogeny, etc…) can decrease resolving power of the method (by 13.4% to 29.3%) and increase the occurrence of artefactual clades (by 5% to 15.6%). Increasing outgroup branch length significantly decreases resolving power and increases artefactual resolution, at least for paleontological trees. All simulations show that ordered parsimony is always superior to 3ta in tested parameter space. These results depend on the assumption in parsimony that reversals (as implied by the Brownian motion, as in most other models) can be evidence for the support of a clade a posteriori from an analysis or a priori on simulations with a known pattern. We discuss implications of these points of view compared to the assumption inherent in 3ta (i.e. , that reversals should not support a clade as other synapomorphies do) on evolutionary models.


Author(s):  
Yaping Li ◽  
Enrico Zio ◽  
Ershun Pan

Degradation is an unavoidable phenomenon in industrial systems. Hidden Markov models (HMMs) have been used for degradation modeling. In particular, segmental HMMs have been developed to model the explicit relationship between degradation signals and hidden states. However, existing segmental HMMs deal only with univariate cases, whereas in real systems, signals from various sensors are collected simultaneously, which makes it necessary to adapt the segmental HMMs to deal with multivariate processes. Also, to make full use of the information from the sensors, it is important to differentiate stable signals from deteriorating ones, but there is no good way for this, especially in multivariate processes. In this paper, the multivariate exponentially weighted moving average (MEWMA) control chart is employed to identify deteriorating multivariate signals. Specifically, the MEWMA statistic is used as a comprehensive indicator for differentiating multivariate observations. Likelihood Maximization is used to estimate the model parameters. To avoid underflow, the forward and backward probabilities are normalized. In order to assess degradation, joint probabilities are defined and derived. Further, the occurrence probability of each degradation state at the current time, as well as in the future, is derived. The Commercial Modular Aero-Propulsion System Simulation (C-MAPSS) dataset of NASA is employed for comparative analysis. In terms of degradation assessment and prediction, the proposed model performs very well in general. By sensitivity analysis, we show that in order to improve further the performance of the method, the weight of the chart should be set relatively small, whereas the method is not sensitive to the change of the in-control average run length (ARL).


2020 ◽  
Vol 70 (1) ◽  
pp. 181-189
Author(s):  
Guy Baele ◽  
Mandev S Gill ◽  
Paul Bastide ◽  
Philippe Lemey ◽  
Marc A Suchard

Abstract Markov models of character substitution on phylogenies form the foundation of phylogenetic inference frameworks. Early models made the simplifying assumption that the substitution process is homogeneous over time and across sites in the molecular sequence alignment. While standard practice adopts extensions that accommodate heterogeneity of substitution rates across sites, heterogeneity in the process over time in a site-specific manner remains frequently overlooked. This is problematic, as evolutionary processes that act at the molecular level are highly variable, subjecting different sites to different selective constraints over time, impacting their substitution behavior. We propose incorporating time variability through Markov-modulated models (MMMs), which extend covarion-like models and allow the substitution process (including relative character exchange rates as well as the overall substitution rate) at individual sites to vary across lineages. We implement a general MMM framework in BEAST, a popular Bayesian phylogenetic inference software package, allowing researchers to compose a wide range of MMMs through flexible XML specification. Using examples from bacterial, viral, and plastid genome evolution, we show that MMMs impact phylogenetic tree estimation and can substantially improve model fit compared to standard substitution models. Through simulations, we show that marginal likelihood estimation accurately identifies the generative model and does not systematically prefer the more parameter-rich MMMs. To mitigate the increased computational demands associated with MMMs, our implementation exploits recent developments in BEAGLE, a high-performance computational library for phylogenetic inference. [Bayesian inference; BEAGLE; BEAST; covarion, heterotachy; Markov-modulated models; phylogenetics.]


2017 ◽  
Vol 114 (35) ◽  
pp. 9403-9408 ◽  
Author(s):  
Elodie Renvoisé ◽  
Kathryn D. Kavanagh ◽  
Vincent Lazzari ◽  
Teemu J. Häkkinen ◽  
Ritva Rice ◽  
...  

Much of the basic information about individual organ development comes from studies using model species. Whereas conservation of gene regulatory networks across higher taxa supports generalizations made from a limited number of species, generality of mechanistic inferences remains to be tested in tissue culture systems. Here, using mammalian tooth explants cultured in isolation, we investigate self-regulation of patterning by comparing developing molars of the mouse, the model species of mammalian research, and the bank vole. A distinct patterning difference between the vole and the mouse molars is the alternate cusp offset present in the vole. Analyses of both species using 3D reconstructions of developing molars and jaws, computational modeling of cusp patterning, and tooth explants cultured with small braces show that correct cusp offset requires constraints on the lateral expansion of the developing tooth. Vole molars cultured without the braces lose their cusp offset, and mouse molars cultured with the braces develop a cusp offset. Our results suggest that cusp offset, which changes frequently in mammalian evolution, is more dependent on the 3D support of the developing jaw than other aspects of tooth shape. This jaw–tooth integration of a specific aspect of the tooth phenotype indicates that organs may outsource specific aspects of their morphology to be regulated by adjacent body parts or organs. Comparative studies of morphologically different species are needed to infer the principles of organogenesis.


Author(s):  
Sergei Tarasov ◽  
Istvan Miko ◽  
Matthew Yoder ◽  
Josef Uyeda

Ancestral character state reconstruction has been long used to gain insight into the evolution of individual traits in organisms. However, organismal anatomies (= entire phenotypes) are not merely ensembles of individual traits, rather they are complex systems where traits interact with each other due to anatomical dependencies (when one trait depends on the presence of another trait) and developmental constraints. Comparative phylogenetics has been largely lacking a method for reconstructing the evolution of entire organismal anatomies or organismal body regions. Herein, we present a new approach named PARAMO (Phylogenetic Ancestral Reconstruction of Anatomy by Mapping Ontologies, Tarasov and Uyeda 2019) that takes into account anatomical dependencies and uses stochastic maps (i.e., phylogenetic trees with an instance of mapped evolutionary history of characters, Huelsenbeck et al. 2003) along with anatomy ontologies to reconstruct organismal anatomies. Our approach treats the entire phenotype or its component body regions as single complex characters and allows exploring and comparing phenotypic evolution at different levels of anatomical hierarchy. These complex characters are constructed by ontology-informed amalgamation of elementary characters (i.e., those coded in character matrix) using stochastic maps. In our approach, characters are linked with the terms from an anatomy ontology, which allows viewing them not just as an ensemble of character state tokens but as entities that have their own biological meaning provided by the ontology. This ontology-informed framework provides new opportunities for tracking phenotypic radiations and anatomical evolution of organisms, which we explore using a large dataset for the insect order Hymenoptera (sawflies, wasps, ants and bees).


Author(s):  
Wanling Song ◽  
Anna L. Duncan ◽  
Mark S.P. Sansom

AbstractG protein-coupled receptors (GPCRs) play key roles in cellular signalling. GPCRs are suggested to form dimers and higher order oligomers in response to activation. However, we do not fully understand GPCR activation at larger scales and in an in vivo context. We have characterised oligomeric configurations of the adenosine 2a receptor (A2aR) by combining large-scale molecular dynamics simulations with Markov state models. Receptor activation results in enhanced oligomerisation, more diverse oligomer populations, and a more connected oligomerisation network. The active state conformation of the A2aR shifts protein-protein association interfaces to those involving intracellular loop ICL3 and transmembrane helix TM6. Binding of PIP2 to A2aR stabilises protein-protein interactions via PIP2-mediated association interfaces. These results indicate that A2aR oligomerisation is responsive to the local membrane lipid environment. This in turn suggests a modulatory effect on A2aR whereby a given oligomerisation profile favours the dynamic formation of specific supra-molecular signalling complexes.


Sign in / Sign up

Export Citation Format

Share Document