scholarly journals Linking Fold, Function and Phylogeny: A Comparative Genomics View on Protein (Domain) Evolution

2008 ◽  
Vol 9 (2) ◽  
pp. 88-96 ◽  
Author(s):  
Aartjan te Velthuis ◽  
Christoph Bagowski
2014 ◽  
Vol 206 (4) ◽  
pp. 1328-1336 ◽  
Author(s):  
Anna R. Kersting ◽  
Eshchar Mizrachi ◽  
Erich Bornberg-Bauer ◽  
Alexander A. Myburg

2010 ◽  
Vol 11 (5) ◽  
pp. 368-376 ◽  
Author(s):  
Christoph P. Bagowski ◽  
Wouter Bruins ◽  
Aartjan J.W. te Velthuis

2004 ◽  
Vol 1 (1) ◽  
pp. 52-63
Author(s):  
M. Dünßer ◽  
R. Lampidis ◽  
S. Schmidt ◽  
D. Seipel ◽  
T. Dandekar

Summary Integration of data in pathogenomics is achieved here considering three different levels of cellular complexity: (i) genome and comparative genomics, (ii) enzyme cascades and pathway analysis, (iii) networks including metabolic network analysis.After direct sequence annotation exploiting tools for protein domain annotation (e.g. AnDOM) and analysis of regulatory elements (e.g. the RNA analyzer tool) the analysis results from extensive comparative genomics are integrated for the first level, pathway alignment adds data for the pathway level, elementary mode analysis and metabolite databanks add to the third level of cellular complexity. For efficient data integration of all data the XML based platform myBSMLStudio2003 is discussed and developed here. It integrates XQuery capabilities, automatic scripting updates for sequence annotation and a JESS expert system shell for functional annotation. In the context of genome annotation platforms in place (GenDB, PEDANT) these different tools and approaches presented here allow improved functional genome annotation as well as data integration in pathogenomics.


2019 ◽  
Vol 374 (1786) ◽  
pp. 20190088 ◽  
Author(s):  
David López-Escardó ◽  
Xavier Grau-Bové ◽  
Amy Guillaumet-Adkins ◽  
Marta Gut ◽  
Michael E. Sieracki ◽  
...  

Understanding the origins of animal multicellularity is a fundamental biological question. Recent genome data have unravelled the role that co-option of pre-existing genes played in the origin of animals. However, there were also some important genetic novelties at the onset of Metazoa. To have a clear understanding of the specific genetic innovations and how they appeared, we need the broadest taxon sampling possible, especially among early-branching animals and their unicellular relatives. Here, we take advantage of single-cell genomics to expand our understanding of the genomic diversity of choanoflagellates, the sister-group to animals. With these genomes, we have performed an updated and taxon-rich reconstruction of protein evolution from the Last Eukaryotic Common Ancestor (LECA) to animals. Our novel data re-defines the origin of some genes previously thought to be metazoan-specific, like the POU transcription factor, which we show appeared earlier in evolution. Moreover, our data indicate that the acquisition of new genes at the stem of Metazoa was mainly driven by duplications and protein domain rearrangement processes at the stem of Metazoa. Furthermore, our analysis allowed us to reveal protein domains that are essential to the maintenance of animal multicellularity. Our analyses also demonstrate the utility of single-cell genomics from uncultured taxa to address evolutionary questions. This article is part of a discussion meeting issue ‘Single cell ecology’.


2018 ◽  
Author(s):  
Gregg W.C. Thomas ◽  
Elias Dohmen ◽  
Daniel S.T. Hughes ◽  
Shwetha C. Murali ◽  
Monica Poelchau ◽  
...  

AbstractBackgroundArthropods comprise the largest and most diverse phylum on Earth and play vital roles in nearly every ecosystem. Their diversity stems in part from variations on a conserved body plan, resulting from and recorded in adaptive changes in the genome. Dissection of the genomic record of sequence change enables broad questions regarding genome evolution to be addressed, even across hyper-diverse taxa within arthropods.ResultsUsing 76 whole genome sequences representing 21 orders spanning more than 500 million years of arthropod evolution, we document changes in gene and protein domain content and provide temporal and phylogenetic context for interpreting these innovations. We identify many novel gene families that arose early in the evolution of arthropods and during the diversification of insects into modern orders. We reveal unexpected variation in patterns of DNA methylation across arthropods and examples of gene family and protein domain evolution coincident with the appearance of notable phenotypic and physiological adaptations such as flight, metamorphosis, sociality and chemoperception.ConclusionsThese analyses demonstrate how large-scale comparative genomics can provide broad new insights into the genotype to phenotype map and generate testable hypotheses about the evolution of animal diversity.


2020 ◽  
Vol 49 (D1) ◽  
pp. D1464-D1471 ◽  
Author(s):  
Guignon Valentin ◽  
Toure Abdel ◽  
Droc Gaëtan ◽  
Dufayard Jean-François ◽  
Conte Matthieu ◽  
...  

Abstract Comparative genomics is the analysis of genomic relationships among different species and serves as a significant base for evolutionary and functional genomic studies. GreenPhylDB (https://www.greenphyl.org) is a database designed to facilitate the exploration of gene families and homologous relationships among plant genomes, including staple crops critically important for global food security. GreenPhylDB is available since 2007, after the release of the Arabidopsis thaliana and Oryza sativa genomes and has undergone multiple releases. With the number of plant genomes currently available, it becomes challenging to select a single reference for comparative genomics studies but there is still a lack of databases taking advantage several genomes by species for orthology detection. GreenPhylDBv5 introduces the concept of comparative pangenomics by harnessing multiple genome sequences by species. We created 19 pangenes and processed them with other species still relying on one genome. In total, 46 plant species were considered to build gene families and predict their homologous relationships through phylogenetic-based analyses. In addition, since the previous publication, we rejuvenated the website and included a new set of original tools including protein-domain combination, tree topologies searches and a section for users to store their own results in order to support community curation efforts.


F1000Research ◽  
2020 ◽  
Vol 9 ◽  
pp. 112
Author(s):  
Ajith Harish ◽  
David Morrison

Background: Locating the root node of the “tree of life” (ToL) is one of the hardest problems in phylogenetics. The root-node or the universal common ancestor (UCA) divides descendants into organismal domains. Two notable variants of the two-domains ToL (2D-ToL) have gained support recently. One 2D-ToL posits that eukaryotes (organisms with nuclei) and akaryotes (organisms without nuclei) are sister clades that diverged from the UCA and that Asgard archaea are sister to other archaea, whereas the other proposes that eukaryotes emerged within archaea and places Asgard archaea sister to eukaryotes. Williams et al. (Nature Ecol. Evol. 4: 138–147; 2020) re-evaluated the data and methods that support the competing two-domains proposals and concluded that eukaryotes are the closest relatives of Asgard archaea. Critique: We argue that important aspects of estimating evolutionary relatedness and assessing phylogenetic signal in empirical data were overlooked. We focus on phylogenetic character reconstructions necessary to describe the UCA or its closest descendants in the absence of reliable fossils. It is well known that different character types present different perspectives on evolutionary history that relate to different phylogenetic depths. Which 2D-ToL is better supported depends on which kind of molecular features – protein-domains or their component amino acids – are better for resolving common ancestors at the roots of clades. In practice, this involves reconstructing character compositions of the ancestral nodes all the way back to the UCA. We believe the criticisms of 2D-ToL focus on superficial aspects of the data and reflects common misunderstandings of phylogenetic reconstructions using protein domains (folds).   Clarifications: Models of protein domain evolution support more reliable phylogenetic reconstructions. In contrast, even the best available amino acid substitution models fail to resolve the archaeal radiation, despite employing thousands of genes. Therefore, the primary domains Eukaryotes and Akaryotes are better supported in a 2D-ToL.


2015 ◽  
Author(s):  
Brian K Mannakee ◽  
Ryan N Gutenkunst

The long-held principle that functionally important proteins evolve slowly has recently been challenged by studies in mice and yeast showing that the severity of a protein knockout only weakly predicts that protein's rate of evolution. However, the relevance of these studies to evolutionary changes within proteins is unknown, because amino acid substitutions, unlike knockouts, often only slightly perturb protein activity. To quantify the phenotypic effect of small biochemical perturbations, we developed an approach to use computational systems biology models to measure the influence of individual reaction rate constants on network dynamics. We show that this dynamical influence is predictive of protein domain evolutionary rate in vertebrates and yeast, even after controlling for expression level and breadth, network topology, and knockout effect. Thus, our results not only demonstrate the importance of protein domain function in determining evolutionary rate, but also the power of systems biology modeling to uncover unanticipated evolutionary forces.


2006 ◽  
Vol 357 (1) ◽  
pp. 9-17 ◽  
Author(s):  
J. Ignasi Lucas ◽  
Vicente Arnau ◽  
Ignacio Marín

Sign in / Sign up

Export Citation Format

Share Document