scholarly journals The bounded coalescent model: conditioning a genealogy on a minimum root date

2022 ◽  
Author(s):  
Jake Carson ◽  
Alice Ledda ◽  
Luca Ferretti ◽  
Matt Keeling ◽  
Xavier Didelot

The coalescent model represents how individuals sampled from a population may have originated from a last common ancestor. The bounded coalescent model is obtained by conditioning the coalescent model such that the last common ancestor must have existed after a certain date. This conditioned model arises in a variety of applications, such as speciation, horizontal gene transfer or transmission analysis, and yet the bounded coalescent model has not been previously analysed in detail. Here we describe a new algorithm to simulate from this model directly, without resorting to rejection sampling. We show that this direct simulation algorithm is more computationally efficient than the rejection sampling approach. We also show how to calculate the probability of the last common ancestor occurring after a given date, which is required to compute the probability of realisations under the bounded coalescent model. Our results are applicable in both the isochronous (when all samples have the same date) and heterochronous (where samples can have different dates) settings. We explore the effect of setting a bound on the date of the last common ancestor, and show that it affects a number of properties of the resulting phylogenies. All our methods are implemented in a new R package called BoundedCoalescent which is freely available online.

mSystems ◽  
2021 ◽  
Vol 6 (1) ◽  
Author(s):  
Anoop Singh ◽  
Mohita Gaur ◽  
Vishal Sharma ◽  
Palak Khanna ◽  
Ankur Bothra ◽  
...  

ABSTRACT Clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated (Cas) genes are conserved genetic elements in many prokaryotes, including Mycobacterium tuberculosis, the causative agent of tuberculosis. Although knowledge of CRISPR locus variability has been utilized in M. tuberculosis strain genotyping, its evolutionary path in Mycobacteriaceae is not well understood. In this study, we have performed a comparative analysis of 141 mycobacterial genomes and identified the exclusive presence of the CRISPR-Cas type III-A system in M. tuberculosis complex (MTBC). Our global phylogenetic analysis of CRISPR repeats and Cas10 proteins offers evidence of horizontal gene transfer (HGT) of the CRISPR-Cas module in the last common ancestor of MTBC and Mycobacterium canettii from a Streptococcus-like environmental bacterium. Additionally, our results show that the variation of CRISPR-Cas organization in M. tuberculosis lineages, especially in the Beijing sublineage of lineage 2, is due to the transposition of insertion sequence IS6110. The direct repeat (DR) region of the CRISPR-Cas locus acts as a hot spot for IS6110 insertion. We show in M. tuberculosis H37Rv that the repeat at the 5′ end of CRISPR1 of the forward strand is an atypical repeat made up partly of IS-terminal inverted repeat and partly CRISPR DR. By tracing an undetectable spacer sequence in the DR region, the two CRISPR loci could theoretically be joined to reconstruct the ancestral single CRISPR-Cas locus organization, as seen in M. canettii. This study retracing the evolutionary events of HGT and IS6110-driven genomic deletions helps us to better understand the strain-specific variations in M. tuberculosis lineages. IMPORTANCE Comparative genomic analysis of prokaryotes has led to a better understanding of the biology of several pathogenic microorganisms. One such clinically important pathogen is M. tuberculosis, the leading cause of bacterial infection worldwide. Recent evidence on the functionality of the CRISPR-Cas system in M. tuberculosis has brought back focus on these conserved genetic elements, present in many prokaryotes. Our study advances understanding of mycobacterial CRISPR-Cas origin and its diversity among the different species. We provide phylogenetic evidence of acquisition of CRISPR-Cas type III-A in the last common ancestor shared between MTBC and M. canettii, by HGT-mediated events. The most likely source of HGT was an environmental Firmicutes bacterium. Genomic mapping of the CRISPR loci showed the IS6110 transposition-driven variations in M. tuberculosis strains. Thus, this study offers insights into events related to the evolution of CRISPR-Cas in M. tuberculosis lineages.


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Evy van Berlo ◽  
Alejandra P. Díaz-Loyo ◽  
Oscar E. Juárez-Mora ◽  
Mariska E. Kret ◽  
Jorg J. M. Massen

AbstractYawning is highly contagious, yet both its proximate mechanism(s) and its ultimate causation remain poorly understood. Scholars have suggested a link between contagious yawning (CY) and sociality due to its appearance in mostly social species. Nevertheless, as findings are inconsistent, CY’s function and evolution remains heavily debated. One way to understand the evolution of CY is by studying it in hominids. Although CY has been found in chimpanzees and bonobos, but is absent in gorillas, data on orangutans are missing despite them being the least social hominid. Orangutans are thus interesting for understanding CY’s phylogeny. Here, we experimentally tested whether orangutans yawn contagiously in response to videos of conspecifics yawning. Furthermore, we investigated whether CY was affected by familiarity with the yawning individual (i.e. a familiar or unfamiliar conspecific and a 3D orangutan avatar). In 700 trials across 8 individuals, we found that orangutans are more likely to yawn in response to yawn videos compared to control videos of conspecifics, but not to yawn videos of the avatar. Interestingly, CY occurred regardless of whether a conspecific was familiar or unfamiliar. We conclude that CY was likely already present in the last common ancestor of humans and great apes, though more converging evidence is needed.


Author(s):  
Bennett J Kapili ◽  
Anne E Dekas

Abstract Motivation Linking microbial community members to their ecological functions is a central goal of environmental microbiology. When assigned taxonomy, amplicon sequences of metabolic marker genes can suggest such links, thereby offering an overview of the phylogenetic structure underpinning particular ecosystem functions. However, inferring microbial taxonomy from metabolic marker gene sequences remains a challenge, particularly for the frequently sequenced nitrogen fixation marker gene, nitrogenase reductase (nifH). Horizontal gene transfer in recent nifH evolutionary history can confound taxonomic inferences drawn from the pairwise identity methods used in existing software. Other methods for inferring taxonomy are not standardized and require manual inspection that is difficult to scale. Results We present Phylogenetic Placement for Inferring Taxonomy (PPIT), an R package that infers microbial taxonomy from nifH amplicons using both phylogenetic and sequence identity approaches. After users place query sequences on a reference nifH gene tree provided by PPIT (n = 6317 full-length nifH sequences), PPIT searches the phylogenetic neighborhood of each query sequence and attempts to infer microbial taxonomy. An inference is drawn only if references in the phylogenetic neighborhood are: (1) taxonomically consistent and (2) share sufficient pairwise identity with the query, thereby avoiding erroneous inferences due to known horizontal gene transfer events. We find that PPIT returns a higher proportion of correct taxonomic inferences than BLAST-based approaches at the cost of fewer total inferences. We demonstrate PPIT on deep-sea sediment and find that Deltaproteobacteria are the most abundant potential diazotrophs. Using this dataset we show that emending PPIT inferences based on visual inspection of query sequence placement can achieve taxonomic inferences for nearly all sequences in a query set. We additionally discuss how users can apply PPIT to the analysis of other marker genes. Availability PPIT is freely available to non-commercial users at https://github.com/bkapili/ppit. Installation includes a vignette that demonstrates package use and reproduces the nifH amplicon analysis discussed here. The raw nifH amplicon sequence data have been deposited in the GenBank, EMBL, and DDBJ databases under BioProject number PRJEB37167. Supplementary information Supplementary data are available at Bioinformatics online.


2015 ◽  
Vol 112 (29) ◽  
pp. 9070-9075 ◽  
Author(s):  
Purushottam D. Dixit ◽  
Tin Yau Pang ◽  
F. William Studier ◽  
Sergei Maslov

An approximation to the ∼4-Mbp basic genome shared by 32 strains ofEscherichia colirepresenting six evolutionary groups has been derived and analyzed computationally. A multiple alignment of the 32 complete genome sequences was filtered to remove mobile elements and identify the most reliable ∼90% of the aligned length of each of the resulting 496 basic-genome pairs. Patterns of single base-pair mutations (SNPs) in aligned pairs distinguish clonally inherited regions from regions where either genome has acquired DNA fragments from diverged genomes by homologous recombination since their last common ancestor. Such recombinant transfer is pervasive across the basic genome, mostly between genomes in the same evolutionary group, and generates many unique mosaic patterns. The six least-diverged genome pairs have one or two recombinant transfers of length ∼40–115 kbp (and few if any other transfers), each containing one or more gene clusters known to confer strong selective advantage in some environments. Moderately diverged genome pairs (0.4–1% SNPs) show mosaic patterns of interspersed clonal and recombinant regions of varying lengths throughout the basic genome, whereas more highly diverged pairs within an evolutionary group or pairs between evolutionary groups having >1.3% SNPs have few clonal matches longer than a few kilobase pairs. Many recombinant transfers appear to incorporate fragments of the entering DNA produced by restriction systems of the recipient cell. A simple computational model can closely fit the data. Most recombinant transfers seem likely to be due to generalized transduction by coevolving populations of phages, which could efficiently distribute variability throughout bacterial genomes.


2021 ◽  
Author(s):  
Ksenia Juravel ◽  
Luis Porras ◽  
Sebastian Hoehna ◽  
Davide Pisani ◽  
Gert Wörheide

An accurate phylogeny of animals is needed to clarify their evolution, ecology, and impact on shaping the biosphere. Although multi-gene alignments of up to several hundred thousand amino acids are nowadays routinely used to test hypotheses of animal relationships, some nodes towards the root of the animal phylogeny are proving hard to resolve. While the relationships of the non-bilaterian lineages, primarily sponges (Porifera) and comb jellies (Ctenophora), have received much attention since more than a decade, controversies about the phylogenetic position of the worm-like bilaterian lineage Xenacoelomorpha and the monophyly of the "Superphylum" Deuterostomia have more recently emerged. Here we independently analyse novel genome gene content and morphological datasets to assess patterns of phylogenetic congruence with previous amino-acid derived phylogenetic hypotheses. Using statistical hypothesis testing, we show that both our datasets very strongly support sponges as the sister group of all the other animals, Xenoacoelomorpha as the sister group of the other Bilateria, and largely support monophyletic Deuterostomia. Based on these results, we conclude that the last common animal ancestor may have been a simple, filter-feeding organism without a nervous system and muscles, while the last common ancestor of Bilateria might have been a small, acoelomate-like worm without a through gut.


Development ◽  
2002 ◽  
Vol 129 (9) ◽  
pp. 2121-2128
Author(s):  
Damon T. Page

In vertebrates (deuterostomes), brain patterning depends on signals from adjacent tissues. For example, holoprosencephaly, the most common brain anomaly in humans, results from defects in signaling between the embryonic prechordal plate (consisting of the dorsal foregut endoderm and mesoderm) and the brain. I have examined whether a similar mechanism of brain development occurs in the protostome Drosophila, and find that the foregut and mesoderm act to pattern the fly embryonic brain. When the foregut and mesoderm of Drosophila are ablated, brain patterning is disrupted. The loss of Hedgehog expressed in the foregut appears to mediate this effect, as it does in vertebrates. One mechanism whereby these defects occur is a disruption of normal apoptosis in the brain. These data argue that the last common ancestor of protostomes and deuterostomes had a prototype of the brains present in modern animals, and also suggest that the foregut and mesoderm contributed to the patterning of this ‘proto-brain’. They also argue that the foreguts of protostomes and deuterostomes, which have traditionally been assigned to different germ layers, are actually homologous.


2011 ◽  
Vol 50 ◽  
pp. 19-42 ◽  
Author(s):  
Elie Dassa

In recent years, our understanding of the functioning of ABC (ATP-binding cassette) systems has been boosted by the combination of biochemical and structural approaches. However, the origin and the distribution of ABC proteins among living organisms are difficult to understand in a phylogenetic perspective, because it is hard to discriminate orthology and paralogy, due to the existence of horizontal gene transfer. In this chapter, I present an update of the classification of ABC systems and discuss a hypothetical scenario of their evolution. The hypothetical presence of ABC ATPases in the last common ancestor of modern organisms is discussed, as well as the additional possibility that ABC systems might have been transmitted to eukaryotes, after the two endosymbiosis events that led to the constitution of eukaryotic organelles. I update the functional information of selected ABC systems and introduce new families of ABC proteins that have been included recently into this vast superfamily, thanks to the availability of high-resolution three-dimensional structures.


eLife ◽  
2014 ◽  
Vol 3 ◽  
Author(s):  
Tera C Levin ◽  
Allison J Greaney ◽  
Laura Wetzel ◽  
Nicole King

The origin of animal multicellularity may be reconstructed by comparing animals with one of their closest living relatives, the choanoflagellate Salpingoeca rosetta. Just as animals develop from a single cell–the zygote–multicellular rosettes of S. rosetta develop from a founding cell. To investigate rosette development, we established forward genetics in S. rosetta. We find that the rosette defect of one mutant, named Rosetteless, maps to a predicted C-type lectin, a class of signaling and adhesion genes required for the development and innate immunity in animals. Rosetteless protein is essential for rosette development and forms an extracellular layer that coats and connects the basal poles of each cell in rosettes. This study provides the first link between genotype and phenotype in choanoflagellates and raises the possibility that a protein with C-type lectin-like domains regulated development in the last common ancestor of choanoflagellates and animals.


2018 ◽  
Vol 122 ◽  
pp. 84-92 ◽  
Author(s):  
Mark Grabowski ◽  
Kevin G. Hatala ◽  
William L. Jungers

Sign in / Sign up

Export Citation Format

Share Document