A genome-wide Approximate Bayesian Computation approach suggests only limited numbers of soft sweeps in humans over the last 100,000 years

Over the last 100,000 years, humans have spread across the globe and encountered a highly diverse set of environments to which they have had to adapt. Genome-wide scans of selection are powerful to detect selective sweeps. However, because of unknown fractions of undetected sweeps and false discoveries, the numbers of detected sweeps often poorly reflect actual numbers of selective sweeps in populations. The thousands of soft sweeps on standing variation recently evidenced in humans have also been interpreted as a majority of mis-classified neutral regions. In such a context, the extent of human adaptation remains little understood. We present a new rationale to estimate these actual numbers of sweeps expected over the last 100,000 years (denoted by X) from genome-wide population data, both considering hard sweeps and selective sweeps on standing variation. We implemented an approximate Bayesian computation framework and showed, based on computer simulations, that such a method can properly estimate X. We then jointly estimated the number of selective sweeps, their mean intensity and age in several 1000G African, European and Asian populations. Our estimations of X, found weakly sensitive to demographic misspecifications, revealed very limited numbers of sweeps regardless the frequency of the selected alleles at the onset of selection and the completion of sweeps. We estimated ∼80 sweeps in average across fifteen 1000G populations when assuming incomplete sweeps only and ∼140 selective sweeps in non-African populations when incorporating complete sweeps in our simulations. The method proposed may help to address controversies on the number of selective sweeps in populations, guiding further genome-wide investigations of recent positive selection.

Download Full-text

Sporadic Occurrence of Recent Selective Sweeps from Standing Variation in Humans as Revealed by an Approximate Bayesian Computation Approach

Genetics ◽

10.1093/genetics/iyab161 ◽

2021 ◽

Author(s):

Guillaume Laval ◽

Etienne Patin ◽

Pierre Boutillier ◽

Lluis Quintana-Murci

Keyword(s):

Approximate Bayesian Computation ◽

Sequence Data ◽

Machine Learning Algorithms ◽

Whole Genome Sequence ◽

Bayesian Computation ◽

Human Adaptation ◽

Selective Sweeps ◽

Standing Variation ◽

Genome Wide ◽

Approximate Bayesian

Abstract During their dispersals over the last 100,000 years, modern humans have been exposed to a large variety of environments, resulting in genetic adaptation. While genome-wide scans for the footprints of positive Darwinian selection have increased knowledge of genes and functions potentially involved in human local adaptation, they have globally produced evidence of a limited contribution of selective sweeps in humans. Conversely, studies based on machine learning algorithms suggest that recent sweeps from standing variation are widespread in humans, an observation that has been recently questioned. Here, we sought to formally quantify the number of recent selective sweeps in humans, by leveraging approximate Bayesian computation and whole-genome sequence data. Our computer simulations revealed suitable ABC estimations, regardless of the frequency of the selected alleles at the onset of selection and the completion of sweeps. Under a model of recent selection from standing variation, we inferred that an average of 68 (from 56 to 79) and 140 (from 94 to 198) sweeps occurred over the last 100,000 years of human history, in African and Eurasian populations, respectively. The former estimation is compatible with human adaptation rates estimated since divergence with chimps, and reveal numbers of sweeps per generation per site in the range of values estimated in Drosophila. Our results confirm the rarity of selective sweeps in humans and show a low contribution of sweeps from standing variation to recent human adaptation.

Download Full-text

Approximate Bayesian computation to recalibrate individual-based models with population data: Illustration with a forest simulation model

Ecological Modelling ◽

10.1016/j.ecolmodel.2014.09.023 ◽

2015 ◽

Vol 306 ◽

pp. 278-286 ◽

Cited By ~ 15

Author(s):

Guillaume Lagarrigues ◽

Franck Jabot ◽

Valentine Lafond ◽

Benoit Courbaud

Keyword(s):

Simulation Model ◽

Approximate Bayesian Computation ◽

Population Data ◽

Bayesian Computation ◽

Individual Based Models ◽

Forest Simulation Model ◽

Forest Simulation ◽

Approximate Bayesian

Download Full-text

Inferring Population Size History from Large Samples of Genome-Wide Molecular Data - An Approximate Bayesian Computation Approach

PLoS Genetics ◽

10.1371/journal.pgen.1005877 ◽

2016 ◽

Vol 12 (3) ◽

pp. e1005877 ◽

Cited By ~ 75

Author(s):

Simon Boitard ◽

Willy Rodríguez ◽

Flora Jay ◽

Stefano Mona ◽

Frédéric Austerlitz

Keyword(s):

Population Size ◽

Approximate Bayesian Computation ◽

Molecular Data ◽

Bayesian Computation ◽

Large Samples ◽

Genome Wide ◽

Approximate Bayesian

Download Full-text

Phylogenomics Reveals an Ancient Hybrid Origin of the Persian Walnut

Molecular Biology and Evolution ◽

10.1093/molbev/msz112 ◽

2019 ◽

Vol 36 (11) ◽

pp. 2451-2461 ◽

Cited By ~ 14

Author(s):

Bo-Wen Zhang ◽

Lin-Lin Xu ◽

Nan Li ◽

Peng-Cheng Yan ◽

Xin-Hua Jiang ◽

...

Keyword(s):

Approximate Bayesian Computation ◽

Juglans Regia ◽

Phylogenetic Network ◽

Single Copy ◽

Hybrid Origin ◽

Bayesian Computation ◽

Persian Walnut ◽

Genome Wide ◽

American Black ◽

Approximate Bayesian

Abstract Persian walnut (Juglans regia) is cultivated worldwide for its high-quality wood and nuts, but its origin has remained mysterious because in phylogenies it occupies an unresolved position between American black walnuts and Asian butternuts. Equally unclear is the origin of the only American butternut, J. cinerea. We resequenced the whole genome of 80 individuals from 19 of the 22 species of Juglans and assembled the genome of its relatives Pterocarya stenoptera and Platycarya strobilacea. Using phylogenetic-network analysis of single-copy nuclear genes, genome-wide site pattern probabilities, and Approximate Bayesian Computation, we discovered that J. regia (and its landrace J. sigillata) arose as a hybrid between the American and the Asian lineages and that J. cinerea resulted from massive introgression from an immigrating Asian butternut into the genome of an American black walnut. Approximate Bayesian Computation modeling placed the hybrid origin in the late Pliocene, ∼3.45 My, with both parental lineages since having gone extinct in Europe.

Download Full-text

Colonization and/or mitochondrial selective sweeps across the North Atlantic intertidal assemblage revealed by multi-taxa approximate Bayesian computation

Molecular Ecology ◽

10.1111/j.1365-294x.2010.04790.x ◽

2010 ◽

Vol 19 (20) ◽

pp. 4505-4519 ◽

Cited By ~ 44

Author(s):

KATRIINA L. ILVES ◽

WEN HUANG ◽

JOHN P. WARES ◽

MICHAEL J. HICKERSON

Keyword(s):

North Atlantic ◽

Approximate Bayesian Computation ◽

Bayesian Computation ◽

Selective Sweeps ◽

The North ◽

The North Atlantic ◽

Approximate Bayesian ◽

Intertidal Assemblage

Download Full-text

Gaussian process enhanced semi-automatic approximate Bayesian computation: parameter inference in a stochastic differential equation system for chemotaxis

Journal of Computational Physics ◽

10.1016/j.jcp.2020.109999 ◽

2020 ◽

pp. 109999

Author(s):

Agnieszka Borowska ◽

Diana Giurghita ◽

Dirk Husmeier

Keyword(s):

Differential Equation ◽

Stochastic Differential Equation ◽

Gaussian Process ◽

Approximate Bayesian Computation ◽

Equation System ◽

Bayesian Computation ◽

Differential Equation System ◽

Parameter Inference ◽

Approximate Bayesian

Download Full-text

Weighted approximate Bayesian computation via Sanov’s theorem

Computational Statistics ◽

10.1007/s00180-021-01093-4 ◽

2021 ◽

Author(s):

Cecilia Viscardi ◽

Michele Boreale ◽

Fabio Corradi

Keyword(s):

Large Deviations ◽

Posterior Distribution ◽

Approximate Bayesian Computation ◽

Bayesian Computation ◽

Information Theoretic ◽

Discrete Random Variables ◽

Positive Weights ◽

Approximate Bayesian ◽

Information Theoretic Method ◽

Computational Resources

AbstractWe consider the problem of sample degeneracy in Approximate Bayesian Computation. It arises when proposed values of the parameters, once given as input to the generative model, rarely lead to simulations resembling the observed data and are hence discarded. Such “poor” parameter proposals do not contribute at all to the representation of the parameter’s posterior distribution. This leads to a very large number of required simulations and/or a waste of computational resources, as well as to distortions in the computed posterior distribution. To mitigate this problem, we propose an algorithm, referred to as the Large Deviations Weighted Approximate Bayesian Computation algorithm, where, via Sanov’s Theorem, strictly positive weights are computed for all proposed parameters, thus avoiding the rejection step altogether. In order to derive a computable asymptotic approximation from Sanov’s result, we adopt the information theoretic “method of types” formulation of the method of Large Deviations, thus restricting our attention to models for i.i.d. discrete random variables. Finally, we experimentally evaluate our method through a proof-of-concept implementation.

Download Full-text

Inference of Brain Networks with Approximate Bayesian Computation – assessing face validity with an example application in Parkinsonism

NeuroImage ◽

10.1016/j.neuroimage.2021.118020 ◽

2021 ◽

pp. 118020

Author(s):

Timothy O. West ◽

Luc Berthouze ◽

Simon F. Farmer ◽

Hayriye Cagnan ◽

Vladimir Litvak

Keyword(s):

Approximate Bayesian Computation ◽

Brain Networks ◽

Face Validity ◽

Bayesian Computation ◽

Approximate Bayesian

Download Full-text

Flow parameter estimation using laser absorption spectroscopy and approximate Bayesian computation

Experiments in Fluids ◽

10.1007/s00348-020-03122-2 ◽

2021 ◽

Vol 62 (2) ◽

Author(s):

Jason D. Christopher ◽

Olga A. Doronina ◽

Dan Petrykowski ◽

Torrey R. S. Hayden ◽

Caelan Lapointe ◽

...

Keyword(s):

Parameter Estimation ◽

Absorption Spectroscopy ◽

Approximate Bayesian Computation ◽

Flow Parameter ◽

Bayesian Computation ◽

Laser Absorption Spectroscopy ◽

Laser Absorption ◽

Approximate Bayesian

Download Full-text

Approximate Bayesian Computation for Discrete Spaces

Entropy ◽

10.3390/e23030312 ◽

2021 ◽

Vol 23 (3) ◽

pp. 312

Author(s):

Ilze A. Auzina ◽

Jakub M. Tomczak

Keyword(s):

Approximate Bayesian Computation ◽

Likelihood Function ◽

Real Life ◽

Random Variables ◽

Bayesian Computation ◽

Inference Problem ◽

Markov Kernel ◽

Inference Problems ◽

Binary Neural Network ◽

Approximate Bayesian

Many real-life processes are black-box problems, i.e., the internal workings are inaccessible or a closed-form mathematical expression of the likelihood function cannot be defined. For continuous random variables, likelihood-free inference problems can be solved via Approximate Bayesian Computation (ABC). However, an optimal alternative for discrete random variables is yet to be formulated. Here, we aim to fill this research gap. We propose an adjusted population-based MCMC ABC method by re-defining the standard ABC parameters to discrete ones and by introducing a novel Markov kernel that is inspired by differential evolution. We first assess the proposed Markov kernel on a likelihood-based inference problem, namely discovering the underlying diseases based on a QMR-DTnetwork and, subsequently, the entire method on three likelihood-free inference problems: (i) the QMR-DT network with the unknown likelihood function, (ii) the learning binary neural network, and (iii) neural architecture search. The obtained results indicate the high potential of the proposed framework and the superiority of the new Markov kernel.

Download Full-text