scholarly journals Bayesian Nonparametric Inference of Population Size Changes from Sequential Genealogies

2015 ◽  
Author(s):  
Julia A Palacios ◽  
John Wakeley ◽  
Sohini Ramachandran

Sophisticated inferential tools coupled with the coalescent model have recently emerged for estimating past population sizes from genomic data. Accurate methods are available for data from a single locus or from independent loci. Recent methods that model recombination require small sample sizes, make constraining assumptions about population size changes, and do not report measures of uncertainty for estimates. Here, we develop a Gaussian process-based Bayesian nonparametric method coupled with a sequentially Markov coalescent model which allows accurate inference of population sizes over time from a set of genealogies. In contrast to current methods, our approach considers a broad class of recombination events, including those that do not change local genealogies. We show that our method outperforms recent likelihood-based methods that rely on discretization of the parameter space. We illustrate the application of our method to multiple demographic histories, including population bottlenecks and exponential growth. In simulation, our Bayesian approach produces point estimates four times more accurate than maximum likelihood estimation (based on the sum of absolute differences between the truth and the estimated values). Further, our method's credible intervals for population size as a function of time cover 90 percent of true values across multiple demographic scenarios, enabling formal hypothesis testing about population size differences over time. Using genealogies estimated with ARGweaver, we apply our method to European and Yoruban samples from the 1000 Genomes Project and confirm key known aspects of population size history over the past 150,000 years.

1987 ◽  
Vol 1 (3) ◽  
pp. 349-366
Author(s):  
Jaxk H. Reeves ◽  
Ashim Mallik ◽  
William P. McCormick

A sequential procedure to select optimal prices based on maximum likelihood estimation is considered. Asymptotic properties of the pricing scheme and the concommitant estimation problem are examined. For small sample sizes, simulation results show that the proposed procedure has high efficiency relative to the best procedure when the parameter is known.


1996 ◽  
Vol 12 (1) ◽  
pp. 1-29 ◽  
Author(s):  
Richard A. Davis ◽  
William T.M. Dunsmuir

This paper considers maximum likelihood estimation for the moving average parameter θ in an MA(1) model when θ is equal to or close to 1. A derivation of the limit distribution of the estimate θLM, defined as the largest of the local maximizers of the likelihood, is given here for the first time. The theory presented covers, in a unified way, cases where the true parameter is strictly inside the unit circle as well as the noninvertible case where it is on the unit circle. The asymptotic distribution of the maximum likelihood estimator subMLE is also described and shown to differ, but only slightly, from that of θLM. Of practical significance is the fact that the asymptotic distribution for either estimate is surprisingly accurate even for small sample sizes and for values of the moving average parameter considerably far from the unit circle.


Author(s):  
Francisco C. Ceballos ◽  
Kanat Gürün ◽  
N. Ezgi Altınışık ◽  
Hasan Can Gemici ◽  
Cansu Karamurat ◽  
...  

SummaryThe history of human inbreeding is controversial. The development of sedentary agricultural societies may have had opposite influences on inbreeding levels. On the one hand, agriculture and food surplus may have diminished inbreeding by increasing population sizes and lowering endogamy, i.e. inbreeding due to population isolation. On the other hand, increased sedentism, as well as the advent of private property may have promoted inbreeding through the emergence of consanguineous marriage customs or via ethnic and caste endogamy. The net impact is unknown, and to date, no systematic study on the temporal frequency of inbreeding in human societies has been conducted. Here we present a new approach for reliable estimation of runs of homozygosity (ROH) in genomes with ≥3x mean coverage across >1 million SNPs, and apply this to 440 ancient Eurasian genomes from the last 15,000 years. We show that the frequency of inbreeding, as measured by ROH, has decreased over time. The strongest effect is associated with the Neolithic transition, but the trend has since continued, indicating a population size effect on inbreeding prevalence. We further show that most inbreeding in our historical sample can be attributed to endogamy, although singular cases of high consanguinity can also be found in the archaeogenomic record.HighlightsA study of 440 ancient genomes shows inbreeding decreased over time.The decrease appears linked with population size increase due to farming.Extreme consanguineous matings did occur among farmers, but rarely.


2019 ◽  
Author(s):  
Elodie M Garnier ◽  
Nastasia Fouret ◽  
Médéric Descoins

The scientific community encourages the use of raw data graphs to improve the reliability and transparency of the results presented in articles. However, the current methods used to visualize raw data are limited to one or two numerical variables per graph and/or small sample sizes. In the behavioral sciences, numerous variables must be plotted together in order to gain insight into the behavior in question. In this paper, we present ViSiElse, an R-package offering a new approach in the visualization of raw data. ViSiElse was developed with the open-source software R to visualize behavioral observations over time based on raw time data extracted from visually recorded sessions of experimental observations. ViSiElse gives a global overview of a process by creating a visualization of the timestamps for multiple actions and all participants into a single graph; individual or group behavior can then be easily assessed. Additional features allow users to further inspect their data by including summary statistics and time constraints.


PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e8341
Author(s):  
Elodie M. Garnier ◽  
Nastasia Fouret ◽  
Médéric Descoins

The scientific community encourages the use of raw data graphs to improve the reliability and transparency of the results presented in articles. However, the current methods used to visualize raw data are limited to one or two numerical variables per graph and/or small sample sizes. In the behavioral sciences, numerous variables must be plotted together in order to gain insight into the behavior in question. In this article, we present ViSiElse, an R-package offering a new approach in the visualization of raw data. ViSiElse was developed with the open-source software R to visualize behavioral observations over time based on raw time data extracted from visually recorded sessions of experimental observations. ViSiElse gives a global overview of a process by creating a visualization of the timestamps for multiple actions and all participants into a single graph; individual or group behavior can then be easily assessed. Additional features allow users to further inspect their data by including summary statistics and time constraints.


2010 ◽  
Vol 28 (1) ◽  
pp. 317-338 ◽  
Author(s):  
Annette Tyree Debisette ◽  
Irene Sandvold ◽  
Barbara Easterling ◽  
Angela Martinelli

The purpose of this chapter is to present an analysis of selected published nursing workforce studies published between the years of 2005 and 2010. Thirteen nursing workforce studies were reviewed and analyzed using a modification of the method suggested by Ganong (1987). Nursing workforce studies were selected based on the following criteria: (1) the date of publication was between the years of 2005 and 2010; (2) the primary focus was on nurses working in practice; or, as students or faculty in nursing educational programs. When reviewed, the 13 studies (1) lacked uniform measures among databases; (2) lacked longitudinal studies that followed the respondent over time from the beginning of their career to retirement; (3) had response rates that contributed to small sample sizes or sampling frame that did not take into consideration all characteristics of interest; (4) lacked attention to an interdisciplinary mix of providers; and (5) implied the need for future study on intergenerational characteristics due to shifting demographics in the profession and nursing workforce.


PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e8502 ◽  
Author(s):  
Patcharee Maneerat ◽  
Sa-aat Niwitpong ◽  
Suparat Niwitpong

Natural disasters such as drought and flooding are the consequence of severe rainfall fluctuation, and rainfall amount data often contain both zero and positive observations, thus making them fit a delta-lognormal distribution. By way of comparison, rainfall dispersion may not be similar in enclosed regions if the topography and the drainage basin are different, so it can be evaluated by the ratio of variances. To estimate this, credible intervals using the highest posterior density based on the normal-gamma prior (HPD-NG) and the method of variance estimates recovery (MOVER) for the ratio of delta-lognormal variances are proposed. Monte Carlo simulation was used to assess the performance of the proposed methods in terms of coverage probability and relative average length. The results of the study reveal that HPD-NG performed very well and was able to meet the requirements in various situations, even with a large difference between the proportions of zeros. However, MOVER is the recommended method for equal small sample sizes. Natural rainfall datasets for the northern and northeastern regions of Thailand are used to illustrate the practical use of the proposed credible intervals.


2008 ◽  
Vol 15 (6) ◽  
pp. 1033-1039 ◽  
Author(s):  
P. Ribereau ◽  
A. Guillou ◽  
P. Naveau

Abstract. Since the pioneering work of Landwehr et al. (1979), Hosking et al. (1985) and their collaborators, the Probability Weighted Moments (PWM) method has been very popular, simple and efficient to estimate the parameters of the Generalized Extreme Value (GEV) distribution when modeling the distribution of maxima (e.g., annual maxima of precipitations) in the Identically and Independently Distributed (IID) context. When the IID assumption is not satisfied, a flexible alternative, the Maximum Likelihood Estimation (MLE) approach offers an elegant way to handle non-stationarities by letting the GEV parameters to be time dependent. Despite its qualities, the MLE applied to the GEV distribution does not always provide accurate return level estimates, especially for small sample sizes or heavy tails. These drawbacks are particularly true in some non-stationary situations. To reduce these negative effects, we propose to extend the PWM method to a more general framework that enables us to model temporal covariates and provide accurate GEV-based return levels. Theoretical properties of our estimators are discussed. Small and moderate sample sizes simulations in a non-stationary context are analyzed and two brief applications to annual maxima of CO2 and seasonal maxima of cumulated daily precipitations are presented.


2019 ◽  
Author(s):  
Elodie M Garnier ◽  
Nastasia Fouret ◽  
Médéric Descoins

The scientific community encourages the use of raw data graphs to improve the reliability and transparency of the results presented in articles. However, the current methods used to visualize raw data are limited to one or two numerical variables per graph and/or small sample sizes. In the behavioral sciences, numerous variables must be plotted together in order to gain insight into the behavior in question. In this paper, we present ViSiElse, an R-package offering a new approach in the visualization of raw data. ViSiElse was developed with the open-source software R to visualize behavioral observations over time based on raw time data extracted from visually recorded sessions of experimental observations. ViSiElse gives a global overview of a process by creating a visualization of the timestamps for multiple actions and all participants into a single graph; individual or group behavior can then be easily assessed. Additional features allow users to further inspect their data by including summary statistics and time constraints.


2016 ◽  
Vol 47 (4) ◽  
pp. 164-173
Author(s):  
T. Bubová ◽  
M. Kulma ◽  
V. Vrabec

AbstractIn recent decades, changes in meadows maintenance have reduced the populations of endangered butterfly speciesPhengaris nausithous(Bergsträsser, 1779) andP. teleius(Bergsträsser, 1779). Currently, meadows are either abandoned or intensively used. Unfortunately, both these managements are considered unfavourable for grassland butterfly species. In this study, the effect of suitable meadow management on population sizes of both the above mentionedPhengarisspecies was investigated. The experiment was performed at the locality Dolní Labe (Děčín, Czech Republic). The most suitable models, based on the lowest values of Akaike’s information criterion corrected for small sample sizes, were selected using MARK statistical software. The results were subsequently compared with data obtained from this locality prior to the management application. Unexpectedly, no significant positive effects were found. To reach the desirable status, suitable management practices should be applied for long-term. To verify the management effect on the population size, the meadows were divided into three groups: (i) application of favourable management, (ii) mowing in inappropriate term, (iii) without management. Based on the statistical evaluation, the management application proved to be the most favourable option for both studied butterflies species.


Sign in / Sign up

Export Citation Format

Share Document