The method of weighted likelihood functions

Model-Based Probabilistic Inversion Using Magnetic Data: A Case Study on the Kevitsa Deposit

Geosciences ◽

10.3390/geosciences11040150 ◽

2021 ◽

Vol 11 (4) ◽

pp. 150

Author(s):

Nilgün Güdük ◽

Miguel de la Varga ◽

Janne Kaukolinna ◽

Florian Wellmann

Keyword(s):

Structural Parameters ◽

Rock Properties ◽

Magnetic Data ◽

Geological Model ◽

Borehole Data ◽

Likelihood Functions ◽

Different Types ◽

Geophysical Information ◽

Different Sources

Structural geological models are widely used to represent relevant geological interfaces and property distributions in the subsurface. Considering the inherent uncertainty of these models, the non-uniqueness of geophysical inverse problems, and the growing availability of data, there is a need for methods that integrate different types of data consistently and consider the uncertainties quantitatively. Probabilistic inference provides a suitable tool for this purpose. Using a Bayesian framework, geological modeling can be considered as an integral part of the inversion and thereby naturally constrain geophysical inversion procedures. This integration prevents geologically unrealistic results and provides the opportunity to include geological and geophysical information in the inversion. This information can be from different sources and is added to the framework through likelihood functions. We applied this methodology to the structurally complex Kevitsa deposit in Finland. We started with an interpretation-based 3D geological model and defined the uncertainties in our geological model through probability density functions. Airborne magnetic data and geological interpretations of borehole data were used to define geophysical and geological likelihoods, respectively. The geophysical data were linked to the uncertain structural parameters through the rock properties. The result of the inverse problem was an ensemble of realized models. These structural models and their uncertainties are visualized using information entropy, which allows for quantitative analysis. Our results show that with our methodology, we can use well-defined likelihood functions to add meaningful information to our initial model without requiring a computationally-heavy full grid inversion, discrepancies between model and data are spotted more easily, and the complementary strength of different types of data can be integrated into one framework.

Download Full-text

NONLINEAR PANEL DATA MODELS WITH DISTRIBUTION-FREE CORRELATED RANDOM EFFECTS

Econometric Theory ◽

10.1017/s0266466620000481 ◽

2021 ◽

pp. 1-25

Author(s):

Yu-Chin Hsu ◽

Ji-Liang Shiu

Keyword(s):

Panel Data ◽

Data Model ◽

Conditional Distribution ◽

Unobserved Heterogeneity ◽

Random Effect ◽

Data Models ◽

Panel Data Model ◽

Panel Data Models ◽

Likelihood Functions ◽

Correlated Random Effects

Under a Mundlak-type correlated random effect (CRE) specification, we first show that the average likelihood of a parametric nonlinear panel data model is the convolution of the conditional distribution of the model and the distribution of the unobserved heterogeneity. Hence, the distribution of the unobserved heterogeneity can be recovered by means of a Fourier transformation without imposing a distributional assumption on the CRE specification. We subsequently construct a semiparametric family of average likelihood functions of observables by combining the conditional distribution of the model and the recovered distribution of the unobserved heterogeneity, and show that the parameters in the nonlinear panel data model and in the CRE specification are identifiable. Based on the identification result, we propose a sieve maximum likelihood estimator. Compared with the conventional parametric CRE approaches, the advantage of our method is that it is not subject to misspecification on the distribution of the CRE. Furthermore, we show that the average partial effects are identifiable and extend our results to dynamic nonlinear panel data models.

Download Full-text

The Sampling Distribution of Disease-Associated Alleles

Genetics ◽

10.1093/genetics/147.4.1855 ◽

1997 ◽

Vol 147 (4) ◽

pp. 1855-1861 ◽

Cited By ~ 1

Author(s):

Montgomery Slatkin ◽

Bruce Rannala

Keyword(s):

Low Frequency ◽

Null Model ◽

Sampling Distribution ◽

Death Process ◽

Published Data ◽

Data Sets ◽

Likelihood Functions ◽

Alternative Hypotheses ◽

Size Standard ◽

Birth Death

Abstract A theory is developed that provides the sampling distribution of low frequency alleles at a single locus under the assumption that each allele is the result of a unique mutation. The numbers of copies of each allele is assumed to follow a linear birth-death process with sampling. If the population is of constant size, standard results from theory of birth-death processes show that the distribution of numbers of copies of each allele is logarithmic and that the joint distribution of numbers of copies of k alleles found in a sample of size n follows the Ewens sampling distribution. If the population from which the sample was obtained was increasing in size, if there are different selective classes of alleles, or if there are differences in penetrance among alleles, the Ewens distribution no longer applies. Likelihood functions for a given set of observations are obtained under different alternative hypotheses. These results are applied to published data from the BRCA1 locus (associated with early onset breast cancer) and the factor VIII locus (associated with hemophilia A) in humans. In both cases, the sampling distribution of alleles allows rejection of the null hypothesis, but relatively small deviations from the null model can account for the data. In particular, roughly the same population growth rate appears consistent with both data sets.

Download Full-text

Robust estimation for multivariate wrapped models

METRON ◽

10.1007/s40300-021-00214-9 ◽

2021 ◽

Author(s):

Giovanni Saraceno ◽

Claudio Agostinelli ◽

Luca Greco

Keyword(s):

Robust Estimation ◽

Numerical Study ◽

Real Data ◽

Likelihood Method ◽

Weighted Likelihood ◽

Finite Sample ◽

Pearson Residuals ◽

Data Points ◽

Wrapped Distributions ◽

Standard Techniques

AbstractA weighted likelihood technique for robust estimation of multivariate Wrapped distributions of data points scattered on a $$p-$$ p - dimensional torus is proposed. The occurrence of outliers in the sample at hand can badly compromise inference for standard techniques such as maximum likelihood method. Therefore, there is the need to handle such model inadequacies in the fitting process by a robust technique and an effective downweighting of observations not following the assumed model. Furthermore, the employ of a robust method could help in situations of hidden and unexpected substructures in the data. Here, it is suggested to build a set of data-dependent weights based on the Pearson residuals and solve the corresponding weighted likelihood estimating equations. In particular, robust estimation is carried out by using a Classification EM algorithm whose M-step is enhanced by the computation of weights based on current parameters’ values. The finite sample behavior of the proposed method has been investigated by a Monte Carlo numerical study and real data examples.

Download Full-text

Bayesian Quickest Detection Problems for Some Diffusion Processes

Advances in Applied Probability ◽

10.1239/aap/1363354107 ◽

2013 ◽

Vol 45 (1) ◽

pp. 164-185 ◽

Cited By ~ 8

Author(s):

Pavel V. Gapeev ◽

Albert N. Shiryaev

Keyword(s):

Diffusion Processes ◽

Free Boundary Problems ◽

Three Dimensional ◽

Parabolic Type ◽

Value Functions ◽

Weighted Likelihood ◽

Likelihood Ratios ◽

Boundary Problems ◽

Detection Delay ◽

Penalty Costs

We study the Bayesian problems of detecting a change in the drift rate of an observable diffusion process with linear and exponential penalty costs for a detection delay. The optimal times of alarms are found as the first times at which the weighted likelihood ratios hit stochastic boundaries depending on the current observations. The proof is based on the reduction of the initial problems into appropriate three-dimensional optimal stopping problems and the analysis of the associated parabolic-type free-boundary problems. We provide closed-form estimates for the value functions and the boundaries, under certain nontrivial relations between the coefficients of the observable diffusion.

Download Full-text

Estimation and testing of regression disturbances based on modified likelihood functions

Journal of Statistical Planning and Inference ◽

10.1016/s0378-3758(98)00091-3 ◽

1998 ◽

Vol 71 (1-2) ◽

pp. 75-92 ◽

Cited By ~ 10

Author(s):

Mizan R. Laskar ◽

Maxwell L. King

Keyword(s):

Likelihood Functions ◽

Modified Likelihood

Download Full-text

Minimum f-divergence estimators and quasi-likelihood functions

Annals of the Institute of Statistical Mathematics ◽

10.1007/bf00058640 ◽

1992 ◽

Vol 44 (2) ◽

pp. 261-279 ◽

Cited By ~ 11

Author(s):

Paul W. Vos

Keyword(s):

Likelihood Functions

Download Full-text

Adaptive Gaussian Process Approximation for Bayesian Inference with Expensive Likelihood Functions

Neural Computation ◽

10.1162/neco_a_01127 ◽

2018 ◽

Vol 30 (11) ◽

pp. 3072-3094 ◽

Cited By ~ 8

Author(s):

Hongqiao Wang ◽

Jinglai Li

Keyword(s):

Bayesian Inference ◽

Gaussian Process ◽

Adaptive Algorithm ◽

Posterior Density ◽

Unknown Parameters ◽

Likelihood Functions ◽

Inference Problems ◽

Active Learning Method ◽

Computationally Intensive ◽

Process Approximation

We consider Bayesian inference problems with computationally intensive likelihood functions. We propose a Gaussian process (GP)–based method to approximate the joint distribution of the unknown parameters and the data, built on recent work (Kandasamy, Schneider, & Póczos, 2015 ). In particular, we write the joint density approximately as a product of an approximate posterior density and an exponentiated GP surrogate. We then provide an adaptive algorithm to construct such an approximation, where an active learning method is used to choose the design points. With numerical examples, we illustrate that the proposed method has competitive performance against existing approaches for Bayesian computation.

Download Full-text

Shrinkage of dispersion parameters in the binomial family, with application to differential exon skipping

10.1101/012823 ◽

2014 ◽

Author(s):

Sean Ruddy ◽

Marla Johnson ◽

Elizabeth Purdom

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Empirical Bayes ◽

Simulated Data ◽

Exon Skipping ◽

Expression Data ◽

Weighted Likelihood ◽

Sequencing Data ◽

Dispersion Parameters ◽

Per Gene

The prevalence of sequencing experiments in genomics has led to an increased use of methods for count data in analyzing high-throughput genomic data to perform analyses. The importance of shrinkage methods in improving the performance of statistical methods remains. A common example is that of gene expression data, where the counts per gene are often modeled as some form of an over-dispersed Poisson. In this case, shrinkage estimates of the per-gene dispersion parameter have led to improved estimation of dispersion in the case of a small number of samples. We address a different count setting introduced by the use of sequencing data: comparing differential proportional usage via an over-dispersed binomial model. This is motivated by our interest in testing for differential exon skipping in mRNA-Seq experiments. We introduce a novel method that is developed by modeling the dispersion based on the double binomial distribution proposed by Efron (1986). Our method (WEB-Seq) is an empirical bayes strategy for producing a shrunken estimate of dispersion and effectively detects differential proportional usage, and has close ties to the weighted-likelihood strategy of edgeR developed for gene expression data (Robinson and Smyth, 2007; Robinson et al., 2010). We analyze its behavior on simulated data sets as well as real data and show that our method is fast, powerful and gives accurate control of the FDR compared to alternative approaches. We provide implementation of our methods in the R package DoubleExpSeq available on CRAN.

Download Full-text

Evaluating crystallographic likelihood functions using numerical quadratures

Acta Crystallographica Section A Foundations and Advances ◽

10.1107/s0108767320099390 ◽

2020 ◽

Vol 76 (a1) ◽

pp. a60-a60

Author(s):

Petrus Zwart ◽

D. Elliot Perryman

Keyword(s):

Likelihood Functions ◽

Numerical Quadratures

Download Full-text