scholarly journals Bias Adaptive Statistical Inference Learning Agents for Learning from Human Feedback

Author(s):  
Jonathan I Watson

We present a novel technique for learning behaviors from ahuman provided feedback signal that is distorted by system-atic bias. Our technique, which we refer to as BASIL, modelsthe feedback signal as being separable into a heuristic evalu-ation of the utility of an action and a bias value that is drawnfrom a parametric distribution probabilistically, where thedistribution is defined by unknown parameters. We presentthe general form of the technique as well as a specific algo-rithm for integrating the technique with the TAMER algo-rithm for bias values drawn from a normal distribution. Wetest our algorithm against standard TAMER in the domain ofTetris using a synthetic oracle that provides feedback undervarying levels of distortion. We find our algorithm can learnvery quickly under bias distortions that entirely stymie thelearning of classic TAMER.

Symmetry ◽  
2019 ◽  
Vol 11 (12) ◽  
pp. 1509
Author(s):  
Guillermo Martínez-Flórez ◽  
Artur J. Lemonte ◽  
Hugo S. Salinas

The univariate power-normal distribution is quite useful for modeling many types of real data. On the other hand, multivariate extensions of this univariate distribution are not common in the statistic literature, mainly skewed multivariate extensions that can be bimodal, for example. In this paper, based on the univariate power-normal distribution, we extend the univariate power-normal distribution to the multivariate setup. Structural properties of the new multivariate distributions are established. We consider the maximum likelihood method to estimate the unknown parameters, and the observed and expected Fisher information matrices are also derived. Monte Carlo simulation results indicate that the maximum likelihood approach is quite effective to estimate the model parameters. An empirical application of the proposed multivariate distribution to real data is provided for illustrative purposes.


2018 ◽  
Vol 10 (04) ◽  
pp. 1850009 ◽  
Author(s):  
Gamze Ozel ◽  
Emrah Altun ◽  
Morad Alizadeh ◽  
Mahdieh Mozafari

In this paper, a new heavy-tailed distribution is used to model data with a strong right tail, as often occuring in practical situations. The proposed distribution is derived from the log-normal distribution, by using odd log-logistic distribution. Statistical properties of this distribution, including hazard function, moments, quantile function, and asymptotics, are derived. The unknown parameters are estimated by the maximum likelihood estimation procedure. For different parameter settings and sample sizes, a simulation study is performed and the performance of the new distribution is compared to beta log-normal. The new lifetime model can be very useful and its superiority is illustrated by means of two real data sets.


Mathematics ◽  
2021 ◽  
Vol 9 (21) ◽  
pp. 2703
Author(s):  
Ke Wu ◽  
Liang Wang ◽  
Li Yan ◽  
Yuhlong Lio

In this paper, statistical inference and prediction issue of left truncated and right censored dependent competing risk data are studied. When the latent lifetime is distributed by Marshall–Olkin bivariate Rayleigh distribution, the maximum likelihood estimates of unknown parameters are established, and corresponding approximate confidence intervals are also constructed by using a Fisher information matrix and asymptotic approximate theory. Furthermore, Bayesian estimates and associated high posterior density credible intervals of unknown parameters are provided based on general flexible priors. In addition, when there is an order restriction between unknown parameters, the point and interval estimates based on classical and Bayesian frameworks are discussed too. Besides, the prediction issue of a censored sample is addressed based on both likelihood and Bayesian methods. Finally, extensive simulation studies are conducted to investigate the performance of the proposed methods, and two real-life examples are presented for illustration purposes.


Methodology ◽  
2009 ◽  
Vol 5 (1) ◽  
pp. 35-39
Author(s):  
Emilia I. De la Fuente ◽  
Gustavo R. Cañadas ◽  
Joan Guàrdia ◽  
Luis M. Lozano

After almost a century of debate among renowned statisticians, 21st century traditional Statistical Inference is marked by controversy over the application of the procedures for hypothesis estimation and contrast. The aim of this paper is twofold: First, to present various debatable issues that arise when the mean in a Normal distribution of known precision is contrasted and second, to argue the suitability of Bayesian philosophy for the analysis of research data.


2016 ◽  
Vol 84 (3) ◽  
pp. 371-389 ◽  
Author(s):  
Beate Franke ◽  
Jean‐François Plante ◽  
Ribana Roscher ◽  
En‐shiun Annie Lee ◽  
Cathal Smyth ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document