Normal approximation for mixtures of normal distributions and the evolution of phenotypic traits

Krzysztof Bartoszek; Torkel Erhardsson

doi:10.1017/apr.2020.53

Normal approximation for mixtures of normal distributions and the evolution of phenotypic traits

Advances in Applied Probability ◽

10.1017/apr.2020.53 ◽

2021 ◽

Vol 53 (1) ◽

pp. 162-188

Author(s):

Krzysztof Bartoszek ◽

Torkel Erhardsson

Keyword(s):

Normal Distribution ◽

Phenotypic Trait ◽

Phenotypic Traits ◽

Normal Distributions ◽

Average Value ◽

Conditional Moments ◽

Mixture Of Normal Distributions ◽

Explicit Bounds ◽

Mixtures Of Normal Distributions ◽

Parameter Values

AbstractExplicit bounds are given for the Kolmogorov and Wasserstein distances between a mixture of normal distributions, by which we mean that the conditional distribution given some $\sigma$ -algebra is normal, and a normal distribution with properly chosen parameter values. The bounds depend only on the first two moments of the first two conditional moments given the $\sigma$ -algebra. The proof is based on Stein’s method. As an application, we consider the Yule–Ornstein–Uhlenbeck model, used in the field of phylogenetic comparative methods. We obtain bounds for both distances between the distribution of the average value of a phenotypic trait over n related species, and a normal distribution. The bounds imply and extend earlier limit theorems by Bartoszek and Sagitov.

Download Full-text

The Distribution of Semantic Fields in Author’s Texts

Cybernetics and Information Technologies ◽

10.1515/cait-2016-0043 ◽

2016 ◽

Vol 16 (3) ◽

pp. 195-204

Author(s):

Bohdan Pavlyshenko

Keyword(s):

Vector Space ◽

Normal Distribution ◽

Frequency Distribution ◽

Null Hypothesis ◽

T Test ◽

Normal Distributions ◽

Semantic Fields ◽

Mixture Of Normal Distributions ◽

Student’S T ◽

Student’S T Test

Abstract The paper describes the analysis of frequency distribution of semantic fields of nouns and verbs in the texts of English fiction. To such distributions, we applied Shapiro-Wilk test. The null hypothesis of normal distribution of semantic fields frequencies in the array of texts under analysis is rejected for some semantic fields. This makes it possible to consider the frequency distribution of semantic fields as a categorized mixture of normal distributions. As a factor of categorization, we chose text authorship. We divided the author’s categories with rejected hypothesis of normal distribution into subcategories with normal distribution. Paired Student’s t-test for the distributions of semantic fields in the texts of different authors revealed a measure of authorship representation in the structure of semantic fields. The analysis of the results showed that the author’s idiolect is represented in the vector space of semantic fields. Such a space can be used in the analysis of the authorship and author’s idiolect of texts.

Download Full-text

Finite mixtures of normal distributions in the study of the error in altimetry

Advances in Cartography and GIScience of the ICA ◽

10.5194/ica-adv-3-13-2021 ◽

2021 ◽

Vol 3 ◽

pp. 1-8

Author(s):

José Rodríguez-Avi ◽

Francisco Javier Ariza-López

Keyword(s):

Finite Mixtures ◽

Mixture Of Distributions ◽

Normal Distributions ◽

Mixture Of Normal Distributions ◽

Mixtures Of Normal Distributions

Abstract. The modelling of the altimetric error is proposed by means of the mixture of normal distributions. This alternative allows to avoid the problems of lack of normality of the altimetric error and that have been indicated numerous times. The conceptual bases of the mixture of distributions are presented and its application is demonstrated with an applied example. In the example, the altimetric errors existing between a DEM with 5 × 5 m resolution and another DEM with 2 × 2 m resolution are modelled, which is considered as a reference. The application demonstrates the feasibility and power of analysis of the proposal made.

Download Full-text

ESTIMATION OF THE FROCINI CRITERIA AND OMEGA SQUARE CRITERIA STATISTICS BY THE STATISTICAL TESTS METHOD FOR A MIXTURE OF NORMAL DISTRIBUTIONS

Siberian Journal of Science and Technology ◽

10.31772/2587-6066-2019-20-1-28-34 ◽

2019 ◽

Vol 20 (1) ◽

pp. 28-34

Author(s):

S. V. Ushanov ◽

◽

D. A. Ogurtsov ◽

Keyword(s):

Statistical Tests ◽

Normal Distributions ◽

Mixture Of Normal Distributions

Download Full-text

Quantiles for finite mixtures of normal distributions

International Journal of Mathematical Education in Science and Technology ◽

10.1080/00207390500433103 ◽

2006 ◽

Vol 37 (3) ◽

pp. 352-358 ◽

Cited By ~ 3

Author(s):

Mezbahur Rahman ◽

Rumanur Rahman ◽

Larry M. Pearson

Keyword(s):

Finite Mixtures ◽

Normal Distributions ◽

Mixtures Of Normal Distributions

Download Full-text

NON-NORMAL STATISTICAL TOLERANCE ANALYSIS USING ANALYTICAL CONVOLUTION METHOD

Journal of Advanced Manufacturing Systems ◽

10.1142/s0219686708001218 ◽

2008 ◽

Vol 07 (01) ◽

pp. 127-130 ◽

Cited By ~ 2

Author(s):

S. G. LIU ◽

P. WANG ◽

Z. G. LI

Keyword(s):

Normal Distribution ◽

Closed Loop ◽

Tolerance Analysis ◽

Accurate Method ◽

Probability Density Functions ◽

Triangular Distribution ◽

Normal Distributions ◽

Statistical Tolerance ◽

Statistical Tolerance Analysis ◽

Convolution Method

In statistical tolerance analysis, it is usually assumed that the statistical tolerance is normally distributed. But in practice, there are many non-normal distributions, such as uniform distribution, triangular distribution, etc. The simple way to analyze non-normal distributions is to approximately represent it with normal distribution, but the accuracy is low. Monte-Carlo simulation can analyze non-normal distributions with higher accuracy, but is time consuming. Convolution method is an accurate method to analyze statistical tolerance, but there are few reported works about it because of the difficulty. In this paper, analytical convolution is used to analyze non-normal distribution, and the probability density functions of closed loop component are obtained. Comparing with other methods, convolution method is accurate and faster.

Download Full-text

Using Mixture of Normal Distributions to Detect Treatment Effects when the Frequentist Method Fails

RAS Oncology & Therapy ◽

10.51520/2766-2586-9 ◽

2021 ◽

Vol 2 (1) ◽

Author(s):

Anthony Orlando

Keyword(s):

Dose Response ◽

Linear Models ◽

Controlled Trial ◽

Response To Treatment ◽

Double Blind ◽

Underlying Assumption ◽

Normal Distributions ◽

Efficacy Measure ◽

Baseline Weight ◽

Mixture Of Normal Distributions

Background: Results from a clinical trial can either support the efficacy and safety of a new compound or fail to provide such evidence. One reason for ‘non[1]positive’ result is due to the underlying assumption of normality and homogeneity of variances, which are quite often violated when analyzing data from clinical trials, despite randomization. A question of interest is can we obtain more informative results when using mixture of normal distributions or linear models (MLMs) in such cases. Introduction: MLM can be used when traditional methods fail. MLMs “search” within the variability in data to identify components or subgroups of individuals (also known as latent classes) who have common intercepts and common slopes of change in a variable/endpoint of interest but whose intercepts and slopes are different from other subsets of patients. Thus, MLMs can be used to identify subgroups of patients exhibiting differential response to treatment within each treatment arm. The purpose of our study was to examine the usefulness of using MLM in such circumstances. Methods: Data of 155 subjects taken from a Multicenter, randomized, double blind, placebo controlled trial that evaluated the efficacy of Cpn10, administered twice weekly subcutaneously to treat Rheumatoid Arthritis was taken to evaluate the usefulness of MLM. The primary efficacy measure ACR20 was analyzed using a 3-step process: first, MLM was used to estimate RA duration using a 3-component model. The second step took the results of the first step to inform the logistic model and its analyses. Model was fitted with an intercept, MLM components, treatment arm, RA duration (linear and quadratic), dose response (modeled as an interaction effect), age and baseline weight. LOCF was used to impute for missing data. Data was analyzed using MLM and SAS v 9.0. Results: The model was a good fit to the data with a likelihood ratio significant at p=0.026, and a significant increase in the -2log L. We also observed low p-values for those variables that were non normal. Overall and for the 75 mg dose, Cpn 10 was efficacious relative to placebo, p<0.050. We also observed that dose response was significant at p><0.15 Conclusion: The use of MLM adds value because it can be used to understand the disease experience or the value of treatment when traditional statistical methods cannot. Key words: Mixture of linear models, normality, entropy.

Download Full-text

Mean mixtures of normal distributions: properties, inference and application

Metrika ◽

10.1007/s00184-018-0692-x ◽

2018 ◽

Vol 82 (4) ◽

pp. 501-528 ◽

Cited By ~ 3

Author(s):

Hossein Negarestani ◽

Ahad Jamalizadeh ◽

Sobhan Shafiei ◽

Narayanaswamy Balakrishnan

Keyword(s):

Normal Distributions ◽

Mixtures Of Normal Distributions

Download Full-text

How to Distinguish Languages and Dialects

Computational Linguistics ◽

10.1162/coli_a_00366 ◽

2020 ◽

Vol 45 (4) ◽

pp. 823-831 ◽

Cited By ~ 2

Author(s):

Søren Wichmann

Keyword(s):

Temporal Distance ◽

Strong Tendency ◽

Data Sets ◽

Language Group ◽

Lexical Information ◽

Normal Distributions ◽

Mixture Of Normal Distributions ◽

The Mean ◽

One To One

The terms “language” and “dialect” are ingrained, but linguists nevertheless tend to agree that it is impossible to apply a non-arbitrary distinction such that two speech varieties can be identified as either distinct languages or two dialects of one and the same language. A database of lexical information for more than 7,500 speech varieties, however, unveils a strong tendency for linguistic distances to be bimodally distributed. For a given language group the linguistic distances pertaining to either cluster can be teased apart, identifying a mixture of normal distributions within the data and then separating them fitting curves and finding the point where they cross. The thresholds identified are remarkably consistent across data sets, qualifying their mean as a universal criterion for distinguishing between language and dialect pairs. The mean of the thresholds identified translates into a temporal distance of around one to one-and-a-half millennia (1,075–1,635 years).

Download Full-text

The failure rate properties of a bimodal mixture of normal distributions in an unequal variance case

Statistics & Probability Letters ◽

10.1016/j.spl.2008.01.083 ◽

2008 ◽

Vol 78 (14) ◽

pp. 2006-2009 ◽

Cited By ~ 1

Author(s):

Fuxiang Liu ◽

Yanyan Liu

Keyword(s):

Failure Rate ◽

Unequal Variance ◽

Normal Distributions ◽

Mixture Of Normal Distributions

Download Full-text

Hierarchical models with scale mixtures of normal distributions

Test ◽

10.1007/bf02564434 ◽

1997 ◽

Vol 6 (1) ◽

pp. 205-221 ◽

Cited By ~ 36

Author(s):

S. T. B. Choy ◽

A. F. M. Smith

Keyword(s):

Hierarchical Models ◽

Normal Distributions ◽

Scale Mixtures ◽

Mixtures Of Normal Distributions

Download Full-text