scholarly journals Exact distributions of statistics for making inferences on mixed models under the default covariance structure

Author(s):  
Samaradasa Weerahandi ◽  
Ching-Ray Yu
Data ◽  
2020 ◽  
Vol 5 (1) ◽  
pp. 6 ◽  
Author(s):  
Alberto Gianinetti

Germination data are discrete and binomial. Although analysis of variance (ANOVA) has long been used for the statistical analysis of these data, generalized linear mixed models (GzLMMs) provide a more consistent theoretical framework. GzLMMs are suitable for final germination percentages (FGP) as well as longitudinal studies of germination time-courses. Germination indices (i.e., single-value parameters summarizing the results of a germination assay by combining the level and rapidity of germination) and other data with a Gaussian error distribution can be analyzed too. There are, however, different kinds of GzLMMs: Conditional (i.e., random effects are modeled as deviations from the general intercept with a specific covariance structure), marginal (i.e., random effects are modeled solely as a variance/covariance structure of the error terms), and quasi-marginal (some random effects are modeled as deviations from the intercept and some are modeled as a covariance structure of the error terms) models can be applied to the same data. It is shown that: (a) For germination data, conditional, marginal, and quasi-marginal GzLMMs tend to converge to a similar inference; (b) conditional models are the first choice for FGP; (c) marginal or quasi-marginal models are more suited for longitudinal studies, although conditional models lead to a congruent inference; (d) in general, common random factors are better dealt with as random intercepts, whereas serial correlation is easier to model in terms of the covariance structure of the error terms; (e) germination indices are not binomial and can be easier to analyze with a marginal model; (f) in boundary conditions (when some means approach 0% or 100%), conditional models with an integral approximation of true likelihood are more appropriate; in non-boundary conditions, (g) germination data can be fitted with default pseudo-likelihood estimation techniques, on the basis of the SAS-based code templates provided here; (h) GzLMMs are remarkably good for the analysis of germination data except if some means are 0% or 100%. In this case, alternative statistical approaches may be used, such as survival analysis or linear mixed models (LMMs) with transformed data, unless an ad hoc data adjustment in estimates of limit means is considered, either experimentally or computationally. This review is intended as a basic tutorial for the application of GzLMMs, and is, therefore, of interest primarily to researchers in the agricultural sciences.


2018 ◽  
Vol 19 (1) ◽  
Author(s):  
Andrey Ziyatdinov ◽  
Miquel Vázquez-Santiago ◽  
Helena Brunel ◽  
Angel Martinez-Perez ◽  
Hugues Aschard ◽  
...  

2016 ◽  
Vol 25 (6) ◽  
pp. 2506-2520 ◽  
Author(s):  
Xicheng Fang ◽  
Jialiang Li ◽  
Weng Kee Wong ◽  
Bo Fu

Mixed-effects models are increasingly used in many areas of applied science. Despite their popularity, there is virtually no systematic approach for examining the homogeneity of the random-effects covariance structure commonly assumed for such models. We propose two tests for evaluating the homogeneity of the covariance structure assumption across subjects: one is based on the covariance matrices computed from the fitted model and the other is based on the empirical variation computed from the estimated random effects. We used simulation studies to compare performances of the two tests for detecting violations of the homogeneity assumption in the mixed-effects models and showed that they were able to identify abnormal clusters of subjects with dissimilar random-effects covariance structures; in particular, their removal from the fitted model might change the signs and the magnitudes of important predictors in the analysis. In a case study, we applied our proposed tests to a longitudinal cohort study of rheumatoid arthritis patients and compared their abilities to ascertain whether the assumption of covariance homogeneity for subject-specific random effects holds.


2021 ◽  
Author(s):  
Mohammed Sultan ◽  
Ritbano Ahmed

Abstract The linear mixed model is one of the common models used to analyze the longitudinal data;it may comprise of separate (Univariate), joint Bivariate, and joint Multivariate linear mixed model, which is predicted on the number of response variables incorporated in the analysis. Adjusting for correlation matrix and covariance matrix between and within subjects is one reason why modern longitudinal data analysis techniques are deemed more appropriate than some of the previous methods of analysis. Some studies assume that the correlation between observation is zero. However, it is unlikely that repeated measurements on the same individual Will actually be independent. To that end, comparing the different linear mixed models identifying the appropriate model demonstrates that the evolution of patients with congestive heart failure is necessary.In this study the separate, bivariate, and multivariate linear mixed models were compared with different covariance and correlation structures. Finally, a multivariate linear mixed model with autoregressive order one correlation structure and unstructured covariance structure for random effects, to consider within and between patient's variations, was considered as a best model to depict the evolution of patients with congestive heart failure.


2016 ◽  
Vol 14 (2) ◽  
pp. e0703 ◽  
Author(s):  
Marcin Studnicki ◽  
Wiesław Mądry ◽  
Kinga Noras ◽  
Elżbieta Wójcik-Gront ◽  
Edward Gacek

The main objectives of multi-environmental trials (METs) are to assess cultivar adaptation patterns under different environmental conditions and to investigate genotype by environment (G×E) interactions. Linear mixed models (LMMs) with more complex variance-covariance structures have become recognized and widely used for analyzing METs data. Best practice in METs analysis is to carry out a comparison of competing models with different variance-covariance structures. Improperly chosen variance-covariance structures may lead to biased estimation of means resulting in incorrect conclusions. In this work we focused on adaptive response of cultivars on the environments modeled by the LMMs with different variance-covariance structures. We identified possible limitations of inference when using an inadequate variance-covariance structure. In the presented study we used the dataset on grain yield for 63 winter wheat cultivars, evaluated across 18 locations, during three growing seasons (2008/2009-2010/2011) from the Polish Post-registration Variety Testing System. For the evaluation of variance-covariance structures and the description of cultivars adaptation to environments, we calculated adjusted means for the combination of cultivar and location in models with different variance-covariance structures. We concluded that in order to fully describe cultivars adaptive patterns modelers should use the unrestricted variance-covariance structure. The restricted compound symmetry structure may interfere with proper interpretation of cultivars adaptive patterns. We found, that the factor-analytic structure is also a good tool to describe cultivars reaction on environments, and it can be successfully used in METs data after determining the optimal component number for each dataset.


2011 ◽  
Vol 140 (8) ◽  
pp. 1439-1445 ◽  
Author(s):  
P. M. POLGREEN ◽  
J. D. SPARKS ◽  
L. A. POLGREEN ◽  
M. YANG ◽  
M. L. HARRIS ◽  
...  

SUMMARYIn order to characterize the association between county-level risk factors and the incidence of Cryptosporidium in the 2007 Iowa outbreak, we used generalized linear mixed models with the number of Cryptosporidium cases per county as the dependent variable. We employed a spatial power covariance structure, which assumed that the correlation between the numbers of cases in any two counties decreases as the distance between them increases. County population size was included in the model to adjust for population differences. Independent variables included the number of pools in specific pool categories (large, small, spa, wading, waterslide) and pool-owner classes (apartment, camp, country club or health club, hotel, municipal, school, other) as well as the proportion of residents aged <5 years. We found that increases in the number of bigger pools, pools with more heterogeneous mixing (municipal pools vs. country club or apartment pools), and pools catering to young children (wading pools) are associated with more cases at the county level.


Sign in / Sign up

Export Citation Format

Share Document