An Explained Variation Measure for Ordinal Response Models With Comparisons to Other Ordinal R² Measures

2006 ◽  
Vol 34 (4) ◽  
pp. 469-520 ◽  
Author(s):  
Michael G. Lacy
Author(s):  
Lacramioara Balan ◽  
Rajesh Paleti

Traditional crash databases that record police-reported injury severity data are prone to misclassification errors. Ignoring these errors in discrete ordered response models used for analyzing injury severity can lead to biased and inconsistent parameter estimates. In this study, a mixed generalized ordered response (MGOR) model that quantifies misclassification rates in the injury severity variable and adjusts the bias in parameter estimates associated with misclassification was developed. The proposed model does this by considering the observed injury severity outcome as a realization from a discrete random variable that depends on true latent injury severity that is unobservable to the analyst. The model was used to analyze misclassification rates in police-reported injury severity in the 2014 General Estimates System (GES) data. The model found that only 68.23% and 62.75% of possible and non-incapacitating injuries were correctly recorded in the GES data. Moreover, comparative analysis with the MGOR model that ignores misclassification not only has lower data fit but also considerable bias in both the parameter and elasticity estimates. The model developed in this study can be used to analyze misclassification errors in ordinal response variables in other empirical contexts.


2017 ◽  
Vol 11 (2) ◽  
pp. 3407-3445 ◽  
Author(s):  
Maria Iannario ◽  
Anna Clara Monti ◽  
Domenico Piccolo ◽  
Elvezio Ronchetti

2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Yiran Zhang ◽  
Kellie J. Archer

Abstract Background Acute myeloid leukemia (AML) is a heterogeneous cancer of the blood, though specific recurring cytogenetic abnormalities in AML are strongly associated with attaining complete response after induction chemotherapy, remission duration, and survival. Therefore recurring cytogenetic abnormalities have been used to segregate patients into favorable, intermediate, and adverse prognostic risk groups. However, it is unclear how expression of genes is associated with these prognostic risk groups. We postulate that expression of genes monotonically associated with these prognostic risk groups may yield important insights into leukemogenesis. Therefore, in this paper we propose penalized Bayesian ordinal response models to predict prognostic risk group using gene expression data. We consider a double exponential prior, a spike-and-slab normal prior, a spike-and-slab double exponential prior, and a regression-based approach with variable inclusion indicators for modeling our high-dimensional ordinal response, prognostic risk group, and identify genes through hypothesis tests using Bayes factor. Results Gene expression was ascertained using Affymetrix HG-U133Plus2.0 GeneChips for 97 favorable, 259 intermediate, and 97 adverse risk AML patients. When applying our penalized Bayesian ordinal response models, genes identified for model inclusion were consistent among the four different models. Additionally, the genes included in the models were biologically plausible, as most have been previously associated with either AML or other types of cancer. Conclusion These findings demonstrate that our proposed penalized Bayesian ordinal response models are useful for performing variable selection for high-dimensional genomic data and have the potential to identify genes relevantly associated with an ordinal phenotype.


2014 ◽  
Vol 13 ◽  
pp. CIN.S20806 ◽  
Author(s):  
Kellie J. Archer ◽  
Jiayi Hou ◽  
Qing Zhou ◽  
Kyle Ferber ◽  
John G. Layne ◽  
...  

High-throughput genomic assays are performed using tissue samples with the goal of classifying the samples as normal < pre-malignant < malignant or by stage of cancer using a small set of molecular features. In such cases, molecular features monotonically associated with the ordinal response may be important to disease development; that is, an increase in the phenotypic level (stage of cancer) may be mechanistically linked through a monotonic association with gene expression or methylation levels. Though traditional ordinal response modeling methods exist, they assume independence among the predictor variables and require the number of samples ( n) to exceed the number of covariates ( P) included in the model. In this paper, we describe our ordinalgmifs R package, available from the Comprehensive R Archive Network, which can fit a variety of ordinal response models when the number of predictors ( P) exceeds the sample size ( n). R code illustrating usage is also provided.


2021 ◽  
Author(s):  
Yiran Zhang ◽  
Kellie J. Archer

Abstract Background: Acute myeloid leukemia (AML) is a heterogeneous cancer of the blood, though specific recurring cytogenetic abnormalities in AML strongly are associated with attaining complete response after induction chemotherapy, remission duration, and survival. Therefore recurring cytogenetic abnormalities have been used to segregate patients into favorable, intermediate, and adverse prognostic risk groups. However, it is unclear how expression of genes is associated with these prognostic risk groups. We postulate that expression of genes monotonically associated with these prognostic risk groups may yield important insights into leukemogenesis. Therefore, in this paper we propose penalized Bayesian ordinal response models to predict prognostic risk group using gene expression data. We consider a double exponential prior, a spike-and-slab normal prior, a spike-and-slab double exponential prior, and a regression-based approach with variable inclusion indicators for modeling our high-dimensional ordinal response, prognostic risk group, and identify genes through hypothesis tests using Bayes Factor. Results: Gene expression was ascertained using Affymetrix HG-U133Plus2.0 GeneChips for 97 favorable, 259 intermediate, and 97 adverse risk AML patients. When applying our penalized Bayesian ordinal response models, genes identified for model inclusion were consistent among the four different models. Additionally, the genes included in the models were biologically plausible, as most have been previously associated with either AML or other types of cancer. Conclusion: These findings demonstrate that our proposed penalized Bayesian ordinal response models are useful for performing variable selection for high-dimensional genomic data and have the potential to identify genes relevantly associated with an ordinal phenotype.


2003 ◽  
Vol 28 (1) ◽  
pp. 31-44 ◽  
Author(s):  
Leonardo Grilli ◽  
Carla Rampichini

Multivariate multilevel models for ordinal variables are quite complex with respect to both interpretation and estimation. The specification in terms of a multivariate latent distribution and a set of thresholds helps in the interpretation of the variance-covariance parameters. However, most existing estimation algorithms for multilevel models can be used only if the model is reparameterized as a univariate model with an additional dummy bottom level. Moreover, the univariate formulation allows the model to be cast in the framework of Generalized Linear Latent and Mixed Models ( Rabe-Hesketh, Pickles, & Skrondal, 2001a ), a rather general class that includes, as special cases, structural equations and factor models. This article outlines the multivariate latent distribution specification and the corresponding interpretation issues; it then shows the univariate formulation, along with some alternative parameterizations that are useful in the estimation phase. An application to student ratings data illustrates the interpretation of the parameters and the estimation procedures, with a discussion of some computational issues.


Sign in / Sign up

Export Citation Format

Share Document