To tune or not to tune, a case study of ridge logistic regression in small or sparse datasets

Abstract Background For finite samples with binary outcomes penalized logistic regression such as ridge logistic regression has the potential of achieving smaller mean squared errors (MSE) of coefficients and predictions than maximum likelihood estimation. There is evidence, however, that ridge logistic regression can result in highly variable calibration slopes in small or sparse data situations. Methods In this paper, we elaborate this issue further by performing a comprehensive simulation study, investigating the performance of ridge logistic regression in terms of coefficients and predictions and comparing it to Firth’s correction that has been shown to perform well in low-dimensional settings. In addition to tuned ridge regression where the penalty strength is estimated from the data by minimizing some measure of the out-of-sample prediction error or information criterion, we also considered ridge regression with pre-specified degree of shrinkage. We included ‘oracle’ models in the simulation study in which the complexity parameter was chosen based on the true event probabilities (prediction oracle) or regression coefficients (explanation oracle) to demonstrate the capability of ridge regression if truth was known. Results Performance of ridge regression strongly depends on the choice of complexity parameter. As shown in our simulation and illustrated by a data example, values optimized in small or sparse datasets are negatively correlated with optimal values and suffer from substantial variability which translates into large MSE of coefficients and large variability of calibration slopes. In contrast, in our simulations pre-specifying the degree of shrinkage prior to fitting led to accurate coefficients and predictions even in non-ideal settings such as encountered in the context of rare outcomes or sparse predictors. Conclusions Applying tuned ridge regression in small or sparse datasets is problematic as it results in unstable coefficients and predictions. In contrast, determining the degree of shrinkage according to some meaningful prior assumptions about true effects has the potential to reduce bias and stabilize the estimates.

Download Full-text

Application and Simulation Study of Stress-Dependent Weibull Lifetime Models

International Journal of Reliability Quality and Safety Engineering ◽

10.1142/s021853931650008x ◽

2016 ◽

Vol 23 (02) ◽

pp. 1650008 ◽

Cited By ~ 2

Author(s):

Jochen Juskowiak ◽

Bernd Bertsche

Keyword(s):

Test Data ◽

Simulation Study ◽

Model Evaluation ◽

Likelihood Estimation ◽

Information Criterion ◽

Accelerated Life Test ◽

Ratio Test ◽

Plausibility Check ◽

Wide Range ◽

Stress Dependent

Different Weibull lifetime models are presented whose scale, shape and minimum lifetime parameters are stress-dependent. This allows describing and predicting the lifetime of products with a Weibull distribution more accurately wherever stress-dependence applies to the failure mechanism. For instance, this is the case for failures due to fatigue, on which this paper focusses. The proposed procedure encompasses a two-step maximum likelihood estimation and a Fisher matrix (FM) confidence bounds calculation, followed by a model evaluation. This model evaluation is conducted by means of a general plausibility check (PC), likelihood ratio test (LRT) and Bayesian information criterion (BIC). Their applicability to accelerated life test data is discussed and validated using test data. Finally, a simulation study confirms a wide range of applicability.

Download Full-text

Beta-binomial models for meta-analysis with binary outcomes: Variations, extensions, and additional insights from econometrics

Research Methods in Medicine & Health Sciences ◽

10.1177/2632084321996225 ◽

2021 ◽

pp. 263208432199622

Author(s):

Tim Mathes ◽

Oliver Kuss

Keyword(s):

Simulation Study ◽

Count Data ◽

Negative Binomial ◽

Meta Analysis ◽

Negative Binomial Regression ◽

Binary Outcomes ◽

Small Scale ◽

Panel Count Data ◽

Count Data Models ◽

Meta Analyses

Background Meta-analysis of systematically reviewed studies on interventions is the cornerstone of evidence based medicine. In the following, we will introduce the common-beta beta-binomial (BB) model for meta-analysis with binary outcomes and elucidate its equivalence to panel count data models. Methods We present a variation of the standard “common-rho” BB (BBST model) for meta-analysis, namely a “common-beta” BB model. This model has an interesting connection to fixed-effect negative binomial regression models (FE-NegBin) for panel count data. Using this equivalence, it is possible to estimate an extension of the FE-NegBin with an additional multiplicative overdispersion term (RE-NegBin), while preserving a closed form likelihood. An advantage due to the connection to econometric models is, that the models can be easily implemented because “standard” statistical software for panel count data can be used. We illustrate the methods with two real-world example datasets. Furthermore, we show the results of a small-scale simulation study that compares the new models to the BBST. The input parameters of the simulation were informed by actually performed meta-analysis. Results In both example data sets, the NegBin, in particular the RE-NegBin showed a smaller effect and had narrower 95%-confidence intervals. In our simulation study, median bias was negligible for all methods, but the upper quartile for median bias suggested that BBST is most affected by positive bias. Regarding coverage probability, BBST and the RE-NegBin model outperformed the FE-NegBin model. Conclusion For meta-analyses with binary outcomes, the considered common-beta BB models may be valuable extensions to the family of BB models.

Download Full-text

New class of Lindley distributions: properties and applications

Journal of Statistical Distributions and Applications ◽

10.1186/s40488-021-00127-y ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Duha Hamed ◽

Ahmad Alzaghal

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Simulation Study ◽

Likelihood Estimation ◽

Statistical Properties ◽

Data Sets ◽

Lifetime Data ◽

Unknown Parameters ◽

Lindley Distribution ◽

New Class

AbstractA new generalized class of Lindley distribution is introduced in this paper. This new class is called the T-Lindley{Y} class of distributions, and it is generated by using the quantile functions of uniform, exponential, Weibull, log-logistic, logistic and Cauchy distributions. The statistical properties including the modes, moments and Shannon’s entropy are discussed. Three new generalized Lindley distributions are investigated in more details. For estimating the unknown parameters, the maximum likelihood estimation has been used and a simulation study was carried out. Lastly, the usefulness of this new proposed class in fitting lifetime data is illustrated using four different data sets. In the application section, the strength of members of the T-Lindley{Y} class in modeling both unimodal as well as bimodal data sets is presented. A member of the T-Lindley{Y} class of distributions outperformed other known distributions in modeling unimodal and bimodal lifetime data sets.

Download Full-text

Penalized maximum likelihood estimation in logistic regression and discrimination

Biometrika ◽

10.1093/biomet/69.1.123 ◽

1982 ◽

Vol 69 (1) ◽

pp. 123-136 ◽

Cited By ~ 49

Author(s):

J. A. ANDERSON ◽

V. BLAIR

Keyword(s):

Logistic Regression ◽

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Likelihood Estimation ◽

Penalized Maximum Likelihood ◽

Penalized Maximum Likelihood Estimation

Download Full-text

Alternative models and randomization techniques for Bayesian response-adaptive randomization with binary outcomes

Clinical Trials ◽

10.1177/17407745211010139 ◽

2021 ◽

pp. 174077452110101

Author(s):

Jennifer Proper ◽

John Connett ◽

Thomas Murray

Keyword(s):

Logistic Regression ◽

Sample Size ◽

Error Rate ◽

Adaptive Design ◽

Type I Error ◽

Probability Model ◽

Binary Outcomes ◽

Type I ◽

Operating Characteristics ◽

Type I Error Rate

Background: Bayesian response-adaptive designs, which data adaptively alter the allocation ratio in favor of the better performing treatment, are often criticized for engendering a non-trivial probability of a subject imbalance in favor of the inferior treatment, inflating type I error rate, and increasing sample size requirements. The implementation of these designs using the Thompson sampling methods has generally assumed a simple beta-binomial probability model in the literature; however, the effect of these choices on the resulting design operating characteristics relative to other reasonable alternatives has not been fully examined. Motivated by the Advanced R2 Eperfusion STrategies for Refractory Cardiac Arrest trial, we posit that a logistic probability model coupled with an urn or permuted block randomization method will alleviate some of the practical limitations engendered by the conventional implementation of a two-arm Bayesian response-adaptive design with binary outcomes. In this article, we discuss up to what extent this solution works and when it does not. Methods: A computer simulation study was performed to evaluate the relative merits of a Bayesian response-adaptive design for the Advanced R2 Eperfusion STrategies for Refractory Cardiac Arrest trial using the Thompson sampling methods based on a logistic regression probability model coupled with either an urn or permuted block randomization method that limits deviations from the evolving target allocation ratio. The different implementations of the response-adaptive design were evaluated for type I error rate control across various null response rates and power, among other performance metrics. Results: The logistic regression probability model engenders smaller average sample sizes with similar power, better control over type I error rate, and more favorable treatment arm sample size distributions than the conventional beta-binomial probability model, and designs using the alternative randomization methods have a negligible chance of a sample size imbalance in the wrong direction. Conclusion: Pairing the logistic regression probability model with either of the alternative randomization methods results in a much improved response-adaptive design in regard to important operating characteristics, including type I error rate control and the risk of a sample size imbalance in favor of the inferior treatment.

Download Full-text

Marginal maximum likelihood estimation methods for the tuning parameters of ridge, power ridge, and generalized ridge regression

Communications in Statistics - Simulation and Computation ◽

10.1080/03610918.2017.1321119 ◽

2017 ◽

Vol 47 (6) ◽

pp. 1632-1651 ◽

Cited By ~ 5

Author(s):

George Karabatsos

Keyword(s):

Maximum Likelihood ◽

Maximum Likelihood Estimation ◽

Ridge Regression ◽

Likelihood Estimation ◽

Estimation Methods ◽

Marginal Maximum Likelihood ◽

Marginal Maximum Likelihood Estimation ◽

Tuning Parameters ◽

Marginal Maximum

Download Full-text

GENE SELECTION USING LOGISTIC REGRESSIONS BASED ON AIC, BIC AND MDL CRITERIA

New Mathematics and Natural Computation ◽

10.1142/s179300570500007x ◽

2005 ◽

Vol 01 (01) ◽

pp. 129-145 ◽

Cited By ~ 15

Author(s):

XIAOBO ZHOU ◽

XIAODONG WANG ◽

EDWARD R. DOUGHERTY

Keyword(s):

Logistic Regression ◽

Regression Model ◽

Logistic Regression Model ◽

Gene Selection ◽

Information Criterion ◽

Cancer Classification ◽

Data Sets ◽

Classification Methods ◽

Gene Expressions ◽

Experimental Conditions

In microarray-based cancer classification, gene selection is an important issue owing to the large number of variables (gene expressions) and the small number of experimental conditions. Many gene-selection and classification methods have been proposed; however most of these treat gene selection and classification separately, and not under the same model. We propose a Bayesian approach to gene selection using the logistic regression model. The Akaike information criterion (AIC), the Bayesian information criterion (BIC) and the minimum description length (MDL) principle are used in constructing the posterior distribution of the chosen genes. The same logistic regression model is then used for cancer classification. Fast implementation issues for these methods are discussed. The proposed methods are tested on several data sets including those arising from hereditary breast cancer, small round blue-cell tumors, lymphoma, and acute leukemia. The experimental results indicate that the proposed methods show high classification accuracies on these data sets. Some robustness and sensitivity properties of the proposed methods are also discussed. Finally, mixing logistic-regression based gene selection with other classification methods and mixing logistic-regression-based classification with other gene-selection methods are considered.

Download Full-text

Causal inference with multiple concurrent medications: A comparison of methods and an application in multidrug-resistant tuberculosis

Statistical Methods in Medical Research ◽

10.1177/0962280218808817 ◽

2018 ◽

Vol 28 (12) ◽

pp. 3534-3549 ◽

Cited By ~ 2

Author(s):

Arman Alam Siddique ◽

Mireille E Schnitzer ◽

Asma Bahamyirou ◽

Guanbo Wang ◽

Timothy H Holtz ◽

...

Keyword(s):

Propensity Score ◽

Simulation Study ◽

Antimicrobial Agents ◽

Treatment Success ◽

Likelihood Estimation ◽

Multidrug Resistant ◽

Patient Treatment ◽

Resistant Tuberculosis ◽

Multidrug Resistant Tuberculosis ◽

Propensity Score Adjustment

This paper investigates different approaches for causal estimation under multiple concurrent medications. Our parameter of interest is the marginal mean counterfactual outcome under different combinations of medications. We explore parametric and non-parametric methods to estimate the generalized propensity score. We then apply three causal estimation approaches (inverse probability of treatment weighting, propensity score adjustment, and targeted maximum likelihood estimation) to estimate the causal parameter of interest. Focusing on the estimation of the expected outcome under the most prevalent regimens, we compare the results obtained using these methods in a simulation study with four potentially concurrent medications. We perform a second simulation study in which some combinations of medications may occur rarely or not occur at all in the dataset. Finally, we apply the methods explored to contrast the probability of patient treatment success for the most prevalent regimens of antimicrobial agents for patients with multidrug-resistant pulmonary tuberculosis.

Download Full-text