Sample size re-estimation without unblinding for normally distributed outcomes with unknown variance

A SEM-based approach using likelihood-based confidence interval (LBCI) has been proposed to form confidence intervals for unstandardized and standardized indirect effect in mediation models. However, when used with the maximum likelihood estimation, this approach requires that the variables are multivariate normally distributed. This can affect the LBCIs of unstandardized and standardized effect differently. In the present study, the robustness of this approach when the predictor is not normally distributed but the error terms are conditionally normal, which does not violate the distributional assumption of ordinary least squares (OLS) estimation, is compared to four other approaches: nonparametric bootstrapping, two variants of LBCI, LBCI assuming the predictor is fixed (LBCI-Fixed-X) and LBCI based on ADF estimation (LBCI-ADF), and Monte Carlo. A simulation study was conducted using a simple mediation model and a serial mediation model, manipulating the distribution of the predictor. The Monte Carlo method performed worst among the methods. LBCI and LBCI-Fixed-X had suboptimal performance when the distributions had high kurtosis and the population indirect effects were medium to large. In some conditions, the problem was severe even when the sample size was large. LBCI-ADF and nonparametric bootstrapping had coverage probabilities close to the nominal value in nearly all conditions, although the coverage probabilities were still suboptimal for the serial mediation model when the sample size was small with respect to the model. Implications of these findings in the context of this special case of nonnormal data were discussed.

Download Full-text

Blinded Sample Size Recalculation for Normally Distributed Outcomes Using Long- and Short-term Data

Biometrical Journal ◽

10.1002/bimj.200390060 ◽

2003 ◽

Vol 45 (8) ◽

pp. 915-930 ◽

Cited By ~ 7

Author(s):

K. Wüst ◽

M. Kieser

Keyword(s):

Sample Size ◽

Short Term ◽

Term Data ◽

Normally Distributed

Download Full-text

A comparative review of methods for comparing means using partially paired data

Statistical Methods in Medical Research ◽

10.1177/0962280215577111 ◽

2015 ◽

Vol 26 (3) ◽

pp. 1323-1340 ◽

Cited By ~ 22

Author(s):

Beibei Guo ◽

Ying Yuan

Keyword(s):

Missing Data ◽

Maximum Likelihood ◽

Sample Size ◽

T Test ◽

Type I ◽

Paired Data ◽

Modified Maximum Likelihood ◽

Comparative Review ◽

Paired T Test ◽

Normally Distributed

In medical experiments with the objective of testing the equality of two means, data are often partially paired by design or because of missing data. The partially paired data represent a combination of paired and unpaired observations. In this article, we review and compare nine methods for analyzing partially paired data, including the two-sample t-test, paired t-test, corrected z-test, weighted t-test, pooled t-test, optimal pooled t-test, multiple imputation method, mixed model approach, and the test based on a modified maximum likelihood estimate. We compare the performance of these methods through extensive simulation studies that cover a wide range of scenarios with different effect sizes, sample sizes, and correlations between the paired variables, as well as true underlying distributions. The simulation results suggest that when the sample size is moderate, the test based on the modified maximum likelihood estimator is generally superior to the other approaches when the data is normally distributed and the optimal pooled t-test performs the best when the data is not normally distributed, with well-controlled type I error rates and high statistical power; when the sample size is small, the optimal pooled t-test is to be recommended when both variables have missing data and the paired t-test is to be recommended when only one variable has missing data.

Download Full-text

An integrated selection formulation for the best normal mean: the unequal and unknown variance case

Journal of Applied Mathematics and Decision Sciences ◽

10.1155/s1173912602000020 ◽

2002 ◽

Vol 6 (1) ◽

pp. 23-42 ◽

Cited By ~ 1

Author(s):

Pinyuen Chen ◽

Jun-Lue Zhang

Keyword(s):

Sample Size ◽

Parameter Space ◽

Correct Selection ◽

Indifference Zone ◽

Unknown Variance ◽

Normal Mean ◽

Favorable Configuration ◽

Preference Zone ◽

Selected Subset ◽

Least Favorable Configuration

This paper considers an integrated formulation in selecting the best normal mean in the case of unequal and unknown variances. The formulation separates the parameter space into two disjoint parts, the preference zone (PZ) and the indifference zone (IZ). In the PZ we insist on selecting the best for a correct selection (CS1) but in the IZ we define any selected subset to be correct (CS2) if it contains the best population. We find the least favorable configuration (LFC) and the worst configuration (WC) respectively in PZ and IZ. We derive formulas for P(CS1|LFC), P(CS2|WC) and the bounds for the expected sample size E(N). We also give tables for the procedure parameters to implement the proposed procedure. An example is given to illustrate how to apply the procedure and how to use the table.

Download Full-text

The Distributional Behavior of Futures Price Spreads

Journal of Agricultural and Applied Economics ◽

10.1017/s1074070800027838 ◽

2000 ◽

Vol 32 (1) ◽

pp. 73-87

Author(s):

Min-Kyoung Kim ◽

Raymond M. Leuthold

Keyword(s):

Sample Size ◽

Temporal Aggregation ◽

Normal Distributions ◽

Time Period ◽

Futures Price ◽

Larger Sample ◽

Relative Spread ◽

Normally Distributed

AbstractThe distributional behavior of futures price spreads is examined for four commodities: corn, live cattle, gold and T-bonds. Remarkably different results are found over commodities, time period, and sample size. Actual spread changes for the smaller sample size of gold and T-bonds and for corn produce more normal distributions for weekly than for daily differencing intervals, while all live cattle spreads for actual changes are normally distributed. However, the larger sample size of both gold and T-bonds and the relative spread changes for corn and live cattle do not become more normally distributed under temporal aggregation of the data.

Download Full-text