The Sampling Distribution of the Mean

2021 ◽  
pp. 111-140
Author(s):  
James A. Middleton
2013 ◽  
Vol 6 (4) ◽  
pp. 937-948 ◽  
Author(s):  
M. Toohey ◽  
T. von Clarmann

Abstract. Climatologies of atmospheric observations are often produced by binning measurements according to latitude and calculating zonal means. The uncertainty in these climatological means is characterised by the standard error of the mean (SEM). However, the usual estimator of the SEM, i.e., the sample standard deviation divided by the square root of the sample size, holds only for uncorrelated randomly sampled measurements. Measurements of the atmospheric state along a satellite orbit cannot always be considered as independent because (a) the time-space interval between two nearest observations is often smaller than the typical scale of variations in the atmospheric state, and (b) the regular time-space sampling pattern of a satellite instrument strongly deviates from random sampling. We have developed a numerical experiment where global chemical fields from a chemistry climate model are sampled according to real sampling patterns of satellite-borne instruments. As case studies, the model fields are sampled using sampling patterns of the Michelson Interferometer for Passive Atmospheric Sounding (MIPAS) and Atmospheric Chemistry Experiment Fourier-Transform Spectrometer (ACE-FTS) satellite instruments. Through an iterative subsampling technique, and by incorporating information on the random errors of the MIPAS and ACE-FTS measurements, we produce empirical estimates of the standard error of monthly mean zonal mean model O3 in 5° latitude bins. We find that generally the classic SEM estimator is a conservative estimate of the SEM, i.e., the empirical SEM is often less than or approximately equal to the classic estimate. Exceptions occur only when natural variability is larger than the random measurement error, and specifically in instances where the zonal sampling distribution shows non-uniformity with a similar zonal structure as variations in the sampled field, leading to maximum sensitivity to arbitrary phase shifts between the sample distribution and sampled field. The occurrence of such instances is thus very sensitive to slight changes in the sampling distribution, and to the variations in the measured field. This study highlights the need for caution in the interpretation of the oft-used classically computed SEM, and outlines a relatively simple methodology that can be used to assess one component of the uncertainty in monthly mean zonal mean climatologies produced from measurements from satellite-borne instruments.


2012 ◽  
Vol 151 ◽  
pp. 678-684
Author(s):  
Jian Mei Xu ◽  
Lun Bai

The sampling distribution of the coefficient of variation for a normal population is theoretically deduced, as well as its mean and variance. The conditions under which the mean and variance of the sampling distribution exist are studied, and the affecting factors on the sampling distribution shape are discussed.


1998 ◽  
Vol 25 (3) ◽  
pp. 192-195 ◽  
Author(s):  
Jennifer L. Dyck ◽  
Nancy R. Gee

In this article, we describe a hands-on, in-class demonstration using M&M's® candy to illustrate the concept of the sampling distribution of the mean. With the class serving as the population, each student receives a small package of M&M's. The instructor draws samples from the population and constructs an actual sampling distribution. Students in two statistics courses received either the M&M demonstration or a comparable demonstration using a textbook example. They took a quiz on their knowledge and rated their attitudes toward the demonstration. Results indicated that students who participated in the M&M demonstration answered more questions correctly on the quiz, believed they had learned more, enjoyed class more, and had fewer negative feelings toward the demonstration than those who received the textbook example demonstration.


Author(s):  
Ann E. Watkins ◽  
Anna Bargagliotti ◽  
Christine Franklin

2012 ◽  
Vol 44 (02) ◽  
pp. 429-451
Author(s):  
Hosam M. Mahmoud ◽  
Robert T. Smythe

The ‘coupon collection problem’ refers to a class of occupancy problems in which j identical items are distributed, independently and at random, to n cells, with no restrictions on multiple occupancy. Identifying the cells as coupons, a coupon is ‘collected’ if the cell is occupied by one or more of the distributed items; thus, some coupons may never be collected, whereas others may be collected once or twice or more. We call the number of coupons collected exactly r times coupons of type r. The coupon collection model we consider is general, in that a random number of purchases occurs at each stage of collecting a large number of coupons; the sample sizes at each stage are independent and identically distributed according to a sampling distribution. The joint behavior of the various types is an intricate problem. In fact, there is a variety of joint central limit theorems (and other limit laws) that arise according to the interrelation between the mean, variance, and range of the sampling distribution, and of course the phase (how far we are in the collection processes). According to an appropriate combination of the mean of the sampling distribution and the number of available coupons, the phase is sublinear, linear, or superlinear. In the sublinear phase, the normalization that produces a Gaussian limit law for uncollected coupons can be used to obtain a multivariate central limit law for at most two other types — depending on the rates of growth of the mean and variance of the sampling distribution, we may have a joint central limit theorem between types 0 and 1, or between types 0, 1, and 2. In the linear phase we have a multivariate central limit theorem among the types 0, 1,…, k for any fixed k.


Sign in / Sign up

Export Citation Format

Share Document