- Understand the Central Limit Theorem and calculate a sampling distribution using the mean and standard deviation of a normally distributed random variable.
- Understand the relationship between the Central Limit Theorem and the normal approximation of a sampling distribution.
In the previous lesson, you learned that sampling is an important tool for determining the characteristics of a population. Although the parameters of the population (mean, standard deviation, etc.) were unknown, random sampling was used to yield reliable estimates of these values. The estimates were plotted on graphs to provide a visual representation of the distribution of the sample means for various sample sizes. It is now time to define some properties of a sampling distribution of sample means and to examine what we can conclude about the entire population based on these properties.
Central Limit Theorem
The Central Limit Theorem is a very important theorem in statistics. It basically confirms what might be an intuitive truth to you: that as you increase the sample size for a random variable, the distribution of the sample means better approximates a normal distribution.
Before going any further, you should become familiar with (or reacquaint yourself with) the symbols that are commonly used when dealing with properties of the sampling distribution of sample means. These symbols are shown in the table below:
Sx¯ or σx¯
The Central Limit Theorem states the following:
Example: Suppose you wanted to answer the question, “What is the probability that a random sample of 20 families in Canada will have an average of 1.5 pets or fewer?” where the mean of the population is 0.8 and the standard deviation of the population is 1.2.
Using technology, a sketch of this problem is as follows:
The shaded area shows the probability that the sample mean is less than 1.5.
Therefore, the probability that the sample mean will be below 1.5 is 0.9937. In other words, with a random sample of 20 families, it is almost definite that the average number of pets per family will be less than 1.5.
The properties associated with the Central Limit Theorem are displayed in the diagram below:
The vertical axis now reads probability density, rather than frequency, since frequency can only be used when you are dealing with a finite number of sample means. Sampling distributions, on the other hand, are theoretical depictions of an infinite number of sample means, and probability density is the relative density of the selections from within this set.
Example: A random sample of size 40 is selected from a known population with a mean of 23.5 and a standard deviation of 4.3. Samples of the same size are repeatedly collected, allowing a sampling distribution of sample means to be drawn.
a) What is the expected shape of the resulting distribution?
b) Where is the sampling distribution of sample means centered?
c) What is the approximate standard deviation of the sample means?
The question indicates that multiple samples of size 40 are being collected from a known population, multiple sample means are being calculated, and then the sampling distribution of the sample means is being studied. Therefore, an understanding of the Central Limit Theorem is necessary to answer the question.
a) The sampling distribution of the sample means will be approximately bell-shaped.
b) The sampling distribution of the sample means will be centered about the population mean of 23.5.
c) The approximate standard deviation of the sample means is 0.68, which can be calculated as shown below:
a) What is the population mean?
b) Using technology, determine the mean of the sample means.
c) What is the population standard deviation?
d) Using technology, determine the standard deviation of the sample means.
e) As the sample size increases, what value will the mean of the sample means approach?
f) As the sample size increases, what value will the the standard deviation of the sample means approach?
On the Web
http://tinyurl.com/2f969wj Explore how the sample size and the number of samples affect the mean and standard deviation of the distribution of sample means.
Point to Consider
- How does sample size affect the variation in sample results?
For an explanation of the Central Limit Theorem (16.0), see Lutemann, The Central Limit Theorem, Part 1 of 2 (2:29).
For the second part of the explanation of the Central Limit Theorem (16.0), see Lutemann, The Central Limit Theorem, Part 2 of 2 (4:39).
For an example of using the Central Limit Theorem (9.0), see jsnider3675, Application of the Central Limit Theorem, Part 1 (5:44).
For the continuation of an example using the Central Limit Theorem (9.0), see jsnider3675, Application of the Central Limit Theorem, Part 2 (6:38).
- The lifetimes of a certain type of calculator battery are normally distributed. The mean lifetime is 400 days, with a standard deviation of 50 days. For a sample of 6000 new batteries, determine how many batteries will last:
- between 360 and 460 days.
- more than 320 days.
- less than 280 days.