- Use Student’s -distribution to estimate population mean intervals for smaller samples.
- Understand how the shape of Student’s -distribution corresponds to the sample size (which corresponds to a measure called the degrees of freedom).
Back in the early 1900’s, a chemist at a brewery in Ireland discovered that when he was working with very small samples, the distributions of the means differed significantly from the normal distribution. He noticed that as his sample sizes changed, the shape of the distribution changed as well. He published his results under the pseudonym ‘Student’, and this concept and the distributions for small sample sizes are now known as Student’s -distributions.
Hypothesis Testing with Small Populations and Sample Sizes
Student's -distributions are a family of distributions that, like the normal distribution, are symmetrical, bell-shaped, and centered on a mean. However, the distribution shape changes as the sample size changes. Therefore, there is a specific shape, or distribution, for every sample of a given size (see figure below; each distribution has a different value of , the number of degrees of freedom, which is 1 less than the size of the sample).
We use Student's -distributions in hypothesis testing the same way that we use the normal distribution. Each row in the -distribution table (see link below) represents a different -distribution, and each distribution is associated with a unique number of degrees of freedom (the number of observations minus one). The column headings in the table represent the portion of the area in the tails of the distribution. We use the numbers in the table just as we use -scores.
http://tinyurl.com/ygcc5g9 Follow this link to the Student’s -table.
As the number of observations gets larger, the -distribution approaches the shape of the normal distribution. In general, once the sample size is large enoughusually about 120we would use the normal distribution or a -table instead.
In calculating the -test statistic, we use the following formula:
is the test statistic and has degrees of freedom.
is the sample mean.
is the population mean under the null hypothesis.
is the sample standard deviation.
is the sample size.
is the estimated standard error.
Example: A high school athletic director is asked if football players are doing as well academically as the other student athletes at his school. We know from a previous study that the average GPA for the student athletes is 3.10 and that the standard deviation of the sample is 0.54. After an initiative to help improve the GPA of student athletes, the athletic director samples 20 football players and finds that their GPA is 3.18. Is there a significant improvement? Use a 0.05 significance level.
First, we establish our null and alternative hypotheses:
Next, we use our alpha level of 0.05 and the -distribution table to find our critical values. For a two-tailed test with 19 degrees of freedom and a 0.05 level of significance, our critical values are equal to .
Finally, in calculating the test statistic, we use the formula as shown:
This means that the observed sample mean of the GPA of football players of 3.18 is 0.66 standard errors above the hypothesized value of 3.10. Because the value of the test statistic is less than the critical value of 2.093, we fail to reject the null hypothesis.
Therefore, we can conclude that the difference between the sample mean and the hypothesized value is not sufficient to attribute it to anything other than sampling error. Thus, the athletic director can conclude that the mean academic performance of football players does not differ from the mean performance of other student athletes.
Example: The masses of newly-produced bus tokens are estimated to have a mean of 3.16 grams. A random sample of 11 tokens was removed from the production line, and the mean weight of the tokens was calculated to be 3.21 grams, with a standard deviation of 0.067. What is the value of the test statistic for a test to determine how the mean differs from the estimated mean?
The test statistic for this problem can be calculated as follows:
If the value of from the sample is between the tails of the distribution of constructed by assuming the null hypothesis is true, then the null hypothesis is, in fact, true. On the other hand, if the value of from the sample is way out in a tail of the -distribution, then there is evidence to reject the null hypothesis. When the distribution of is known, if the null hypothesis is true, the location of the value of on the distribution will be between the tails and outside the critical region. The most common method used to determine if this is the case is to find a -value (observed significance level). The -value is a probability that is computed with the assumption that the null hypothesis is true.
The -value for a two-sided test is the area under the -distribution with degrees of freedom of that lies above and below . This -value can be calculated by using technology.
Technology Note: Using the 'tcdf(' Command on the TI-83/84 Calculator to Calculate Probabilities Associated with the -Distribution
Press [2ND][DIST] and use the down arrow to select 'tcdf('. The syntax for this command is 'tcdf(lower bound, upper bound, degrees of freedom)'. This command will return the total area under both tails. To calculate the area under one tail, divide by 2 as shown below:
This means that there is only a 0.016 chance of getting a value of as large as or even larger than the one from this sample. The small -value tells us that the sample is inconsistent with the null hypothesis. Therefore, the population mean differs from the estimated mean of 3.16.
When the -value is close to zero, there is strong evidence against the null hypothesis. On the other hand, when the -value is large, the result from the sample is consistent with the estimated or hypothesized mean, and there is no evidence against the null hypothesis.
A visual picture of the -value can be obtained by using a graphing calculator as follows:
The spread of any -distribution is greater than that of a standard normal distribution. This is due to the fact that in the denominator of the formula, has been replaced with . Since is a random quantity changing with various samples, the variability in is greater, resulting in a larger spread.
Notice that in the first distribution graph shown above, the spread of the inner curve is small, but in the second graph, both distributions are basically overlapping and are roughly normal. This is due to the increase in the degrees of freedom.
To further illustrate this point, the -distributions for 1 and 12 degrees of freedom can be graphed on a graphing calculator. To do so, first press [Y=][2ND][DISTR], choose the 'tpdf(' command, enter 'X' and 1, separated by commas, and close the parentheses. Then go down to Y2 and repeat the process, this time entering 12 instead of 1. Finally, make sure your window is set correctly and press [GRAPH].
The -distributions for 1 and 12 degrees of freedom should look similar to the ones shown below (df denotes degrees of freedom):
Notice the difference in the two distributions. The one with 12 degrees of freedom approximates a normal curve.
The -distribution can be used with any statistic having a bell-shaped distribution. We already know that the Central Limit Theorem states that the sampling distribution of a statistic will be close to normal with a large enough sample size, but, in fact, the Central Limit Theorem predicts a roughly normal distribution under any of the following conditions:
- The population distribution is normal.
- The sampling distribution is symmetric and the sample size is .
- The sampling distribution is moderately skewed and the sample size is .
- The sample size is greater than 30, without outliers.
In addition to the fact that the -distribution can be used with any bell-shaped distribution, it also has some unique properties. These properties are as follows:
- The mean of the distribution equals zero.
- The population standard deviation is unknown.
- The variance is equal to the degrees of freedom divided by the degrees of freedom minus 2. This means that the degrees of freedom must be greater than two to avoid the expression being undefined.
- The variance is always greater than 1, although it approaches 1 as the degrees of freedom increase. This is due to the fact that as the degrees of freedom increase, the distribution is becoming more of a normal distribution.
- Although the -distribution is bell-shaped, the smaller sample sizes produce a flatter curve. The distribution is not as mound-shaped as a normal distribution, and the tails are thicker. As the sample size increases and approaches 30, the distribution approaches a normal distribution.
- The population is unimodal and symmetric.
Example: Duracell manufactures batteries that the CEO claims will last 300 hours under normal use. A researcher randomly selected 15 batteries from the production line and tested these batteries. The tested batteries had a mean life span of 290 hours, with a standard deviation of 50 hours. If the CEO’s claim were true, what is the probability that 15 randomly selected batteries would have a life span of no more than 290 hours?
Using a graphing calculator or a table, the cumulative probability is shown to be 0.226, which means that if the true life span of a battery were 300 hours, there is a 22.6% chance that the life span of the 15 tested batteries would be less than or equal to 290 hours. This is not a high enough level of confidence to reject the null hypothesis and count the discrepancy as significant.
Note: To find this answer using a graphing calculator, press [2ND][DISTR], select the 'tcdf(' command, enter , 0.7745967, and 14, separated by commas, and press [ENTER]. Subtract the result from 1, and then divide by 2.
Example: You have just taken ownership of a pizza shop. The previous owner told you that you would save money if you bought the mozzarella cheese in a 4.5-pound slab. Each time you purchase a slab of cheese, you weigh it to ensure that you are receiving 72 ounces of cheese. The results of 7 random measurements are 70, 69, 73, 68, 71, 69 and 71 ounces, respectively. Find the test statistic for this scenario.
Begin the problem by determining the mean of the sample and the sample standard deviation. This can be done using a graphing calculator. You should find that and . Now calculate the test statistic as follows:
Example: In the last example, the test statistic for testing that the mean weight of the cheese wasn’t 72 ounces was computed. Find and interpret the -value.
The test statistic computed in the last example was . Using technology, the -value is 0.0262. In other words, the probability that 7 random measurements would give a value of greater than 2.9315 or less than is about 0.0262.
Example: In the previous example, the -value for testing that the mean weight of cheese wasn’t 72 ounces was determined.
a) State the hypotheses.
b) Would the null hypothesis be rejected at the 10% level? The 5% level? The 1% level?
b) Because the -value of 0.0262 is less than both 0.10 and 0.05, the null hypothesis would be rejected at these levels. However, the -value is greater than 0.01, so the null hypothesis would not be rejected if this level of confidence was required.
A test of significance is done when a claim is made about the value of a population parameter. The test can only be conducted if the random sample taken from the population came from a distribution that is normal or approximately normal. When the sample size is small, you must use instead of to complete the significance test for a mean.
Points to Consider
- Is there a way to determine where the -statistic lies on a distribution?
- If a way does exist, what is the meaning of its placement?
For an explanation of the -distribution and an example using it (7.0)(17.0), see bionicturtledotcom, Student's t distribution (8:32).
- You intend to use simulation to construct an approximate -distribution with 8 degrees of freedom by taking random samples from a population with bowling scores that are normally distributed with mean, , and standard deviation, .
- Explain how you will do one run of this simulation.
- Produce four values of using this simulation.
- The dean from UCLA is concerned that the students’ grade point averages have changed dramatically in recent years. The graduating seniors’ mean GPA over the last five years is 2.75. The dean randomly samples 30 seniors from the last graduating class and finds that their mean GPA is 2.85, with a sample standard deviation of 0.65. Would a -distribution now be the appropriate sampling distribution for the mean? Why or why not?
- Using the appropriate -distribution, test the same null hypothesis with a sample of 30.
- With a sample size of 30, do you need to have a larger or smaller difference between the hypothesized population mean and the sample mean than with a sample size of 256 to obtain statistical significance? Explain your answer.