5.3: Applications of the Normal Distribution
Learning Objectives
 Apply the characteristics of the normal distribution to solving problems.
Introduction
The normal distribution is the foundation for statistical inference and will be an essential part of many of those topics in later chapters. In the meantime, this section will cover some of the types of questions that can be answered using the properties of a normal distribution. The first examples deal with more theoretical questions that will help you master basic understandings and computational skills, while the later problems will provide examples with real data, or at least a real context.
Unknown Value Problems
If you truly understand the relationship between the area under a density curve and the mean, standard deviation, and \begin{align*}z\end{align*}score, you should be able to solve problems in which you are provided all but one of these values and are asked to calculate the remaining value. While perhaps not directly practical, it is the thorough understanding of these calculations that will lead to a high degree of comfort when a more relevant context is provided.
In the last lesson we found the probability, or area under the density curve. What if you are asked to find a value that gives a particular probability?
Example:
Given a normally distributed random variable \begin{align*}x\end{align*} with \begin{align*}\mu = 35\end{align*} and \begin{align*}\sigma = 7.4\end{align*}, what is the value of \begin{align*}x\end{align*} where the probability of experiencing a value less than that is \begin{align*}80\%\end{align*}?
Solution:
As suggested before, it is important and helpful to sketch the distribution.
If we had to estimate an actual value first, we know from the empirical rule that about \begin{align*}84\%\end{align*} of the data is below one standard deviation to the right of the mean.
\begin{align*}\mu + 1\ \ \sigma = 35 + 7.4 = 42.4\end{align*}
We expect the answer to be slightly below this value.
When we were given a value of the variable and were asked to find the percentage or probability, we used the \begin{align*}z\end{align*}table or a normalcdf command. But how do we find a value given the percentage? Again, the table has its limitations in this case and graphing calculators or computer software are much more convenient and accurate. The command on the TI83/84 calculator is invNorm. You may have seen it already in the distribution menu.
The syntax for this command is:
InvNorm (percentage or probability to the left, mean, standard deviation)
Enter the values in the correct order:
Unknown Mean or Standard Deviation
Example:
For a normally distributed random variable, \begin{align*}\sigma= 4.5, x = 20\end{align*}, and \begin{align*}p = .05\end{align*}, Estimate \begin{align*}\mu\end{align*}
Solution:
First draw a sketch:
Remember that about \begin{align*}95\%\end{align*} of the data is within \begin{align*}2\end{align*} standard deviations of the mean. This would leave \begin{align*}2.5\%\end{align*} of the data in the lower tail, so our \begin{align*}5\%\end{align*} value must be less than \begin{align*}9 \;\mathrm{units}\end{align*} from the mean.
Because we do not know the mean, we have to use the standard normal curve and calculate a \begin{align*}z\end{align*}score using the invNorm command. The result \begin{align*}(1.645)\end{align*} confirms the prediction that the value should be less than \begin{align*}2\end{align*} standard deviations from the mean.
In one of the few instances in beginning statistics that we use algebra, plug in the known quantities into the \begin{align*}z\end{align*}score formula:
\begin{align*}z &= \frac{x  \mu} {\sigma}\\ 1.645 & \approx \frac{20  \mu} {4.5}\\ 1.645 * 4.5 & \approx 20  \mu\\ 7.402  20 & \approx  \mu\\ 27.402 & \approx  \mu\\ \mu & \approx 27.402\end{align*}
Example:
For a normally distributed random variable, \begin{align*}\mu = 83, x = 94\end{align*}, and \begin{align*}p = .90\end{align*}, find \begin{align*}\sigma\end{align*}.
Solution:
Again, let’s first look at a sketch of the distribution.
Since about \begin{align*}97.5\%\end{align*} of the data is below \begin{align*}2\end{align*} standard deviations, it seems reasonable to estimate that the \begin{align*}x\end{align*}value is less than two standard deviations away and \begin{align*}\sigma\end{align*} might be around \begin{align*}7\end{align*} or \begin{align*}8\end{align*}.
Again, use invNorm to calculate the \begin{align*}z\end{align*}score. Remember that we are not entering a mean or standard deviation, so the result is from \begin{align*}\mu = 0\end{align*} and \begin{align*}\sigma = 1\end{align*}.
Use the \begin{align*}z\end{align*}score formula and solve for \begin{align*}\sigma\end{align*}:
\begin{align*}z & = \frac{x  \mu} {\sigma}\\ 1.282 & \approx \frac{94  83} {\sigma}\\ \sigma & \approx \frac{11} {1.282}\\ \sigma & \approx 8.583\end{align*}
Drawing a Distribution on the Calculator
As you saw in Lesson 1 of this chapter, the TI83/84 will also draw the distribution for you. But before doing that, we need to set an appropriate window (see screen below) and delete or turn off any functions or plots. Let’s use the last example and draw the shaded region of the normal curve with \begin{align*}\mu = 83\end{align*} and \begin{align*}\sigma = 8.583\end{align*} below \begin{align*}94\end{align*}. Remember from the empirical rule that we probably want to show about \begin{align*}3\end{align*} standard deviations away from \begin{align*}83\end{align*} in either direction. If we use \begin{align*}9\end{align*} as an estimate for \begin{align*}\sigma\end{align*}, then we should open our window \begin{align*}27 \;\mathrm{units}\end{align*} above and below \begin{align*}83\end{align*}. The \begin{align*}y\end{align*}settings can be a bit tricky, but with a little practice you will get used to determining the maximum percentage of area near the mean.
The reason that we went below the \begin{align*}x\end{align*}axis is to leave room for the text as you will see.
Now press [2nd] [DISTR]> and arrow over to the Draw option.
Choose the ShadeNorm command. You enter the values just as if you were doing a normalcdf calculation:
ShadeNorm(lower bound, upper bound, mean, standard deviation)
Press [ENTER] to see the result.
Normalpdf on the Calculator
You may have noticed that the first option in the distribution menu is Normalpdf, which stands for a normal probability density function. It is the option you used in lesson 5.1 to draw the graph of the normal distribution. Many students wonder what this function is for and occasionally even use it by mistake to calculate what they think are cumulative probabilities. This function is actually the mathematical formula for drawing the normal distribution. You can find this formula in the resources at the end of the lesson if you are interested. The numbers this formula returns are not really useful to us statistically. The primary useful purpose for this function is to draw the normal curve.
As you did in Lesson 5.1, plot Y1=Normalpdf with the window shown below. Be sure to turn off any plots and clear out any functions. Enter \begin{align*}x\end{align*} and close the parentheses. Because we did not specify a mean and standard deviation, we will draw the standard normal curve. Enter the window settings necessary to fit most of the curve on the screen as shown below (think about the empirical rule to help with this).
Normal Distributions with Real Data
The foundation of collecting surveys, samples, and experiments is most often based on the normal distribution as you will learn in later chapters. Here are two examples.
Example:
The Information Centre of the National Health Service in Britain collects and publishes a great deal of information and statistics on health issues affecting the population. One such comprehensive data set tracks information about the health of children\begin{align*}^1\end{align*}. According to their statistics, in 2006 the mean height of \begin{align*}12\end{align*} yearold boys was \begin{align*}152.9 \;\mathrm{cm}\end{align*} with a standard deviation estimate of approximately \begin{align*}8.5 \;\mathrm{cm}\end{align*} (these are not the exact figures for the population and in later chapters we will learn how they are calculated and how accurate they may be, but for now we will assume that they are a reasonable estimate of the true parameters).
Part 1 If \begin{align*}12\;\mathrm{year}\end{align*} old Cecil is \begin{align*}158 \;\mathrm{cm}\end{align*}, approximately what percentage of all \begin{align*}12\end{align*} yearold boys in Britain is he taller than?
We first must assume that the height of \begin{align*}12\end{align*} yearold boys in Britain is normally distributed. This seems a reasonable assumption to make. As always, the first step should be to draw a sketch and estimate a reasonable answer prior to calculating the percentage. In this case, let’s use the calculator to sketch the distribution and the shading. First decide on an appropriate window that includes about \begin{align*}3\end{align*} standard deviations on either side of the mean. In this case, \begin{align*}3\end{align*} standard deviations is about \begin{align*}25.5 \;\mathrm{cm}\end{align*}, so add and subtract that value to/from the mean to find the horizontal extremes. Then enter the appropriate ShadeNorm command.
From this data, we would estimate Cecil is taller than \begin{align*}73\%\end{align*} of \begin{align*}12\end{align*} yearold boys. We could also phrase this answer as follows: the probability of a randomly selected British \begin{align*}12\end{align*} yearold boy being shorter than Cecil is \begin{align*}0.73\end{align*}. Often with data like this we use percentiles. We would say Cecil is in the \begin{align*}73^{th}\end{align*} percentile for height among \begin{align*}12\end{align*} yearold boys in Britain.
Part 2 How tall would Cecil need to be to be in the top \begin{align*}1\%\end{align*} of all \begin{align*}12\end{align*} yearold boys in Britain?
Here is a sketch:
In this case we are given the percentage, so we need to use the invNorm command.
Cecil would need to be about \begin{align*}173 \;\mathrm{cm}\end{align*} tall to be in the top \begin{align*}1\%\end{align*} of \begin{align*}12\end{align*} yearold boys in Britain.
Example:
Suppose that the distribution of mass of female marine iguanas Puerto Villamil in the Galapagos Islands is approximately normal with a mean mass of \begin{align*}950 \;\mathrm{g}\end{align*} and a standard deviation of \begin{align*}325 \;\mathrm{g}\end{align*}. There are very few young marine iguanas in the populated areas of the islands because feral cats tend to kill them. How rare is it that we would find a female marine iguana with a mass less than \begin{align*}400 \;\mathrm{g}\end{align*} in this area?
Solution:
Using the graphing calculator we need to approximate the probability of being less than \begin{align*}200 \;\mathrm{grams}\end{align*}.
With a probability of approximately \begin{align*}0.045\end{align*}, we could say it is rather unlikely (only about \begin{align*}5\%\end{align*} of the time) that we would find an iguana this small.
Lesson Summary
In order to find the percentage of data in between two values (or the probability of a randomly chosen value being between those values) in a normal distribution, we can use the normalcdf command on the TI83/84 calculator. When you know the percentage or probability, use the invNorm command to find a \begin{align*}z\end{align*}score or value of the variable. In order to use these tools in real situations, we need to know that the distribution of the variable in question is approximately normal. When solving problems using normal probabilities, it helps to draw a sketch of the distribution and shade the appropriate region.
Points to Consider
 How do the probabilities of a standard normal curve apply to making decisions about unknown parameters for a population given a sample?
Review Questions
 Which of the following intervals contains the middle \begin{align*}95\%\end{align*} of the data in a standard normal distribution?
 \begin{align*}z < 2\end{align*}
 \begin{align*}z \le 1.645\end{align*}
 \begin{align*}z \le 1.96\end{align*}
 \begin{align*}1.645 \le z \le 1.645\end{align*}
 \begin{align*}1.96 \le z \le 1.96\end{align*}
 For each of the following problems, \begin{align*}x\end{align*} is a continuous random variable with a normal distribution and the given mean and standard deviation. \begin{align*}P\end{align*} is the probability of a value of the distribution being less than \begin{align*}x\end{align*}. Find the missing value and sketch and shade the distribution.
 \begin{align*}& \text{mean} && \text{Standard deviation} && \text{x} && \text{P} \\ & 85 && 4.5 && && 0.68 \end{align*}
 \begin{align*}& \text{mean} && \text{Standard deviation} && \text{x} && \text{P} \\ & && 1 && 16 && 0.05\end{align*}
 \begin{align*}& \text{mean} && \text{Standard deviation} && \text{x} && \text{P} \\ & 73&&&& 85 && 0.91\end{align*}
 \begin{align*}& \text{mean} && \text{Standard deviation} && \text{x} && \text{P} \\ & 93 && 5 &&&& 0.90\end{align*}
 What is the \begin{align*}z\end{align*}score for the lower quartile in a standard normal distribution?
 The manufacturing process at a metal parts factory produces some slight variation in the diameter of metal ball bearings. The quality control experts claim that the bearings produced have a mean diameter of \begin{align*}1.4 \;\mathrm{cm}\end{align*}. If the diameter is more than \begin{align*}.0035 \;\mathrm{cm}\end{align*} to wide or too narrow, they will not work properly. In order to maintain its reliable reputation, the company wishes to insure that no more than \begin{align*}1/10^{th}\end{align*} of \begin{align*}1\%\end{align*} of the bearings that are made are ineffective. What should the standard deviation of the manufactured bearings be in order to meet this goal?
 Suppose that the wrapper of a certain candy bar lists its weight as \begin{align*}2.13\;\mathrm{ounces}\end{align*}. Naturally, the weights of individual bars vary somewhat. Suppose that the weights of these candy bars vary according to a normal distribution with \begin{align*}\mu = 2.2 \;\mathrm{ounces}\end{align*} and \begin{align*}\sigma = .04 \;\mathrm{ounces}\end{align*}.
 What proportion of candy bars weigh less than the advertised weight?
 What proportion of candy bars weight between \begin{align*}2.2\end{align*} and \begin{align*}2.3 \;\mathrm{ounces}\end{align*}?
 What weight candy bar would be heavier than all but \begin{align*}1\%\end{align*} of the candy bars out there?
 If the manufacturer wants to adjust the production process so no more than \begin{align*}1\end{align*} candy bar in \begin{align*}1000 \;\mathrm{weighs}\end{align*} less than the advertised weight, what should the mean of the actual weights be? (Assuming the standard deviation remains the same)
 If the manufacturer wants to adjust the production process so that the mean remains at \begin{align*}2.2 \;\mathrm{ounces}\end{align*} and no more than \begin{align*}1\end{align*} candy bar in \begin{align*}1000 \;\mathrm{weighs}\end{align*} less than the advertised weight, how small does the standard deviation of the weights need to be??
Review Answers
 e
 \begin{align*}87.1\end{align*}
 \begin{align*}17.64\end{align*}
 \begin{align*}8.96\end{align*}
 \begin{align*}99.41\end{align*}
 \begin{align*}0.674\end{align*}

\begin{align*}\sigma \approx 0.00106\end{align*}
 \begin{align*}\approx 0.04\end{align*}
 \begin{align*}\approx 0.49\end{align*}
 \begin{align*}\approx 2.29 \;\mathrm{ounces}\end{align*}
 \begin{align*}\approx 2.254 \;\mathrm{ounces}\end{align*}
 \begin{align*}\approx 0.023 \;\mathrm{ounces}\end{align*}
References
Other sites of interest
 This one contains the formula for the normal probability density function: http://davidmlane.com/hyperstat/A25726.html
 This one contains some background of the normal distribution, including a picture of Carl Friedrich Gauss, a German mathematician who first used the function: http://www.willamette.edu/~mjaneba/help/normalcurve.html
 This one is highly mathematical: http://en.wikipedia.org/wiki/Normal_distribution
Keywords
Normal Distribution
Density Curve
Standard Normal Curve
Empirical Rule
\begin{align*}Z\end{align*} Scores
Normal Probability Plot (or Normal Quantile Plot)
Cumulative Density Function
Probability Density Function
Inflection Points
Notes/Highlights Having trouble? Report an issue.
Color  Highlighted Text  Notes  

Please Sign In to create your own Highlights / Notes  
Show More 