### The Empirical Rule

You have already learned that 68% of the data in a normal distribution lies within 1 standard deviation of the mean, 95% of the data lies within 2 standard deviations of the mean, and 99.7% of the data lies within 3 standard deviations of the mean. This is referred to as the **Empirical Rule**, which is also known as the 68-95-99.7 Rule. To accommodate the percentages given by the Empirical Rule, there are defined values in each of the regions to the left and to the right of the mean.

These percentages are used to answer real-world problems when both the mean and the standard deviation of a data set are known. Also keep in mind that since 99.7% of the data in a normal distribution is within 3 standard deviations of the mean, \begin{align*}1-99.7\%=0.3\%\end{align*}

#### Real-World Application: Lifetime of Light Bulbs

The lifetimes of a certain type of light bulb are normally distributed. The mean life is 400 hours, and the standard deviation is 75 hours. For a group of 5,000 light bulbs, how many are expected to last each of the following times?

a) between 325 hours and 475 hours

68% of the light bulbs are expected to last between 325 hours and 475 hours. This means that \begin{align*}5,000 \times 0.68=3,400\end{align*}

b) more than 250 hours

\begin{align*}95\% + 2.35\% + 0.15\% = 97.5\%\end{align*}

c) less than 250 hours

Only \begin{align*}2.35\% + 0.15\% = 2.5\%\end{align*}

#### Real-World Application: Bags of Chips

A bag of chips has a mean mass of 70 g, with a standard deviation of 3 g. Assuming a normal distribution, create a normal curve, including all necessary values.

a) If 1,250 bags of chips are processed each day, how many bags will have a mass between 67 g and 73 g?

Between 67 g and 73 g lies 68% of the data. If 1,250 bags of chips are processed, \begin{align*}1,250 \times 0.68 = 850\end{align*}

b) What percentage of the bags of chips will have a mass greater than 64 g?

\begin{align*}95\% + 2.35\% + 0.15\% = 97.5\%\end{align*}

#### Real-World Application: Half Marathon

The finishing times for people completing a half marathon in 2010 were normally distributed, with a mean of 130 minutes and a standard deviation of 20 minutes. If 1,400,000 people completed a half marathon in 2010, how many people had finishing times between each of the following pairs of times?

a) 90 minutes and 150 minutes

Between 90 minutes and 150 minutes lies \begin{align*}13.5\% + 68\% = 81.5\%\end{align*}

b) 110 minutes and 130 minutes

Between 110 minutes and 130 minutes lies 34% of the data. This means that \begin{align*}1,400,000 \times 0.34 = 476,000\end{align*}

c) 130 minutes and 190 minutes

Between 130 minutes and 190 minutes lies \begin{align*}34\% + 13.5\% + 2.35\% = 49.85\%\end{align*} of the data. This means that \begin{align*}1,400,000 \times 0.4985 = 697,900\end{align*} of the people who completed a half marathon in 2010 had a time between 130 minutes and 190 minutes.

**Points to Consider**

- Is the normal distribution curve the only way to represent data?
- The normal distribution curve shows the spread of the data, but it does not show the actual data values. Do other representations of data show the actual data values?

-->

### Example

#### Example 1

Sudoku is a very popular logic game of number combinations. It originated in the late 1800's by the French press, *Le Siècle*. The average times (in minutes) it takes those in a senior math class to complete a Sudoku puzzle are normally distributed and are found below. Draw a normal distribution curve to represent this data. Determine what time a student must complete a Sudoku puzzle in to be in the top 2.5%.

\begin{align*}& 20 \quad 15 \quad 21 \quad 24 \quad 7 \quad \ 19 \quad 10 \quad 17 \quad 15 \quad 22 \quad 31 \quad 19 \quad 20 \quad 21\\ & 21 \quad \ 9 \quad 12 \quad 26 \quad 24 \quad 28 \quad 19 \quad 16 \quad 24 \quad 11 \quad 17 \quad 31 \quad 25 \quad 13\\ & 16 \quad 18 \quad 22 \quad 32 \quad 9 \quad \ 15 \quad 19 \quad 27 \quad 14 \quad 25 \quad 32 \quad 29 \quad \quad \end{align*}

You can use the data from the 1-Var Stats calculation to draw the normal distribution curve:

According to the data, the mean of the times is approximately 19.9 minutes, and the standard deviation is approximately 6.6 minutes. Therefore, the normal distribution curve can be drawn as follows:

Since \begin{align*}2.35\% + 0.15\% = 2.5\%\end{align*} of the times are below 6.7 minutes, a student must complete a Sudoku puzzle in 6.7 minutes to be in the top 2.5%.

### Review

- What percentage of the data in a normal distribution is between 1 standard deviation below the mean and 2 standard deviations above the mean?
- What percentage of the data in a normal distribution is between 3 standard deviations below the mean and 1 standard deviation above the mean?
- What percentage of the data in a normal distribution is more than 2 standard deviations above the mean?
- What percentage of the data in a normal distribution is between 2 standard deviations below the mean and 3 standard deviations above the mean?
- What percentage of the data in a normal distribution is between 3 standard deviations below the mean and the mean?
- What percentage of the data in a normal distribution is more than 1 standard deviation above the mean?
- What percentage of the data in a normal distribution is between the mean and 2 standard deviations above the mean?
- 200 senior high students were asked how long they had to wait in the cafeteria line for lunch. Their responses were found to be normally distributed, with a mean of 15 minutes and a standard deviation of 3.5 minutes. (a) How many students would you expect to wait more than 11.5 minutes? (b) How many students would you expect to wait more than 18.5 minutes? (c) How many students would you expect to wait between 11.5 and 18.5 minutes?
- 350 babies were born at Neo Hospital in the past 6 months. The average weight for the babies was found to be 6.8 lbs, with a standard deviation of 0.5 lbs. (a) How many babies would you expect to weigh more than 7.3 lbs? (b) How many babies would you expect to weigh more than 7.8 lbs? (c) How many babies would you expect to weigh between 6.3 and 7.8 lbs?
- Sheldon has planted seedlings in his garden in the back yard. After 30 days, he measures the heights of the seedlings to determine how much they have grown. The differences in heights can be seen in the table below. The heights are measured in inches. Draw a normal distribution curve to represent the data. Determine what the range of the differences in heights of the seedlings is for the middle 68% of the data. \begin{align*}& 10 \quad 3 \quad 8 \quad 4 \quad \ \ 7 \quad 12 \quad 8 \quad \ 5 \quad 4 \quad \ 9 \quad \ 3 \quad 8\\ & 6 \quad 10 \quad 7 \quad 10 \quad 11 \quad 8 \quad 12 \quad 9 \quad 10 \quad 7 \quad 8 \quad 11\end{align*}

### Review (Answers)

To view the Review answers, open this PDF file and look for section 6.5.