7.3: BoxandWhisker Plots
Learning Objectives
 Construct a boxandwhisker plot.
 Construct and interpret a boxandwhisker plot.
 Construct boxandwhisker plots for comparison.
 Use technology to create boxandwhisker plots.
Introduction
An oil company claims that its premium grade gasoline contains an additive that significantly increases gas mileage. To prove their claim the selected 15 drivers and first filled each of their cars with 45L of regular gasoline and asked them to record their mileage. Then they filled each of the cars with 45L of premium gasoline and again asked them to record their mileage. The results below show the number of kilometers each car traveled.
Display each set of data to explain whether or not the claim made by the oil company is true or false.
We will revisit this problem later in the lesson to determine whether or not the oil company did place an additive in its premium gasoline that improved gas mileage.
BoxandWhisker Plot
A boxandwhisker plot is another type of graph used to display data. It shows how the data are dispersed around a median, but does not show specific values in the data. It does not show a distribution in as much detail as does a stemandleaf plot or a histogram, but it clearly shows where the data is located. This type of graph is often used when the number of data values is large or when two or more data sets are being compared. The center of the distribution, its spread and the range of the data are very obvious form the graph. The boxandwhisker plot (often called a box plot), divides the data into quarters by use of the medians of these quarters.
As we construct a boxandwhisker plot for a given set of data, you will understand how this type of graph is very useful in statistics.
Example 1:
You have a summer job working at Paddy’s Pond which is a recreational fishing spot where children can go to catch salmon which have been raised in a nearby fish hatchery and then transferred into the pond. The cost of fishing depends upon the length of the fish caught (\begin{align*}\$0.75\end{align*}
\begin{align*}& \text{Length of Fish (in.)}\\
& 13 \quad \ 14 \quad \ 6 \quad \ \ \ 9 \quad \ 10\\
& 21 \quad \ 17 \quad \ 15 \quad \ 15 \quad \ 7\\
& 10 \quad \ 13 \quad \ 13 \quad \ \ 8 \quad \ 11\end{align*}
Since the boxandwhisker plot is based on medians, the first step is to organize the data in order from smallest to largest.
\begin{align*}& 6\; \quad \ \ 7 \quad \ \ 8\; \quad \ \ 9 \ \quad \ 10\\
& 10 \quad \ 11 \quad \ 13 \quad \ 13 \quad \ 13\\
& 14 \quad \ 15 \quad \ 15 \quad \ 17 \quad \ 21\end{align*}
\begin{align*}6, 7, 8, 9, 10, 10, 11, 13, \fbox{{\color{blue}13}}, 13, 14, 15, 15, 17, 21\end{align*}
This is an odd number of data, so the median of all the data is the value in the middle position which is 13. There are 7 numbers before and 7 numbers after 13. The next step is the find the median of the first half of the data – the 7 numbers before the median. This is called the lower quartile since it is the first quarter of the data. On the graphing calculator this value is referred to as \begin{align*}Q_1\end{align*}
\begin{align*}6, 7, 8, \fbox{{\color{blue}9}}, 10, 10, 11\end{align*}
The median of the lower quartile is 9.
This step must be repeated for the second half of the data – the 7 numbers below the median of 13. This is called the upper quartile since it is the third quarter of the data. On the graphing calculator this value is referred to as \begin{align*}Q_3\end{align*}
\begin{align*}13, 13, 14, \fbox{{\color{blue}15}}, 15, 17, 21\end{align*}
Now that the medians have all been determined, it is time to construct the actual graph. The graph is drawn above a number line that includes all the values in the data set (graph paper works very well since the numbers can be placed evenly using the lines of the graph paper). Represent the following values by using small vertical lines above their corresponding values on the number line:
\begin{align*}& \text{Smallest Number}  6 && \text{Median of the Lower Quartile}  9 && \text{Median}  13\\
& \text{Median of the Upper Quartile}  15 && \text{Largest Number}  21\end{align*}
The five data values listed above are often called the fivenumber summary for the data set and are used to graph every boxandwhisker plot.
Join the tops and bottoms of the vertical lines that were drawn to represent the three median values. This will complete the box.
The three medians divide the data into four equal parts. In other words:
 Onequarter of the data values are located between 6 and 9
 Onequarter of the data values are located between 9 and 13
 Onequarter of the data values are located between 13 and 15
 Onequarter of the data values are located between 15 and 21
From the boxwhisker, any outliers (unusual data values that can be either low or high) can be easily seen on a box plot. An outlier would create a whisker that would be very long.
The next diagram will show where these numbers are actually located on the boxandwhisker plot.
Each whisker contains 25% of the data and the remaining 50% of the data is contained within the box. It is easy to see the range of the values as well as how these values are distributed around the middle value. The smaller the box, the more consistent the data values are with the median of the data.
Example 2:
After one month of growing, the heights of 30 parsley seed plants were measured and recorded. The measurements (in inches) are shown in the table below.
6  26  23  33  11  26 

22  28  30  40  38  18 
11  37  12  34  49  17 
25  37  46  39  8  27 
16  38  18  23  26  14 
Construct a boxandwhisker plot to represent the data.
The data organized from smallest to largest is shown in the table below. (You could use your calculator to quickly sort these values)
6  8  11  11  12  14 

16  17  18  18  22  23 
23  25  26  26  26  27 
28  30  33  34  37  37 
38  38  39  40  46  49 
There is an even number of data values so the median will be the mean of the two middle values. \begin{align*}Med = \frac{26 + 26}{2} = 26\end{align*}
The TI83 can also be used to create a boxand whisker plot. The fivenumber summary values can be determined by using the trace function of the calculator.
BoxandWhisker plots are very useful when two data sets need to be compared. The graphs are plotted, one above the other, on the same number line. This method can be used to determine whether or not the additive, which the oil company put in their premium gas, improved gas mileage.
From the above boxandwhisker plots, where the blue one represents the regular gasoline and the yellow one the premium gasoline, it is safe to say that the additive in the premium gasoline definitely increases the mileage. However, the value of 500 seems to be an outlier.
Lesson Summary
In this lesson you learned how the medians of a set of data can be used to represent the values in a meaningful graph called the boxandwhisker plot. You also learned that two sets of data can be compared by representing them using boxandwhisker plots graphed on the same number line. In addition, you also learned the importance of the fivenumber summary associated with a data set and how these values can be found on the TI83 when a boxand whisker plot is created using technology.
Points to Consider
 Are there still other ways to represent data graphically?
 We have seen how the mean and the median are used for graphical representations of data. Is the mode ever used to produce a graph?
Review Questions
 Below is the data that represents the amount of money that males spent on prom night, \begin{align*}& 25 \quad \ 60 \quad \ 120 \quad \ 64 \quad \ 65 \quad \ 28 \ \quad \ 110 \quad \ 60\\
& 70 \quad \ 34 \quad \ 35 \ \quad \ 70 \quad \ 58 \quad \ 100 \quad \ 55 \ \quad \ 95\\
& 55 \quad \ 95 \quad \ 93 \quad \ \ 50 \quad \ 75 \quad \ 35 \quad \ \ 40 \ \quad \ 75\\
& 90 \quad \ 40 \quad \ 50 \ \quad \ 80 \quad \ 85 \quad \ 50 \ \quad \ 80 \ \quad \ 47\\
& 50 \quad \ 80 \quad \ 90 \ \quad \ 42 \quad \ 49 \quad \ 84 \ \quad \ 35 \ \quad \ 70\end{align*}
25 60 120 64 65 28 110 6070 34 35 70 58 100 55 9555 95 93 50 75 35 40 7590 40 50 80 85 50 80 4750 80 90 42 49 84 35 70 Construct a boxandwhisker graph to represent the data.  Using the following boxand whisker plot, list three things pieces of information that you can determine from the graph.
 In a recent survey done at a high school cafeteria, a random selection of males and females were asked how much money they spent each month on school lunches. The following boxandwhisker plots compare the responses of males to those of females. The lower one is the response by males
 How much money did the middle 50% of each sex spend on school lunches each month?
 What is the significance of the value \begin{align*}\$42\end{align*}
$42 for males and \begin{align*}\$46\end{align*}$46 for females?  What conclusions can be drawn from the above plots? Explain.
 The following boxandwhisker plot shows final grades last semester. How would you best describe a typical grade in that course?
 Students typically made between 82 and 88.
 Students typically made between 41 and 82.
 Students typically made around 62.
 Students typically made between 58 and 82.
Review Answers
 Three things we can say from the graph are:
 The smallest number is 100
 The largest number is 195
 50% of the data is between 120 and 155
 (Males \begin{align*}\$22\end{align*}
$22  \begin{align*}\$58\end{align*}$58 ) (Females \begin{align*}\$28\end{align*}$28  \begin{align*}\$68\end{align*}$68 )  Median values.
 Females spend more money on lunches than males spend.
 Students typically made between 41 and 82.
Answer Key for Review Questions (even numbers)
2. Three things we can say from the graph are:
 The smallest number is 100
 The largest number is 195
 50% of the data is between 120 and 155
4. Students typically made between 41 and 82.
Vocabulary
 BrokenLine Graph
 A graph with line segments joining points that represent data.
 Continuous Data
 Data which has all meaningful values for the problem.
 Correlation
 A linear relationship between two variables.
 Data
 A set of numbers or observations that have meaning and are collected from a sample or a population.
 Discrete Data
 Data in which the values between the plotted points have no meaning for the problem.
 Double BrokenLine Graph
 Two brokenline graphs plotted on the same axis and used for comparison of data.
 Dot Plot
 A graph that shows the values of a variable along a number line.
 Linear Graph

A graph of a straight line that has an equation in the form \begin{align*}y = mx + b\end{align*}
y=mx+b
 Line of Best Fit
 A line connecting points on a scatter plot that best represents the data.
 Scatter Plot
 A plot of dots that shows the relationship between two variables.
 Bar Graph
 Graph that compares data using equally spaced bars to represent the data.
 Histogram
 A type of bar graph that has no spaces between the bars.
 StemandLeaf Plot
 A type of graph that is similar to a histogram and the data is arranged according to place value.