Designing, Conducting, and Analyzing Surveys
A survey is a way to ask a lot of people a few well-constructed questions. The survey is a series of unbiased questions that the subject must answer. Some advantages of surveys are that they are efficient ways of collecting information from a large number of people, they are relatively easy to administer, a wide variety of information can be collected and they can be focused (researchers can stick to just the questions that interest them.) Some disadvantages of surveys arise from the fact that they depend on the subjects’ motivation, honesty, memory and ability to respond. Moreover, answer choices to survey questions could lead to vague data. For example, the choice “moderately agree” may mean different things to different people or to whoever ends up interpreting the data.
Conducting a Survey
There are various methods for administering a survey. It can be done as a face-to face interview or a phone interview where the researcher is questioning the subject. A different option is to have a self-administered survey where the subject can complete a survey on paper and mail it back, or complete the survey online. There are advantages and disadvantages to each of these methods.
The advantages of face-to-face interviews include fewer misunderstood questions, fewer incomplete responses, higher response rates, and greater control over the environment in which the survey is administered; also, the researcher can collect additional information if any of the respondents’ answers need clarifying. The disadvantages of face-to-face interviews are that they can be expensive and time-consuming and may require a large staff of trained interviewers. In addition, the response can be biased by the appearance or attitude of the interviewer.
The advantages of self-administered surveys are that they are less expensive than interviews, do not require a large staff of experienced interviewers and can be administered in large numbers. In addition, anonymity and privacy encourage more candid and honest responses, and there is less pressure on respondents. The disadvantages of self-administered surveys are that responders are more likely to stop participating mid-way through the survey and respondents cannot ask them to clarify their answers. In addition, there are lower response rates than in personal interviews, and often the respondents who bother to return surveys represent extremes of the population – those people who care about the issue strongly, whichever way their opinion leans.
Designing a Survey
Surveys can take different forms. They can be used to ask only one question or they can ask a series of questions. We can use surveys to test out people’s opinions or to test a hypothesis.
When designing a survey, the following steps are useful:
- Determine the goal of your survey: What question do you want to answer?
- Identify the sample population: Whom will you interview?
- Choose an interviewing method: face-to-face interview, phone interview, self-administered paper survey, or internet survey.
- Decide what questions you will ask in what order, and how to phrase them. (This is important if there is more than one piece of information you are looking for.)
- Conduct the interview and collect the information.
- Analyze the results by making graphs and drawing conclusions.
Constructing a Survey
1. Martha wants to construct a survey that shows which sports students at her school like to play the most.
a) List the goal of the survey.
The goal of the survey is to find the answer to the question: “Which sports do students at Martha’s school like to play the most?”
b) What population sample should she interview?
A sample of the population would include a random sample of the student population in Martha’s school. A good strategy would be to randomly select students (using dice or a random number generator) as they walk into an all-school assembly.
c) How should she administer the survey?
Face-to-face interviews are a good choice in this case. Interviews will be easy to conduct since the survey consists of only one question which can be quickly answered and recorded, and asking the question face to face will help eliminate non-response bias.
d) Create a data collection sheet that she can use to record her results.
In order to collect the data to this simple survey Martha can design a data collection sheet such as the one below:
Sport | Tally |
---|---|
baseball | |
basketball | |
football | |
soccer | |
volleyball | |
swimming |
This is a good, simple data collection sheet because:
- Plenty of space is left for the tally marks.
- Only one question is being asked.
- Many possibilities are included, but space is left at the bottom in case students give answers that Martha didn’t think of.
- The answer from each interviewee can be quickly collected and then the data collector can move on to the next person.
Once the data has been collected, suitable graphs can be made to display the results.
2. Raoul wants to construct a survey that shows how many hours per week the average student at his school works.
a) List the goal of the survey.
The goal of the survey is to find the answer to the question “How many hours per week do you work?”
b) What population sample will he interview?
Raoul suspects that older students might work more hours per week than younger students. He decides that a stratified sample of the student population would be appropriate in this case. The strata are grade levels \begin{align*}9^{th}\end{align*}
c) How would he administer the survey?
Face-to-face interviews are a good choice in this case since the survey consists of two short questions which can be quickly answered and recorded.
d) Create a data collection sheet that Raoul can use to record his results.
In order to collect the data for this survey Raoul designed the data collection sheet shown below:
Grade Level | Number of Hours Worked | Total number of students |
---|---|---|
\begin{align*}9^{th}\end{align*} |
||
\begin{align*}10^{th}\end{align*} |
||
\begin{align*}11^{th}\end{align*} |
||
\begin{align*}12^{th}\end{align*} |
This data collection sheet allows Raoul to write down the actual numbers of hours worked per week by students as opposed to just collecting tally marks for several categories.
Display, Analyze, and Interpret Statistical Survey Data
In the previous section we considered two examples of surveys you might conduct in your school. The first one was designed to find the sport that students like to play the most. The second survey was designed to find out how many hours per week students worked.
For the first survey, students’ choices fit neatly into separate categories. Appropriate ways to display the data might be a pie chart or a bar graph. Let’s revisit this example.
In Example A Martha interviewed 112 students and obtained the following results.
Sport | Tally | |
---|---|---|
baseball | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ |\end{align*} |
31 |
basketball | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ {||}\end{align*} |
17 |
football | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ {||||}\end{align*} |
14 |
soccer | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ {|||}\end{align*} |
28 |
volleyball | \begin{align*} \bcancel{||||} \ {||||}\end{align*} |
9 |
swimming | \begin{align*} \bcancel{||||} \ {|||}\end{align*} |
8 |
gymnastics | \begin{align*}{|||}\end{align*} |
3 |
fencing | \begin{align*}{||}\end{align*} |
2 |
Total: 112 |
a) Make a bar graph of the results showing the percentage of students in each category.
To make a bar graph, we list the sport categories on the \begin{align*}x-\end{align*}
To find the percentage of students in each category, we divide the number of students in each category by the total number of students surveyed:
Sport | Percentage |
---|---|
baseball | \begin{align*} \frac{31}{112}=.28= 28\%\end{align*} |
basketball | \begin{align*} \frac{17}{112}=.15=15\% \end{align*} |
football | \begin{align*} \frac{14}{112}=.125=12.5\%\end{align*} |
soccer | \begin{align*} \frac{28}{112}=.25=25\%\end{align*} |
volleyball | \begin{align*} \frac{9}{112}=.08=8\%\end{align*} |
swimming | \begin{align*} \frac{8}{112}=.07=7\%\end{align*} |
gymnastic | \begin{align*} \frac{3}{112}=.025=2.5\%\end{align*} |
fencing | \begin{align*} \frac{2}{112}=.02 = 2\%\end{align*} |
Now we can make a graph where the height of each bar represents the percentage of students in each category:
b. Make a pie chart of the collected information, showing the percentage of students in each category.
To make a pie chart, we find the percentage of the students in each category by dividing the number of students in each category as in part a. The central angle of each slice of the pie is found by multiplying the percentage of students in each category by 360 degrees (the total number of degrees in a circle). To draw a pie-chart by hand, you can use a protractor to measure the central angles that you find for each category.
Sport | Percentage | Central angle |
---|---|---|
baseball | \begin{align*} \frac{31}{112}=.28=28\%\end{align*} |
\begin{align*}.28 \times 360^{\circ} = 101^{\circ}\end{align*} |
basketball | \begin{align*} \frac{17}{112}=.15 = 15\%\end{align*} |
\begin{align*}.15 \times 360^{\circ} = 54^{\circ}\end{align*} |
football | \begin{align*} \frac{14}{112}=.125 = 12.5\%\end{align*} |
\begin{align*}.125 \times 360^{\circ} = 45^{\circ}\end{align*} |
soccer | \begin{align*} \frac{28}{112}=.25 = 25\% \end{align*} |
\begin{align*}.25 \times 360^{\circ} = 90^{\circ}\end{align*} |
volleyball | \begin{align*} \frac{9}{112}=.08 = 8\%\end{align*} |
\begin{align*}.08 \times 360^{\circ} = 29^{\circ}\end{align*} |
swimming | \begin{align*} \frac{8}{112}=.07 = 7\% \end{align*} |
\begin{align*}.07 \times 360^{\circ} = 25^{\circ}\end{align*} |
gymnastics | \begin{align*} \frac{3}{112}=.025=2.5\% \end{align*} |
\begin{align*}.025 \times 360^{\circ} = 9^{\circ}\end{align*} |
fencing | \begin{align*} \frac{2}{112}=.02 = 2\% \end{align*} |
\begin{align*}.02 \times 360^{\circ} = 7^{\circ}\end{align*} |
Here is the pie-chart that represents the percentage of students in each category:
For the second survey, actual numerical data can be collected from each student. In this case we can display the data using a stem-and-leaf plot, a frequency table, a histogram, or a box-and-whisker plot.
Examples
In the second example Raoul found that that 30% of the students at his school are in \begin{align*}9^{th}\end{align*}
Grade Level | Number of hours worked | Total number of students |
---|---|---|
\begin{align*}9^{th}\end{align*} |
0, 5, 4, 0, 0, 10, 5, 6, 0, 0, 2, 4, 0, 8, 0, 5, 7, 0 |
18 |
\begin{align*}10^{th}\end{align*} |
6, 10, 12, 0, 10, 15, 0, 0, 8, 5, 0, 7, 10, 12, 0, 0 |
16 |
\begin{align*}11^{th}\end{align*} |
0, 12, 15, 18, 10, 0, 0, 20, 8, 15, 10, 15, 0, 5 |
14 |
\begin{align*}12^{th}\end{align*} |
22, 15, 12, 15, 10, 0, 18, 20, 10, 0, 12, 16 |
12 |
Example 1
Construct a stem-and-leaf plot of the collected data
The ordered stem-and-leaf plot looks as follows:
\begin{align*}& 0 \qquad 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 2 \ 4 \ 4 \ 5 \ 5 \ 5 \ 5 \ 5 \ 6 \ 6 \ 7 \ 7 \ 8 \ 8 \ 8\\
& 1 \qquad 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 0 \ 2 \ 2 \ 2 \ 2 \ 2 \ 5 \ 5 \ 5 \ 5 \ 5 \ 5 \ 6 \ 8 \ 8\\
& 2 \qquad 0 \ 0 \ 2\end{align*}
We can easily see from the stem-and-leaf plot that the mode of the data is 0. This makes sense because many students do not work in high school.
Example 2
Construct a frequency table with bin size of 5.
We construct the frequency table by counting how many students fit in each category.
Hours worked | Frequency |
---|---|
\begin{align*}0 \le x < 5\end{align*} | 23 |
\begin{align*}5 \le x < 10\end{align*} | 12 |
\begin{align*}10 \le x < 15\end{align*} | 13 |
\begin{align*}15 \le x < 20\end{align*} | 9 |
\begin{align*}20 \le x < 25\end{align*} | 3 |
Example 3
Draw a histogram of the data.
The histogram associated with this frequency table is shown below.
Example 4
Find the five number summary of the data and draw a box-and-whisker plot.
The five number summary is as follows:
smallest number = 0
largest number = 22
Since there are 60 data points, \begin{align*}\left ( \frac{n+1}{2} \right ) = 30.5\end{align*}. The median is the mean of the \begin{align*}30^{th}\end{align*} and the \begin{align*}31^{st}\end{align*} values:
median = 6.5
Since each half of the list has 30 values in it, then the first and third quartiles are the medians of each of the smaller lists. The first quartile is the mean of the \begin{align*}15^{th}\end{align*} and \begin{align*}16^{th}\end{align*} values:
first quartile = 0
The third quartile is the mean of the \begin{align*}45^{th}\end{align*} and \begin{align*}46^{th}\end{align*} values:
third quartile = 12
The associated box-and-whisker plot is shown below.
Review
- Make a pie chart for the problem in the Guided Practice. Specifically a total of 60 students in four groups composed of: 18 ninth grade students, 16 tenth grade students, 14 eleventh grade students, and 12 twelfth grade students.
- Melissa conducted a survey to answer the question “What sport do high school students like to watch on TV the most?” She collected the following information on her data collection sheet.
Sport | Tally | |
---|---|---|
baseball | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ ||\end{align*} | 32 |
basketball | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ {|||}\end{align*} | 28 |
football | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ {||||}\end{align*} | 24 |
soccer | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ {|||} \end{align*} | 18 |
gymnastics | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ {||||} \end{align*} | 19 |
figure skating | \begin{align*} \bcancel{||||} \ {|||} \end{align*} | 8 |
hockey | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ {|||} \end{align*} | 18 |
Total: | 147 |
a) Make a pie-chart of the results showing the percentage of people in each category.
b) Make a bar-graph of the results.
- Samuel conducted a survey to answer the following question: “What is the favorite kind of pie of the people living in my town?” By standing in front of his grocery store, he collected the following information on his data collection sheet:
Type of pie | Tally | |
---|---|---|
apple | \begin{align*}\bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ ||\end{align*} | 37 |
pumpkin | \begin{align*}\bcancel{||||} \ \bcancel{||||} \ |||\end{align*} | 13 |
lemon meringue | \begin{align*}\bcancel{||||} \ ||\end{align*} | 7 |
chocolate mousse | \begin{align*}\bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ |||\end{align*} | 23 |
cherry | \begin{align*}||||\end{align*} | 4 |
chicken pot pie | \begin{align*}\bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ |\end{align*} | 31 |
other | \begin{align*} \bcancel{||||} \ ||\end{align*} | 7 |
Total: | 122 |
a) Make a pie chart of the results showing the percentage of people in each category.
b) Make a bar graph of the results.
- Myra conducted a survey of people at her school to see “In which month does a person’s birthday fall?” She collected the following information in her data collection sheet:
Month | Tally | |
---|---|---|
January | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ {|}\end{align*} | 16 |
February | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ {|||}\end{align*} | 13 |
March | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ {||}\end{align*} | 12 |
April | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ {|}\end{align*} | 11 |
May | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ {|||}\end{align*} | 13 |
June | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ {||}\end{align*} | 12 |
July | \begin{align*} \bcancel{||||} \ {||||}\end{align*} | 9 |
August | \begin{align*} \bcancel{||||} \ {||}\end{align*} | 7 |
September | \begin{align*} \bcancel{||||} \ {||||}\end{align*} | 9 |
October | \begin{align*} \bcancel{||||} \ {|||}\end{align*} | 8 |
November | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ {|||}\end{align*} | 13 |
December | \begin{align*} \bcancel{||||} \ \bcancel{||||} \ {|||}\end{align*} | 13 |
Total: | 136 |
a) Make a pie chart of the results showing the percentage of people whose birthday falls in each month.
b) Make a bar graph of the results.
- Nam-Ling conducted a survey that answers the question “Which student would you vote for in your school’s elections?” She collected the following information:
Candidate | \begin{align*}9^{th}\end{align*} graders | \begin{align*}10^{th}\end{align*} graders | \begin{align*}11^{th}\end{align*} graders | \begin{align*}12^{th}\end{align*} graders | Total |
---|---|---|---|---|---|
Susan Cho | \begin{align*}\bcancel{||||} \ \bcancel{||||}\end{align*} | \begin{align*}||\end{align*} | \begin{align*}|\end{align*} | \begin{align*}\bcancel{||||} \ |\end{align*} | 19 |
Margarita Martinez | \begin{align*}\bcancel{||||} \ ||\end{align*} | \begin{align*}\bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ \bcancel{||||}\end{align*} | \begin{align*}||||\end{align*} | 31 | |
Steve Coogan | \begin{align*}\bcancel{||||}\end{align*} | \begin{align*}||||\end{align*} | \begin{align*}||\end{align*} | \begin{align*}\bcancel{||||}\end{align*} | 16 |
Solomon Duning | \begin{align*}\bcancel{||||} \ |\end{align*} | \begin{align*}|\end{align*} | \begin{align*}\bcancel{||||} \ \bcancel{||||} \ \bcancel{||||} \ |||\end{align*} | \begin{align*}|\end{align*} | 26 |
Juan Rios | \begin{align*}\bcancel{||||} \ |||\end{align*} | \begin{align*}|||\end{align*} | \begin{align*}\bcancel{||||} \end{align*} | \begin{align*}\bcancel{||||} \ \bcancel{||||} \ ||\end{align*} | 28 |
Total | 36 | 30 | 30 | 24 | 120 |
a) Make a pie chart of the results showing the percentage of people planning to vote for each candidate.
b) Make a bar graph of the results.
- Graham conducted a survey to find how many hours of TV teenagers watch each week in the United States. He collaborated with three friends that lived in different parts of the US and found the following information:
Part of the country | Number of hours of TV watched per week | Total number of teens |
---|---|---|
West Coast | 10, 12, 8, 20, 6, 0, 15, 18, 12, 22, 9, 5, 16, 12, 10, 18, 10, 20, 24, 8 | 20 |
Mid West | 20, 12, 24, 10, 8, 26, 34, 15, 18, 6, 22, 16, 10, 20, 15, 25, 32, 12, 18, 22 | 20 |
New England | 16, 9, 12, 0, 6, 10, 15, 24, 20, 30, 15, 10, 12, 8, 28, 32, 24, 12, 10, 10 | 20 |
South | 24, 22, 12, 32, 30, 20, 25, 15, 10, 14, 10, 12, 24, 28, 32, 38, 20, 25, 15, 12 | 20 |
a) Make a stem-and-leaf plot of the data.
b) Decide on an appropriate bin size and construct a frequency table.
c) Make a histogram of the results.
d) Find the five-number summary of the data and construct a box-and-whisker plot.
In exercises 7-10, consider the following survey questions.
- “What do students in your high-school like to spend their money on?”
- Which categories would you include on your data collection sheet?
- Design the data collection sheet that can be used to collect this information.
- Conduct the survey. This activity is best done as a group with each person contributing at least 20 results.
- Make a pie chart of the results showing the percentage of people in each category.
- Make a bar graph of the results.
- “What is the height of students in your class?”
- Design the data collection sheet that can be used to collect this information.
- Conduct the survey. This activity is best done as a group with each person contributing at least 20 results.
- Make a stem-and-leaf plot of the data.
- Decide on an appropriate bin size and construct a frequency table.
- Make a histogram of the results.
- Find the five-number summary of the data and construct a box-and-whisker plot.
- “How much allowance money do students in your school get per week?”
- Design the data collection sheet that can be used to collect this information,
- Conduct the survey. This activity is best done as a group with each person contributing at least 20 results.
- Make a stem-and-leaf plot of the data.
- Decide on an appropriate bin size and construct a frequency table.
- Make a histogram of the results.
- Find the five-number summary of the data and construct a box-and-whisker plot.
- “What time do students in your school get up in the morning during the school week?”
- Design the data collection sheet that can be used to collect this information.
- Conduct the survey. This activity is best done as a group with each person contributing at least 20 results.
- Make a stem-and-leaf plot of the data.
- Decide on an appropriate bin size and construct a frequency table.
- Make a histogram of the results.
- Find the five-number summary of the data and construct a box-and-whisker plot.
Review (Answers)
To view the Review answers, open this PDF file and look for section 13.14.
R