Datasets and Statistical Questions: Background You’ll Need 2

Variability

The question “Do states with state-wide, hands-free cell phone laws tend to have fewer automobile accidents?” anticipates variability. To vary means to differ, and we expect the different U.S. states to have different numbers of accidents. We also expect the number of accidents to differ among states that all have a hands-free cell phone law. We anticipate that there might be a difference in the number of accidents in states with such laws compared to the number of accidents in states without those laws, and our hope in answering the question would be to use statistics to determine whether there is a pattern or some kind of general trend that describes this difference, or this variation.

Note that in order for a question to anticipate variability, the answer to the question must involve multiple data points so that there is a possibility for the data to vary.

Example

What does it mean for a question to anticipate variability? Choose the best answer from the choices below.

a) The question may be answered incorrectly depending on who is being asked.

b) The answer may vary dependent upon when the question is being asked.

c) The question must involve multiple data points so that there is a possibility for the data to vary.

Questions that anticipate variability often include words and phrases that signal patterns or tendencies that could be revealed using the methods of statistics. Work in a small group to answer the remaining two questions.

Question 2

Which of the following questions anticipate variability? Select all that apply. There may be more than one correct answer.

  1. a) Do states with more public transportation options tend to have less traffic?
  2. b) Which states typically have the highest speed limits?
  3. c) What percentage of registered vehicles in New York are motorcycles?
  4. d) Do states with more rural areas tend to have more people who drive trucks?
  5. e) How many highways are there in the U.S. interstate highway system?

Now you try putting it all together to identify which questions are good, statistical questions by completing the table in Question 3.

Question 3

A statistical question has both qualities we’ve discussed, so it anticipates variability in the data and it does not have an exact answer.

For example, consider the question “Which state gave the most speeding tickets in 2019?” This question anticipates variability in that we would need to check the number of tickets given by each state, and those values would most certainly vary. However, once we had the list of ticket numbers from 2019, we would just identify the largest number. Therefore, the question has an exact answer, so it is still not a statistical question.

For each of the following questions, decide if the question anticipates variability and whether it has an exact answer. Based on those facts, decide if the question is a statistical question.

Question Does it have an exact answer? Does it anticipate variability? Is it a statistical question?
Do highways or roads that are not highways tend to see more traffic?
How many automobile accidents in the United States were attributed to distracted drivers in 2020?
Do younger people tend to get more speeding tickets than older people?
What percentage of states allows right turns on red lights?
Which state has the most registered vehicles?
Are people who live in Southern states more likely to drive SUVs?

You’ve had practice in this activity determining whether a question has an exact answer and anticipates variability or not. Hopefully, you feel more comfortable identifying the characteristics of a good statistical question now. It’s time to move on to the What to Know page to extend your understanding.