Corequisite Support Activity for 2A: Random Sampling

What you’ll need to know:

In this support activity you’ll become familiar with the following:

  • Use technology to create a dotplot from a dataset.
  • Answer questions about a variable using a dotplot.
  • Use a random number generator to select a random sample.
  • Anticipate sample-to-sample variability.

You will also have an opportunity to refresh the following skills:

  • Calculate a sample mean by hand

In the next preview assignment and in the next class, you will need to interpret features of a dotplot, use a random number generator to select a random sample from a finite population, and calculate the arithmetic mean. In this activity, you will become familiar with the data analysis tools that will be used throughout this course. You can access these tools on any smart device, including a phone.

Some new vocabulary will appear in this section of course material. These are terms you’ll discuss in greater terms later, that you’ll use throughout the course, and that you may have seen before. As you work through this assignment, try to draw the statistical meaning of the words random and sample as they are used in context.

The two tools in this activity are the Describing and Exploring Quantitative Data tool at  https://dcmathpathways.shinyapps.io/EDA_quantitative/ and the Generate Random Numbers tool at
https://dcmathpathways.shinyapps.io/RandomNumbers/. They’ll also be linked below as you need them.

Work in pairs during this activity if possible, in close proximity so that you can share and compare the outputs from the tool. If more than one of you share a device to complete the activity, switch roles halfway through so everyone gets practice using the tools.

Interpreting Dotplots

[insert image of a generic dotplot with labels, arrows pointing to individual observations, horizontal axis]

A dotplot is a graphical display of the distribution of a quantitative variable. It shows the variable’s possible values and the frequency of each value. In this corequisite support activity, you will use technology to generate a dotplot and then use the dotplot to describe the features of the distribution. You will explore other ways of visualizing a quantitative variable in Forming Connections [3C].

question 1

Go to the Describing and Exploring Quantitative Data tool at https://dcmathpathways.shinyapps.io/EDA_quantitative/.

Step 1) Select the Single Group tab.

Step 2) Locate the drop-down menu under Enter Data and select From Textbook.

Step 3) Locate the drop-down menu under Dataset and select Cereal Sodium Content.

Step 4) Under Choose Type of Plot, uncheck Histogram and Boxplot and check Dotplot. Then adjust the Dot size to 0.5 and the Bin width to 10.

Use the dotplot displayed in the tool to answer the following questions.

 

Part A: Which type of variable is cereal sodium content?

 

Part B: Describe the typical value of cereal sodium content.

 

Part C: How many observations of cereal sodium content are less than 100 milligrams (mg)? Approximately what are the values of these observations?

 

Part D: How many observations of cereal sodium content are between 200 and 300 mg (including 200 and 300)?

You’ll be using the tools frequently throughout this course, and detailed instructions will be provided for each of the first few times you do, so don’t worry if it doesn’t yet feel comfortable.

Let’s turn our focus now to another tool you’ll need to use soon, the random number generator.

Calculating a Sample Mean

In the upcoming course activity, you’ll need to take a sample from a population and ensure that the sampling method you use is unbiased. We’ll learn more about what that really means later. For now, we’ll focus on using the random number generator to select a random sample, then calculate the mean of that sample.

Random Sample

The following table (continued on the next page) displays the number of drivers involved in fatal collisions per billion miles for each of the 50 states.

Ordered Number State Number of Drivers Ordered Number State Number of Drivers
1 Alabama 18.8 26 Montana 21.4
2 Alaska 18.1 27 Nebraska 14.9
3 Arizona 18.6 28 Nevada 14.7
4 Arkansas 22.4 29 New Hampshire 11.6
5 California 12.0 30 New Jersey 11.2
6 Colorado 13.6 31 New Mexico 18.4
7 Connecticut 10.8 32 New York 12.3
8 Delaware 16.2 33 North Carolina 16.8
9 Florida 17.9 34 North Dakota 23.9
10 Georgia 15.6 35 Ohio 14.1
11 Hawaii 17.5 36 Oklahoma 19.9
12 Idaho 15.3 37 Oregon 12.8
13 Illinois 12.8 38 Pennsylvania 18.2
14 Indiana 14.5 39 Rhode Island 11.1
15 Iowa 15.7 40 South Carolina 23.9
16 Kansas 17.8 41 South Dakota 19.4
17 Kentucky 21.4 42 Tennessee 19.5
18 Louisiana 20.5 43 Texas 19.4
19 Maine 15.1 44 Utah 11.3
20 Maryland 12.5 45 Vermont 13.6
21 Massachusetts 8.2 46 Virginia 12.7
22 Michigan 14.1 47 Washington 10.6
23 Minnesota 9.6 48 West Virginia 23.8
24 Mississippi 17.6 49 Wisconsin 13.8
25 Missouri 16.1 50 Wyoming 17.4

We would like to select a random sample of six states from this list. We will select this sample “without replacement,” meaning that we cannot select the same state twice. The word random, used statistically, means that each individual in the population has the same chance of being selected in the sample.

Question 2

Describe how you could use cards to select a random sample of six states, without replacement. In your description, include the number of cards you would need, what you would write on each card, and how you would select your sample.

Now, instead of using cards, you are going to use a random number generator to select a random sample of six states. You’ll use the tool to do this. If you work with a partner, use the sample you generate together in Question 3 to answer Question 4.

question 3

Go to the Generate Random Numbers tool at https://dcmathpathways.shinyapps.io/RandomNumbers/.

Step 1) Select the Random Numbers tab.

Step 2) Under “Choose Minimum,” select “1.”

Step 3) Under “Choose Maximum,” select “50.”

Step 4) Under “How many numbers do you want to generate,” select “6.”

Step 5) Under “Sample with Replacement,” select “No.”

Step 6) Click “Generate.” This will generate six random numbers between 1 and 50. These six numbers correspond to the states chosen for your sample (locate the number next to its corresponding state name in the data list above). Fill in the following table with the corresponding state and number of drivers involved in fatal collisions per billion miles for each of your randomly-generated numbers.

Randomly

Generated Number

State Number of Drivers

Involved in Fatal Collisions

per Billion Miles

Sample Mean

To understand what the typical number of drivers involved in fatal collisions might be, we can calculate the mean or average of the values. The mean is calculated by adding the values and then dividing the total by the number of values in the dataset. When we calculate the mean of a random sample, we call this the sample mean.

Recall

Core skill:

See the example below, then answer Question 4 using the random sample you generated from the data table during Question 3.

example

Suppose a random sample of states was taken from the list above: Minnesota, Nevada, West Virginia, Tennessee, Alaska, Indiana.

To calculate the mean number of drivers involved in fatal collisions per billion miles for this sample, we take the sum of the number of drivers per state in the sample, then divide by the total number of states in sample.

  1. What are the number of drivers involved in fatal collisions per billions associated with each state in the sample? Look these up in the data table.

  2. Calculate the sample mean.

Now you try it.

question 4

Calculate the sample mean (i.e., average) number of drivers involved in fatal collisions per billion miles for the six states in your randomly-selected sample.

question 5

If you were to use the random number generator to generate another simple random sample of six states, would you get the same six numbers you found in Question 3? Would you get the same value for the sample mean number of drivers involved in fatal collisions per billion miles you found in Question 4? Explain.

You’ve had an introduction to the analysis tool, learned about dotplots, had a chance to refresh your skills at computing the mean, and learned how to use the random number generator in order to choose a random sample from a dataset. Hopefully, you are feeling secure with these new abilities and are ready to move on to the next section and activity.