Calculating a Sample Mean
In the upcoming course activity, you’ll need to take a sample from a population and ensure that the sampling method you use is unbiased. We’ll learn more about what that really means later. For now, we’ll focus on using the random number generator to select a random sample, then calculate the mean of that sample.
Random Sample
The following table (continued on the next page) displays the number of drivers involved in fatal collisions per billion miles for each of the 50 states.
| Ordered Number | State | Number of Drivers | Ordered Number | State | Number of Drivers | |
| 1 | Alabama | 18.8 | 26 | Montana | 21.4 | |
| 2 | Alaska | 18.1 | 27 | Nebraska | 14.9 | |
| 3 | Arizona | 18.6 | 28 | Nevada | 14.7 | |
| 4 | Arkansas | 22.4 | 29 | New Hampshire | 11.6 | |
| 5 | California | 12.0 | 30 | New Jersey | 11.2 | |
| 6 | Colorado | 13.6 | 31 | New Mexico | 18.4 | |
| 7 | Connecticut | 10.8 | 32 | New York | 12.3 | |
| 8 | Delaware | 16.2 | 33 | North Carolina | 16.8 | |
| 9 | Florida | 17.9 | 34 | North Dakota | 23.9 | |
| 10 | Georgia | 15.6 | 35 | Ohio | 14.1 | |
| 11 | Hawaii | 17.5 | 36 | Oklahoma | 19.9 | |
| 12 | Idaho | 15.3 | 37 | Oregon | 12.8 | |
| 13 | Illinois | 12.8 | 38 | Pennsylvania | 18.2 | |
| 14 | Indiana | 14.5 | 39 | Rhode Island | 11.1 | |
| 15 | Iowa | 15.7 | 40 | South Carolina | 23.9 | |
| 16 | Kansas | 17.8 | 41 | South Dakota | 19.4 | |
| 17 | Kentucky | 21.4 | 42 | Tennessee | 19.5 | |
| 18 | Louisiana | 20.5 | 43 | Texas | 19.4 | |
| 19 | Maine | 15.1 | 44 | Utah | 11.3 | |
| 20 | Maryland | 12.5 | 45 | Vermont | 13.6 | |
| 21 | Massachusetts | 8.2 | 46 | Virginia | 12.7 | |
| 22 | Michigan | 14.1 | 47 | Washington | 10.6 | |
| 23 | Minnesota | 9.6 | 48 | West Virginia | 23.8 | |
| 24 | Mississippi | 17.6 | 49 | Wisconsin | 13.8 | |
| 25 | Missouri | 16.1 | 50 | Wyoming | 17.4 |
We would like to select a random sample of six states from this list. We will select this sample “without replacement,” meaning that we cannot select the same state twice. The word random, used statistically, means that each individual in the population has the same chance of being selected in the sample.
Question 2
Describe how you could use cards to select a random sample of six states, without replacement. In your description, include the number of cards you would need, what you would write on each card, and how you would select your sample.
Now, instead of using cards, you are going to use a random number generator to select a random sample of six states. You’ll use the tool to do this. If you work with a partner, use the sample you generate together in Question 3 to answer Question 4.
question 3
Go to the Generate Random Numbers tool at https://dcmathpathways.shinyapps.io/RandomNumbers/.
Step 1) Select the Random Numbers tab.
Step 2) Under “Choose Minimum,” select “1.”
Step 3) Under “Choose Maximum,” select “50.”
Step 4) Under “How many numbers do you want to generate,” select “6.”
Step 5) Under “Sample with Replacement,” select “No.”
Step 6) Click “Generate.” This will generate six random numbers between 1 and 50. These six numbers correspond to the states chosen for your sample (locate the number next to its corresponding state name in the data list above). Fill in the following table with the corresponding state and number of drivers involved in fatal collisions per billion miles for each of your randomly-generated numbers.
| Randomly
Generated Number |
State | Number of Drivers
Involved in Fatal Collisions per Billion Miles |
Sample Mean
To understand what the typical number of drivers involved in fatal collisions might be, we can calculate the mean or average of the values. The mean is calculated by adding the values and then dividing the total by the number of values in the dataset. When we calculate the mean of a random sample, we call this the sample mean.
Recall
Core skill:
See the example below, then answer Question 4 using the random sample you generated from the data table during Question 3.
example
Suppose a random sample of states was taken from the list above: Minnesota, Nevada, West Virginia, Tennessee, Alaska, Indiana.
To calculate the mean number of drivers involved in fatal collisions per billion miles for this sample, we take the sum of the number of drivers per state in the sample, then divide by the total number of states in sample.
- What are the number of drivers involved in fatal collisions per billions associated with each state in the sample? Look these up in the data table.
Show Answer
- Calculate the sample mean.
Show Answer
Now you try it.
question 4
Calculate the sample mean (i.e., average) number of drivers involved in fatal collisions per billion miles for the six states in your randomly-selected sample.
question 5
If you were to use the random number generator to generate another simple random sample of six states, would you get the same six numbers you found in Question 3? Would you get the same value for the sample mean number of drivers involved in fatal collisions per billion miles you found in Question 4? Explain.
You’ve had an introduction to the analysis tool, learned about dotplots, had a chance to refresh your skills at computing the mean, and learned how to use the random number generator in order to choose a random sample from a dataset. Hopefully, you are feeling secure with these new abilities and are ready to move on to the next section and activity.