Interpreting the Mean and Median of a Dataset: Apply It 1

Is It Worth It?

Consider this scenario. A college basketball player is skilled enough to make an NBA roster and is thinking about dropping out of college this year.

Lots of hundred dollars bills in a fan shape held in front of someone

question 1

In this activity, you’ll use a distribution of professional basketball salaries to see how outliers influence different measures of center and how averages can be misleading.

recall

Before beginning this activity, take a moment to recall the meanings of the terms left-skewed, right-skewed, symmetric, and outlier. You’ll need to be able to use those terms to describe features of a data set.

Core skill:

video placement

[Intro: Starting from a sentence or two discussing Question 1, remind students that they have recently been working to calculate and interpret the mean and median of a data set. That is, the median is the value that splits the data in half, with half the observations above the mean and half below, regardless of the presence of skew or outliers. The median is fixed. But the mean is not; it gets pulled to the left or right of the mean under the presence of skew or outliers. The mean is sensitive to extreme values. So when we see that the mean is higher than the median, we say that it has been “pulled to the right,” and we understand the quantitative variable is skewed right. Likewise, if the mean is smaller, we’ll say it’s been “pulled to the left,” and we understand the quantitative variable is skewed left. If the mean and median are similar, though, we understand that the distribution is symmetric. In this activity, we’ll use a distribution of professional basketball salaries to explore how skew arises in a quantitative variable and why we must be careful to consider all the characteristics of a quantitative variable’s distribution before deciding if the mean or median would be more responsible to use as a measure of a “typical” value. ]

Below is a dotplot of NBA salaries[1] for Texas players in the 2017–2018 season:

A dotplot labeled "Texas Player Salaries ($)," numbered in increments of 5 million from 0 to 25 million. There are several high stacks of dots between 0 and 5 million. Above 5 million, there is only one stack and it has two dots. There are also several individual dots. One dot near 0 is labeled "Chris Johnson: $25,000." Another dot is labeled "Chris Paul: $24,599,495." One more dot is labeled "James Harden: $28,299,399."

question 2

question 3


  1.  NBA player salary data set (2017-2018). (2018) Kaggle. Retrieved from https://www.kaggle.com/koki25ando/salary