We saw that the median and the mean employee salaries for January were the same. What understanding can we take from that information?
Interactive Example
What can we understand about the median and mean employee salaries for January being the same? Fill in the blanks to answer the following questions.
- The median of the data set implies that ____________ made more than [latex]\$4,500[/latex] in January and _________ made less.
- The mean of the data set implies that if the January salaries had been added up and evenly distributed across all six employees, each person would have received ________________.
Show Answer
- Half the employees made more than [latex]\$4,500[/latex] and half made less.
- Each person would have received [latex]\$4,500[/latex]. That is, the average salary was [latex]\$4,500[/latex] for January.
It was interesting that the mean and the median were identical values. This tells us that the the salaries are evenly distributed among high and low values; the distribution is symmetrical, without skew. But what happens if we change one of the values in the data set? Let’s move on to questions 8 – 10 to find out.
What happens to the mean and median if we change one of the values in the data set?
Recall that the data set of employee salaries from February includes a big raise from one employee. First calculate the median of this set to answer Question 8 below, then consider how we might expect the mean of the February salaries compares to the mean of the January salaries.
Here is the February salary table again for convenience.
| Employee |
Monthly Salary in February
(in thousands of dollars)
|
| Employee 1 |
[latex]4[/latex] |
| Employee 2 |
[latex]8[/latex] |
| Employee 3 |
[latex]3[/latex] |
| Employee 4 |
[latex]5[/latex] |
| Employee 5 |
[latex]6[/latex] |
| Employee 6 |
[latex]3[/latex] |
Before you get the median and the mean from the technology, or before you calculate the mean by hand, first think about what you think will be true about the February mean compared to the January mean and why.
question 8
Hint
Remember that one of the salaries changed when an employee received a big raise in February.
question 9
Hint
What do you think? Consider what changed in the set of salaries from January to February. Is the total higher or lower than it was in January?
question 10
Hint
See the processes listed above.
Interactive example
Was the mean you calculated for February salaries higher, lower, or similar? What do you think caused that to be true? Click below for a discussion after you enter your answers to Questions 8 – 10.
Here are a couple of good questions to ask about it.
The mean is now higher than the median. They were identical in January.
- Did the increase in one salary cause the mean to rise?
- Would that always happen if a data value increases?
- How could we predict mathematically how much the mean would increase under the increase of a single value?
- We could predict the increase in mean mathematically by taking the difference in the January salary and the February salary then distributing that difference out evenly among the employees.
- Ex. One salary increased by [latex]$2,000[/latex]. If we divide the [latex]$2,000[/latex] across all six employees, we’ll have the amount by which the new mean is higher.
- [latex]\dfrac{$2,000}{6}=\$333.33[/latex]
- For January, [latex]\bar{x}=$4,500[/latex] and for February, [latex]\bar{x}=$4,833.33[/latex]. The mean increased by $333.33.
But why did the median stay the same? Would the median always be roughly the same if a data value changes?
- If the middle-most number or two numbers didn’t change, the median won’t change.
- What would happen though, if instead of Employee 2 receiving the raise, Employee 1 had received it instead? What would the new median be?
- The January median of the data set [latex]3, 3, 4, 5, 6, 6[/latex] is the mean of [latex]4[/latex] and [latex]5[/latex] in thousands of dollars, or [latex]\$4,500[/latex]
- Changing one of the salaries from [latex]6[/latex] thousand to [latex]8[/latex] thousand didn’t affect the middle two numbers.
- But changing the [latex]4[/latex] to an [latex]8[/latex] would require the reordering of the values.
- [latex]3, 3, 5, 6, 6, 8[/latex] now yields a median of [latex]5.5[/latex]
Now let’s consider a slightly different question.
question 11
Hint
Would there still be six data values (employee salary) in the set?
It may take some time before you really feel comfortable interpreting means and medians and understanding what they imply about a data set. A key idea to take from this activity is that, while the median stays relatively fixed in a data set if one value changes by a large amount, the mean does not. This tells us that the mean is sensitive to the presence of extreme values in the data set.