Forming Connections in Five Number Summary in Box Plots and Datasets: 4D – 27

objectives for this activity

During this activity, you will:

Click on a skill above to jump to its location in this activity.

video placement

[Intro: The dataset used in this activity comes from the independent Tax Policy Center, a non-partisan group. We’ll look at the Tax Policy Center’s distributional tables of tax cuts by income levels, which were constructed in February 2018, in order to apply public policy analysis to a tax reform bill signed into law in 2017. Specifically, we’ll use this situation to explore how boxplots can be used to compare distributions across several populations and draw inferences about them. As you read the two quotes at the start of the activity, consider the differences in them and ways in which you might investigate the accuracy of their claims.]

Good Tax Policy?

A calculator that has a display reading "tax" sitting on top of 100 dollar bills.

Read these two quotes about the U.S. 2017 Tax Reform bill:[1]

“We’re doing everything we can to reduce the tax burden on you and your family. By eliminating tax breaks and loopholes, we will ensure that the benefits are focused on the middle class, the working men and women, not the highest income earners. Our framework includes our explicit commitment that tax reform will protect low-income and middle-income households, not the wealthy and well connected.” – President Donald Trump in 2017, prior to the bill’s enactment[2]

“You remember just a few years ago when Trump and my Republican colleagues voted for almost $2 trillion in tax breaks for the wealthiest people in this country and the largest corporations.” – Bernie Sanders in 2021, after the bill was enacted[3]

question 1

Describe the differences in how these two politicians characterized the effects of the 2017 Tax Reform. How would you investigate whether these claims were accurate or misleading?

In this activity, you’ll see how to use a boxplot to provide a visual summary of a quantitative variable, and how boxplots can be used to compare the distributions of multiple populations. Since we’ll be using data from the Tax Policy Center to analyze the claims made in the two quotes above, let’s first investigate a statement the Center made about the average tax cut experienced from the law.

According to the independent Tax Policy Center, the average income tax cut as a result of the law was $1,260.[4] However, as we’ve seen before, averages can be misleading and uninformative. Two boxplots are provided below, each displaying a hypothetical distribution of 5,000 tax cuts that would result in a mean tax cut of $1,260.

Two box plots. The horizontal axis is labeled "Household Tax Costs ($ thousands) and is numbered in increments of one. The top is labeled A and has its lowest point at zero and its highest at approximately 2.3. The lower end of the box is at approximately 0.5 while the upper end is at approximately 2. The center line is at approximately 1.4. For plot B, the lowest point is at 0 and the highest point is at approximately 1.25. The low end of the box is at approximately 0.05 while the upper end is at approximately 0.5. The middle line is at approximately 0.1. Above the high point of plot B, there are lots of individual dots very close together all the way up the rest of the box plot.

Using Boxplots to Compare Multiple Distributions

question 2

According to Boxplot A, what percent of households received tax cuts of $500 or higher?

question 3

According to Boxplot B, what percent of households received tax cuts of $500 or higher?

Now, let’s compare these distributions.

question 4

Compare a measure of center within both distributions. What do you observe about the center in Boxplot A as compared to the center in Boxplot B?

question 5

Compare the IQR and total range of each distribution. What do you observe about the IQR and range in Boxplot A as compared to Boxplot B?

question 6

Compare the shapes of the distributions and explain how both distributions could have the same mean value.

question 7

Which distribution (A or B) supports the idea that the tax pretty equally benefitted a broad swath of the American population? Explain your reasoning.

question 8

Which distribution (A or B) supports the idea that the tax cuts mostly benefitted a smaller minority of households?

Drawing inferences from boxplots

Now let’s look at the data from the 2017 Tax Reform bill.

The data from the Tax Policy Center in the table below separates income groups into quintiles. Quintiles divide datasets into five parts, from the lowest 20% of values up to the highest 20% of values.

The Tax Policy Center released the following data about the actual tax cuts households experienced because of the 2017 Tax Reform bill.[5]

Household Tax Cuts
Income Group Mean Tax Cut
Lowest Quintile $40
Second Quintile $320
Middle Quintile $780
Fourth Quintile $1,480
Top Quintile $5,790
Top 1 Percent $32,650
Top 0.1 Percent $89,060

question 9

Based on the table above, do you believe the median tax cut is close to the median displayed in Boxplot A, close to the median in Boxplot B, or somewhere in between? Explain. Note: Your answer won’t be exact.

question 10

Where would you predict the maximum tax cut in the 2017 Tax Cut bill to be, relative to the maximum displayed in Boxplot B? Explain.

question 11

Based on all your previous analyses, critique the following statement: “Only the wealthy benefitted from the 2017 Tax Reform. The middle class didn’t benefit at all.”

question 12

Based on all your previous analyses, critique the following statement: “The typical American household paid $1,260 less in taxes because of the 2017 Tax Reform.”

video placement

Wrap up: “There are many perspectives to take and questions to answer when trying to sort out as complicated a situation as this one. We are only presented a small amount of information in this activity so we limited ourselves to questions we could answer using the data we had. It is of interest, though, to compare these tax cuts given in dollar amounts to their equivalencies in terms of percent of peoples’ incomes. [voice over the table given in the hidden text below]. The Tax Policy Center also provides this information. The top income groups received both the largest cuts in dollar amounts but also the largest percent growth in after-tax income as a result of the bill (the top 0.1 percent don’t follow this trend, however). But hopefully you’ve been able to understand from this activity how boxplots provide a quick glance, a summary, of the data to make comparisons based on median, skew, outliers, and percentiles. Let’s take a look at a few general five-number summaries to make sure you feel good about what you’ve learned here.” — Provide a few five-number summaries and ask students to sketch boxplots of them. Then show the answers for comparison. — “You’ve seen that boxplots provide visual summaries of quantitative variables that can be used to compare the distributions of multiple populations. Hopefully, you feel that you are now comfortable using boxplots to compare and draw inferences.”

 


  1. Data and lesson context adapted from Skew the Script, www.skewthescript.org
  2. Administration of Donald J. Trump. (2017, September 27). Remarks in Indianapolis, Indiana. Govinfo.gov. https://www.govinfo.gov/content/pkg/DCPD-201700693/html/DCPD-201700693.htm 
  3. Cooper, A. (Interviewer). (2021, January 29.). CNN.
  4. Tax Policy Center. (2018, February 16). T18-0025 - The Tax Cuts and Jobs Act (TCJA): All provisions and individual income tax provisions; distribution of federal tax change by expanded cash income percentile, 2018. htpps://www.taxpolicycenter.org/model-estimates/individual-income-tax-provisions-tax-cuts-and-jobs-act-tcja-february-2019/t18-0025
  5.  Tax Policy Center. (2018, February 16). T18-0025 - The Tax Cuts and Jobs Act (TCJA): All provisions and individual income tax provisions; distribution of federal tax change by expanded cash income percentile, 2018. https://www.taxpolicycenter.org/model-estimates/individual-income-tax-provisions-tax-cuts-and-jobs-act-tcja-february-2018/t18-0025