15C Coreq

In the next preview assignment and in the next class, you will need to compute relative frequencies and find expected counts based on certain proportions.
Flight Frequencies
In the next in-class activity, we will be comparing distributions of a categorical variable for multiple populations. In particular, we will be looking at whether different airlines have the same distribution of flight status. Our categorical variable will be flight status, and the populations we are comparing are the flights for different airlines. The values of our categorical variable are On-Time, Delayed, Canceled, and Diverted. The following table is a two-way table(also called a contingency table), and it gives the counts for each value of the variable flight status for Delta Airlines and Southwest Airlines arrivals at the Atlanta airport in March 2021.[1] Notice that each row gives the distribution of flight status for an individual airline.
On-Time Flights Delayed Flights Canceled Flights Diverted Flights Total
Delta Airlines 12,716 904 23 8 13,651
Southwest Airlines 2,240 299 22 1 2,562
For Delta Airlines, we can find the relative frequency of on-time flights, or the proportion of on-time flights, by looking at the ratio of on-time flights to the total number of flights: [latex]\frac{12,716}{13,651}≈0.9315[/latex] So, about 93.15% of Delta Airlines’ arriving flights in Atlanta in March 2021 were on time. We can performa similar computation for each value of the variable flight status to obtain the relative frequency distribution of flight status for Delta Airlines in terms of percentages, as shown in the following table.
On-Time Percentage Flights Delayed Percentage Flights Canceled Percentage Flights Diverted Percentage Flights Total
Delta Airlines 93.15% 6.62% 0.17% 0.06% 100%

Question 1

1) Complete thefollowingtable for the relative frequency distribution of flight statusfor Southwest Airlines by percentage. Round to the nearest hundredth.

On-Time Percentage Flights Delayed Percentage Flights Canceled Percentage Flights Diverted Percentage Flights Total
Southwest Airlines 100%

We can also consider the total number of flights for each value of the variable flight statusand look at the overall relative frequency for each. For example, there were 12,716+2,240=14,956on-time flights in total for both airlines. The total number of flights overall for both airlines was13,651+2,562=16,213flights. Then,the overall relative frequency for on-time flights was: [latex]\frac{14,956}{16,213}≈0.92246962[/latex] So, about 92.25% of flights were on time for both airlines combined. In this case, we will keep more decimal places to avoid rounding errors in our next computations.

Question 2

2) Find the overall relative frequencyexpressed as a percentagefor each value of the variable flight statusand complete the following table.

On-Time Percentage Flights Delayed Percentage Flights Canceled Percentage Flights Diverted Percentage Flights Total
Overall 92.246962% 100%
Now, let’s imagine that Delta Airlines and Southwest Airlines both had flight status proportions that matched the overall proportions. For example, Delta Airlines would have 92.246962% of its flights arrive on time. Since Delta Airlines had 13,651 flights arrive in Atlanta in total in March 2021, 92.246962% of 13,651=0.92246962×13,651≈12,592.633is the number of flights that we would expect to be on time. This is called the expected count of on-time flights if Delta Airlines’ distribution matched the overall proportions. Similarly, for Southwest Airlines, we would expect to have 0.92246962×2,562≈2,363.367 on-time flights if its distribution matched the overall proportions. Notice that these expected counts do not have to be whole numbers because they are theoretical values. Notice also that there were 14,956 total on-time flights for these two airlines in March 2021, so once we knew that Delta Airlines would be expected to have 12,592.633 on-time flights if its distribution matched the overall proportions, we could have found the expected number of on-time flights for Southwest Airlines by subtracting:
14,956−12,592.633=2,363.367
We see that we get the same expected count as we did when we used the percentage.

Question 3

3) Complete the following table of expected counts for flights if both airlines had flight status distributions that matched the overall proportions.

Expected On-Time Flights Expected Delayed Flights Expected Canceled Flights Expected Diverted Flights Total Flights
Delta Airlines 12,592.633 13,651
Southwest Airlines 2,363.367 2,562
Total Flights 14,956 1,203 45 9 16,213

Question 4

4)As we’ve seen in previous in-class activities, we can compare the observed and expected counts by calculating the difference between the observed count and the expected count. (So, observed–expected for each cell of the table.)
Part A: Notice that for Delta Airlines’ on-time flights, the difference is 12,716−12,592.633=123.367, so Delta Airlineshad 123.367 more on-time flights than would be expected if DeltaAirlines’distribution matched the overall proportions. Find and interpret the differencefor SouthwestAirlines’delayed flights.
Part B:When the difference between an observed count and thecorrespondingexpected countis negative, itmeans the expected countwas larger than the observed count, so there were fewer observed values than expected. Find and interpret the differencefor SouthwestAirlines’on-time flights.

  1. U.S.Department of Transportation,Bureau of Transportation Statistics. (n.d.).On-time performance -Reporting operating carrier flight delays at a glance.https://www.transtats.bts.gov/HomeDrillChart_Month.asp?5ry_lrn4=FDFD&N44_Qry=E&5ry_Pn44vr4=DDD&5ry_Nv42146=DDD&heY_fryrp6lrn4=FDFE&heY_fryrp6Z106u=F