{"id":5493,"date":"2022-09-14T15:52:33","date_gmt":"2022-09-14T15:52:33","guid":{"rendered":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/?post_type=chapter&#038;p=5493"},"modified":"2022-10-05T12:40:39","modified_gmt":"2022-10-05T12:40:39","slug":"15c-coreq","status":"publish","type":"chapter","link":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/chapter\/15c-coreq\/","title":{"raw":"15C Coreq","rendered":"15C Coreq"},"content":{"raw":"<div id=\"bp-page-1\" class=\"page\" data-page-number=\"1\" data-loaded=\"true\">\r\n<div class=\"textLayer\">In the next preview assignment and in the next class, you will need to compute relative frequencies and find expected counts based on certain proportions.<\/div>\r\n<div class=\"textLayer\">Flight Frequencies<\/div>\r\n<div class=\"textLayer\">In the next in-class activity, we will be comparing distributions of a categorical variable for multiple populations. In particular, we will be looking at whether different airlines have the same distribution of flight status. Our categorical variable will be flight status, and the populations we are comparing are the flights for different airlines. The values of our categorical variable are On-Time, Delayed, Canceled, and Diverted. The following table is a two-way table(also called a contingency table), and it gives the counts for each value of the variable flight status for Delta Airlines and Southwest Airlines arrivals at the Atlanta airport in March 2021.[footnote]U.S.Department of Transportation,Bureau of Transportation Statistics. (n.d.).On-time performance -Reporting operating carrier flight delays at a glance.https:\/\/www.transtats.bts.gov\/HomeDrillChart_Month.asp?5ry_lrn4=FDFD&amp;N44_Qry=E&amp;5ry_Pn44vr4=DDD&amp;5ry_Nv42146=DDD&amp;heY_fryrp6lrn4=FDFE&amp;heY_fryrp6Z106u=F[\/footnote] Notice that each row gives the distribution of flight status for an individual airline.<\/div>\r\n<div>\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td><\/td>\r\n<td><strong>On-Time Flights<\/strong><\/td>\r\n<td><strong>Delayed Flights<\/strong><\/td>\r\n<td><strong>Canceled Flights<\/strong><\/td>\r\n<td><strong>Diverted Flights<\/strong><\/td>\r\n<td><strong>Total<\/strong><\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>Delta Airlines<\/strong><\/td>\r\n<td>12,716<\/td>\r\n<td>904<\/td>\r\n<td>23<\/td>\r\n<td>8<\/td>\r\n<td>13,651<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>Southwest Airlines<\/strong><\/td>\r\n<td>2,240<\/td>\r\n<td>299<\/td>\r\n<td>22<\/td>\r\n<td>1<\/td>\r\n<td>2,562<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div class=\"textLayer\">For Delta Airlines, we can find the relative frequency of on-time flights, or the proportion of on-time flights, by looking at the ratio of on-time flights to the total number of flights: [latex]\\frac{12,716}{13,651}\u22480.9315[\/latex] So, about 93.15% of Delta Airlines\u2019 arriving flights in Atlanta in March 2021 were on time. We can performa similar computation for each value of the variable flight status to obtain the relative frequency distribution of flight status for Delta Airlines in terms of percentages, as shown in the following table.<\/div>\r\n<\/div>\r\n<div id=\"bp-page-2\" class=\"page\" data-page-number=\"2\" data-loaded=\"true\">\r\n<div class=\"textLayer\">\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td><\/td>\r\n<td><strong>On-Time Percentage Flights<\/strong><\/td>\r\n<td><strong>Delayed Percentage Flights<\/strong><\/td>\r\n<td><strong>Canceled Percentage Flights<\/strong><\/td>\r\n<td><strong>Diverted Percentage Flights<\/strong><\/td>\r\n<td><strong>Total<\/strong><\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>Delta Airlines<\/strong><\/td>\r\n<td>93.15%<\/td>\r\n<td>6.62%<\/td>\r\n<td>0.17%<\/td>\r\n<td>0.06%<\/td>\r\n<td>100%<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 1<\/h3>\r\n1) Complete thefollowingtable for the relative frequency distribution of flight statusfor Southwest Airlines by percentage. Round to the nearest hundredth.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td><\/td>\r\n<td><strong>On-Time Percentage Flights<\/strong><\/td>\r\n<td><strong>Delayed Percentage Flights<\/strong><\/td>\r\n<td><strong>Canceled Percentage Flights<\/strong><\/td>\r\n<td><strong>Diverted Percentage Flights<\/strong><\/td>\r\n<td><strong>Total<\/strong><\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>Southwest Airlines<\/strong><\/td>\r\n<td><\/td>\r\n<td><\/td>\r\n<td><\/td>\r\n<td><\/td>\r\n<td>100%<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nWe can also consider the total number of flights for each value of the variable flight statusand look at the overall relative frequency for each. For example, there were 12,716+2,240=14,956on-time flights in total for both airlines. The total number of flights overall for both airlines was13,651+2,562=16,213flights. Then,the overall relative frequency for on-time flights was: [latex]\\frac{14,956}{16,213}\u22480.92246962[\/latex] So, about 92.25% of flights were on time for both airlines combined. In this case, we will keep more decimal places to avoid rounding errors in our next computations.\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 2<\/h3>\r\n2) Find the overall relative frequencyexpressed as a percentagefor each value of the variable flight statusand complete the following table.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td><\/td>\r\n<td><strong>On-Time Percentage Flights<\/strong><\/td>\r\n<td><strong>Delayed Percentage Flights<\/strong><\/td>\r\n<td><strong>Canceled Percentage Flights<\/strong><\/td>\r\n<td><strong>Diverted Percentage Flights<\/strong><\/td>\r\n<td><strong>Total<\/strong><\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>Overall<\/strong><\/td>\r\n<td>92.246962%<\/td>\r\n<td><\/td>\r\n<td><\/td>\r\n<td><\/td>\r\n<td>100%<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">Now, let\u2019s imagine that Delta Airlines and Southwest Airlines both had flight status proportions that matched the overall proportions. For example, Delta Airlines would have 92.246962% of its flights arrive on time. Since Delta Airlines had 13,651 flights arrive in Atlanta in total in March 2021, 92.246962% of 13,651=0.92246962\u00d713,651\u224812,592.633is the number of flights that we would expect to be on time. This is called the expected count of on-time flights if Delta Airlines\u2019 distribution matched the overall proportions. Similarly, for Southwest Airlines, we would expect to have 0.92246962\u00d72,562\u22482,363.367 on-time flights if its distribution matched the overall proportions. Notice that these expected counts do not have to be whole numbers because they are theoretical values. Notice also that there were 14,956 total on-time flights for these two airlines in March 2021, so once we knew that Delta Airlines would be expected to have 12,592.633 on-time flights if its distribution matched the overall proportions, we could have found the expected number of on-time flights for Southwest Airlines by subtracting:<\/div>\r\n<div class=\"textLayer\">14,956\u221212,592.633=2,363.367<\/div>\r\n<div class=\"textLayer\">We see that we get the same expected count as we did when we used the percentage.<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 3<\/h3>\r\n3) Complete the following table of expected counts for flights if both airlines had flight status distributions that matched the overall proportions.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td><\/td>\r\n<td><strong>Expected On-Time Flights<\/strong><\/td>\r\n<td><strong>Expected Delayed Flights<\/strong><\/td>\r\n<td><strong>Expected Canceled Flights<\/strong><\/td>\r\n<td><strong>Expected Diverted Flights<\/strong><\/td>\r\n<td><strong>Total Flights<\/strong><\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>Delta Airlines<\/strong><\/td>\r\n<td>12,592.633<\/td>\r\n<td><\/td>\r\n<td><\/td>\r\n<td><\/td>\r\n<td>13,651<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>Southwest Airlines<\/strong><\/td>\r\n<td>2,363.367<\/td>\r\n<td><\/td>\r\n<td><\/td>\r\n<td><\/td>\r\n<td>2,562<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>Total Flights<\/strong><\/td>\r\n<td>14,956<\/td>\r\n<td>1,203<\/td>\r\n<td>45<\/td>\r\n<td>9<\/td>\r\n<td>16,213<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 4<\/h3>\r\n<div class=\"textLayer\">4)As we\u2019ve seen in previous in-class activities, we can compare the observed and expected counts by calculating the difference between the observed count and the expected count. (So, observed\u2013expected for each cell of the table.)<\/div>\r\n<div class=\"textLayer\">Part A: Notice that for Delta Airlines\u2019 on-time flights, the difference is 12,716\u221212,592.633=123.367, so Delta Airlineshad 123.367 more on-time flights than would be expected if DeltaAirlines\u2019distribution matched the overall proportions. Find and interpret the differencefor SouthwestAirlines\u2019delayed flights.<\/div>\r\n<div class=\"textLayer\">Part B:When the difference between an observed count and thecorrespondingexpected countis negative, itmeans the expected countwas larger than the observed count, so there were fewer observed values than expected. Find and interpret the differencefor SouthwestAirlines\u2019on-time flights.<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>","rendered":"<div id=\"bp-page-1\" class=\"page\" data-page-number=\"1\" data-loaded=\"true\">\n<div class=\"textLayer\">In the next preview assignment and in the next class, you will need to compute relative frequencies and find expected counts based on certain proportions.<\/div>\n<div class=\"textLayer\">Flight Frequencies<\/div>\n<div class=\"textLayer\">In the next in-class activity, we will be comparing distributions of a categorical variable for multiple populations. In particular, we will be looking at whether different airlines have the same distribution of flight status. Our categorical variable will be flight status, and the populations we are comparing are the flights for different airlines. The values of our categorical variable are On-Time, Delayed, Canceled, and Diverted. The following table is a two-way table(also called a contingency table), and it gives the counts for each value of the variable flight status for Delta Airlines and Southwest Airlines arrivals at the Atlanta airport in March 2021.<a class=\"footnote\" title=\"U.S.Department of Transportation,Bureau of Transportation Statistics. (n.d.).On-time performance -Reporting operating carrier flight delays at a glance.https:\/\/www.transtats.bts.gov\/HomeDrillChart_Month.asp?5ry_lrn4=FDFD&amp;N44_Qry=E&amp;5ry_Pn44vr4=DDD&amp;5ry_Nv42146=DDD&amp;heY_fryrp6lrn4=FDFE&amp;heY_fryrp6Z106u=F\" id=\"return-footnote-5493-1\" href=\"#footnote-5493-1\" aria-label=\"Footnote 1\"><sup class=\"footnote\">[1]<\/sup><\/a> Notice that each row gives the distribution of flight status for an individual airline.<\/div>\n<div>\n<table>\n<tbody>\n<tr>\n<td><\/td>\n<td><strong>On-Time Flights<\/strong><\/td>\n<td><strong>Delayed Flights<\/strong><\/td>\n<td><strong>Canceled Flights<\/strong><\/td>\n<td><strong>Diverted Flights<\/strong><\/td>\n<td><strong>Total<\/strong><\/td>\n<\/tr>\n<tr>\n<td><strong>Delta Airlines<\/strong><\/td>\n<td>12,716<\/td>\n<td>904<\/td>\n<td>23<\/td>\n<td>8<\/td>\n<td>13,651<\/td>\n<\/tr>\n<tr>\n<td><strong>Southwest Airlines<\/strong><\/td>\n<td>2,240<\/td>\n<td>299<\/td>\n<td>22<\/td>\n<td>1<\/td>\n<td>2,562<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div class=\"textLayer\">For Delta Airlines, we can find the relative frequency of on-time flights, or the proportion of on-time flights, by looking at the ratio of on-time flights to the total number of flights: [latex]\\frac{12,716}{13,651}\u22480.9315[\/latex] So, about 93.15% of Delta Airlines\u2019 arriving flights in Atlanta in March 2021 were on time. We can performa similar computation for each value of the variable flight status to obtain the relative frequency distribution of flight status for Delta Airlines in terms of percentages, as shown in the following table.<\/div>\n<\/div>\n<div id=\"bp-page-2\" class=\"page\" data-page-number=\"2\" data-loaded=\"true\">\n<div class=\"textLayer\">\n<table>\n<tbody>\n<tr>\n<td><\/td>\n<td><strong>On-Time Percentage Flights<\/strong><\/td>\n<td><strong>Delayed Percentage Flights<\/strong><\/td>\n<td><strong>Canceled Percentage Flights<\/strong><\/td>\n<td><strong>Diverted Percentage Flights<\/strong><\/td>\n<td><strong>Total<\/strong><\/td>\n<\/tr>\n<tr>\n<td><strong>Delta Airlines<\/strong><\/td>\n<td>93.15%<\/td>\n<td>6.62%<\/td>\n<td>0.17%<\/td>\n<td>0.06%<\/td>\n<td>100%<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 1<\/h3>\n<p>1) Complete thefollowingtable for the relative frequency distribution of flight statusfor Southwest Airlines by percentage. Round to the nearest hundredth.<\/p>\n<table>\n<tbody>\n<tr>\n<td><\/td>\n<td><strong>On-Time Percentage Flights<\/strong><\/td>\n<td><strong>Delayed Percentage Flights<\/strong><\/td>\n<td><strong>Canceled Percentage Flights<\/strong><\/td>\n<td><strong>Diverted Percentage Flights<\/strong><\/td>\n<td><strong>Total<\/strong><\/td>\n<\/tr>\n<tr>\n<td><strong>Southwest Airlines<\/strong><\/td>\n<td><\/td>\n<td><\/td>\n<td><\/td>\n<td><\/td>\n<td>100%<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>We can also consider the total number of flights for each value of the variable flight statusand look at the overall relative frequency for each. For example, there were 12,716+2,240=14,956on-time flights in total for both airlines. The total number of flights overall for both airlines was13,651+2,562=16,213flights. Then,the overall relative frequency for on-time flights was: [latex]\\frac{14,956}{16,213}\u22480.92246962[\/latex] So, about 92.25% of flights were on time for both airlines combined. In this case, we will keep more decimal places to avoid rounding errors in our next computations.<\/p>\n<\/div>\n<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 2<\/h3>\n<p>2) Find the overall relative frequencyexpressed as a percentagefor each value of the variable flight statusand complete the following table.<\/p>\n<table>\n<tbody>\n<tr>\n<td><\/td>\n<td><strong>On-Time Percentage Flights<\/strong><\/td>\n<td><strong>Delayed Percentage Flights<\/strong><\/td>\n<td><strong>Canceled Percentage Flights<\/strong><\/td>\n<td><strong>Diverted Percentage Flights<\/strong><\/td>\n<td><strong>Total<\/strong><\/td>\n<\/tr>\n<tr>\n<td><strong>Overall<\/strong><\/td>\n<td>92.246962%<\/td>\n<td><\/td>\n<td><\/td>\n<td><\/td>\n<td>100%<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<div class=\"textLayer\">Now, let\u2019s imagine that Delta Airlines and Southwest Airlines both had flight status proportions that matched the overall proportions. For example, Delta Airlines would have 92.246962% of its flights arrive on time. Since Delta Airlines had 13,651 flights arrive in Atlanta in total in March 2021, 92.246962% of 13,651=0.92246962\u00d713,651\u224812,592.633is the number of flights that we would expect to be on time. This is called the expected count of on-time flights if Delta Airlines\u2019 distribution matched the overall proportions. Similarly, for Southwest Airlines, we would expect to have 0.92246962\u00d72,562\u22482,363.367 on-time flights if its distribution matched the overall proportions. Notice that these expected counts do not have to be whole numbers because they are theoretical values. Notice also that there were 14,956 total on-time flights for these two airlines in March 2021, so once we knew that Delta Airlines would be expected to have 12,592.633 on-time flights if its distribution matched the overall proportions, we could have found the expected number of on-time flights for Southwest Airlines by subtracting:<\/div>\n<div class=\"textLayer\">14,956\u221212,592.633=2,363.367<\/div>\n<div class=\"textLayer\">We see that we get the same expected count as we did when we used the percentage.<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 3<\/h3>\n<p>3) Complete the following table of expected counts for flights if both airlines had flight status distributions that matched the overall proportions.<\/p>\n<table>\n<tbody>\n<tr>\n<td><\/td>\n<td><strong>Expected On-Time Flights<\/strong><\/td>\n<td><strong>Expected Delayed Flights<\/strong><\/td>\n<td><strong>Expected Canceled Flights<\/strong><\/td>\n<td><strong>Expected Diverted Flights<\/strong><\/td>\n<td><strong>Total Flights<\/strong><\/td>\n<\/tr>\n<tr>\n<td><strong>Delta Airlines<\/strong><\/td>\n<td>12,592.633<\/td>\n<td><\/td>\n<td><\/td>\n<td><\/td>\n<td>13,651<\/td>\n<\/tr>\n<tr>\n<td><strong>Southwest Airlines<\/strong><\/td>\n<td>2,363.367<\/td>\n<td><\/td>\n<td><\/td>\n<td><\/td>\n<td>2,562<\/td>\n<\/tr>\n<tr>\n<td><strong>Total Flights<\/strong><\/td>\n<td>14,956<\/td>\n<td>1,203<\/td>\n<td>45<\/td>\n<td>9<\/td>\n<td>16,213<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 4<\/h3>\n<div class=\"textLayer\">4)As we\u2019ve seen in previous in-class activities, we can compare the observed and expected counts by calculating the difference between the observed count and the expected count. (So, observed\u2013expected for each cell of the table.)<\/div>\n<div class=\"textLayer\">Part A: Notice that for Delta Airlines\u2019 on-time flights, the difference is 12,716\u221212,592.633=123.367, so Delta Airlineshad 123.367 more on-time flights than would be expected if DeltaAirlines\u2019distribution matched the overall proportions. Find and interpret the differencefor SouthwestAirlines\u2019delayed flights.<\/div>\n<div class=\"textLayer\">Part B:When the difference between an observed count and thecorrespondingexpected countis negative, itmeans the expected countwas larger than the observed count, so there were fewer observed values than expected. Find and interpret the differencefor SouthwestAirlines\u2019on-time flights.<\/div>\n<\/div>\n<\/div>\n<\/div>\n<hr class=\"before-footnotes clear\" \/><div class=\"footnotes\"><ol><li id=\"footnote-5493-1\">U.S.Department of Transportation,Bureau of Transportation Statistics. (n.d.).On-time performance -Reporting operating carrier flight delays at a glance.https:\/\/www.transtats.bts.gov\/HomeDrillChart_Month.asp?5ry_lrn4=FDFD&amp;N44_Qry=E&amp;5ry_Pn44vr4=DDD&amp;5ry_Nv42146=DDD&amp;heY_fryrp6lrn4=FDFE&amp;heY_fryrp6Z106u=F <a href=\"#return-footnote-5493-1\" class=\"return-footnote\" aria-label=\"Return to footnote 1\">&crarr;<\/a><\/li><\/ol><\/div>","protected":false},"author":23592,"menu_order":7,"template":"","meta":{"_candela_citation":"[]","CANDELA_OUTCOMES_GUID":"","pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-5493","chapter","type-chapter","status-publish","hentry"],"part":5479,"_links":{"self":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5493","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/users\/23592"}],"version-history":[{"count":4,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5493\/revisions"}],"predecessor-version":[{"id":5620,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5493\/revisions\/5620"}],"part":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/parts\/5479"}],"metadata":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5493\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/media?parent=5493"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapter-type?post=5493"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/contributor?post=5493"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/license?post=5493"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}