Question 1
1) Suppose that we obtain enough evidence to reject the null hypothesis; that is, we decide that there is strong evidence to support the idea that the flight status distributions are not the same for at least one of the three airlines. At that point, what else might you want to know about the situation?
Question 2
Question 3
3) Let’sproceed with this chi-square test at a significance level of 0.01. Continueusing the data analysis tool. What is thevalue of thechi-square test statistic obtained from the test?
Question 4
4) What is the P-value obtained from the chi-square test? What does the P-valuerepresent and what does it tell you?
Question 5
5)What is the conclusion of our hypothesis test? State your conclusion in context. Even though we have already drawn a conclusion from our hypothesis test, there is still some information we can glean by looking at the difference between the observed count and the expected count for each cell.The data analysis toolcalls this difference the residualfor that cell (and the idea is similar to the concept of residuals you saw when looking at the differencesbetween observed values and predicted values in the linear regression context). Residuals are calculated using the formula:Residual=Observed−ExpectedSince the values in our cells may vary quite a bit, it’s a good idea to look atwhat the data analysis toolcallsstandardized residualsinstead.These are sometimes referred to as Standardized Pearson residuals.These are values that standardize the residuals so that if the null hypothesis is assumed to be true, they can be interpreted as normal z-scores. In particular, most standardized residuals for a given test will fall between −2 and 2. We can use these standardized residuals to determine how far off our observed countis from what was expected if the null hypothesis istrue (i.e., if the distributionsare really the same). The sign of the standardized residual tells us whether we observed more cases in that cell than weexpected (a positive residual) or fewer cases than we expected (a negative residual).
Question 6
Question 7
7) Discuss this result in terms of practical significance and statisticalsignificance. Can we safely conclude that any of our three airlines are doing muchbetter than the others?
- U.S. Department of Transportation, Bureau of Transportation Statistics. (n.d.). On-time performance -Reporting operating carrier flight delays at a glance. https://www.transtats.bts.gov/HomeDrillChart_Month.asp?5ry_lrn4=FDFD&N44_Qry=E&5ry_Pn44vr4=DDD&5ry_Nv42146=DDD&heY_fryrp6lrn4=FDFE&heY_fryrp6Z106u=F ↵