{"id":287,"date":"2021-07-14T15:59:07","date_gmt":"2021-07-14T15:59:07","guid":{"rendered":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/chapter\/two-population-means-with-unknown-standard-deviations\/"},"modified":"2023-12-05T09:38:01","modified_gmt":"2023-12-05T09:38:01","slug":"two-population-means-with-unknown-standard-deviations","status":"publish","type":"chapter","link":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/chapter\/two-population-means-with-unknown-standard-deviations\/","title":{"raw":"Testing for Two Population Means","rendered":"Testing for Two Population Means"},"content":{"raw":"<div class=\"textbox learning-objectives\">\r\n<h3>Learning Outcomes<\/h3>\r\n<section>\r\n<ul>\r\n \t<li>Conduct a hypothesis test for a difference in two population means with unknown standard deviations and interpret the conclusion in context<\/li>\r\n<\/ul>\r\n<\/section><\/div>\r\n<ol>\r\n \t<li>The two independent samples are simple random samples from two distinct populations.<\/li>\r\n \t<li>For the two distinct populations:\r\n<ul>\r\n \t<li>if the sample sizes are small, the distributions are important (should be normal)<\/li>\r\n \t<li>if the sample sizes are large, the distributions are not important (need not be normal)<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ol>\r\n<strong>Note:<\/strong> The test comparing two independent population means with unknown and possibly unequal population standard deviations is called the Aspin-Welch t-test. The degrees of freedom formula was developed by Aspin-Welch.\r\n\r\nThe comparison of two population means is very common. A difference between the two samples depends on both the means and the standard deviations. Very different means can occur by chance if there is great variation among the individual samples. In order to account for the variation, we take the difference of the sample means, [latex]\\displaystyle\\overline{{X}}_{{1}}-\\overline{{X}}_{{2}} [\/latex], and divide by the standard error in order to standardize the difference. The result is a t-score test statistic.\r\n\r\nBecause we do not know the population standard deviations, we estimate them using the two-sample standard deviations from our independent samples. For the hypothesis test, we calculate the estimated standard deviation, or <strong>standard error<\/strong>, of <strong>the difference in sample means, <\/strong>[latex]\\displaystyle\\overline{{X}}_{{1}}-\\overline{{X}}_{{2}} [\/latex].\r\n\r\nThe standard error is: [latex]\\displaystyle\\sqrt{\\frac{(s_1)^2}{n_1}+\\frac{(s_2)^2}{n_2}} [\/latex]\r\n<div class=\"textbox examples\">\r\n<h3>Recall: Order of Operations<\/h3>\r\n<div class=\"textbox shaded\">\r\n\r\nWhen simplifying mathematical expressions perform the operations in the following order:\r\n1. <strong>P<\/strong>arentheses and other Grouping Symbols\r\n<ul id=\"fs-id1171104029952\">\r\n \t<li>Simplify all expressions inside the parentheses or other grouping symbols, working on the innermost parentheses first.<\/li>\r\n<\/ul>\r\n2. <strong>E<\/strong>xponents\r\n<ul id=\"fs-id1171104407077\">\r\n \t<li>Simplify all expressions with exponents.<\/li>\r\n<\/ul>\r\n3. <strong>M<\/strong>ultiplication and <strong>D<\/strong>ivision\r\n<ul id=\"fs-id1171103140103\">\r\n \t<li>Perform all multiplication and division in order from left to right. These operations have equal priority.<\/li>\r\n<\/ul>\r\n4. <strong>A<\/strong>ddition and <strong>S<\/strong>ubtraction\r\n<ul id=\"fs-id1171104002792\">\r\n \t<li>Perform all addition and subtraction in order from left to right. These operations have equal priority.<\/li>\r\n<\/ul>\r\n<\/div>\r\nFor the standard error formula, you would follow the following steps:\r\n\r\nFirst, calculate [latex]\\frac{(s_1)^2}{n_1}[\/latex] by removing the parentheses by squaring the standard deviation of the first data set and then divide by n of the first data set.\r\n\r\nSecond, calculate [latex]\\frac{(s_2)^2}{n_2}[\/latex] by removing the parentheses by squaring the standard deviation of the second data set and then divide by n of the second data set.\r\n\r\nThird, add what you got in both previous steps and then take the square root of the sum.\r\n\r\n<\/div>\r\nThe test statistic (<em>t<\/em>-score) is calculated as follows: [latex]t= \\dfrac{(\\overline{x}_1-\\overline{x}_2)-(\\mu_1-\\mu_2)}{\\displaystyle\\sqrt{\\frac{(s_1)^2}{n_1}+\\frac{(s_2)^2}{n_2}}} [\/latex]\r\n\r\n<strong>Where:<\/strong>\r\n<ul>\r\n \t<li><em>s<\/em><sub>1<\/sub> and <em>s<\/em><sub>2<\/sub>, the sample standard deviations, are estimates of <em>\u03c3<\/em><sub>1<\/sub> and <em>\u03c3<\/em><sub>2<\/sub>, respectively.<\/li>\r\n \t<li><em>\u03c3<\/em><sub>1<\/sub> and <em>\u03c3<\/em><sub>1<\/sub> are the unknown population standard deviations.<\/li>\r\n \t<li>[latex]\\displaystyle\\overline{{x}}_{{1}} [\/latex] and [latex]\\overline{{x}}_{{2}} [\/latex] are the sample means.<\/li>\r\n \t<li>[latex]\\mu_1 [\/latex] and [latex]\\mu_2[\/latex] are the population means.<\/li>\r\n<\/ul>\r\nThe number of <strong>degrees of freedom (<em data-redactor-tag=\"em\">df<\/em>)<\/strong> requires a somewhat complicated calculation. However, a computer or calculator calculates it easily. The <em>df<\/em> are not always a whole number. The test statistic calculated previously is approximated by the Student's <em>t<\/em>-distribution with <em>df<\/em> as follows:\r\n\r\n[latex]\\displaystyle{df}=\\dfrac{((\\dfrac{(s_1)^2}{n_1})+(\\dfrac{(s_2)^2}{n_2}))^2}{(\\dfrac{1}{n_1-1})(\\dfrac{(s_1)^2}{n_1})^2+(\\dfrac{1}{n_2-1})(\\dfrac{(s_2)^2}{n_2})^2} [\/latex]\r\n\r\nWhen both sample sizes <em>n<\/em><sub>1<\/sub> and <em>n<\/em><sub>2<\/sub> are five or larger, the Student's <em>t<\/em> approximation is very good. Notice that the sample variances (<em>s<\/em><sub>1<\/sub>)<sup>2<\/sup> and (<em>s<\/em><sub>2<\/sub>)<sup>2<\/sup> are not pooled. (If the question comes up, do not pool the variances.)\r\n\r\n<strong>Note:<\/strong> It is not necessary to compute this by hand. A calculator or computer easily computes it.\r\n<div class=\"textbox exercises\">\r\n<h3>Example 1<\/h3>\r\n<strong>Independent groups<\/strong>\r\n\r\nThe average amount of time boys and girls aged seven to 11 spend playing sports each day is believed to be the same. A study is done and data are collected, resulting in the data in the table below. Each populations has a normal distribution.\r\n<table style=\"height: 33px;\">\r\n<thead>\r\n<tr style=\"height: 11px;\">\r\n<th style=\"height: 11px; width: 25.6406px;\"><\/th>\r\n<th style=\"height: 11px; width: 67.6406px;\">Sample Size<\/th>\r\n<th style=\"height: 11px; width: 272.641px;\">Average Number of Hours Playing Sports Per Day<\/th>\r\n<th style=\"height: 11px; width: 149.641px;\">Sample Standard Deviation<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr style=\"height: 11px;\">\r\n<td style=\"height: 11px; width: 25.6406px;\">Girls<\/td>\r\n<td style=\"height: 11px; width: 67.6406px;\">9<\/td>\r\n<td style=\"height: 11px; width: 272.641px;\">2<\/td>\r\n<td style=\"height: 11px; width: 149.641px;\">0.866<\/td>\r\n<\/tr>\r\n<tr style=\"height: 11px;\">\r\n<td style=\"height: 11px; width: 25.6406px;\">Boys<\/td>\r\n<td style=\"height: 11px; width: 67.6406px;\">16<\/td>\r\n<td style=\"height: 11px; width: 272.641px;\">3.2<\/td>\r\n<td style=\"height: 11px; width: 149.641px;\">1.00<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nIs there a difference in the mean amount of time boys and girls aged seven to 11 play sports each day? Test at the 5% level of significance.\r\n\r\n[reveal-answer q=\"67452\"]Show Answer[\/reveal-answer]\r\n[hidden-answer a=\"67452\"]\r\n\r\n<strong>The population standard deviations are not known. <\/strong>Let <em>g<\/em> be the subscript for girls and <em>b<\/em> be the subscript for boys. Then, <em>\u03bc<sub data-redactor-tag=\"sub\">g<\/sub><\/em> is the population mean for girls and <em>\u03bc<sub data-redactor-tag=\"sub\">b<\/sub><\/em> is the population mean for boys. This is a test of two <strong>independent groups<\/strong>, two population <strong>means<\/strong>.\r\n\r\n<strong>Random variable: <\/strong>[latex]\\displaystyle\\overline{{X}}_{{{g}}}-\\overline{{X}}_{{b}} [\/latex] = difference in the sample mean amount of time girls and boys play sports each day.\r\n\r\n[latex]H_0:\\mu_g=\\mu_b [\/latex]; [latex]H_0:\\mu_g-\\mu_b=0 [\/latex]\r\n\r\n[latex]H_a:\\mu_g\\neq\\mu_b [\/latex]; [latex]H_a:\\mu_g-\\mu_b\\neq{0} [\/latex]\r\n\r\nThe words \"<strong>the same<\/strong>\" tell you <em>H<sub data-redactor-tag=\"sub\">0<\/sub><\/em> has an equal sign. Since there are no other words to indicate <em>H<sub data-redactor-tag=\"sub\">a<\/sub><\/em>, assume it says \"<strong>is different<\/strong>.\" This is a two-tailed test.\r\n\r\n<strong>Distribution for the test:<\/strong> Use <em>t<sub data-redactor-tag=\"sub\">df<\/sub><\/em> where <em>df<\/em> is calculated using the <em>df <\/em>formula for independent groups, two population means. Using a calculator, <em>df<\/em> is approximately 18.8462. <strong>Do not pool the variances.<\/strong>\r\n\r\n<strong>Calculate the <em data-redactor-tag=\"em\">p<\/em>-value using a Student's <em>t<\/em>-distribution:<\/strong> <em>p<\/em>-value = 0.0054\r\n\r\n<strong>Graph:<\/strong>\r\n\r\n<img class=\"aligncenter wp-image-2110 size-full\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5668\/2021\/07\/01161712\/219dc5b9cc62017e5f30888eb68e03577cf4b353.jpeg\" alt=\"This is a normal distribution curve representing the difference in the average amount of time girls and boys play sports all day. The mean is equal to zero, and the values -1.2, 0, and 1.2 are labeled on the horizontal axis. Two vertical lines extend from -1.2 and 1.2 to the curve. The region to the left of x = -1.2 and the region to the right of x = 1.2 are shaded to represent the p-value. The area of each region is 0.0028.\" width=\"488\" height=\"208\" \/>\r\n\r\n<em>s<sub>g<\/sub><\/em> = 0.866\r\n\r\n<em>s<sub>b<\/sub><\/em> = 1\r\n\r\nSo, [latex]\\displaystyle\\overline{x}_g-\\overline{x}_b=2-3.2=-1.2 [\/latex]\r\n\r\nHalf the <em>p<\/em>-value is below \u20131.2 and half is above 1.2.\r\n\r\n<strong>Make a decision:<\/strong> Since <em>\u03b1<\/em> &gt; <em>p<\/em>-value, reject <em>H<sub>0<\/sub><\/em>. This means you reject <em>\u03bc<sub>g<\/sub><\/em> = <em>\u03bc<sub>b<\/sub><\/em>. The means are different.\r\n\r\n<header>\r\n<h4 class=\"os-title\" data-type=\"title\"><span class=\"os-title-label\">USING THE TI-83, 83+, 84, 84+ CALCULATOR<\/span><\/h4>\r\n<\/header><section>\r\n<div class=\"os-note-body\"><\/div>\r\n<\/section>\r\n<ul>\r\n \t<li>Press <code style=\"line-height: 1.6em;\">STAT<\/code>.<\/li>\r\n \t<li>Arrow over to <code style=\"line-height: 1.6em;\">TESTS<\/code> and press <code style=\"line-height: 1.6em;\">4:2-SampTTest<\/code>.<\/li>\r\n \t<li>Arrow over to Stats and press <code style=\"line-height: 1.6em;\">ENTER<\/code>.<\/li>\r\n \t<li>Arrow down and enter <code style=\"line-height: 1.6em;\">2<\/code> for the first sample mean, [latex]0.866[\/latex] for Sx1, <code style=\"line-height: 1.6em;\">9<\/code> for n1, <code style=\"line-height: 1.6em;\">3.2<\/code> for the second sample mean, <code style=\"line-height: 1.6em;\">1<\/code> for Sx2, and <code style=\"line-height: 1.6em;\">16<\/code> for n2.<\/li>\r\n \t<li>Arrow down to \u03bc1: and arrow to <code style=\"line-height: 1.6em;\">does not equal<\/code> \u03bc2.<\/li>\r\n \t<li>Press <code style=\"line-height: 1.6em;\">ENTER<\/code>.<\/li>\r\n \t<li>Arrow down to Pooled: and<code style=\"line-height: 1.6em;\">No<\/code>.<\/li>\r\n \t<li>Press <code style=\"line-height: 1.6em;\">ENTER<\/code>.<\/li>\r\n \t<li>Arrow down to <code style=\"line-height: 1.6em;\">Calculate<\/code> and press <code style=\"line-height: 1.6em;\">ENTER<\/code>.<\/li>\r\n \t<li>The <em style=\"font-size: 1rem; orphans: 1; text-align: initial;\">p<\/em><span style=\"font-size: 1rem; orphans: 1; text-align: initial;\">-value is <\/span><em style=\"font-size: 1rem; orphans: 1; text-align: initial;\">p<\/em><span style=\"font-size: 1rem; orphans: 1; text-align: initial;\"> = 0.0054, the <\/span><em style=\"font-size: 1rem; orphans: 1; text-align: initial;\">dfs<\/em><span style=\"font-size: 1rem; orphans: 1; text-align: initial;\"> are approximately 18.8462, and the test statistic is \u20133.14.<\/span><\/li>\r\n \t<li>Do the procedure again but instead of Calculate do Draw.<\/li>\r\n<\/ul>\r\n<strong>Conclusion: <\/strong>At the 5% level of significance, the sample data show there is sufficient evidence to conclude that the mean number of hours that girls and boys aged seven to 11 play sports per day is different (the mean number of hours boys aged seven to 11 play sports per day is greater than the mean number of hours played by girls OR the mean number of hours girls aged seven to 11 play sports per day is greater than the mean number of hours played by boys).\r\n\r\n[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>try it 1<\/h3>\r\nTwo samples are shown in the table. Both have normal distributions. The means for the two populations are thought to be the same. Is there a difference in the means? Test at the 5% level of significance.\r\n<table>\r\n<thead>\r\n<tr>\r\n<th><\/th>\r\n<th>Sample Size<\/th>\r\n<th>Sample Mean<\/th>\r\n<th>Sample Standard Deviation<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td>Population A<\/td>\r\n<td>25<\/td>\r\n<td>5<\/td>\r\n<td>1<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Population B<\/td>\r\n<td>16<\/td>\r\n<td>4.7<\/td>\r\n<td>1.2<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n[reveal-answer q=\"14467\"]Show Answer[\/reveal-answer]\r\n[hidden-answer a=\"14467\"]\r\n\r\nThe <em>p<\/em>-value is 0.4125, which is much higher than 0.05, so we decline to reject the null hypothesis. There is not sufficient evidence to conclude that the means of the two populations are not the same.\r\n\r\n[\/hidden-answer]\r\n\r\n<\/div>\r\n<strong>Note:<\/strong> When the sum of the sample sizes is larger than 30 (<em>n<\/em><sub>1<\/sub> + <em>n<\/em><sub>2<\/sub> &gt; 30) you can use the normal distribution to approximate the Student's <em>t<\/em>.\r\n<div class=\"textbox exercises\">\r\n<h3>Example 2<\/h3>\r\nA study is done by a community group in two neighboring colleges to determine which one graduates students with more math classes. College A samples 11 graduates. Their average is four math classes with a standard deviation of 1.5 math classes. College B samples nine graduates. Their average is 3.5 math classes with a standard deviation of one math class. The community group believes that a student who graduates from College A <strong>has taken more math classes<\/strong>, on the average. Both populations have a normal distribution. Test at a 1% significance level. Answer the following questions.\r\n<ol>\r\n \t<li>Is this a test of two means or two proportions?<\/li>\r\n \t<li>Are the populations standard deviations known or unknown?<\/li>\r\n \t<li>Which distribution do you use to perform the test?<\/li>\r\n \t<li>What is the random variable?<\/li>\r\n \t<li>What are the null and alternate hypotheses?<\/li>\r\n \t<li>Is this test right-, left-, or two-tailed?<\/li>\r\n \t<li>What is the <em>p<\/em>-value?<\/li>\r\n \t<li>Do you reject or not reject the null hypothesis?<\/li>\r\n<\/ol>\r\n[reveal-answer q=\"493064\"]Show Answer[\/reveal-answer]\r\n[hidden-answer a=\"493064\"]\r\n<ol>\r\n \t<li>two means<\/li>\r\n \t<li>unknown<\/li>\r\n \t<li>Student's <em>t<\/em><\/li>\r\n \t<li>[latex]\\displaystyle\\overline{{X}}_{{{A}}}-\\overline{{X}}_{{B}} [\/latex]<\/li>\r\n \t<li><em>H<sub>0<\/sub><\/em>:<em> \u03bc<sub>A<\/sub><\/em>\u2264<em> \u03bc<sub>B<\/sub><\/em>\r\n<em> H<sub>a<\/sub><\/em>:<em> \u03bc<sub>A<\/sub><\/em>&gt;<em> \u03bc<sub>B<\/sub><\/em><\/li>\r\n \t<li>right\r\n<img class=\"alignnone wp-image-2112 size-full\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5668\/2021\/07\/01161847\/70782e8d6a8ad9163855f668c5223866c7e0315d.jpeg\" alt=\"This is a normal distribution curve with mean equal to 0. A vertical line near the tail of the curve to the right of zero extends from the axis to the curve. The region under the curve to the right of the line is shaded.\" width=\"487\" height=\"240\" \/><\/li>\r\n \t<li>0.1928<\/li>\r\n \t<li>do not reject<\/li>\r\n<\/ol>\r\n<strong>Conclusion: <\/strong>At the 1% level of significance, from the sample data, there is not sufficient evidence to conclude that a student who graduates from College A has taken more math classes, on the average, than a student who graduates from College B.\r\n\r\n[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>try it 2<\/h3>\r\nA study is done to determine if Company A retains its workers longer than Company B. Company A samples 15 workers, and their average time with the company is five years with a standard deviation of 1.2. Company B samples 20 workers, and their average time with the company is 4.5 years with a standard deviation of 0.8. The populations are normally distributed.\r\n<ol>\r\n \t<li>Are the population standard deviations known?<\/li>\r\n \t<li>Conduct an appropriate hypothesis test. At the 5% significance level, what is your conclusion?<\/li>\r\n<\/ol>\r\n[reveal-answer q=\"196475\"]Show Answer[\/reveal-answer]\r\n[hidden-answer a=\"196475\"]\r\n<ol>\r\n \t<li>They are unknown.<\/li>\r\n \t<li>The <em style=\"font-size: 1rem; orphans: 1; text-align: initial;\">p<\/em><span style=\"font-size: 1rem; orphans: 1; text-align: initial;\">-value = 0.0878. At the 5% level of significance, there is insufficient evidence to conclude that the workers of Company A stay longer with the company.<\/span><\/li>\r\n<\/ol>\r\n[\/hidden-answer]\r\n\r\n<\/div>\r\nhttps:\/\/www.youtube.com\/embed\/mvye6X_0upA\r\n<div class=\"textbox exercises\">\r\n<h3>Example 3<\/h3>\r\nA professor at a large community college wanted to determine whether there is a difference in the means of final exam scores between students who took his statistics course online and the students who took his face-to-face statistics class. He believed that the mean of the final exam scores for the online class would be lower than that of the face-to-face class. Was the professor correct? The randomly selected 30 final exam scores from each group are listed in the two tables below:\r\n\r\nOnline Class:\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>67.6<\/td>\r\n<td>41.2<\/td>\r\n<td>85.3<\/td>\r\n<td>55.9<\/td>\r\n<td>82.4<\/td>\r\n<td>91.2<\/td>\r\n<td>73.5<\/td>\r\n<td>94.1<\/td>\r\n<td>64.7<\/td>\r\n<td>64.7<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>70.6<\/td>\r\n<td>38.2<\/td>\r\n<td>61.8<\/td>\r\n<td>88.2<\/td>\r\n<td>70.6<\/td>\r\n<td>58.8<\/td>\r\n<td>91.2<\/td>\r\n<td>73.5<\/td>\r\n<td>82.4<\/td>\r\n<td>35.5<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>94.1<\/td>\r\n<td>88.2<\/td>\r\n<td>64.7<\/td>\r\n<td>55.9<\/td>\r\n<td>88.2<\/td>\r\n<td>97.1<\/td>\r\n<td>85.3<\/td>\r\n<td>61.8<\/td>\r\n<td>79.4<\/td>\r\n<td>79.4<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nFace-to-face Class:\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td>77.9<\/td>\r\n<td>95.3<\/td>\r\n<td>81.2<\/td>\r\n<td>74.1<\/td>\r\n<td>98.8<\/td>\r\n<td>88.2<\/td>\r\n<td>85.9<\/td>\r\n<td>92.9<\/td>\r\n<td>87.1<\/td>\r\n<td>88.2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>69.4<\/td>\r\n<td>57.6<\/td>\r\n<td>69.4<\/td>\r\n<td>67.1<\/td>\r\n<td>97.6<\/td>\r\n<td>85.9<\/td>\r\n<td>88.2<\/td>\r\n<td>91.8<\/td>\r\n<td>78.8<\/td>\r\n<td>71.8<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>98.8<\/td>\r\n<td>61.2<\/td>\r\n<td>92.9<\/td>\r\n<td>90.6<\/td>\r\n<td>97.6<\/td>\r\n<td>100<\/td>\r\n<td>95.3<\/td>\r\n<td>83.5<\/td>\r\n<td>92.9<\/td>\r\n<td>89.4<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nIs the mean of the Final Exam scores of the online class lower than the mean of the Final Exam scores of the face-to-face class? Test at a 5% significance level. Answer the following questions:\r\n<ol>\r\n \t<li>Is this a test of two means or two proportions?<\/li>\r\n \t<li>Are the population standard deviations known or unknown?<\/li>\r\n \t<li>Which distribution do you use to perform the test?<\/li>\r\n \t<li>What is the random variable?<\/li>\r\n \t<li>What are the null and alternative hypotheses? Write the null and alternative hypotheses in words and in symbols.<\/li>\r\n \t<li>Is this test right, left, or two tailed?<\/li>\r\n \t<li>What is the <em>p<\/em>-value?<\/li>\r\n \t<li>Do you reject or not reject the null hypothesis?<\/li>\r\n \t<li>At the ___ level of significance, from the sample data, there ______ (is\/is not) sufficient evidence to conclude that ______.<\/li>\r\n<\/ol>\r\n(Review the conclusion in Example 2, and write yours in a similar fashion)\r\n\r\nBe careful not to mix up the information for Group 1 and Group 2!\r\n\r\n[reveal-answer q=\"984147\"]Show Answer[\/reveal-answer]\r\n[hidden-answer a=\"984147\"]\r\n\r\n<header>\r\n<h4 class=\"os-title\" data-type=\"title\"><span class=\"os-title-label\">USING THE TI-83, 83+, 84, 84+ CALCULATOR<\/span><\/h4>\r\n<\/header>First put the data for each group into two lists (such as L1 and L2). Press STAT. Arrow over to TESTS and press 4:2SampTTest. Make sure Data is highlighted and press ENTER. Arrow down and enter L1 for the first list and L2 for the second list. Arrow down to\r\n<em>\u03bc<\/em><sub>1<\/sub>: and arrow to \u2260 <em>\u03bc<\/em><sub>2<\/sub> (does not equal). Press ENTER. Arrow down to Pooled: No. Press ENTER. Arrow down to Calculate and press ENTER.\r\n<ol>\r\n \t<li>two means<\/li>\r\n \t<li>unknown<\/li>\r\n \t<li>Student's <em>t<\/em><\/li>\r\n \t<li>[latex]\\displaystyle\\overline{{X}}_{{1}}-\\overline{{X}}_{{2}} [\/latex]\r\n<ol>\r\n \t<li><em>H<sub data-redactor-tag=\"sub\">0<\/sub><\/em>: <em>\u03bc<sub data-redactor-tag=\"sub\">1<\/sub><\/em> = <em>\u03bc<sub data-redactor-tag=\"sub\">2<\/sub><\/em> Null hypothesis: the means of the final exam scores are equal for the online and face-to-face statistics classes.<\/li>\r\n \t<li><em>H<sub data-redactor-tag=\"sub\">a<\/sub><\/em>: <em>\u03bc<sub data-redactor-tag=\"sub\">1<\/sub><\/em> &lt; <em>\u03bc<sub data-redactor-tag=\"sub\">2<\/sub><\/em> Alternative hypothesis: the mean of the final exam scores of the online class is less than the mean of the final exam scores of the face-to-face class.<\/li>\r\n<\/ol>\r\n<\/li>\r\n \t<li>left-tailed<\/li>\r\n \t<li><em>p<\/em>-value = 0.0011\r\n<img class=\"alignnone wp-image-2114 size-full\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5668\/2021\/07\/01162554\/30b8f681a7471d241b0674f60f2c05602b6ee9c5.jpeg\" alt=\"This is a normal distribution curve with mean equal to zero. A vertical line near the tail of the curve to the left of zero extends from the axis to the curve. The region under the curve to the left of the line is shaded representing p-value = 0.0011.\" width=\"487\" height=\"177\" \/><\/li>\r\n \t<li>reject the null hypothesis<\/li>\r\n \t<li>The professor was correct. The evidence shows that the mean of the final exam scores for the online class is lower than that of the face-to-face class. At the <span style=\"text-decoration: underline;\">5%<\/span> level of significance, from the sample data, there <span style=\"text-decoration: underline;\">is<\/span> (is\/is not) sufficient evidence to conclude that the mean of the final exam scores for the online class is less than <span style=\"text-decoration: underline;\">the mean of final exam scores of the face-to-face class.<\/span><\/li>\r\n<\/ol>\r\n[\/hidden-answer]\r\n\r\n<\/div>","rendered":"<div class=\"textbox learning-objectives\">\n<h3>Learning Outcomes<\/h3>\n<section>\n<ul>\n<li>Conduct a hypothesis test for a difference in two population means with unknown standard deviations and interpret the conclusion in context<\/li>\n<\/ul>\n<\/section>\n<\/div>\n<ol>\n<li>The two independent samples are simple random samples from two distinct populations.<\/li>\n<li>For the two distinct populations:\n<ul>\n<li>if the sample sizes are small, the distributions are important (should be normal)<\/li>\n<li>if the sample sizes are large, the distributions are not important (need not be normal)<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<p><strong>Note:<\/strong> The test comparing two independent population means with unknown and possibly unequal population standard deviations is called the Aspin-Welch t-test. The degrees of freedom formula was developed by Aspin-Welch.<\/p>\n<p>The comparison of two population means is very common. A difference between the two samples depends on both the means and the standard deviations. Very different means can occur by chance if there is great variation among the individual samples. In order to account for the variation, we take the difference of the sample means, [latex]\\displaystyle\\overline{{X}}_{{1}}-\\overline{{X}}_{{2}}[\/latex], and divide by the standard error in order to standardize the difference. The result is a t-score test statistic.<\/p>\n<p>Because we do not know the population standard deviations, we estimate them using the two-sample standard deviations from our independent samples. For the hypothesis test, we calculate the estimated standard deviation, or <strong>standard error<\/strong>, of <strong>the difference in sample means, <\/strong>[latex]\\displaystyle\\overline{{X}}_{{1}}-\\overline{{X}}_{{2}}[\/latex].<\/p>\n<p>The standard error is: [latex]\\displaystyle\\sqrt{\\frac{(s_1)^2}{n_1}+\\frac{(s_2)^2}{n_2}}[\/latex]<\/p>\n<div class=\"textbox examples\">\n<h3>Recall: Order of Operations<\/h3>\n<div class=\"textbox shaded\">\n<p>When simplifying mathematical expressions perform the operations in the following order:<br \/>\n1. <strong>P<\/strong>arentheses and other Grouping Symbols<\/p>\n<ul id=\"fs-id1171104029952\">\n<li>Simplify all expressions inside the parentheses or other grouping symbols, working on the innermost parentheses first.<\/li>\n<\/ul>\n<p>2. <strong>E<\/strong>xponents<\/p>\n<ul id=\"fs-id1171104407077\">\n<li>Simplify all expressions with exponents.<\/li>\n<\/ul>\n<p>3. <strong>M<\/strong>ultiplication and <strong>D<\/strong>ivision<\/p>\n<ul id=\"fs-id1171103140103\">\n<li>Perform all multiplication and division in order from left to right. These operations have equal priority.<\/li>\n<\/ul>\n<p>4. <strong>A<\/strong>ddition and <strong>S<\/strong>ubtraction<\/p>\n<ul id=\"fs-id1171104002792\">\n<li>Perform all addition and subtraction in order from left to right. These operations have equal priority.<\/li>\n<\/ul>\n<\/div>\n<p>For the standard error formula, you would follow the following steps:<\/p>\n<p>First, calculate [latex]\\frac{(s_1)^2}{n_1}[\/latex] by removing the parentheses by squaring the standard deviation of the first data set and then divide by n of the first data set.<\/p>\n<p>Second, calculate [latex]\\frac{(s_2)^2}{n_2}[\/latex] by removing the parentheses by squaring the standard deviation of the second data set and then divide by n of the second data set.<\/p>\n<p>Third, add what you got in both previous steps and then take the square root of the sum.<\/p>\n<\/div>\n<p>The test statistic (<em>t<\/em>-score) is calculated as follows: [latex]t= \\dfrac{(\\overline{x}_1-\\overline{x}_2)-(\\mu_1-\\mu_2)}{\\displaystyle\\sqrt{\\frac{(s_1)^2}{n_1}+\\frac{(s_2)^2}{n_2}}}[\/latex]<\/p>\n<p><strong>Where:<\/strong><\/p>\n<ul>\n<li><em>s<\/em><sub>1<\/sub> and <em>s<\/em><sub>2<\/sub>, the sample standard deviations, are estimates of <em>\u03c3<\/em><sub>1<\/sub> and <em>\u03c3<\/em><sub>2<\/sub>, respectively.<\/li>\n<li><em>\u03c3<\/em><sub>1<\/sub> and <em>\u03c3<\/em><sub>1<\/sub> are the unknown population standard deviations.<\/li>\n<li>[latex]\\displaystyle\\overline{{x}}_{{1}}[\/latex] and [latex]\\overline{{x}}_{{2}}[\/latex] are the sample means.<\/li>\n<li>[latex]\\mu_1[\/latex] and [latex]\\mu_2[\/latex] are the population means.<\/li>\n<\/ul>\n<p>The number of <strong>degrees of freedom (<em data-redactor-tag=\"em\">df<\/em>)<\/strong> requires a somewhat complicated calculation. However, a computer or calculator calculates it easily. The <em>df<\/em> are not always a whole number. The test statistic calculated previously is approximated by the Student&#8217;s <em>t<\/em>-distribution with <em>df<\/em> as follows:<\/p>\n<p>[latex]\\displaystyle{df}=\\dfrac{((\\dfrac{(s_1)^2}{n_1})+(\\dfrac{(s_2)^2}{n_2}))^2}{(\\dfrac{1}{n_1-1})(\\dfrac{(s_1)^2}{n_1})^2+(\\dfrac{1}{n_2-1})(\\dfrac{(s_2)^2}{n_2})^2}[\/latex]<\/p>\n<p>When both sample sizes <em>n<\/em><sub>1<\/sub> and <em>n<\/em><sub>2<\/sub> are five or larger, the Student&#8217;s <em>t<\/em> approximation is very good. Notice that the sample variances (<em>s<\/em><sub>1<\/sub>)<sup>2<\/sup> and (<em>s<\/em><sub>2<\/sub>)<sup>2<\/sup> are not pooled. (If the question comes up, do not pool the variances.)<\/p>\n<p><strong>Note:<\/strong> It is not necessary to compute this by hand. A calculator or computer easily computes it.<\/p>\n<div class=\"textbox exercises\">\n<h3>Example 1<\/h3>\n<p><strong>Independent groups<\/strong><\/p>\n<p>The average amount of time boys and girls aged seven to 11 spend playing sports each day is believed to be the same. A study is done and data are collected, resulting in the data in the table below. Each populations has a normal distribution.<\/p>\n<table style=\"height: 33px;\">\n<thead>\n<tr style=\"height: 11px;\">\n<th style=\"height: 11px; width: 25.6406px;\"><\/th>\n<th style=\"height: 11px; width: 67.6406px;\">Sample Size<\/th>\n<th style=\"height: 11px; width: 272.641px;\">Average Number of Hours Playing Sports Per Day<\/th>\n<th style=\"height: 11px; width: 149.641px;\">Sample Standard Deviation<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr style=\"height: 11px;\">\n<td style=\"height: 11px; width: 25.6406px;\">Girls<\/td>\n<td style=\"height: 11px; width: 67.6406px;\">9<\/td>\n<td style=\"height: 11px; width: 272.641px;\">2<\/td>\n<td style=\"height: 11px; width: 149.641px;\">0.866<\/td>\n<\/tr>\n<tr style=\"height: 11px;\">\n<td style=\"height: 11px; width: 25.6406px;\">Boys<\/td>\n<td style=\"height: 11px; width: 67.6406px;\">16<\/td>\n<td style=\"height: 11px; width: 272.641px;\">3.2<\/td>\n<td style=\"height: 11px; width: 149.641px;\">1.00<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Is there a difference in the mean amount of time boys and girls aged seven to 11 play sports each day? Test at the 5% level of significance.<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q67452\">Show Answer<\/span><\/p>\n<div id=\"q67452\" class=\"hidden-answer\" style=\"display: none\">\n<p><strong>The population standard deviations are not known. <\/strong>Let <em>g<\/em> be the subscript for girls and <em>b<\/em> be the subscript for boys. Then, <em>\u03bc<sub data-redactor-tag=\"sub\">g<\/sub><\/em> is the population mean for girls and <em>\u03bc<sub data-redactor-tag=\"sub\">b<\/sub><\/em> is the population mean for boys. This is a test of two <strong>independent groups<\/strong>, two population <strong>means<\/strong>.<\/p>\n<p><strong>Random variable: <\/strong>[latex]\\displaystyle\\overline{{X}}_{{{g}}}-\\overline{{X}}_{{b}}[\/latex] = difference in the sample mean amount of time girls and boys play sports each day.<\/p>\n<p>[latex]H_0:\\mu_g=\\mu_b[\/latex]; [latex]H_0:\\mu_g-\\mu_b=0[\/latex]<\/p>\n<p>[latex]H_a:\\mu_g\\neq\\mu_b[\/latex]; [latex]H_a:\\mu_g-\\mu_b\\neq{0}[\/latex]<\/p>\n<p>The words &#8220;<strong>the same<\/strong>&#8221; tell you <em>H<sub data-redactor-tag=\"sub\">0<\/sub><\/em> has an equal sign. Since there are no other words to indicate <em>H<sub data-redactor-tag=\"sub\">a<\/sub><\/em>, assume it says &#8220;<strong>is different<\/strong>.&#8221; This is a two-tailed test.<\/p>\n<p><strong>Distribution for the test:<\/strong> Use <em>t<sub data-redactor-tag=\"sub\">df<\/sub><\/em> where <em>df<\/em> is calculated using the <em>df <\/em>formula for independent groups, two population means. Using a calculator, <em>df<\/em> is approximately 18.8462. <strong>Do not pool the variances.<\/strong><\/p>\n<p><strong>Calculate the <em data-redactor-tag=\"em\">p<\/em>-value using a Student&#8217;s <em>t<\/em>-distribution:<\/strong> <em>p<\/em>-value = 0.0054<\/p>\n<p><strong>Graph:<\/strong><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-2110 size-full\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5668\/2021\/07\/01161712\/219dc5b9cc62017e5f30888eb68e03577cf4b353.jpeg\" alt=\"This is a normal distribution curve representing the difference in the average amount of time girls and boys play sports all day. The mean is equal to zero, and the values -1.2, 0, and 1.2 are labeled on the horizontal axis. Two vertical lines extend from -1.2 and 1.2 to the curve. The region to the left of x = -1.2 and the region to the right of x = 1.2 are shaded to represent the p-value. The area of each region is 0.0028.\" width=\"488\" height=\"208\" \/><\/p>\n<p><em>s<sub>g<\/sub><\/em> = 0.866<\/p>\n<p><em>s<sub>b<\/sub><\/em> = 1<\/p>\n<p>So, [latex]\\displaystyle\\overline{x}_g-\\overline{x}_b=2-3.2=-1.2[\/latex]<\/p>\n<p>Half the <em>p<\/em>-value is below \u20131.2 and half is above 1.2.<\/p>\n<p><strong>Make a decision:<\/strong> Since <em>\u03b1<\/em> &gt; <em>p<\/em>-value, reject <em>H<sub>0<\/sub><\/em>. This means you reject <em>\u03bc<sub>g<\/sub><\/em> = <em>\u03bc<sub>b<\/sub><\/em>. The means are different.<\/p>\n<header>\n<h4 class=\"os-title\" data-type=\"title\"><span class=\"os-title-label\">USING THE TI-83, 83+, 84, 84+ CALCULATOR<\/span><\/h4>\n<\/header>\n<section>\n<div class=\"os-note-body\"><\/div>\n<\/section>\n<ul>\n<li>Press <code style=\"line-height: 1.6em;\">STAT<\/code>.<\/li>\n<li>Arrow over to <code style=\"line-height: 1.6em;\">TESTS<\/code> and press <code style=\"line-height: 1.6em;\">4:2-SampTTest<\/code>.<\/li>\n<li>Arrow over to Stats and press <code style=\"line-height: 1.6em;\">ENTER<\/code>.<\/li>\n<li>Arrow down and enter <code style=\"line-height: 1.6em;\">2<\/code> for the first sample mean, [latex]0.866[\/latex] for Sx1, <code style=\"line-height: 1.6em;\">9<\/code> for n1, <code style=\"line-height: 1.6em;\">3.2<\/code> for the second sample mean, <code style=\"line-height: 1.6em;\">1<\/code> for Sx2, and <code style=\"line-height: 1.6em;\">16<\/code> for n2.<\/li>\n<li>Arrow down to \u03bc1: and arrow to <code style=\"line-height: 1.6em;\">does not equal<\/code> \u03bc2.<\/li>\n<li>Press <code style=\"line-height: 1.6em;\">ENTER<\/code>.<\/li>\n<li>Arrow down to Pooled: and<code style=\"line-height: 1.6em;\">No<\/code>.<\/li>\n<li>Press <code style=\"line-height: 1.6em;\">ENTER<\/code>.<\/li>\n<li>Arrow down to <code style=\"line-height: 1.6em;\">Calculate<\/code> and press <code style=\"line-height: 1.6em;\">ENTER<\/code>.<\/li>\n<li>The <em style=\"font-size: 1rem; orphans: 1; text-align: initial;\">p<\/em><span style=\"font-size: 1rem; orphans: 1; text-align: initial;\">-value is <\/span><em style=\"font-size: 1rem; orphans: 1; text-align: initial;\">p<\/em><span style=\"font-size: 1rem; orphans: 1; text-align: initial;\"> = 0.0054, the <\/span><em style=\"font-size: 1rem; orphans: 1; text-align: initial;\">dfs<\/em><span style=\"font-size: 1rem; orphans: 1; text-align: initial;\"> are approximately 18.8462, and the test statistic is \u20133.14.<\/span><\/li>\n<li>Do the procedure again but instead of Calculate do Draw.<\/li>\n<\/ul>\n<p><strong>Conclusion: <\/strong>At the 5% level of significance, the sample data show there is sufficient evidence to conclude that the mean number of hours that girls and boys aged seven to 11 play sports per day is different (the mean number of hours boys aged seven to 11 play sports per day is greater than the mean number of hours played by girls OR the mean number of hours girls aged seven to 11 play sports per day is greater than the mean number of hours played by boys).<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>try it 1<\/h3>\n<p>Two samples are shown in the table. Both have normal distributions. The means for the two populations are thought to be the same. Is there a difference in the means? Test at the 5% level of significance.<\/p>\n<table>\n<thead>\n<tr>\n<th><\/th>\n<th>Sample Size<\/th>\n<th>Sample Mean<\/th>\n<th>Sample Standard Deviation<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Population A<\/td>\n<td>25<\/td>\n<td>5<\/td>\n<td>1<\/td>\n<\/tr>\n<tr>\n<td>Population B<\/td>\n<td>16<\/td>\n<td>4.7<\/td>\n<td>1.2<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q14467\">Show Answer<\/span><\/p>\n<div id=\"q14467\" class=\"hidden-answer\" style=\"display: none\">\n<p>The <em>p<\/em>-value is 0.4125, which is much higher than 0.05, so we decline to reject the null hypothesis. There is not sufficient evidence to conclude that the means of the two populations are not the same.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<p><strong>Note:<\/strong> When the sum of the sample sizes is larger than 30 (<em>n<\/em><sub>1<\/sub> + <em>n<\/em><sub>2<\/sub> &gt; 30) you can use the normal distribution to approximate the Student&#8217;s <em>t<\/em>.<\/p>\n<div class=\"textbox exercises\">\n<h3>Example 2<\/h3>\n<p>A study is done by a community group in two neighboring colleges to determine which one graduates students with more math classes. College A samples 11 graduates. Their average is four math classes with a standard deviation of 1.5 math classes. College B samples nine graduates. Their average is 3.5 math classes with a standard deviation of one math class. The community group believes that a student who graduates from College A <strong>has taken more math classes<\/strong>, on the average. Both populations have a normal distribution. Test at a 1% significance level. Answer the following questions.<\/p>\n<ol>\n<li>Is this a test of two means or two proportions?<\/li>\n<li>Are the populations standard deviations known or unknown?<\/li>\n<li>Which distribution do you use to perform the test?<\/li>\n<li>What is the random variable?<\/li>\n<li>What are the null and alternate hypotheses?<\/li>\n<li>Is this test right-, left-, or two-tailed?<\/li>\n<li>What is the <em>p<\/em>-value?<\/li>\n<li>Do you reject or not reject the null hypothesis?<\/li>\n<\/ol>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q493064\">Show Answer<\/span><\/p>\n<div id=\"q493064\" class=\"hidden-answer\" style=\"display: none\">\n<ol>\n<li>two means<\/li>\n<li>unknown<\/li>\n<li>Student&#8217;s <em>t<\/em><\/li>\n<li>[latex]\\displaystyle\\overline{{X}}_{{{A}}}-\\overline{{X}}_{{B}}[\/latex]<\/li>\n<li><em>H<sub>0<\/sub><\/em>:<em> \u03bc<sub>A<\/sub><\/em>\u2264<em> \u03bc<sub>B<\/sub><\/em><br \/>\n<em> H<sub>a<\/sub><\/em>:<em> \u03bc<sub>A<\/sub><\/em>&gt;<em> \u03bc<sub>B<\/sub><\/em><\/li>\n<li>right<br \/>\n<img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-2112 size-full\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5668\/2021\/07\/01161847\/70782e8d6a8ad9163855f668c5223866c7e0315d.jpeg\" alt=\"This is a normal distribution curve with mean equal to 0. A vertical line near the tail of the curve to the right of zero extends from the axis to the curve. The region under the curve to the right of the line is shaded.\" width=\"487\" height=\"240\" \/><\/li>\n<li>0.1928<\/li>\n<li>do not reject<\/li>\n<\/ol>\n<p><strong>Conclusion: <\/strong>At the 1% level of significance, from the sample data, there is not sufficient evidence to conclude that a student who graduates from College A has taken more math classes, on the average, than a student who graduates from College B.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>try it 2<\/h3>\n<p>A study is done to determine if Company A retains its workers longer than Company B. Company A samples 15 workers, and their average time with the company is five years with a standard deviation of 1.2. Company B samples 20 workers, and their average time with the company is 4.5 years with a standard deviation of 0.8. The populations are normally distributed.<\/p>\n<ol>\n<li>Are the population standard deviations known?<\/li>\n<li>Conduct an appropriate hypothesis test. At the 5% significance level, what is your conclusion?<\/li>\n<\/ol>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q196475\">Show Answer<\/span><\/p>\n<div id=\"q196475\" class=\"hidden-answer\" style=\"display: none\">\n<ol>\n<li>They are unknown.<\/li>\n<li>The <em style=\"font-size: 1rem; orphans: 1; text-align: initial;\">p<\/em><span style=\"font-size: 1rem; orphans: 1; text-align: initial;\">-value = 0.0878. At the 5% level of significance, there is insufficient evidence to conclude that the workers of Company A stay longer with the company.<\/span><\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/div>\n<p><iframe loading=\"lazy\" id=\"oembed-1\" title=\"One-tailed and two-tailed tests | Inferential statistics | Probability and Statistics | Khan Academy\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/mvye6X_0upA?feature=oembed&#38;rel=0\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<div class=\"textbox exercises\">\n<h3>Example 3<\/h3>\n<p>A professor at a large community college wanted to determine whether there is a difference in the means of final exam scores between students who took his statistics course online and the students who took his face-to-face statistics class. He believed that the mean of the final exam scores for the online class would be lower than that of the face-to-face class. Was the professor correct? The randomly selected 30 final exam scores from each group are listed in the two tables below:<\/p>\n<p>Online Class:<\/p>\n<table>\n<tbody>\n<tr>\n<td>67.6<\/td>\n<td>41.2<\/td>\n<td>85.3<\/td>\n<td>55.9<\/td>\n<td>82.4<\/td>\n<td>91.2<\/td>\n<td>73.5<\/td>\n<td>94.1<\/td>\n<td>64.7<\/td>\n<td>64.7<\/td>\n<\/tr>\n<tr>\n<td>70.6<\/td>\n<td>38.2<\/td>\n<td>61.8<\/td>\n<td>88.2<\/td>\n<td>70.6<\/td>\n<td>58.8<\/td>\n<td>91.2<\/td>\n<td>73.5<\/td>\n<td>82.4<\/td>\n<td>35.5<\/td>\n<\/tr>\n<tr>\n<td>94.1<\/td>\n<td>88.2<\/td>\n<td>64.7<\/td>\n<td>55.9<\/td>\n<td>88.2<\/td>\n<td>97.1<\/td>\n<td>85.3<\/td>\n<td>61.8<\/td>\n<td>79.4<\/td>\n<td>79.4<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Face-to-face Class:<\/p>\n<table>\n<tbody>\n<tr>\n<td>77.9<\/td>\n<td>95.3<\/td>\n<td>81.2<\/td>\n<td>74.1<\/td>\n<td>98.8<\/td>\n<td>88.2<\/td>\n<td>85.9<\/td>\n<td>92.9<\/td>\n<td>87.1<\/td>\n<td>88.2<\/td>\n<\/tr>\n<tr>\n<td>69.4<\/td>\n<td>57.6<\/td>\n<td>69.4<\/td>\n<td>67.1<\/td>\n<td>97.6<\/td>\n<td>85.9<\/td>\n<td>88.2<\/td>\n<td>91.8<\/td>\n<td>78.8<\/td>\n<td>71.8<\/td>\n<\/tr>\n<tr>\n<td>98.8<\/td>\n<td>61.2<\/td>\n<td>92.9<\/td>\n<td>90.6<\/td>\n<td>97.6<\/td>\n<td>100<\/td>\n<td>95.3<\/td>\n<td>83.5<\/td>\n<td>92.9<\/td>\n<td>89.4<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Is the mean of the Final Exam scores of the online class lower than the mean of the Final Exam scores of the face-to-face class? Test at a 5% significance level. Answer the following questions:<\/p>\n<ol>\n<li>Is this a test of two means or two proportions?<\/li>\n<li>Are the population standard deviations known or unknown?<\/li>\n<li>Which distribution do you use to perform the test?<\/li>\n<li>What is the random variable?<\/li>\n<li>What are the null and alternative hypotheses? Write the null and alternative hypotheses in words and in symbols.<\/li>\n<li>Is this test right, left, or two tailed?<\/li>\n<li>What is the <em>p<\/em>-value?<\/li>\n<li>Do you reject or not reject the null hypothesis?<\/li>\n<li>At the ___ level of significance, from the sample data, there ______ (is\/is not) sufficient evidence to conclude that ______.<\/li>\n<\/ol>\n<p>(Review the conclusion in Example 2, and write yours in a similar fashion)<\/p>\n<p>Be careful not to mix up the information for Group 1 and Group 2!<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q984147\">Show Answer<\/span><\/p>\n<div id=\"q984147\" class=\"hidden-answer\" style=\"display: none\">\n<header>\n<h4 class=\"os-title\" data-type=\"title\"><span class=\"os-title-label\">USING THE TI-83, 83+, 84, 84+ CALCULATOR<\/span><\/h4>\n<\/header>\n<p>First put the data for each group into two lists (such as L1 and L2). Press STAT. Arrow over to TESTS and press 4:2SampTTest. Make sure Data is highlighted and press ENTER. Arrow down and enter L1 for the first list and L2 for the second list. Arrow down to<br \/>\n<em>\u03bc<\/em><sub>1<\/sub>: and arrow to \u2260 <em>\u03bc<\/em><sub>2<\/sub> (does not equal). Press ENTER. Arrow down to Pooled: No. Press ENTER. Arrow down to Calculate and press ENTER.<\/p>\n<ol>\n<li>two means<\/li>\n<li>unknown<\/li>\n<li>Student&#8217;s <em>t<\/em><\/li>\n<li>[latex]\\displaystyle\\overline{{X}}_{{1}}-\\overline{{X}}_{{2}}[\/latex]\n<ol>\n<li><em>H<sub data-redactor-tag=\"sub\">0<\/sub><\/em>: <em>\u03bc<sub data-redactor-tag=\"sub\">1<\/sub><\/em> = <em>\u03bc<sub data-redactor-tag=\"sub\">2<\/sub><\/em> Null hypothesis: the means of the final exam scores are equal for the online and face-to-face statistics classes.<\/li>\n<li><em>H<sub data-redactor-tag=\"sub\">a<\/sub><\/em>: <em>\u03bc<sub data-redactor-tag=\"sub\">1<\/sub><\/em> &lt; <em>\u03bc<sub data-redactor-tag=\"sub\">2<\/sub><\/em> Alternative hypothesis: the mean of the final exam scores of the online class is less than the mean of the final exam scores of the face-to-face class.<\/li>\n<\/ol>\n<\/li>\n<li>left-tailed<\/li>\n<li><em>p<\/em>-value = 0.0011<br \/>\n<img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-2114 size-full\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5668\/2021\/07\/01162554\/30b8f681a7471d241b0674f60f2c05602b6ee9c5.jpeg\" alt=\"This is a normal distribution curve with mean equal to zero. A vertical line near the tail of the curve to the left of zero extends from the axis to the curve. The region under the curve to the left of the line is shaded representing p-value = 0.0011.\" width=\"487\" height=\"177\" \/><\/li>\n<li>reject the null hypothesis<\/li>\n<li>The professor was correct. The evidence shows that the mean of the final exam scores for the online class is lower than that of the face-to-face class. At the <span style=\"text-decoration: underline;\">5%<\/span> level of significance, from the sample data, there <span style=\"text-decoration: underline;\">is<\/span> (is\/is not) sufficient evidence to conclude that the mean of the final exam scores for the online class is less than <span style=\"text-decoration: underline;\">the mean of final exam scores of the face-to-face class.<\/span><\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/div>\n\n\t\t\t <section class=\"citations-section\" role=\"contentinfo\">\n\t\t\t <h3>Candela Citations<\/h3>\n\t\t\t\t\t <div>\n\t\t\t\t\t\t <div id=\"citation-list-287\">\n\t\t\t\t\t\t\t <div class=\"licensing\"><div class=\"license-attribution-dropdown-subheading\">CC licensed content, Shared previously<\/div><ul class=\"citation-list\"><li>Two Population Means with Unknown Standard Deviations. <strong>Provided by<\/strong>: OpenStax. <strong>Located at<\/strong>: <a target=\"_blank\" href=\"https:\/\/openstax.org\/books\/introductory-statistics\/pages\/10-1-two-population-means-with-unknown-standard-deviations\">https:\/\/openstax.org\/books\/introductory-statistics\/pages\/10-1-two-population-means-with-unknown-standard-deviations<\/a>. <strong>License<\/strong>: <em><a target=\"_blank\" rel=\"license\" href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\">CC BY: Attribution<\/a><\/em>. <strong>License Terms<\/strong>: Access for free at https:\/\/openstax.org\/books\/introductory-statistics\/pages\/1-introduction<\/li><li>Introductory Statistics. <strong>Authored by<\/strong>: Barbara Illowsky, Susan Dean. <strong>Provided by<\/strong>: OpenStax. <strong>Located at<\/strong>: <a target=\"_blank\" href=\"https:\/\/openstax.org\/books\/introductory-statistics\/pages\/1-introduction\">https:\/\/openstax.org\/books\/introductory-statistics\/pages\/1-introduction<\/a>. <strong>License<\/strong>: <em><a target=\"_blank\" rel=\"license\" href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\">CC BY: Attribution<\/a><\/em>. <strong>License Terms<\/strong>: Access for free at https:\/\/openstax.org\/books\/introductory-statistics\/pages\/1-introduction<\/li><li>Prealgebra. <strong>Provided by<\/strong>: OpenStax. <strong>Located at<\/strong>: <a target=\"_blank\" href=\"https:\/\/openstax.org\/books\/prealgebra\/pages\/1-introduction\">https:\/\/openstax.org\/books\/prealgebra\/pages\/1-introduction<\/a>. <strong>License<\/strong>: <em><a target=\"_blank\" rel=\"license\" href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\">CC BY: Attribution<\/a><\/em>. <strong>License Terms<\/strong>: Access for free at https:\/\/openstax.org\/books\/prealgebra\/pages\/1-introduction<\/li><\/ul><div class=\"license-attribution-dropdown-subheading\">All rights reserved content<\/div><ul class=\"citation-list\"><li>One-tailed and two-tailed tests | Inferential statistics | Probability and Statistics | Khan Academy. <strong>Authored by<\/strong>: Khan Academy. <strong>Located at<\/strong>: <a target=\"_blank\" href=\"https:\/\/www.youtube.com\/embed\/mvye6X_0upA\">https:\/\/www.youtube.com\/embed\/mvye6X_0upA<\/a>. <strong>License<\/strong>: <em>All Rights Reserved<\/em>. <strong>License Terms<\/strong>: Standard YouTube License<\/li><\/ul><\/div>\n\t\t\t\t\t\t <\/div>\n\t\t\t\t\t <\/div>\n\t\t\t <\/section>","protected":false},"author":169134,"menu_order":7,"template":"","meta":{"_candela_citation":"[{\"type\":\"cc\",\"description\":\"Two Population Means with Unknown Standard Deviations\",\"author\":\"\",\"organization\":\"OpenStax\",\"url\":\"https:\/\/openstax.org\/books\/introductory-statistics\/pages\/10-1-two-population-means-with-unknown-standard-deviations\",\"project\":\"\",\"license\":\"cc-by\",\"license_terms\":\"Access for free at https:\/\/openstax.org\/books\/introductory-statistics\/pages\/1-introduction\"},{\"type\":\"copyrighted_video\",\"description\":\"One-tailed and two-tailed tests | Inferential statistics | Probability and Statistics | Khan Academy\",\"author\":\"Khan Academy\",\"organization\":\"\",\"url\":\"https:\/\/www.youtube.com\/embed\/mvye6X_0upA\",\"project\":\"\",\"license\":\"arr\",\"license_terms\":\"Standard YouTube License\"},{\"type\":\"cc\",\"description\":\"Introductory Statistics\",\"author\":\"Barbara Illowsky, Susan Dean\",\"organization\":\"OpenStax\",\"url\":\"https:\/\/openstax.org\/books\/introductory-statistics\/pages\/1-introduction\",\"project\":\"\",\"license\":\"cc-by\",\"license_terms\":\"Access for free at https:\/\/openstax.org\/books\/introductory-statistics\/pages\/1-introduction\"},{\"type\":\"cc\",\"description\":\"Prealgebra\",\"author\":\"\",\"organization\":\"OpenStax\",\"url\":\"https:\/\/openstax.org\/books\/prealgebra\/pages\/1-introduction\",\"project\":\"\",\"license\":\"cc-by\",\"license_terms\":\"Access for free at https:\/\/openstax.org\/books\/prealgebra\/pages\/1-introduction\"}]","CANDELA_OUTCOMES_GUID":"","pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-287","chapter","type-chapter","status-publish","hentry"],"part":285,"_links":{"self":[{"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/pressbooks\/v2\/chapters\/287","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/wp\/v2\/users\/169134"}],"version-history":[{"count":34,"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/pressbooks\/v2\/chapters\/287\/revisions"}],"predecessor-version":[{"id":4054,"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/pressbooks\/v2\/chapters\/287\/revisions\/4054"}],"part":[{"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/pressbooks\/v2\/parts\/285"}],"metadata":[{"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/pressbooks\/v2\/chapters\/287\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/wp\/v2\/media?parent=287"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/pressbooks\/v2\/chapter-type?post=287"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/wp\/v2\/contributor?post=287"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/wp\/v2\/license?post=287"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}