{"id":562,"date":"2017-04-15T03:27:32","date_gmt":"2017-04-15T03:27:32","guid":{"rendered":"https:\/\/courses.lumenlearning.com\/conceptstest1\/chapter\/estimating-the-difference-in-two-population-means\/"},"modified":"2022-08-01T16:05:32","modified_gmt":"2022-08-01T16:05:32","slug":"estimating-the-difference-in-two-population-means","status":"publish","type":"chapter","link":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/chapter\/estimating-the-difference-in-two-population-means\/","title":{"raw":"Estimating the Difference in Two Population Means","rendered":"Estimating the Difference in Two Population Means"},"content":{"raw":"<div class=\"textbox learning-objectives\">\r\n<h3>Learning outcomes<\/h3>\r\n<ul>\r\n \t<li>Construct a confidence interval to estimate a difference in two population means (when conditions are met). Interpret the confidence interval in context.<\/li>\r\n<\/ul>\r\n<\/div>\r\n<h2>Confidence Interval to Estimate \u03bc<sub>1<\/sub> \u2212 \u03bc<sub>2<\/sub><\/h2>\r\nIn a hypothesis test, when the sample evidence leads us to reject the null hypothesis, we conclude that the population means differ or that one is larger than the other. An obvious next question is <em>how much larger?<\/em> In practice, when the sample mean difference is statistically significant, our next step is often to calculate a confidence interval to estimate the size of the population mean difference.\r\n\r\nThe confidence interval gives us a range of reasonable values for the difference in population means \u03bc<sub>1<\/sub> \u2212 \u03bc<sub>2<\/sub>. We call this the <em>two-sample T-interval<\/em> or the <em>confidence interval<\/em> to estimate a difference in two population means. The form of the confidence interval is similar to others we have seen.\r\n<p style=\"text-align: center;\">[latex]\\begin{array}{l}(\\mathrm{sample}\\text{}\\mathrm{statistic})\\text{}&amp;PlusMinus;\\text{}(\\mathrm{margin}\\text{}\\mathrm{of}\\text{}\\mathrm{error})\\\\ (\\mathrm{sample}\\text{}\\mathrm{statistic})\\text{}&amp;PlusMinus;\\text{}(\\mathrm{critical}\\text{}\\mathrm{T-value})(\\mathrm{standard}\\text{}\\mathrm{error})\\end{array}[\/latex]<\/p>\r\n\r\n<h3><strong>Sample Statistic<\/strong><\/h3>\r\nSince we\u2019re estimating the difference between two population means, the sample statistic is the difference between the means of the two independent samples: [latex]{\\stackrel{\u00af}{x}}_{1}-{\\stackrel{\u00af}{x}}_{2}[\/latex].\r\n<h3><strong>Critical T-Value<\/strong><\/h3>\r\nThe critical T-value comes from the T-model, just as it did in \"Estimating a Population Mean.\" Again, this value depends on the degrees of freedom (<em>df<\/em>). For two-sample T-test or two-sample T-intervals, the <em>df<\/em> value is based on a complicated formula that we do not cover in this course. We either give the <em>df<\/em> or use technology to find the <em>df<\/em>.\r\n<h3><strong>Standard Error<\/strong><\/h3>\r\nThe estimated standard error for the two-sample T-interval is the same formula we used for the two-sample T-test. (As usual, <em>s<\/em><sub>1<\/sub> and <em>s<\/em><sub>2<\/sub> denote the sample standard deviations, and <em>n<\/em><sub>1<\/sub> and <em>n<\/em><sub>2<\/sub> denote the sample sizes.)\r\n<p style=\"text-align: center;\">[latex]\\sqrt{\\frac{{{s}_{1}}^{2}}{{n}_{1}}+\\frac{{{s}_{2}}^{2}}{{n}_{2}}}[\/latex]<\/p>\r\nPutting all this together gives us the following formula for the two-sample T-interval.\r\n<p style=\"text-align: center;\">[latex]({\\stackrel{\u00af}{x}}_{1}\\text{\u2212}{\\stackrel{\u00af}{x}}_{2})\\text{}&amp;PlusMinus;\\text{}{T}_{c}\\text{}\u22c5\\text{}\\sqrt{\\frac{{{s}_{1}}^{2}}{{n}_{1}}+\\frac{{{s}_{2}}^{2}}{{n}_{2}}}[\/latex]<\/p>\r\n\r\n<h3><strong>Conditions for Use<\/strong><\/h3>\r\nThe conditions for using this two-sample T-interval are the same as the conditions for using the two-sample T-test.\r\n<ul>\r\n \t<li>The two <em>random <\/em>samples are <em>independent<\/em> and <em>representative.<\/em><\/li>\r\n \t<li>The variable is <em>normally distributed in both populations<\/em>. If it is not known, <em>samples of more than 30 <\/em>will have a difference in sample means that can be modeled adequately by the T-distribution. As we discussed in \"Hypothesis Test for a Population Mean,\" T-procedures are robust even when the variable is not normally distributed in the population. If checking normality in the populations is impossible, then we look at the distribution in the samples. If a histogram or dotplot of the data does not show extreme skew or outliers, we take it as a sign that the variable is not heavily skewed in the populations, and we use the inference procedure.<\/li>\r\n<\/ul>\r\n<div class=\"textbox exercises\">\r\n<h3>Example<\/h3>\r\n<h2>Confidence Interval for the \u201cCalories and Context\u201d Study<\/h2>\r\nIn the preceding few pages, we worked through a two-sample T-test for the \u201ccalories and context\u201d example. In this example, we use the sample data to find a two-sample T-interval for \u03bc<sub>1<\/sub> \u2212 \u03bc<sub>2<\/sub> at the 95% confidence level.\r\n\r\n<strong>Recap of the Situation<\/strong>\r\n<ul>\r\n \t<li>Population 1: Let \u03bc<sub>1<\/sub> be the mean number of calories purchased by women eating with other women.<\/li>\r\n \t<li>Population 2: Let \u03bc<sub>2<\/sub> be the mean number of calories purchased by women eating with men.<\/li>\r\n<\/ul>\r\n<strong>Sample Statistics<\/strong>\r\n<table>\r\n<tbody>\r\n<tr class=\"oli_table\">\r\n<th align=\"center\"><\/th>\r\n<th align=\"center\">Size (n)<\/th>\r\n<th align=\"center\">[latex]\\mathrm{Mean}\\text{}(\\stackrel{\u00af}{x})[\/latex]<\/th>\r\n<th align=\"center\">SD (s)<\/th>\r\n<\/tr>\r\n<tr class=\"oli_table\">\r\n<th align=\"center\">Sample 1<\/th>\r\n<td align=\"center\">45<\/td>\r\n<td align=\"center\">850<\/td>\r\n<td align=\"center\">252<\/td>\r\n<\/tr>\r\n<tr class=\"oli_table\">\r\n<th align=\"center\">Sample 2<\/th>\r\n<td align=\"center\">27<\/td>\r\n<td align=\"center\">719<\/td>\r\n<td align=\"center\">322<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>Standard Error<\/strong>\r\n\r\nWe found that the standard error of the sampling distribution of all sample differences is approximately 72.47.\r\n<p style=\"text-align: center;\">[latex]\\sqrt{\\frac{{{s}_{1}}^{2}}{{n}_{1}}+\\frac{{{s}_{2}}^{2}}{{n}_{2}}}\\text{}=\\text{}\\sqrt{\\frac{{252}^{2}}{45}+\\frac{{322}^{2}}{27}}\\text{}\\approx \\text{}72.47[\/latex]<\/p>\r\n<strong>Critical T-value<\/strong>\r\n\r\nFor these two independent samples, <em>df<\/em> = 45. We find the critical T-value using the same simulation we used in \"Estimating a Population Mean.\"\r\n\r\n<img class=\"alignnone\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/1729\/2017\/04\/15032728\/m10_inference_mean_topic_10_4_m10_est_diff_two_pop_means_1_image7.png\" alt=\"For a 90% confidence interval with df = 45, the critical T-value = 1.6790.\" width=\"499\" height=\"296\" \/>\r\n\r\nReading from the simulation, we see that the critical T-value is 1.6790.\r\n\r\n<strong>Confidence Interval<\/strong>\r\n\r\nWe can now put all this together to compute the confidence interval:\r\n<p style=\"text-align: center;\">[latex]({\\stackrel{\u00af}{x}}_{1}-{\\stackrel{\u00af}{x}}_{2})\\text{}&amp;PlusMinus;\\text{}{T}_{c}\\text{}\u22c5\\text{}\\mathrm{SE}\\text{}=\\text{}(850-719)\\text{}&amp;PlusMinus;\\text{}(1.6790)(72.47)\\text{}\\approx \\text{}131\\text{}&amp;PlusMinus;\\text{}122[\/latex]<\/p>\r\nExpressing this as an interval gives us:\r\n<p style=\"text-align: center;\">[latex](\\mathrm{9,\\; 253})[\/latex]<\/p>\r\n<strong>Interpretation<\/strong>\r\n\r\nWe are 95% confident that the true value of \u03bc<sub>1<\/sub> \u2212 \u03bc<sub>2<\/sub> is between 9 and 253 calories. We can be more specific about the populations. We are 95% confident that at Indiana University of Pennsylvania, undergraduate women eating with women order between 9.32 and 252.68 more calories than undergraduate women eating with men.\r\n\r\n<\/div>\r\nIn this next activity, we focus on interpreting confidence intervals and evaluating a statistics project conducted by students in an introductory statistics course.\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Try It<\/h3>\r\n<h2>Improving Children\u2019s Math Skills<\/h2>\r\nStudents in an introductory statistics course at Los Medanos College designed an experiment to study the impact of subliminal messages on improving children\u2019s math skills. The students were inspired by a similar study at City University of New York, as described in David Moore\u2019s textbook <em>The Basic Practice of Statistics<\/em> <cite>(4th ed., W. H. Freeman, 2007)<\/cite>. The participants were 11 children who attended an afterschool tutoring program at a local church. The children ranged in age from 8 to 11. All received tutoring in arithmetic skills. At the beginning of each tutoring session, the children watched a short video with a religious message that ended with a promotional message for the church.\r\n\r\nThe statistics students added a slide that said, \u201cI work hard and I am good at math.\u201d This slide flashed quickly during the promotional message, so quickly that no one was aware of the slide. Children who attended the tutoring sessions on Mondays watched the video with the extra slide. Children who attended the tutoring sessions on Wednesday watched the video without the extra slide. The experiment lasted 4 weeks. The children took a pretest and posttest in arithmetic. Here are some of the results:\r\n\r\n<img class=\"alignnone\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/1729\/2017\/04\/15032731\/m10_inference_mean_topic_10_4_m10_est_diff_two_pop_means_1_image10.png\" alt=\"Table of means and standard deviations for the treatment group, the control group, and the overall sample\" width=\"679\" height=\"186\" \/>https:\/\/assess.lumenlearning.com\/practice\/10bbd676-7ed8-476f-897b-43ac6076b4d2\r\n\r\nhttps:\/\/assess.lumenlearning.com\/practice\/d3b4eb57-4545-435c-aad2-74708a8de739\r\n\r\nhttps:\/\/assess.lumenlearning.com\/practice\/d9349a45-af23-4c1f-9db7-e0c6791c95a0\r\n\r\n<\/div>\r\n<h2><strong>Let\u2019s Summarize<\/strong><\/h2>\r\nHypothesis tests and confidence intervals for two means can answer research questions about two populations or two treatments that involve quantitative data. In \"Inference for a Difference between Population Means,\" we focused on studies that produced two independent samples. Previously, in \"Hpyothesis Test for a Population Mean,\" we looked at matched-pairs studies in which individual data points in one sample are naturally paired with the individual data points in the other sample.\r\n\r\nThe hypotheses for two population means are similar to those for two population proportions.\r\n\r\nThe null hypothesis, H<sub>0<\/sub>, is a statement of \u201cno effect\u201d or \u201cno difference.\u201d\r\n<ul style=\"list-style-type: none;\">\r\n \t<li>H<sub>0<\/sub>: \u03bc<sub>1<\/sub> - \u03bc<sub>2<\/sub> = 0, which is the same as H<sub>0<\/sub>: \u03bc<sub>1<\/sub> = \u03bc<sub>2<\/sub><\/li>\r\n<\/ul>\r\nThe alternative hypothesis, H<sub>a<\/sub>, takes one of the following three forms:\r\n<ul style=\"list-style-type: none;\">\r\n \t<li>H<sub>a<\/sub>: \u03bc<sub>1<\/sub> - \u03bc<sub>2<\/sub> &lt; 0, which is the same as H<sub>a<\/sub>: \u03bc<sub>1<\/sub> &lt; \u03bc<sub>2<\/sub><\/li>\r\n \t<li>H<sub>a<\/sub>: \u03bc<sub>1<\/sub> - \u03bc<sub>2<\/sub> &gt; 0, which is the same as H<sub>a<\/sub>: \u03bc<sub>1<\/sub> &gt; \u03bc<sub>2<\/sub><\/li>\r\n \t<li>H<sub>a<\/sub>: \u03bc<sub>1<\/sub> - \u03bc<sub>2<\/sub> \u2260 0, which is the same as H<sub>a<\/sub>: \u03bc<sub>1<\/sub> \u2260 \u03bc<sub>2<\/sub><\/li>\r\n<\/ul>\r\nAs usual, how we collect the data determines whether we can use it in the inference procedure. We have our usual two requirements for data collection.\r\n<ul>\r\n \t<li>Samples must be random in order to remove or minimize bias.<\/li>\r\n \t<li>Sample must be representative of the population in question.<\/li>\r\n<\/ul>\r\nWe use the two-sample hypothesis test and confidence interval when the following conditions are met:\r\n<ul>\r\n \t<li>The two random samples are independent.<\/li>\r\n \t<li>The variable is normally distributed in both populations. If this variable is not known, samples of more than 30 will have a difference in sample means that can be modeled adequately by the t-distribution. As we discussed in \"Hypothesis Test for a Population Mean,\" t-procedures are robust even when the variable is not normally distributed in the population. Therefore, if checking normality in the populations is impossible, then we look at the distribution in the samples. If a histogram or dotplot of the data does not show extreme skew or outliers, we take it as a sign that the variable is not heavily skewed in the populations, and we use the inference procedure.<\/li>\r\n<\/ul>\r\n<h3><strong>Formulas:<\/strong><\/h3>\r\nThe confidence interval for \u03bc<sub>1<\/sub> \u2212 \u03bc<sub>2<\/sub> is\r\n<p style=\"text-align: center;\">[latex]({\\stackrel{\u00af}{x}}_{1}\\text{}\\text{\u2212}\\text{}{\\stackrel{\u00af}{x}}_{2})\\text{}&amp;PlusMinus;\\text{}{T}_{c}\\text{}\u22c5\\text{}\\sqrt{\\frac{{{s}_{1}}^{2}}{{n}_{1}}+\\frac{{{s}_{2}}^{2}}{{n}_{2}}}[\/latex]<\/p>\r\nHypothesis test for H<sub>0<\/sub>: \u03bc<sub>1<\/sub> - \u03bc<sub>2<\/sub> = 0 is\r\n<p style=\"text-align: center;\">[latex]T\\text{}=\\text{}\\frac{(\\mathrm{Observed}\\text{}\\mathrm{difference}\\text{}\\mathrm{in}\\text{}\\mathrm{sample}\\text{}\\mathrm{means})\\text{}-\\text{}(\\mathrm{Hypothesized}\\text{}\\mathrm{difference}\\text{}\\mathrm{in}\\text{}\\mathrm{population}\\text{}\\mathrm{means})}{\\mathrm{Standard}\\text{}\\mathrm{error}}[\/latex]<\/p>\r\n<p style=\"text-align: center;\">[latex]T\\text{}=\\text{}\\frac{({\\stackrel{\u00af}{x}}_{1}-{\\stackrel{\u00af}{x}}_{2})\\text{}-\\text{}({\u03bc}_{1}-{\u03bc}_{2})}{\\sqrt{\\frac{{{s}_{1}}^{2}}{{n}_{1}}+\\frac{{{s}_{2}}^{2}}{{n}_{2}}}}[\/latex]<\/p>\r\nWe use technology to find the degrees of freedom to determine P-values and critical t-values for confidence intervals. (In most problems in this section, we provided the degrees of freedom for you.)\r\n<h2>Contribute!<\/h2><div style=\"margin-bottom: 8px;\">Did you have an idea for improving this content? We\u2019d love your input.<\/div><a href=\"https:\/\/docs.google.com\/document\/d\/1Uu0eAxvoEkrIyjvNa7XSGIHtGO3Wp8qFjwF_dUZ1YTU\" target=\"_blank\" style=\"font-size: 10pt; font-weight: 600; color: #077fab; text-decoration: none; border: 2px solid #077fab; border-radius: 7px; padding: 5px 25px; text-align: center; cursor: pointer; line-height: 1.5em;\">Improve this page<\/a><a style=\"margin-left: 16px;\" target=\"_blank\" href=\"https:\/\/docs.google.com\/document\/d\/1vy-T6DtTF-BbMfpVEI7VP_R7w2A4anzYZLXR8Pk4Fu4\">Learn More<\/a>","rendered":"<div class=\"textbox learning-objectives\">\n<h3>Learning outcomes<\/h3>\n<ul>\n<li>Construct a confidence interval to estimate a difference in two population means (when conditions are met). Interpret the confidence interval in context.<\/li>\n<\/ul>\n<\/div>\n<h2>Confidence Interval to Estimate \u03bc<sub>1<\/sub> \u2212 \u03bc<sub>2<\/sub><\/h2>\n<p>In a hypothesis test, when the sample evidence leads us to reject the null hypothesis, we conclude that the population means differ or that one is larger than the other. An obvious next question is <em>how much larger?<\/em> In practice, when the sample mean difference is statistically significant, our next step is often to calculate a confidence interval to estimate the size of the population mean difference.<\/p>\n<p>The confidence interval gives us a range of reasonable values for the difference in population means \u03bc<sub>1<\/sub> \u2212 \u03bc<sub>2<\/sub>. We call this the <em>two-sample T-interval<\/em> or the <em>confidence interval<\/em> to estimate a difference in two population means. The form of the confidence interval is similar to others we have seen.<\/p>\n<p style=\"text-align: center;\">[latex]\\begin{array}{l}(\\mathrm{sample}\\text{}\\mathrm{statistic})\\text{}&PlusMinus;\\text{}(\\mathrm{margin}\\text{}\\mathrm{of}\\text{}\\mathrm{error})\\\\ (\\mathrm{sample}\\text{}\\mathrm{statistic})\\text{}&PlusMinus;\\text{}(\\mathrm{critical}\\text{}\\mathrm{T-value})(\\mathrm{standard}\\text{}\\mathrm{error})\\end{array}[\/latex]<\/p>\n<h3><strong>Sample Statistic<\/strong><\/h3>\n<p>Since we\u2019re estimating the difference between two population means, the sample statistic is the difference between the means of the two independent samples: [latex]{\\stackrel{\u00af}{x}}_{1}-{\\stackrel{\u00af}{x}}_{2}[\/latex].<\/p>\n<h3><strong>Critical T-Value<\/strong><\/h3>\n<p>The critical T-value comes from the T-model, just as it did in &#8220;Estimating a Population Mean.&#8221; Again, this value depends on the degrees of freedom (<em>df<\/em>). For two-sample T-test or two-sample T-intervals, the <em>df<\/em> value is based on a complicated formula that we do not cover in this course. We either give the <em>df<\/em> or use technology to find the <em>df<\/em>.<\/p>\n<h3><strong>Standard Error<\/strong><\/h3>\n<p>The estimated standard error for the two-sample T-interval is the same formula we used for the two-sample T-test. (As usual, <em>s<\/em><sub>1<\/sub> and <em>s<\/em><sub>2<\/sub> denote the sample standard deviations, and <em>n<\/em><sub>1<\/sub> and <em>n<\/em><sub>2<\/sub> denote the sample sizes.)<\/p>\n<p style=\"text-align: center;\">[latex]\\sqrt{\\frac{{{s}_{1}}^{2}}{{n}_{1}}+\\frac{{{s}_{2}}^{2}}{{n}_{2}}}[\/latex]<\/p>\n<p>Putting all this together gives us the following formula for the two-sample T-interval.<\/p>\n<p style=\"text-align: center;\">[latex]({\\stackrel{\u00af}{x}}_{1}\\text{\u2212}{\\stackrel{\u00af}{x}}_{2})\\text{}&PlusMinus;\\text{}{T}_{c}\\text{}\u22c5\\text{}\\sqrt{\\frac{{{s}_{1}}^{2}}{{n}_{1}}+\\frac{{{s}_{2}}^{2}}{{n}_{2}}}[\/latex]<\/p>\n<h3><strong>Conditions for Use<\/strong><\/h3>\n<p>The conditions for using this two-sample T-interval are the same as the conditions for using the two-sample T-test.<\/p>\n<ul>\n<li>The two <em>random <\/em>samples are <em>independent<\/em> and <em>representative.<\/em><\/li>\n<li>The variable is <em>normally distributed in both populations<\/em>. If it is not known, <em>samples of more than 30 <\/em>will have a difference in sample means that can be modeled adequately by the T-distribution. As we discussed in &#8220;Hypothesis Test for a Population Mean,&#8221; T-procedures are robust even when the variable is not normally distributed in the population. If checking normality in the populations is impossible, then we look at the distribution in the samples. If a histogram or dotplot of the data does not show extreme skew or outliers, we take it as a sign that the variable is not heavily skewed in the populations, and we use the inference procedure.<\/li>\n<\/ul>\n<div class=\"textbox exercises\">\n<h3>Example<\/h3>\n<h2>Confidence Interval for the \u201cCalories and Context\u201d Study<\/h2>\n<p>In the preceding few pages, we worked through a two-sample T-test for the \u201ccalories and context\u201d example. In this example, we use the sample data to find a two-sample T-interval for \u03bc<sub>1<\/sub> \u2212 \u03bc<sub>2<\/sub> at the 95% confidence level.<\/p>\n<p><strong>Recap of the Situation<\/strong><\/p>\n<ul>\n<li>Population 1: Let \u03bc<sub>1<\/sub> be the mean number of calories purchased by women eating with other women.<\/li>\n<li>Population 2: Let \u03bc<sub>2<\/sub> be the mean number of calories purchased by women eating with men.<\/li>\n<\/ul>\n<p><strong>Sample Statistics<\/strong><\/p>\n<table>\n<tbody>\n<tr class=\"oli_table\">\n<th align=\"center\"><\/th>\n<th align=\"center\">Size (n)<\/th>\n<th align=\"center\">[latex]\\mathrm{Mean}\\text{}(\\stackrel{\u00af}{x})[\/latex]<\/th>\n<th align=\"center\">SD (s)<\/th>\n<\/tr>\n<tr class=\"oli_table\">\n<th align=\"center\">Sample 1<\/th>\n<td align=\"center\">45<\/td>\n<td align=\"center\">850<\/td>\n<td align=\"center\">252<\/td>\n<\/tr>\n<tr class=\"oli_table\">\n<th align=\"center\">Sample 2<\/th>\n<td align=\"center\">27<\/td>\n<td align=\"center\">719<\/td>\n<td align=\"center\">322<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>Standard Error<\/strong><\/p>\n<p>We found that the standard error of the sampling distribution of all sample differences is approximately 72.47.<\/p>\n<p style=\"text-align: center;\">[latex]\\sqrt{\\frac{{{s}_{1}}^{2}}{{n}_{1}}+\\frac{{{s}_{2}}^{2}}{{n}_{2}}}\\text{}=\\text{}\\sqrt{\\frac{{252}^{2}}{45}+\\frac{{322}^{2}}{27}}\\text{}\\approx \\text{}72.47[\/latex]<\/p>\n<p><strong>Critical T-value<\/strong><\/p>\n<p>For these two independent samples, <em>df<\/em> = 45. We find the critical T-value using the same simulation we used in &#8220;Estimating a Population Mean.&#8221;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/1729\/2017\/04\/15032728\/m10_inference_mean_topic_10_4_m10_est_diff_two_pop_means_1_image7.png\" alt=\"For a 90% confidence interval with df = 45, the critical T-value = 1.6790.\" width=\"499\" height=\"296\" \/><\/p>\n<p>Reading from the simulation, we see that the critical T-value is 1.6790.<\/p>\n<p><strong>Confidence Interval<\/strong><\/p>\n<p>We can now put all this together to compute the confidence interval:<\/p>\n<p style=\"text-align: center;\">[latex]({\\stackrel{\u00af}{x}}_{1}-{\\stackrel{\u00af}{x}}_{2})\\text{}&PlusMinus;\\text{}{T}_{c}\\text{}\u22c5\\text{}\\mathrm{SE}\\text{}=\\text{}(850-719)\\text{}&PlusMinus;\\text{}(1.6790)(72.47)\\text{}\\approx \\text{}131\\text{}&PlusMinus;\\text{}122[\/latex]<\/p>\n<p>Expressing this as an interval gives us:<\/p>\n<p style=\"text-align: center;\">[latex](\\mathrm{9,\\; 253})[\/latex]<\/p>\n<p><strong>Interpretation<\/strong><\/p>\n<p>We are 95% confident that the true value of \u03bc<sub>1<\/sub> \u2212 \u03bc<sub>2<\/sub> is between 9 and 253 calories. We can be more specific about the populations. We are 95% confident that at Indiana University of Pennsylvania, undergraduate women eating with women order between 9.32 and 252.68 more calories than undergraduate women eating with men.<\/p>\n<\/div>\n<p>In this next activity, we focus on interpreting confidence intervals and evaluating a statistics project conducted by students in an introductory statistics course.<\/p>\n<div class=\"textbox key-takeaways\">\n<h3>Try It<\/h3>\n<h2>Improving Children\u2019s Math Skills<\/h2>\n<p>Students in an introductory statistics course at Los Medanos College designed an experiment to study the impact of subliminal messages on improving children\u2019s math skills. The students were inspired by a similar study at City University of New York, as described in David Moore\u2019s textbook <em>The Basic Practice of Statistics<\/em> <cite>(4th ed., W. H. Freeman, 2007)<\/cite>. The participants were 11 children who attended an afterschool tutoring program at a local church. The children ranged in age from 8 to 11. All received tutoring in arithmetic skills. At the beginning of each tutoring session, the children watched a short video with a religious message that ended with a promotional message for the church.<\/p>\n<p>The statistics students added a slide that said, \u201cI work hard and I am good at math.\u201d This slide flashed quickly during the promotional message, so quickly that no one was aware of the slide. Children who attended the tutoring sessions on Mondays watched the video with the extra slide. Children who attended the tutoring sessions on Wednesday watched the video without the extra slide. The experiment lasted 4 weeks. The children took a pretest and posttest in arithmetic. Here are some of the results:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/1729\/2017\/04\/15032731\/m10_inference_mean_topic_10_4_m10_est_diff_two_pop_means_1_image10.png\" alt=\"Table of means and standard deviations for the treatment group, the control group, and the overall sample\" width=\"679\" height=\"186\" \/>https:\/\/assess.lumenlearning.com\/practice\/10bbd676-7ed8-476f-897b-43ac6076b4d2<\/p>\n<p>\t<iframe id=\"assessment_practice_d3b4eb57-4545-435c-aad2-74708a8de739\" class=\"resizable\" src=\"https:\/\/assess.lumenlearning.com\/practice\/d3b4eb57-4545-435c-aad2-74708a8de739?iframe_resize_id=assessment_practice_id_d3b4eb57-4545-435c-aad2-74708a8de739\" frameborder=\"0\" style=\"border:none;width:100%;height:100%;min-height:300px;\"><br \/>\n\t<\/iframe><\/p>\n<p>\t<iframe id=\"assessment_practice_d9349a45-af23-4c1f-9db7-e0c6791c95a0\" class=\"resizable\" src=\"https:\/\/assess.lumenlearning.com\/practice\/d9349a45-af23-4c1f-9db7-e0c6791c95a0?iframe_resize_id=assessment_practice_id_d9349a45-af23-4c1f-9db7-e0c6791c95a0\" frameborder=\"0\" style=\"border:none;width:100%;height:100%;min-height:300px;\"><br \/>\n\t<\/iframe><\/p>\n<\/div>\n<h2><strong>Let\u2019s Summarize<\/strong><\/h2>\n<p>Hypothesis tests and confidence intervals for two means can answer research questions about two populations or two treatments that involve quantitative data. In &#8220;Inference for a Difference between Population Means,&#8221; we focused on studies that produced two independent samples. Previously, in &#8220;Hpyothesis Test for a Population Mean,&#8221; we looked at matched-pairs studies in which individual data points in one sample are naturally paired with the individual data points in the other sample.<\/p>\n<p>The hypotheses for two population means are similar to those for two population proportions.<\/p>\n<p>The null hypothesis, H<sub>0<\/sub>, is a statement of \u201cno effect\u201d or \u201cno difference.\u201d<\/p>\n<ul style=\"list-style-type: none;\">\n<li>H<sub>0<\/sub>: \u03bc<sub>1<\/sub> &#8211; \u03bc<sub>2<\/sub> = 0, which is the same as H<sub>0<\/sub>: \u03bc<sub>1<\/sub> = \u03bc<sub>2<\/sub><\/li>\n<\/ul>\n<p>The alternative hypothesis, H<sub>a<\/sub>, takes one of the following three forms:<\/p>\n<ul style=\"list-style-type: none;\">\n<li>H<sub>a<\/sub>: \u03bc<sub>1<\/sub> &#8211; \u03bc<sub>2<\/sub> &lt; 0, which is the same as H<sub>a<\/sub>: \u03bc<sub>1<\/sub> &lt; \u03bc<sub>2<\/sub><\/li>\n<li>H<sub>a<\/sub>: \u03bc<sub>1<\/sub> &#8211; \u03bc<sub>2<\/sub> &gt; 0, which is the same as H<sub>a<\/sub>: \u03bc<sub>1<\/sub> &gt; \u03bc<sub>2<\/sub><\/li>\n<li>H<sub>a<\/sub>: \u03bc<sub>1<\/sub> &#8211; \u03bc<sub>2<\/sub> \u2260 0, which is the same as H<sub>a<\/sub>: \u03bc<sub>1<\/sub> \u2260 \u03bc<sub>2<\/sub><\/li>\n<\/ul>\n<p>As usual, how we collect the data determines whether we can use it in the inference procedure. We have our usual two requirements for data collection.<\/p>\n<ul>\n<li>Samples must be random in order to remove or minimize bias.<\/li>\n<li>Sample must be representative of the population in question.<\/li>\n<\/ul>\n<p>We use the two-sample hypothesis test and confidence interval when the following conditions are met:<\/p>\n<ul>\n<li>The two random samples are independent.<\/li>\n<li>The variable is normally distributed in both populations. If this variable is not known, samples of more than 30 will have a difference in sample means that can be modeled adequately by the t-distribution. As we discussed in &#8220;Hypothesis Test for a Population Mean,&#8221; t-procedures are robust even when the variable is not normally distributed in the population. Therefore, if checking normality in the populations is impossible, then we look at the distribution in the samples. If a histogram or dotplot of the data does not show extreme skew or outliers, we take it as a sign that the variable is not heavily skewed in the populations, and we use the inference procedure.<\/li>\n<\/ul>\n<h3><strong>Formulas:<\/strong><\/h3>\n<p>The confidence interval for \u03bc<sub>1<\/sub> \u2212 \u03bc<sub>2<\/sub> is<\/p>\n<p style=\"text-align: center;\">[latex]({\\stackrel{\u00af}{x}}_{1}\\text{}\\text{\u2212}\\text{}{\\stackrel{\u00af}{x}}_{2})\\text{}&PlusMinus;\\text{}{T}_{c}\\text{}\u22c5\\text{}\\sqrt{\\frac{{{s}_{1}}^{2}}{{n}_{1}}+\\frac{{{s}_{2}}^{2}}{{n}_{2}}}[\/latex]<\/p>\n<p>Hypothesis test for H<sub>0<\/sub>: \u03bc<sub>1<\/sub> &#8211; \u03bc<sub>2<\/sub> = 0 is<\/p>\n<p style=\"text-align: center;\">[latex]T\\text{}=\\text{}\\frac{(\\mathrm{Observed}\\text{}\\mathrm{difference}\\text{}\\mathrm{in}\\text{}\\mathrm{sample}\\text{}\\mathrm{means})\\text{}-\\text{}(\\mathrm{Hypothesized}\\text{}\\mathrm{difference}\\text{}\\mathrm{in}\\text{}\\mathrm{population}\\text{}\\mathrm{means})}{\\mathrm{Standard}\\text{}\\mathrm{error}}[\/latex]<\/p>\n<p style=\"text-align: center;\">[latex]T\\text{}=\\text{}\\frac{({\\stackrel{\u00af}{x}}_{1}-{\\stackrel{\u00af}{x}}_{2})\\text{}-\\text{}({\u03bc}_{1}-{\u03bc}_{2})}{\\sqrt{\\frac{{{s}_{1}}^{2}}{{n}_{1}}+\\frac{{{s}_{2}}^{2}}{{n}_{2}}}}[\/latex]<\/p>\n<p>We use technology to find the degrees of freedom to determine P-values and critical t-values for confidence intervals. (In most problems in this section, we provided the degrees of freedom for you.)<\/p>\n<h2>Contribute!<\/h2>\n<div style=\"margin-bottom: 8px;\">Did you have an idea for improving this content? We\u2019d love your input.<\/div>\n<p><a href=\"https:\/\/docs.google.com\/document\/d\/1Uu0eAxvoEkrIyjvNa7XSGIHtGO3Wp8qFjwF_dUZ1YTU\" target=\"_blank\" style=\"font-size: 10pt; font-weight: 600; color: #077fab; text-decoration: none; border: 2px solid #077fab; border-radius: 7px; padding: 5px 25px; text-align: center; cursor: pointer; line-height: 1.5em;\">Improve this page<\/a><a style=\"margin-left: 16px;\" target=\"_blank\" href=\"https:\/\/docs.google.com\/document\/d\/1vy-T6DtTF-BbMfpVEI7VP_R7w2A4anzYZLXR8Pk4Fu4\">Learn More<\/a><\/p>\n\n\t\t\t <section class=\"citations-section\" role=\"contentinfo\">\n\t\t\t <h3>Candela Citations<\/h3>\n\t\t\t\t\t <div>\n\t\t\t\t\t\t <div id=\"citation-list-562\">\n\t\t\t\t\t\t\t <div class=\"licensing\"><div class=\"license-attribution-dropdown-subheading\">CC licensed content, Shared previously<\/div><ul class=\"citation-list\"><li>Concepts in Statistics. <strong>Provided by<\/strong>: Open Learning Initiative. <strong>Located at<\/strong>: <a target=\"_blank\" href=\"http:\/\/oli.cmu.edu\">http:\/\/oli.cmu.edu<\/a>. <strong>License<\/strong>: <em><a target=\"_blank\" rel=\"license\" href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\">CC BY: Attribution<\/a><\/em><\/li><\/ul><\/div>\n\t\t\t\t\t\t <\/div>\n\t\t\t\t\t <\/div>\n\t\t\t <\/section>","protected":false},"author":163,"menu_order":21,"template":"","meta":{"_candela_citation":"[{\"type\":\"cc\",\"description\":\"Concepts in Statistics\",\"author\":\"\",\"organization\":\"Open Learning Initiative\",\"url\":\"http:\/\/oli.cmu.edu\",\"project\":\"\",\"license\":\"cc-by\",\"license_terms\":\"\"}]","CANDELA_OUTCOMES_GUID":"d6f3bc4b-4e7a-48a0-b7f7-3a39bb769d8d, adbcb690-a047-4fa3-9228-ae785aca45ab","pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-562","chapter","type-chapter","status-publish","hentry"],"part":474,"_links":{"self":[{"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapters\/562","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/wp\/v2\/users\/163"}],"version-history":[{"count":9,"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapters\/562\/revisions"}],"predecessor-version":[{"id":2818,"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapters\/562\/revisions\/2818"}],"part":[{"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/pressbooks\/v2\/parts\/474"}],"metadata":[{"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapters\/562\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/wp\/v2\/media?parent=562"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapter-type?post=562"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/wp\/v2\/contributor?post=562"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/wp\/v2\/license?post=562"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}