{"id":266,"date":"2016-04-21T22:43:41","date_gmt":"2016-04-21T22:43:41","guid":{"rendered":"https:\/\/courses.lumenlearning.com\/introstats1xmaster\/?post_type=chapter&#038;p=266"},"modified":"2017-10-18T21:18:16","modified_gmt":"2017-10-18T21:18:16","slug":"the-central-limit-theorem-for-sample-means-averages","status":"publish","type":"chapter","link":"https:\/\/courses.lumenlearning.com\/suny-fmcc-introstats1\/chapter\/the-central-limit-theorem-for-sample-means-averages\/","title":{"raw":"The Central Limit Theorem for Sample Means (Averages)","rendered":"The Central Limit Theorem for Sample Means (Averages)"},"content":{"raw":"<div class=\"textbox learning-objectives\">\r\n<h3>Learning Objectives<\/h3>\r\nSuppose\u00a0<em>X<\/em> is a random variable with a distribution that may be known or unknown (it can be any distribution). Using a subscript that matches the random variable, suppose:\r\n<ol>\r\n \t<li><em>\u03bc<\/em><em><sub data-redactor-tag=\"sub\">X<\/sub><\/em> = the mean of <em>X<\/em><\/li>\r\n \t<li><em>\u03c3<\/em><em><sub data-redactor-tag=\"sub\">X<\/sub><\/em> = the standard deviation of <em>X<\/em><\/li>\r\n<\/ol>\r\nIf you draw random samples of size\u00a0<em>n<\/em>, then as <em>n<\/em> increases, the random variable [latex]\\displaystyle\\overline{{X}}[\/latex].\r\n\r\n<\/div>\r\n&nbsp;\r\n\r\nThe\u00a0<strong>central limit theorem<\/strong> for sample means says that if you keep drawing larger and larger samples (such as rolling one, two, five, and finally, ten dice) and <strong>calculating their means<\/strong>, the sample means form their own <strong>normal distribution<\/strong> (the sampling distribution). The normal distribution has the same mean as the original distribution and a variance that equals the original variance divided by, the sample size. The variable <em>n<\/em> is the number of values that are averaged together, not the number of times the experiment is done.\r\n\r\nTo put it more formally, if you draw random samples of size\u00a0<em>n<\/em>, the distribution of the random variable [latex]\\displaystyle\\overline{{X}}[\/latex], which consists of sample means, is called the <strong>sampling distribution of the mean<\/strong>. The sampling distribution of the mean approaches a normal distribution as <em>n<\/em>, the <strong>sample size<\/strong>, increases.\r\n\r\nThe random variable\u00a0[latex]\\displaystyle\\overline{{X}}[\/latex] in one sample.\r\n\r\n[latex]\\displaystyle\\frac{{\\overline{X}-{\\mu}_{x}}}{{\\frac{{\\sigma{x}}}{{\\sqrt{n}}}}}\\\\[\/latex]\r\n\r\n[latex]\\displaystyle{\\mu}_{x}[\/latex] is the average of both X and [latex]\\displaystyle\\overline{X}[\/latex]\r\n[latex]\\displaystyle{\\sigma}\\overline{x} = \\frac{{\\overline{X}-{\\mu}_{x}}}{{\\frac{{\\sigma{x}}}{{\\sqrt{n}}}}}\\\\[\/latex] = standard deviation of [latex]\\displaystyle\\overline{{X}}\\\\[\/latex] and is called the standard error of the mean.\r\nTo find probabilities for means on the calculator, follow these steps:\r\n<ul>\r\n \t<li>2nd DISTR<\/li>\r\n \t<li>2:normalcdf<\/li>\r\n \t<li>normalcdf \u00a0(Lower value of the area, upper value of the area, mean,[latex]\\displaystyle\\sqrt{\\frac{{\\text{standard deviation}}}{{\\text{sample size}}}}\\\\[\/latex]<\/li>\r\n \t<li>where: <em data-redactor-tag=\"em\">mean<\/em> is the mean of the original distribution <em data-redactor-tag=\"em\">standard deviation<\/em> is the standard deviation of the original distribution <em data-redactor-tag=\"em\">sample size<\/em> =<em data-redactor-tag=\"em\">n<\/em><\/li>\r\n<\/ul>\r\n<div class=\"textbox exercises\">\r\n<h3>Example<\/h3>\r\nAn unknown distribution has a mean of 90 and a standard deviation of 15. Samples of size<em data-redactor-tag=\"em\">n<\/em> = 25 are drawn randomly from the population.\r\n<ol>\r\n \t<li>Find the probability that the <strong data-redactor-tag=\"strong\">sample mean<\/strong> is between 85 and 92.<\/li>\r\n \t<li>Find the value that is two standard deviations above the expected value, 90, of the sample mean.<\/li>\r\n<\/ol>\r\n<a href=\"https:\/\/courses.candelalearning.com\/introstats1xmaster\/wp-content\/uploads\/sites\/635\/2015\/06\/Screen-Shot-2015-06-08-at-8.51.05-AM.png\"><img class=\"aligncenter wp-image-516\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/132\/2016\/04\/21214600\/Screen-Shot-2015-06-08-at-8.51.05-AM.png\" alt=\"Graph of probability P(85&lt; x bar &lt; 92)\" width=\"484\" height=\"236\" \/><\/a>\r\n\r\nSolution:\r\n\r\n<code data-redactor-tag=\"code\">normalcdf:\u00a0<\/code>(lower value, upper value, mean, standard error of the mean)\r\nThe parameter list is abbreviated (lower value, upper value, <em data-redactor-tag=\"em\">\u03bc<\/em>,[latex]\\displaystyle\\frac{{\\sigma}}{{\\sqrt{n}}}\\\\[\/latex]\r\n\r\n<code data-redactor-tag=\"code\">normalcdf:\u00a0<\/code>(85,92,90,\u00a0[latex]\\displaystyle\\frac{{15}}{{\\sqrt{25}}}\\\\[\/latex]\u00a0= 0.6997\r\n\r\nTo find the value that is two standard deviations above the expected value 90, use the formula:\r\n<ul>\r\n \t<li>value = [latex]\\displaystyle{\\mu}_{x}[\/latex] + (# of STDEVs)[latex]\\displaystyle\\left(\\frac{{{\\sigma}_{x}}}{{\\sqrt{n}}}\\right)\\\\[\/latex]<\/li>\r\n \t<li>value = 90 + 2 ([latex]\\displaystyle\\frac{{15}}{{\\sqrt{25}}}\\\\[\/latex] )= 96<\/li>\r\n<\/ul>\r\nThe value that is two standard deviations above the expected value is 96.\u00a0The standard error of the mean is\u00a0[latex]\\displaystyle\\frac{{\\sigma}}{{\\sqrt{n}}}\\\\[\/latex] =\u00a0[latex]\\displaystyle\\frac{{15}}{{\\sqrt{25}}}\\\\[\/latex] = 3.\r\n\r\nRecall that the standard error of the mean is a description of how far (on average) that the sample mean will be from the population mean in repeated simple random samples of size <em data-redactor-tag=\"em\">n<\/em>.\r\n\r\n<\/div>\r\nhttps:\/\/www.youtube.com\/embed\/FXZ2O1Lv-KE\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Try it<\/h3>\r\nAn unknown distribution has a mean of 45 and a standard deviation of eight. Samples of size <em data-redactor-tag=\"em\">n<\/em> = 30 are drawn randomly from the population. Find the probability that the sample mean is between 42 and 50.\r\n\r\nP(42 &lt; [latex]\\displaystyle\\overline{x}[\/latex] &lt; 50) = 42, 50, 45,\u00a0[latex]\\displaystyle\\frac{{8}}{{\\sqrt{30}}}\\\\[\/latex] = 0.9797\r\n\r\n<\/div>\r\n<h3><\/h3>\r\n<div class=\"textbox exercises\">\r\n<h3>Example<\/h3>\r\nThe length of time, in hours, it takes an \"over 40\" group of people to play one soccer match is normally distributed with a <strong data-redactor-tag=\"strong\">mean of two hours<\/strong> and a <strong data-redactor-tag=\"strong\">standard deviation of 0.5 hours<\/strong>. A <strong data-redactor-tag=\"strong\">s<\/strong><strong data-redactor-tag=\"strong\">ample of size <em data-redactor-tag=\"em\">n<\/em> = 50<\/strong> is drawn randomly from the population. Find the probability that the <strong data-redactor-tag=\"strong\">sample mean<\/strong> is between 1.8 hours and 2.3 hours.\r\n\r\nThe length of time, in hours, it takes an \"over 40\" group of people to play one soccer match is normally distributed with a\u00a0<span data-redactor-tag=\"strong\">mean of two hours<\/span>\u00a0and a\u00a0<span data-redactor-tag=\"strong\">standard deviation of 0.5 hours<\/span>. A\u00a0<span data-redactor-tag=\"strong\">s<\/span><span data-redactor-tag=\"strong\">ample of size\u00a0<em data-redactor-tag=\"em\">n<\/em>\u00a0= 50<\/span>\u00a0is drawn randomly from the population. Find the probability that the\u00a0<span data-redactor-tag=\"strong\">sample mean<\/span>\u00a0is between 1.8 hours and 2.3 hours.\r\n\r\nLet [latex]\\displaystyle\\overline{X}[\/latex] =\u00a0the <strong data-redactor-tag=\"strong\">mean<\/strong> time, in hours, it takes to play one soccer match.\r\n\r\nIf [latex]{\\mu}_{x}[\/latex]= _________,\u00a0[latex]{\\sigma}_{x}[\/latex] = __________, and <em data-redactor-tag=\"em\">n<\/em> = ___________, then <em data-redactor-tag=\"em\">X<\/em>~ <em data-redactor-tag=\"em\">N<\/em>(______, ______) by the <strong data-redactor-tag=\"strong\">central limit theorem for means<\/strong>.\r\n\r\n[latex]{\\mu}_{x}[\/latex] = 2\r\n[latex]{\\sigma}_{x}[\/latex]\u00a0 =\u00a0 0.5\r\nn\u00a0 =\u00a0 50 and\r\nX~N\u00a0 (2, [latex]\\frac{{0.5}}{{\\sqrt{50}}}\\\\[\/latex])\r\n\r\nFind\u00a0P(1.8 &lt; [latex]\\displaystyle\\overline{x}\\\\[\/latex] &lt; 2.3)\r\n\r\nSolution:\r\n\r\nP(1.8 &lt; [latex]\\displaystyle\\overline{x}\\\\[\/latex] &lt; 2.3)\r\n\r\nP(1.8 &lt; [latex]\\displaystyle\\overline{x}\\\\[\/latex] &lt; 2.3) = \u00a00.9977\r\n\r\n<code data-redactor-tag=\"code\">normalcdf<\/code>\u00a0:\u00a0(1.8,\u00a02.3,\u00a02,\u00a0[latex]\\displaystyle\\frac{{0.5}}{{\\sqrt{50}}}\\\\[\/latex]) = 0.9977\r\n\r\nThe probability that the mean time is between 1.8 hours and 2.3 hours is 0.9977.\r\n\r\n<\/div>\r\n<h3><\/h3>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Try it<\/h3>\r\nThe length of time taken on the SAT for a group of students is normally distributed with a mean of 2.5 hours and a standard deviation of 0.25 hours. A sample size of <em data-redactor-tag=\"em\">n<\/em> = 60 is drawn randomly from the population. Find the probability that the sample mean is between two hours and three hours.\r\n\r\nP(2 &lt; [latex]\\overline{x}[\/latex] &lt; 3) = normalcdf( 2, 3, 2.5,\u00a0[latex]\\frac{0.25}{\\sqrt{60}}[\/latex] = 1\r\n\r\nTo find percentiles for means on the calculator, follow these steps.\r\n<ul>\r\n \t<li>2nd DIStR<\/li>\r\n \t<li>3:invNorm<\/li>\r\n \t<li><em data-redactor-tag=\"em\">k<\/em> = invNorm (area to the left of k, mean[latex]\\displaystyle\\sqrt{\\frac{{\\text{standard deviation}}}{{\\text{sample size}}}}\\\\[\/latex]),\u00a0where: <em data-redactor-tag=\"em\">k<\/em> = the <em data-redactor-tag=\"em\">k<\/em>th percentile <em data-redactor-tag=\"em\">mean<\/em> is the mean of the original distribution <em data-redactor-tag=\"em\">standard deviation<\/em> is the standard deviation of the original distribution <em data-redactor-tag=\"em\">sample size<\/em> = <em data-redactor-tag=\"em\">n<\/em><\/li>\r\n<\/ul>\r\n<\/div>\r\n<h3><\/h3>\r\n<div class=\"textbox exercises\">\r\n<h3>Example<\/h3>\r\nIn a recent study reported Oct. 29, 2012 on the Flurry Blog, the mean age of tablet users is 34 years. Suppose the standard deviation is 15 years. Take a sample of size <em data-redactor-tag=\"em\">n<\/em> = 100.\r\n<ol>\r\n \t<li>What are the mean and standard deviation for the sample mean ages of tablet users?<\/li>\r\n \t<li>What does the distribution look like?<\/li>\r\n \t<li>Find the probability that the sample mean age is more than 30 years (the reported mean age of tablet users in this particular study).<\/li>\r\n \t<li>Find the 95th percentile for the sample mean age (to one decimal place).<\/li>\r\n<\/ol>\r\nSolution:\r\n<ol>\r\n \t<li>Since the sample mean tends to target the population mean, we have <em data-redactor-tag=\"em\">\u03bc<sub data-redactor-tag=\"sub\">X<\/sub><\/em> = <em data-redactor-tag=\"em\">\u03bc<\/em> = 34. The sample standard deviation is given by [latex]\\displaystyle\\frac{{\\sigma}}{{\\sqrt{n}}}=\\frac{{15}}{{\\sqrt{100}}}=\\frac{{15}}{{10}}={1.5}\\\\[\/latex]<\/li>\r\n \t<li>The central limit theorem states that for large sample sizes(<em data-redactor-tag=\"em\">n<\/em>), the sampling distribution will be approximately normal.<\/li>\r\n \t<li>The probability that the sample mean age is more than 30 is given by <em data-redactor-tag=\"em\">P<\/em>(<em data-redactor-tag=\"em\">\u03a7<\/em> &gt; 30) = <code data-redactor-tag=\"code\">normalcdf<\/code>(30,E99,34,1.5) = 0.9962<\/li>\r\n \t<li>Let <em data-redactor-tag=\"em\">k<\/em> = the 95th percentile. k = invNorm(0.95, 34, [latex]\\displaystyle\\frac{{15}}{{\\sqrt{100}}}\\\\[\/latex]) = 36.5<\/li>\r\n<\/ol>\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Try it<\/h3>\r\nIn an article on Flurry Blog, a gaming marketing gap for men between the ages of 30 and 40 is identified. You are researching a startup game targeted at the 35-year-old demographic. Your idea is to develop a strategy game that can be played by men from their late 20s through their late 30s. Based on the article's data, industry research shows that the average strategy player is 28 years old with a standard deviation of 4.8 years. You take a sample of 100 randomly selected gamers. If your target market is 29- to 35-year-olds, should you continue with your development strategy?\r\n\r\nYou need to determine the probability for men whose mean age is between 29 and 35 years of age wanting to play a strategy game.\u00a0Find\u00a0P(29 &lt; [latex]\\displaystyle\\overline{x}[\/latex] &lt; 35) = normalcdf = 0.0186\r\n\r\n(29, 35, 28,\u00a0[latex]\\displaystyle\\frac{{4.8}}{{\\sqrt{100}}}\\\\[\/latex] = 0.0186\r\n\r\nYou can conclude there is approximately a 2% chance that your game will be played by men whose mean age is between 29 and 35.\r\n\r\n<\/div>\r\n<h3><\/h3>\r\n<div class=\"textbox exercises\">\r\n<h3>Example<\/h3>\r\nThe mean number of minutes for app engagement by a tablet user is 8.2 minutes. Suppose the standard deviation is one minute. Take a sample of 60.\r\n<ol>\r\n \t<li>What are the mean and standard deviation for the sample mean number of app engagement by a tablet user?<\/li>\r\n \t<li>What is the standard error of the mean?<\/li>\r\n \t<li>Find the 90th percentile for the sample mean time for app engagement for a tablet user. Interpret this value in a complete sentence.<\/li>\r\n \t<li>Find the probability that the sample mean is between eight minutes and 8.5 minutes.<\/li>\r\n<\/ol>\r\nSolution:\r\n<ol>\r\n \t<li>[latex]\\displaystyle{\\mu}_{\\overline{x}}={\\mu}=8.2\\\\[\/latex],\u00a0[latex]\\displaystyle{\\sigma}_{\\overline{x}}=\\frac{{\\sigma}}{{\\sqrt{n}}}=\\frac{{1}}{{\\sqrt{60}}} = 0.13\\\\[\/latex]This allows us to calculate the probability of sample means of a particular distance from the mean, in repeated samples of size 60.<\/li>\r\n \t<li>Let <em data-redactor-tag=\"em\">k<\/em> = the 90th percentile, k = invNorm(0.9, 8.2, \u00a0[latex]\\displaystyle\\frac{{1}}{{\\sqrt{60}}}\\\\[\/latex]) = 8.37.\u00a0This values indicates that 90 percent of the average app engagement time for table users is less than 8.37 minutes.<\/li>\r\n \t<li>P(8 &lt; [latex]\\displaystyle\\overline{x}\\\\[\/latex] &lt; 8.5) = normalcdf(8, 8.5, 8.2[latex]\\displaystyle\\frac{{1}}{{\\sqrt{60}}}\\\\[\/latex]) = 0.9293<\/li>\r\n<\/ol>\r\n<\/div>\r\nhttps:\/\/www.youtube.com\/embed\/J1twbrHel3o\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Try it<\/h3>\r\nCans of a cola beverage claim to contain 16 ounces. The amounts in a sample are measured and the statistics are <em data-redactor-tag=\"em\">n<\/em> = 34,[latex]\\displaystyle\\overline{x}[\/latex]\u00a0= 16.01 ounces. If the cans are filled so that <em data-redactor-tag=\"em\">\u03bc<\/em> = 16.00 ounces (as labeled) and <em data-redactor-tag=\"em\">\u03c3<\/em>= 0.143 ounces, find the probability that a sample of 34 cans will have an average amount greater than 16.01 ounces. Do the results suggest that cans are filled with an amount greater than 16 ounces?\r\n\r\nWe have P([latex]\\displaystyle\\overline{x}[\/latex] &gt; 16.01) = normalcdf(16.01, E99, 16,\u00a0[latex]\\displaystyle\\frac{{0.143}}{{\\sqrt{34}}}\\\\[\/latex]= 0.3417\r\n\r\nSince there is a 34.17% probability that the average sample weight is greater than 16.01 ounces, we should be skeptical of the company's claimed volume. If I am a consumer, I should be glad that I am probably receiving free cola. If I am the manufacturer, I need to determine if my bottling processes are outside of acceptable limits.\r\n\r\n<\/div>\r\n&nbsp;\r\n<h2>References<\/h2>\r\nBaran, Daya. \"20 Percent of Americans Have Never Used Email.\"WebGuild, 2010. Available online at http:\/\/www.webguild.org\/20080519\/20-percent-of-americans-have-never-used-email (accessed May 17, 2013).\r\n\r\nData from The Flurry Blog, 2013. Available online at http:\/\/blog.flurry.com (accessed May 17, 2013).\r\n\r\nData from the United States Department of Agriculture.\r\n<h2>Concept Review<\/h2>\r\nIn a population whose distribution may be known or unknown, if the size ( <em data-redactor-tag=\"em\">n<\/em>) of samples is sufficiently large, the distribution of the sample means will be approximately normal. The mean of the sample means will equal the population mean. The standard deviation of the distribution of the sample means, called the standard error of the mean, is equal to the population standard deviation divided by the square root of the sample size (<em data-redactor-tag=\"em\">n<\/em>).\r\n<h2>Formula Review<\/h2>\r\nThe Central Limit Theorem for Sample Means:[latex]\\displaystyle\\overline{X}{\\sim}{N}({\\mu}_{x},\\frac{{{\\sigma}_{x}}}{{\\sqrt{n}}})[\/latex]","rendered":"<div class=\"textbox learning-objectives\">\n<h3>Learning Objectives<\/h3>\n<p>Suppose\u00a0<em>X<\/em> is a random variable with a distribution that may be known or unknown (it can be any distribution). Using a subscript that matches the random variable, suppose:<\/p>\n<ol>\n<li><em>\u03bc<\/em><em><sub data-redactor-tag=\"sub\">X<\/sub><\/em> = the mean of <em>X<\/em><\/li>\n<li><em>\u03c3<\/em><em><sub data-redactor-tag=\"sub\">X<\/sub><\/em> = the standard deviation of <em>X<\/em><\/li>\n<\/ol>\n<p>If you draw random samples of size\u00a0<em>n<\/em>, then as <em>n<\/em> increases, the random variable [latex]\\displaystyle\\overline{{X}}[\/latex].<\/p>\n<\/div>\n<p>&nbsp;<\/p>\n<p>The\u00a0<strong>central limit theorem<\/strong> for sample means says that if you keep drawing larger and larger samples (such as rolling one, two, five, and finally, ten dice) and <strong>calculating their means<\/strong>, the sample means form their own <strong>normal distribution<\/strong> (the sampling distribution). The normal distribution has the same mean as the original distribution and a variance that equals the original variance divided by, the sample size. The variable <em>n<\/em> is the number of values that are averaged together, not the number of times the experiment is done.<\/p>\n<p>To put it more formally, if you draw random samples of size\u00a0<em>n<\/em>, the distribution of the random variable [latex]\\displaystyle\\overline{{X}}[\/latex], which consists of sample means, is called the <strong>sampling distribution of the mean<\/strong>. The sampling distribution of the mean approaches a normal distribution as <em>n<\/em>, the <strong>sample size<\/strong>, increases.<\/p>\n<p>The random variable\u00a0[latex]\\displaystyle\\overline{{X}}[\/latex] in one sample.<\/p>\n<p>[latex]\\displaystyle\\frac{{\\overline{X}-{\\mu}_{x}}}{{\\frac{{\\sigma{x}}}{{\\sqrt{n}}}}}\\\\[\/latex]<\/p>\n<p>[latex]\\displaystyle{\\mu}_{x}[\/latex] is the average of both X and [latex]\\displaystyle\\overline{X}[\/latex]<br \/>\n[latex]\\displaystyle{\\sigma}\\overline{x} = \\frac{{\\overline{X}-{\\mu}_{x}}}{{\\frac{{\\sigma{x}}}{{\\sqrt{n}}}}}\\\\[\/latex] = standard deviation of [latex]\\displaystyle\\overline{{X}}\\\\[\/latex] and is called the standard error of the mean.<br \/>\nTo find probabilities for means on the calculator, follow these steps:<\/p>\n<ul>\n<li>2nd DISTR<\/li>\n<li>2:normalcdf<\/li>\n<li>normalcdf \u00a0(Lower value of the area, upper value of the area, mean,[latex]\\displaystyle\\sqrt{\\frac{{\\text{standard deviation}}}{{\\text{sample size}}}}\\\\[\/latex]<\/li>\n<li>where: <em data-redactor-tag=\"em\">mean<\/em> is the mean of the original distribution <em data-redactor-tag=\"em\">standard deviation<\/em> is the standard deviation of the original distribution <em data-redactor-tag=\"em\">sample size<\/em> =<em data-redactor-tag=\"em\">n<\/em><\/li>\n<\/ul>\n<div class=\"textbox exercises\">\n<h3>Example<\/h3>\n<p>An unknown distribution has a mean of 90 and a standard deviation of 15. Samples of size<em data-redactor-tag=\"em\">n<\/em> = 25 are drawn randomly from the population.<\/p>\n<ol>\n<li>Find the probability that the <strong data-redactor-tag=\"strong\">sample mean<\/strong> is between 85 and 92.<\/li>\n<li>Find the value that is two standard deviations above the expected value, 90, of the sample mean.<\/li>\n<\/ol>\n<p><a href=\"https:\/\/courses.candelalearning.com\/introstats1xmaster\/wp-content\/uploads\/sites\/635\/2015\/06\/Screen-Shot-2015-06-08-at-8.51.05-AM.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-516\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/132\/2016\/04\/21214600\/Screen-Shot-2015-06-08-at-8.51.05-AM.png\" alt=\"Graph of probability P(85&lt; x bar &lt; 92)\" width=\"484\" height=\"236\" \/><\/a><\/p>\n<p>Solution:<\/p>\n<p><code data-redactor-tag=\"code\">normalcdf:\u00a0<\/code>(lower value, upper value, mean, standard error of the mean)<br \/>\nThe parameter list is abbreviated (lower value, upper value, <em data-redactor-tag=\"em\">\u03bc<\/em>,[latex]\\displaystyle\\frac{{\\sigma}}{{\\sqrt{n}}}\\\\[\/latex]<\/p>\n<p><code data-redactor-tag=\"code\">normalcdf:\u00a0<\/code>(85,92,90,\u00a0[latex]\\displaystyle\\frac{{15}}{{\\sqrt{25}}}\\\\[\/latex]\u00a0= 0.6997<\/p>\n<p>To find the value that is two standard deviations above the expected value 90, use the formula:<\/p>\n<ul>\n<li>value = [latex]\\displaystyle{\\mu}_{x}[\/latex] + (# of STDEVs)[latex]\\displaystyle\\left(\\frac{{{\\sigma}_{x}}}{{\\sqrt{n}}}\\right)\\\\[\/latex]<\/li>\n<li>value = 90 + 2 ([latex]\\displaystyle\\frac{{15}}{{\\sqrt{25}}}\\\\[\/latex] )= 96<\/li>\n<\/ul>\n<p>The value that is two standard deviations above the expected value is 96.\u00a0The standard error of the mean is\u00a0[latex]\\displaystyle\\frac{{\\sigma}}{{\\sqrt{n}}}\\\\[\/latex] =\u00a0[latex]\\displaystyle\\frac{{15}}{{\\sqrt{25}}}\\\\[\/latex] = 3.<\/p>\n<p>Recall that the standard error of the mean is a description of how far (on average) that the sample mean will be from the population mean in repeated simple random samples of size <em data-redactor-tag=\"em\">n<\/em>.<\/p>\n<\/div>\n<p><iframe loading=\"lazy\" id=\"oembed-1\" title=\"Sampling distribution of the sample mean | Probability and Statistics | Khan Academy\" width=\"500\" height=\"375\" src=\"https:\/\/www.youtube.com\/embed\/FXZ2O1Lv-KE?feature=oembed&#38;rel=0\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<div class=\"textbox key-takeaways\">\n<h3>Try it<\/h3>\n<p>An unknown distribution has a mean of 45 and a standard deviation of eight. Samples of size <em data-redactor-tag=\"em\">n<\/em> = 30 are drawn randomly from the population. Find the probability that the sample mean is between 42 and 50.<\/p>\n<p>P(42 &lt; [latex]\\displaystyle\\overline{x}[\/latex] &lt; 50) = 42, 50, 45,\u00a0[latex]\\displaystyle\\frac{{8}}{{\\sqrt{30}}}\\\\[\/latex] = 0.9797<\/p>\n<\/div>\n<h3><\/h3>\n<div class=\"textbox exercises\">\n<h3>Example<\/h3>\n<p>The length of time, in hours, it takes an &#8220;over 40&#8221; group of people to play one soccer match is normally distributed with a <strong data-redactor-tag=\"strong\">mean of two hours<\/strong> and a <strong data-redactor-tag=\"strong\">standard deviation of 0.5 hours<\/strong>. A <strong data-redactor-tag=\"strong\">s<\/strong><strong data-redactor-tag=\"strong\">ample of size <em data-redactor-tag=\"em\">n<\/em> = 50<\/strong> is drawn randomly from the population. Find the probability that the <strong data-redactor-tag=\"strong\">sample mean<\/strong> is between 1.8 hours and 2.3 hours.<\/p>\n<p>The length of time, in hours, it takes an &#8220;over 40&#8221; group of people to play one soccer match is normally distributed with a\u00a0<span data-redactor-tag=\"strong\">mean of two hours<\/span>\u00a0and a\u00a0<span data-redactor-tag=\"strong\">standard deviation of 0.5 hours<\/span>. A\u00a0<span data-redactor-tag=\"strong\">s<\/span><span data-redactor-tag=\"strong\">ample of size\u00a0<em data-redactor-tag=\"em\">n<\/em>\u00a0= 50<\/span>\u00a0is drawn randomly from the population. Find the probability that the\u00a0<span data-redactor-tag=\"strong\">sample mean<\/span>\u00a0is between 1.8 hours and 2.3 hours.<\/p>\n<p>Let [latex]\\displaystyle\\overline{X}[\/latex] =\u00a0the <strong data-redactor-tag=\"strong\">mean<\/strong> time, in hours, it takes to play one soccer match.<\/p>\n<p>If [latex]{\\mu}_{x}[\/latex]= _________,\u00a0[latex]{\\sigma}_{x}[\/latex] = __________, and <em data-redactor-tag=\"em\">n<\/em> = ___________, then <em data-redactor-tag=\"em\">X<\/em>~ <em data-redactor-tag=\"em\">N<\/em>(______, ______) by the <strong data-redactor-tag=\"strong\">central limit theorem for means<\/strong>.<\/p>\n<p>[latex]{\\mu}_{x}[\/latex] = 2<br \/>\n[latex]{\\sigma}_{x}[\/latex]\u00a0 =\u00a0 0.5<br \/>\nn\u00a0 =\u00a0 50 and<br \/>\nX~N\u00a0 (2, [latex]\\frac{{0.5}}{{\\sqrt{50}}}\\\\[\/latex])<\/p>\n<p>Find\u00a0P(1.8 &lt; [latex]\\displaystyle\\overline{x}\\\\[\/latex] &lt; 2.3)<\/p>\n<p>Solution:<\/p>\n<p>P(1.8 &lt; [latex]\\displaystyle\\overline{x}\\\\[\/latex] &lt; 2.3)<\/p>\n<p>P(1.8 &lt; [latex]\\displaystyle\\overline{x}\\\\[\/latex] &lt; 2.3) = \u00a00.9977<\/p>\n<p><code data-redactor-tag=\"code\">normalcdf<\/code>\u00a0:\u00a0(1.8,\u00a02.3,\u00a02,\u00a0[latex]\\displaystyle\\frac{{0.5}}{{\\sqrt{50}}}\\\\[\/latex]) = 0.9977<\/p>\n<p>The probability that the mean time is between 1.8 hours and 2.3 hours is 0.9977.<\/p>\n<\/div>\n<h3><\/h3>\n<div class=\"textbox key-takeaways\">\n<h3>Try it<\/h3>\n<p>The length of time taken on the SAT for a group of students is normally distributed with a mean of 2.5 hours and a standard deviation of 0.25 hours. A sample size of <em data-redactor-tag=\"em\">n<\/em> = 60 is drawn randomly from the population. Find the probability that the sample mean is between two hours and three hours.<\/p>\n<p>P(2 &lt; [latex]\\overline{x}[\/latex] &lt; 3) = normalcdf( 2, 3, 2.5,\u00a0[latex]\\frac{0.25}{\\sqrt{60}}[\/latex] = 1<\/p>\n<p>To find percentiles for means on the calculator, follow these steps.<\/p>\n<ul>\n<li>2nd DIStR<\/li>\n<li>3:invNorm<\/li>\n<li><em data-redactor-tag=\"em\">k<\/em> = invNorm (area to the left of k, mean[latex]\\displaystyle\\sqrt{\\frac{{\\text{standard deviation}}}{{\\text{sample size}}}}\\\\[\/latex]),\u00a0where: <em data-redactor-tag=\"em\">k<\/em> = the <em data-redactor-tag=\"em\">k<\/em>th percentile <em data-redactor-tag=\"em\">mean<\/em> is the mean of the original distribution <em data-redactor-tag=\"em\">standard deviation<\/em> is the standard deviation of the original distribution <em data-redactor-tag=\"em\">sample size<\/em> = <em data-redactor-tag=\"em\">n<\/em><\/li>\n<\/ul>\n<\/div>\n<h3><\/h3>\n<div class=\"textbox exercises\">\n<h3>Example<\/h3>\n<p>In a recent study reported Oct. 29, 2012 on the Flurry Blog, the mean age of tablet users is 34 years. Suppose the standard deviation is 15 years. Take a sample of size <em data-redactor-tag=\"em\">n<\/em> = 100.<\/p>\n<ol>\n<li>What are the mean and standard deviation for the sample mean ages of tablet users?<\/li>\n<li>What does the distribution look like?<\/li>\n<li>Find the probability that the sample mean age is more than 30 years (the reported mean age of tablet users in this particular study).<\/li>\n<li>Find the 95th percentile for the sample mean age (to one decimal place).<\/li>\n<\/ol>\n<p>Solution:<\/p>\n<ol>\n<li>Since the sample mean tends to target the population mean, we have <em data-redactor-tag=\"em\">\u03bc<sub data-redactor-tag=\"sub\">X<\/sub><\/em> = <em data-redactor-tag=\"em\">\u03bc<\/em> = 34. The sample standard deviation is given by [latex]\\displaystyle\\frac{{\\sigma}}{{\\sqrt{n}}}=\\frac{{15}}{{\\sqrt{100}}}=\\frac{{15}}{{10}}={1.5}\\\\[\/latex]<\/li>\n<li>The central limit theorem states that for large sample sizes(<em data-redactor-tag=\"em\">n<\/em>), the sampling distribution will be approximately normal.<\/li>\n<li>The probability that the sample mean age is more than 30 is given by <em data-redactor-tag=\"em\">P<\/em>(<em data-redactor-tag=\"em\">\u03a7<\/em> &gt; 30) = <code data-redactor-tag=\"code\">normalcdf<\/code>(30,E99,34,1.5) = 0.9962<\/li>\n<li>Let <em data-redactor-tag=\"em\">k<\/em> = the 95th percentile. k = invNorm(0.95, 34, [latex]\\displaystyle\\frac{{15}}{{\\sqrt{100}}}\\\\[\/latex]) = 36.5<\/li>\n<\/ol>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>Try it<\/h3>\n<p>In an article on Flurry Blog, a gaming marketing gap for men between the ages of 30 and 40 is identified. You are researching a startup game targeted at the 35-year-old demographic. Your idea is to develop a strategy game that can be played by men from their late 20s through their late 30s. Based on the article&#8217;s data, industry research shows that the average strategy player is 28 years old with a standard deviation of 4.8 years. You take a sample of 100 randomly selected gamers. If your target market is 29- to 35-year-olds, should you continue with your development strategy?<\/p>\n<p>You need to determine the probability for men whose mean age is between 29 and 35 years of age wanting to play a strategy game.\u00a0Find\u00a0P(29 &lt; [latex]\\displaystyle\\overline{x}[\/latex] &lt; 35) = normalcdf = 0.0186<\/p>\n<p>(29, 35, 28,\u00a0[latex]\\displaystyle\\frac{{4.8}}{{\\sqrt{100}}}\\\\[\/latex] = 0.0186<\/p>\n<p>You can conclude there is approximately a 2% chance that your game will be played by men whose mean age is between 29 and 35.<\/p>\n<\/div>\n<h3><\/h3>\n<div class=\"textbox exercises\">\n<h3>Example<\/h3>\n<p>The mean number of minutes for app engagement by a tablet user is 8.2 minutes. Suppose the standard deviation is one minute. Take a sample of 60.<\/p>\n<ol>\n<li>What are the mean and standard deviation for the sample mean number of app engagement by a tablet user?<\/li>\n<li>What is the standard error of the mean?<\/li>\n<li>Find the 90th percentile for the sample mean time for app engagement for a tablet user. Interpret this value in a complete sentence.<\/li>\n<li>Find the probability that the sample mean is between eight minutes and 8.5 minutes.<\/li>\n<\/ol>\n<p>Solution:<\/p>\n<ol>\n<li>[latex]\\displaystyle{\\mu}_{\\overline{x}}={\\mu}=8.2\\\\[\/latex],\u00a0[latex]\\displaystyle{\\sigma}_{\\overline{x}}=\\frac{{\\sigma}}{{\\sqrt{n}}}=\\frac{{1}}{{\\sqrt{60}}} = 0.13\\\\[\/latex]This allows us to calculate the probability of sample means of a particular distance from the mean, in repeated samples of size 60.<\/li>\n<li>Let <em data-redactor-tag=\"em\">k<\/em> = the 90th percentile, k = invNorm(0.9, 8.2, \u00a0[latex]\\displaystyle\\frac{{1}}{{\\sqrt{60}}}\\\\[\/latex]) = 8.37.\u00a0This values indicates that 90 percent of the average app engagement time for table users is less than 8.37 minutes.<\/li>\n<li>P(8 &lt; [latex]\\displaystyle\\overline{x}\\\\[\/latex] &lt; 8.5) = normalcdf(8, 8.5, 8.2[latex]\\displaystyle\\frac{{1}}{{\\sqrt{60}}}\\\\[\/latex]) = 0.9293<\/li>\n<\/ol>\n<\/div>\n<p><iframe loading=\"lazy\" id=\"oembed-2\" title=\"Standard error of the mean | Inferential statistics | Probability and Statistics | Khan Academy\" width=\"500\" height=\"375\" src=\"https:\/\/www.youtube.com\/embed\/J1twbrHel3o?feature=oembed&#38;rel=0\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<div class=\"textbox key-takeaways\">\n<h3>Try it<\/h3>\n<p>Cans of a cola beverage claim to contain 16 ounces. The amounts in a sample are measured and the statistics are <em data-redactor-tag=\"em\">n<\/em> = 34,[latex]\\displaystyle\\overline{x}[\/latex]\u00a0= 16.01 ounces. If the cans are filled so that <em data-redactor-tag=\"em\">\u03bc<\/em> = 16.00 ounces (as labeled) and <em data-redactor-tag=\"em\">\u03c3<\/em>= 0.143 ounces, find the probability that a sample of 34 cans will have an average amount greater than 16.01 ounces. Do the results suggest that cans are filled with an amount greater than 16 ounces?<\/p>\n<p>We have P([latex]\\displaystyle\\overline{x}[\/latex] &gt; 16.01) = normalcdf(16.01, E99, 16,\u00a0[latex]\\displaystyle\\frac{{0.143}}{{\\sqrt{34}}}\\\\[\/latex]= 0.3417<\/p>\n<p>Since there is a 34.17% probability that the average sample weight is greater than 16.01 ounces, we should be skeptical of the company&#8217;s claimed volume. If I am a consumer, I should be glad that I am probably receiving free cola. If I am the manufacturer, I need to determine if my bottling processes are outside of acceptable limits.<\/p>\n<\/div>\n<p>&nbsp;<\/p>\n<h2>References<\/h2>\n<p>Baran, Daya. &#8220;20 Percent of Americans Have Never Used Email.&#8221;WebGuild, 2010. Available online at http:\/\/www.webguild.org\/20080519\/20-percent-of-americans-have-never-used-email (accessed May 17, 2013).<\/p>\n<p>Data from The Flurry Blog, 2013. Available online at http:\/\/blog.flurry.com (accessed May 17, 2013).<\/p>\n<p>Data from the United States Department of Agriculture.<\/p>\n<h2>Concept Review<\/h2>\n<p>In a population whose distribution may be known or unknown, if the size ( <em data-redactor-tag=\"em\">n<\/em>) of samples is sufficiently large, the distribution of the sample means will be approximately normal. The mean of the sample means will equal the population mean. The standard deviation of the distribution of the sample means, called the standard error of the mean, is equal to the population standard deviation divided by the square root of the sample size (<em data-redactor-tag=\"em\">n<\/em>).<\/p>\n<h2>Formula Review<\/h2>\n<p>The Central Limit Theorem for Sample Means:[latex]\\displaystyle\\overline{X}{\\sim}{N}({\\mu}_{x},\\frac{{{\\sigma}_{x}}}{{\\sqrt{n}}})[\/latex]<\/p>\n\n\t\t\t <section class=\"citations-section\" role=\"contentinfo\">\n\t\t\t <h3>Candela Citations<\/h3>\n\t\t\t\t\t <div>\n\t\t\t\t\t\t <div id=\"citation-list-266\">\n\t\t\t\t\t\t\t <div class=\"licensing\"><div class=\"license-attribution-dropdown-subheading\">CC licensed content, Shared previously<\/div><ul class=\"citation-list\"><li>Introductory Statistics . <strong>Authored by<\/strong>: Barbara Illowski, Susan Dean. <strong>Provided by<\/strong>: Open Stax. <strong>Located at<\/strong>: <a target=\"_blank\" href=\"http:\/\/cnx.org\/contents\/30189442-6998-4686-ac05-ed152b91b9de@17.44\">http:\/\/cnx.org\/contents\/30189442-6998-4686-ac05-ed152b91b9de@17.44<\/a>. <strong>License<\/strong>: <em><a target=\"_blank\" rel=\"license\" href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\">CC BY: Attribution<\/a><\/em>. <strong>License Terms<\/strong>: Download for free at http:\/\/cnx.org\/contents\/30189442-6998-4686-ac05-ed152b91b9de@17.44<\/li><\/ul><div class=\"license-attribution-dropdown-subheading\">All rights reserved content<\/div><ul class=\"citation-list\"><li>Sampling distribution of the sample mean | Probability and Statistics | Khan Academy. <strong>Authored by<\/strong>: Khan Academy. <strong>Located at<\/strong>: <a target=\"_blank\" href=\"https:\/\/www.youtube.com\/embed\/FXZ2O1Lv-KE\">https:\/\/www.youtube.com\/embed\/FXZ2O1Lv-KE<\/a>. <strong>License<\/strong>: <em>All Rights Reserved<\/em>. <strong>License Terms<\/strong>: Standard YouTube License<\/li><li>Standard error of the mean | Inferential statistics | Probability and Statistics | Khan Academy. <strong>Authored by<\/strong>: Khan Academy. <strong>Located at<\/strong>: <a target=\"_blank\" href=\"https:\/\/www.youtube.com\/embed\/J1twbrHel3o\">https:\/\/www.youtube.com\/embed\/J1twbrHel3o<\/a>. <strong>License<\/strong>: <em>All Rights Reserved<\/em>. <strong>License Terms<\/strong>: Standard YouTube LIcense<\/li><\/ul><\/div>\n\t\t\t\t\t\t <\/div>\n\t\t\t\t\t <\/div>\n\t\t\t <\/section>","protected":false},"author":21,"menu_order":2,"template":"","meta":{"_candela_citation":"[{\"type\":\"copyrighted_video\",\"description\":\"Sampling distribution of the sample mean | Probability and Statistics | Khan Academy\",\"author\":\"Khan Academy\",\"organization\":\"\",\"url\":\"https:\/\/www.youtube.com\/embed\/FXZ2O1Lv-KE\",\"project\":\"\",\"license\":\"arr\",\"license_terms\":\"Standard YouTube License\"},{\"type\":\"copyrighted_video\",\"description\":\"Standard error of the mean | Inferential statistics | Probability and Statistics | Khan Academy\",\"author\":\"Khan Academy\",\"organization\":\"\",\"url\":\"https:\/\/www.youtube.com\/embed\/J1twbrHel3o\",\"project\":\"\",\"license\":\"arr\",\"license_terms\":\"Standard YouTube LIcense\"},{\"type\":\"cc\",\"description\":\"Introductory Statistics \",\"author\":\"Barbara Illowski, Susan Dean\",\"organization\":\"Open Stax\",\"url\":\"http:\/\/cnx.org\/contents\/30189442-6998-4686-ac05-ed152b91b9de@17.44\",\"project\":\"\",\"license\":\"cc-by\",\"license_terms\":\"Download for free at http:\/\/cnx.org\/contents\/30189442-6998-4686-ac05-ed152b91b9de@17.44\"}]","CANDELA_OUTCOMES_GUID":"","pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-266","chapter","type-chapter","status-publish","hentry"],"part":256,"_links":{"self":[{"href":"https:\/\/courses.lumenlearning.com\/suny-fmcc-introstats1\/wp-json\/pressbooks\/v2\/chapters\/266","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/courses.lumenlearning.com\/suny-fmcc-introstats1\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/courses.lumenlearning.com\/suny-fmcc-introstats1\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/suny-fmcc-introstats1\/wp-json\/wp\/v2\/users\/21"}],"version-history":[{"count":5,"href":"https:\/\/courses.lumenlearning.com\/suny-fmcc-introstats1\/wp-json\/pressbooks\/v2\/chapters\/266\/revisions"}],"predecessor-version":[{"id":1577,"href":"https:\/\/courses.lumenlearning.com\/suny-fmcc-introstats1\/wp-json\/pressbooks\/v2\/chapters\/266\/revisions\/1577"}],"part":[{"href":"https:\/\/courses.lumenlearning.com\/suny-fmcc-introstats1\/wp-json\/pressbooks\/v2\/parts\/256"}],"metadata":[{"href":"https:\/\/courses.lumenlearning.com\/suny-fmcc-introstats1\/wp-json\/pressbooks\/v2\/chapters\/266\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/courses.lumenlearning.com\/suny-fmcc-introstats1\/wp-json\/wp\/v2\/media?parent=266"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/suny-fmcc-introstats1\/wp-json\/pressbooks\/v2\/chapter-type?post=266"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/suny-fmcc-introstats1\/wp-json\/wp\/v2\/contributor?post=266"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/suny-fmcc-introstats1\/wp-json\/wp\/v2\/license?post=266"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}