{"id":51,"date":"2022-05-20T16:59:05","date_gmt":"2022-05-20T16:59:05","guid":{"rendered":"https:\/\/courses.lumenlearning.com\/alphamodule\/chapter\/z-score-and-the-empirical-rule-what-to-know\/"},"modified":"2022-07-11T19:47:13","modified_gmt":"2022-07-11T19:47:13","slug":"z-score-and-the-empirical-rule-what-to-know","status":"publish","type":"chapter","link":"https:\/\/courses.lumenlearning.com\/alphamodule\/chapter\/z-score-and-the-empirical-rule-what-to-know\/","title":{"raw":"Z-Score and the Empirical Rule: Learn It 1","rendered":"Z-Score and the Empirical Rule: Learn It 1"},"content":{"raw":"<div class=\"textbox learning-objectives\">\r\n<h3>Learning Goals<\/h3>\r\nAfter completing this section, you should feel comfortable performing these skills.\r\n<ul>\r\n \t<li><a href=\"#defineZscore\">Define the standardized value, or z-score.<\/a><\/li>\r\n \t<li><a href=\"#convert\">Use technology to convert values into standardized scores.<\/a><\/li>\r\n \t<li><a href=\"#identNumStdDev\">Use a dotplot and histogram to identify the number of standard deviations from the mean of certain observations.<\/a><\/li>\r\n \t<li><a href=\"#calcZscore\">Calculate a value's standardized score by hand to determine its location relative to the mean.<\/a><\/li>\r\n \t<li><a href=\"#defineEmp\">Define the Empirical Rule.<\/a><\/li>\r\n<\/ul>\r\nClick on a skill above to jump to its location in this section.\r\n\r\n<\/div>\r\nIn the next activity, you will need to be able to convert values into standardized values (also called standardized scores or z-scores) and use a value\u2019s standardized value to determine whether the value is above, below, or equal to the mean. You will also need to be able to explain the Empirical Rule. In this section, we'll use a data set to explore how to perform necessary calculations by hand and using technology.\r\n<h2>Standardized Values<\/h2>\r\nYou learned in\u00a0<a href=\"https:\/\/courses.lumenlearning.com\/exemplarstatistics\/chapter\/comparing-variability-of-data sets-what-to-know\/\"><em>Comparing Variability of Data Sets: What to Know<\/em><\/a> that a standard deviation is a measure for how spread out observations are from the mean.\r\n\r\nA <strong>standardized value<\/strong>, or <strong>z-score<\/strong>, is the number of standard deviations an observation is away from the mean.\r\n\r\nFor example, in this section we will analyze runtimes (in minutes) of G-rated movies to learn how to calculate standardized values. Within this context, the standardized value, or z-score, is the number of standard deviations a particular movie runtime is from the mean.\r\n\r\nIt is important to note that the distance of a particular movie runtime from the mean is not measured in minutes; rather it is measured in standard deviations. Thus, a z-score of\u00a0[latex]-2.3[\/latex] is an observation that is\u00a0[latex]2.3[\/latex] standard deviations <em>below<\/em> the mean, and a z-score of\u00a0[latex]2.3[\/latex] is an observation that is\u00a0[latex]2.3[\/latex] standard deviations <em>above<\/em> the mean. It is important to note that z-scores do not have units associated with them.\r\n<p style=\"text-align: left;\"><strong>Z-Score Formula\u00a0\u00a0<\/strong>The value of an observation is <strong>standardized<\/strong> using the formula [latex]z=\\dfrac{x-\\mu}{\\sigma}[\/latex], where [latex]x[\/latex] represents the value of the observation, [latex]\\mu[\/latex] represents the population mean, [latex]\\sigma[\/latex] represents the population standard deviation, and [latex]z[\/latex] represents the standardized value, or z-score.<\/p>\r\nBefore we use the formula to convert values into standardized values, let's recap our understanding of standard deviation. In <a href=\"https:\/\/courses.lumenlearning.com\/exemplarstatistics\/chapter\/comparing-variability-of-data sets-what-to-know\/\"><em>Comparing Variability of Data Sets: What to Know<\/em><\/a>, you learned to understand standard deviation as a measure of variability in a data set. You looked at the statistical components that went into the formulas for standard deviation and variance and saw that larger standard deviations could represent more variability, and vice-versa. We'd like to shift that perspective now and look at a unit of standard deviation as a distance from the mean of a data set in a distribution.\r\n<div class=\"textbox tryit\">\r\n<h3>standard deviation as a unit of distance<\/h3>\r\n<span style=\"background-color: #99cc00;\">[Perspective video -- a 3-instructor video showing how to think about standard deviation as a unit of distance in a distribution -- i.e., illustrating values so many standard deviations above and below the mean of a bell-shaped, unimodal, symmetric distribution. Show how adding or subtracting std devs can obtain a certain value at that location in the distribution. Show that a value's z-score (negative or positive) is that many std deviations away from the mean in that direction.]<\/span>\r\n\r\n<\/div>\r\nSee the example below for a demonstration, then try it out using the Movie Runtimes database to answer the questions below.\r\n<div class=\"textbox exercises\">\r\n<h3>inTeractive example<\/h3>\r\nLet's return again to the data set Sleep Study: Average Sleep, which we used in\u00a0<em><a href=\"https:\/\/courses.lumenlearning.com\/exemplarstatistics\/chapter\/comparing-variability-of-data sets-what-to-know\/\">Comparing Variability of Data Sets: What to Know<\/a><\/em> to learn about standard deviation as a measure of the variability of a data set.\r\n\r\nOpen the tool at\u00a0<a href=\"https:\/\/dcmathpathways.shinyapps.io\/EDA_quantitative\/\">https:\/\/dcmathpathways.shinyapps.io\/EDA_quantitative\/<\/a>\u00a0and select the Sleep Study: Average Sleep data set.\u00a0Display a histogram and dotplot and make a note of the mean and standard deviation in the descriptive statistics. Round your final answers to the questions below to 3 decimal places, as needed.\r\n<ol>\r\n \t<li>Describe the shape of the data set using the histogram and dotplot. For practice, display a boxplot as well and note the visual clues that you can use to determine the shape of the distribution from the boxplot.<\/li>\r\n \t<li>How does the relationship between the mean and median (given in descriptive statistics) help to support your analysis?<\/li>\r\n \t<li>What are the mean and standard deviation of the data set?<\/li>\r\n \t<li>What number of sleep hours lies one standard deviation above the mean? What value lies one standard deviation below?<\/li>\r\n \t<li>What number of sleep hours lie two standard deviations above and below the mean?<\/li>\r\n<\/ol>\r\n[reveal-answer q=\"599366\"]Show Answer[\/reveal-answer]\r\n[hidden-answer a=\"599366\"]\r\n<ol>\r\n \t<li>The distribution is unimodal and approximately symmetric. A few outliers lie to either side of the distribution but do so evenly.<\/li>\r\n \t<li>The mean and median are approximately equal, which supports that the outliers are approximately evenly distributed.<\/li>\r\n \t<li><span style=\"font-size: 1rem; orphans: 1; text-align: initial;\">[latex]\\bar{x}=7.97[\/latex] and [latex]s=0.965[\/latex]<\/span><\/li>\r\n \t<li>The standard deviation is 0.965. We can add that to the mean to determine the value exactly one standard deviation above the mean. Likewise, we can subtract that from the mean to find the value exactly one standard deviation below the mean.\r\n<ul>\r\n \t<li>[latex]7.97 + 0.965 = 8.935[\/latex]:\r\n<ul>\r\n \t<li>[latex]8.935[\/latex] hours of sleep lies <strong>one<\/strong> standard deviation <strong>above<\/strong> the mean.<\/li>\r\n \t<li>That is, the standardized value for the observation [latex]8.935[\/latex] hours is [latex]1[\/latex[] Its z-score is <strong>[latex]1[\/latex]<\/strong>.<\/li>\r\n<\/ul>\r\n<\/li>\r\n \t<li>[latex]7.97 - 0.965 = 7.005[\/latex]:\r\n<ul>\r\n \t<li>[latex]7.005[\/latex] hours of sleep lies <strong>one<\/strong> standard deviation <strong>below<\/strong> the mean.<\/li>\r\n \t<li>That is, the standardized value for the observation [latex]7.005[\/latex] hours is [latex]-1[\/latex]. Its z-score is <strong>[latex]-1[\/latex]<\/strong><\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ul>\r\n<\/li>\r\n \t<li>The standard deviation is [latex]0.965[\/latex]. We can add twice that to the mean to determine the value exactly two standard deviations above the mean. Likewise, we can subtract [latex]2*0.965[\/latex] from the mean to find the value exactly one standard deviation below the mean.\r\n<ul>\r\n \t<li>[latex]7.97 + 2*0.965 = 9.9[\/latex]:\r\n<ul>\r\n \t<li>[latex]9.9[\/latex] hours of sleep lies <strong>two<\/strong> standard deviations\u00a0<strong>above<\/strong> the mean.<\/li>\r\n \t<li>That is, the standardized value for the observation [latex]9.9[\/latex] hours is [latex]2[\/latex[] Its z-score is <strong>[latex]2[\/latex]<\/strong>.<\/li>\r\n<\/ul>\r\n<\/li>\r\n \t<li>[latex]7.97 - 2*0.965 = 6.04[\/latex]:\r\n<ul>\r\n \t<li>[latex]6.04[\/latex] hours of sleep lies <strong>two<\/strong> standard deviations\u00a0<strong>below<\/strong> the mean.<\/li>\r\n \t<li>That is, the standardized value for the observation [latex]6.04[\/latex] hours is [latex]-2[\/latex]. Its z-score is\u00a0<strong>[latex]-2[\/latex].<\/strong><\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ol>\r\n[\/hidden-answer]\r\n\r\n<\/div>\r\n","rendered":"<div class=\"textbox learning-objectives\">\n<h3>Learning Goals<\/h3>\n<p>After completing this section, you should feel comfortable performing these skills.<\/p>\n<ul>\n<li><a href=\"#defineZscore\">Define the standardized value, or z-score.<\/a><\/li>\n<li><a href=\"#convert\">Use technology to convert values into standardized scores.<\/a><\/li>\n<li><a href=\"#identNumStdDev\">Use a dotplot and histogram to identify the number of standard deviations from the mean of certain observations.<\/a><\/li>\n<li><a href=\"#calcZscore\">Calculate a value&#8217;s standardized score by hand to determine its location relative to the mean.<\/a><\/li>\n<li><a href=\"#defineEmp\">Define the Empirical Rule.<\/a><\/li>\n<\/ul>\n<p>Click on a skill above to jump to its location in this section.<\/p>\n<\/div>\n<p>In the next activity, you will need to be able to convert values into standardized values (also called standardized scores or z-scores) and use a value\u2019s standardized value to determine whether the value is above, below, or equal to the mean. You will also need to be able to explain the Empirical Rule. In this section, we&#8217;ll use a data set to explore how to perform necessary calculations by hand and using technology.<\/p>\n<h2>Standardized Values<\/h2>\n<p>You learned in\u00a0<a href=\"https:\/\/courses.lumenlearning.com\/exemplarstatistics\/chapter\/comparing-variability-of-data sets-what-to-know\/\"><em>Comparing Variability of Data Sets: What to Know<\/em><\/a> that a standard deviation is a measure for how spread out observations are from the mean.<\/p>\n<p>A <strong>standardized value<\/strong>, or <strong>z-score<\/strong>, is the number of standard deviations an observation is away from the mean.<\/p>\n<p>For example, in this section we will analyze runtimes (in minutes) of G-rated movies to learn how to calculate standardized values. Within this context, the standardized value, or z-score, is the number of standard deviations a particular movie runtime is from the mean.<\/p>\n<p>It is important to note that the distance of a particular movie runtime from the mean is not measured in minutes; rather it is measured in standard deviations. Thus, a z-score of\u00a0[latex]-2.3[\/latex] is an observation that is\u00a0[latex]2.3[\/latex] standard deviations <em>below<\/em> the mean, and a z-score of\u00a0[latex]2.3[\/latex] is an observation that is\u00a0[latex]2.3[\/latex] standard deviations <em>above<\/em> the mean. It is important to note that z-scores do not have units associated with them.<\/p>\n<p style=\"text-align: left;\"><strong>Z-Score Formula\u00a0\u00a0<\/strong>The value of an observation is <strong>standardized<\/strong> using the formula [latex]z=\\dfrac{x-\\mu}{\\sigma}[\/latex], where [latex]x[\/latex] represents the value of the observation, [latex]\\mu[\/latex] represents the population mean, [latex]\\sigma[\/latex] represents the population standard deviation, and [latex]z[\/latex] represents the standardized value, or z-score.<\/p>\n<p>Before we use the formula to convert values into standardized values, let&#8217;s recap our understanding of standard deviation. In <a href=\"https:\/\/courses.lumenlearning.com\/exemplarstatistics\/chapter\/comparing-variability-of-data sets-what-to-know\/\"><em>Comparing Variability of Data Sets: What to Know<\/em><\/a>, you learned to understand standard deviation as a measure of variability in a data set. You looked at the statistical components that went into the formulas for standard deviation and variance and saw that larger standard deviations could represent more variability, and vice-versa. We&#8217;d like to shift that perspective now and look at a unit of standard deviation as a distance from the mean of a data set in a distribution.<\/p>\n<div class=\"textbox tryit\">\n<h3>standard deviation as a unit of distance<\/h3>\n<p><span style=\"background-color: #99cc00;\">[Perspective video &#8212; a 3-instructor video showing how to think about standard deviation as a unit of distance in a distribution &#8212; i.e., illustrating values so many standard deviations above and below the mean of a bell-shaped, unimodal, symmetric distribution. Show how adding or subtracting std devs can obtain a certain value at that location in the distribution. Show that a value&#8217;s z-score (negative or positive) is that many std deviations away from the mean in that direction.]<\/span><\/p>\n<\/div>\n<p>See the example below for a demonstration, then try it out using the Movie Runtimes database to answer the questions below.<\/p>\n<div class=\"textbox exercises\">\n<h3>inTeractive example<\/h3>\n<p>Let&#8217;s return again to the data set Sleep Study: Average Sleep, which we used in\u00a0<em><a href=\"https:\/\/courses.lumenlearning.com\/exemplarstatistics\/chapter\/comparing-variability-of-data sets-what-to-know\/\">Comparing Variability of Data Sets: What to Know<\/a><\/em> to learn about standard deviation as a measure of the variability of a data set.<\/p>\n<p>Open the tool at\u00a0<a href=\"https:\/\/dcmathpathways.shinyapps.io\/EDA_quantitative\/\">https:\/\/dcmathpathways.shinyapps.io\/EDA_quantitative\/<\/a>\u00a0and select the Sleep Study: Average Sleep data set.\u00a0Display a histogram and dotplot and make a note of the mean and standard deviation in the descriptive statistics. Round your final answers to the questions below to 3 decimal places, as needed.<\/p>\n<ol>\n<li>Describe the shape of the data set using the histogram and dotplot. For practice, display a boxplot as well and note the visual clues that you can use to determine the shape of the distribution from the boxplot.<\/li>\n<li>How does the relationship between the mean and median (given in descriptive statistics) help to support your analysis?<\/li>\n<li>What are the mean and standard deviation of the data set?<\/li>\n<li>What number of sleep hours lies one standard deviation above the mean? What value lies one standard deviation below?<\/li>\n<li>What number of sleep hours lie two standard deviations above and below the mean?<\/li>\n<\/ol>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q599366\">Show Answer<\/span><\/p>\n<div id=\"q599366\" class=\"hidden-answer\" style=\"display: none\">\n<ol>\n<li>The distribution is unimodal and approximately symmetric. A few outliers lie to either side of the distribution but do so evenly.<\/li>\n<li>The mean and median are approximately equal, which supports that the outliers are approximately evenly distributed.<\/li>\n<li><span style=\"font-size: 1rem; orphans: 1; text-align: initial;\">[latex]\\bar{x}=7.97[\/latex] and [latex]s=0.965[\/latex]<\/span><\/li>\n<li>The standard deviation is 0.965. We can add that to the mean to determine the value exactly one standard deviation above the mean. Likewise, we can subtract that from the mean to find the value exactly one standard deviation below the mean.\n<ul>\n<li>[latex]7.97 + 0.965 = 8.935[\/latex]:\n<ul>\n<li>[latex]8.935[\/latex] hours of sleep lies <strong>one<\/strong> standard deviation <strong>above<\/strong> the mean.<\/li>\n<li>That is, the standardized value for the observation [latex]8.935[\/latex] hours is [latex]1[\/latex[] Its z-score is <strong>[latex]1[\/latex]<\/strong>.<\/li>\n<\/ul>\n<\/li>\n<li>[latex]7.97 - 0.965 = 7.005[\/latex]:\n<ul>\n<li>[latex]7.005[\/latex] hours of sleep lies <strong>one<\/strong> standard deviation <strong>below<\/strong> the mean.<\/li>\n<li>That is, the standardized value for the observation [latex]7.005[\/latex] hours is [latex]-1[\/latex]. Its z-score is <strong>[latex]-1[\/latex]<\/strong><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/li>\n<li>The standard deviation is [latex]0.965[\/latex]. We can add twice that to the mean to determine the value exactly two standard deviations above the mean. Likewise, we can subtract [latex]2*0.965[\/latex] from the mean to find the value exactly one standard deviation below the mean.\n<ul>\n<li>[latex]7.97 + 2*0.965 = 9.9[\/latex]:\n<ul>\n<li>[latex]9.9[\/latex] hours of sleep lies <strong>two<\/strong> standard deviations\u00a0<strong>above<\/strong> the mean.<\/li>\n<li>That is, the standardized value for the observation [latex]9.9[\/latex] hours is [latex]2[\/latex[] Its z-score is <strong>[latex]2[\/latex]<\/strong>.<\/li>\n<\/ul>\n<\/li>\n<li>[latex]7.97 - 2*0.965 = 6.04[\/latex]:\n<ul>\n<li>[latex]6.04[\/latex] hours of sleep lies <strong>two<\/strong> standard deviations\u00a0<strong>below<\/strong> the mean.<\/li>\n<li>That is, the standardized value for the observation [latex]6.04[\/latex] hours is [latex]-2[\/latex]. Its z-score is\u00a0<strong>[latex]-2[\/latex].<\/strong><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/div>\n","protected":false},"author":17533,"menu_order":49,"template":"","meta":{"_candela_citation":"[]","CANDELA_OUTCOMES_GUID":"","pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-51","chapter","type-chapter","status-publish","hentry"],"part":20,"_links":{"self":[{"href":"https:\/\/courses.lumenlearning.com\/alphamodule\/wp-json\/pressbooks\/v2\/chapters\/51","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/courses.lumenlearning.com\/alphamodule\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/courses.lumenlearning.com\/alphamodule\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/alphamodule\/wp-json\/wp\/v2\/users\/17533"}],"version-history":[{"count":5,"href":"https:\/\/courses.lumenlearning.com\/alphamodule\/wp-json\/pressbooks\/v2\/chapters\/51\/revisions"}],"predecessor-version":[{"id":611,"href":"https:\/\/courses.lumenlearning.com\/alphamodule\/wp-json\/pressbooks\/v2\/chapters\/51\/revisions\/611"}],"part":[{"href":"https:\/\/courses.lumenlearning.com\/alphamodule\/wp-json\/pressbooks\/v2\/parts\/20"}],"metadata":[{"href":"https:\/\/courses.lumenlearning.com\/alphamodule\/wp-json\/pressbooks\/v2\/chapters\/51\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/courses.lumenlearning.com\/alphamodule\/wp-json\/wp\/v2\/media?parent=51"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/alphamodule\/wp-json\/pressbooks\/v2\/chapter-type?post=51"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/alphamodule\/wp-json\/wp\/v2\/contributor?post=51"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/alphamodule\/wp-json\/wp\/v2\/license?post=51"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}