{"id":450,"date":"2021-12-20T14:34:21","date_gmt":"2021-12-20T14:34:21","guid":{"rendered":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/?post_type=chapter&#038;p=450"},"modified":"2022-02-17T20:10:37","modified_gmt":"2022-02-17T20:10:37","slug":"what-to-know-about-4c","status":"publish","type":"chapter","link":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/chapter\/what-to-know-about-4c\/","title":{"raw":"What to Know About Interpreting the Mean and Median of a Dataset: 4C - 23","rendered":"What to Know About Interpreting the Mean and Median of a Dataset: 4C &#8211; 23"},"content":{"raw":"<div class=\"textbox learning-objectives\">\r\n<h3>goals for this section<\/h3>\r\nAfter completing this section, you should feel comfortable performing these skills.\r\n<ul>\r\n \t<li><a href=\"#IntMeanMedian\">Interpret the median of a dataset.<\/a><\/li>\r\n \t<li><a href=\"#IntMeanMedian\">Interpret the mean of a dataset.<\/a><\/li>\r\n \t<li><a href=\"#IdentSkew\">Identify whether a dataset is left-skewed, symmetric, or right-skewed.<\/a><\/li>\r\n \t<li><a href=\"#IdentSkew\">Identify in which dataset the mean is greater than, less than, or approximately equal to the median.<\/a><\/li>\r\n \t<li><a href=\"#resistant\">Identify which of the mean or median is resistant to skew.<\/a><\/li>\r\n<\/ul>\r\nClick on a skill above to jump to its location in this section.\r\n\r\n<\/div>\r\nWhen examining the distribution of a quantitative variable using a histogram or a dotplot, we often find that the distribution follows a bell shape with a mound of observances in the middle of the distribution and even amounts of data falling to the right and left. But sometimes a distribution's values are bunched up to one side or the other, with a few observations stretching way out to the other side. You may recall from <em>What to Know About Applications of Histograms: 3D <\/em>that there are specialized statistical terms we use for these different distribution shapes: skewness and symmetry. In this section, you'll learn that there are certain ways the mean of the data relates to the median under these different shapes.\r\n<h2>Using Skew to Describe Datasets<\/h2>\r\nRecall that we say a quantitative variable has a <strong>right-skewed<\/strong> distribution or a <strong>positive skew<\/strong> if there is a \"tail\" of infrequent values on the right (upper) end of the distribution. We say a dataset has an approximately <strong>symmetric<\/strong> distribution if values are similarly distributed on either side of the mean\/median. We say a dataset has a <strong>left-skewed<\/strong> distribution or a <strong>negative skew<\/strong> if there is a \"tail\" of infrequent values on the left (lower) end of the distribution.\r\n<div class=\"textbox tryit\">\r\n<h3>skewed distributions<\/h3>\r\n<span style=\"background-color: #ffff99;\">I'd like an animation here (super simple) of a data set that moves from right skew to symmetry to left skew with a slider students can manipulate. The labels would change over the slider: right skew \/ roughly symmetric \/ roughly symmetric \/ left skew.<\/span>\r\n\r\n<\/div>\r\nIn the next activity, you'll need to calculate and interpret the mean and median in skewed distributions. Let's get some practice with these skills using data collected around the T.V. show\u00a0<em>Friends<\/em>.\r\n<h3 id=\"IntMeanMedian\">Interpreting Mean and Median<\/h3>\r\n<em>Friends<\/em> was a popular American television show that aired from 1994 to 2004. The show followed a group of six friends living in New York City and chronicled their relationships and day-to-day adventures. The show became known in popular culture for its comedy and for the closeness of its cast.[footnote]Encyclopedia Britannica. (n.d.). Friends. In <em>Encyclopedia Britannica.com<\/em>. https:\/\/www.britannica.com\/topic\/Friends[\/footnote]\r\n\r\nThe following table lists the number of U.S. viewers of each episode of the 10th and final season of Friends.[footnote]Mock, T. (2020). <em>A weekly data project aimed at the R ecosystem<\/em>. TidyTuesday. https:\/\/github.com\/rfordatascience\/tidytuesday\/blob\/master\/data\/2020\/2020-09-08\/readme.md#friends_infocsv[\/footnote]\r\n<div align=\"left\">\r\n<table><caption class=\"center\"><span style=\"text-transform: uppercase;\">Friends Final Season Viewers by episode<\/span><strong>\r\n<\/strong><\/caption>\r\n<tbody>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>Episode Number<\/strong><\/td>\r\n<td style=\"text-align: center;\"><strong>Episode Title<\/strong><\/td>\r\n<td style=\"text-align: center;\"><strong>Air Date<\/strong><\/td>\r\n<td style=\"text-align: center;\"><strong>U.S. Viewers (Millions)<\/strong><\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>1<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One After Joey and Rachel Kiss<\/td>\r\n<td style=\"text-align: center;\">9\/25\/03<\/td>\r\n<td style=\"text-align: center;\">24.54<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>2<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One Where Ross Is Fine<\/td>\r\n<td style=\"text-align: center;\">10\/2\/03<\/td>\r\n<td style=\"text-align: center;\">22.38<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>3<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One with Ross's Tan<\/td>\r\n<td style=\"text-align: center;\">10\/9\/03<\/td>\r\n<td style=\"text-align: center;\">21.87<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>4<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One with the Cake<\/td>\r\n<td style=\"text-align: center;\">10\/23\/03<\/td>\r\n<td style=\"text-align: center;\">18.77<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>5<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One Where Rachel's Sister Babysits<\/td>\r\n<td style=\"text-align: center;\">10\/30\/03<\/td>\r\n<td style=\"text-align: center;\">19.37<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>6<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One with Ross's Grant<\/td>\r\n<td style=\"text-align: center;\">11\/6\/03<\/td>\r\n<td style=\"text-align: center;\">20.38<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>7<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One with the Home Study<\/td>\r\n<td style=\"text-align: center;\">11\/13\/03<\/td>\r\n<td style=\"text-align: center;\">20.21<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>8<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One with the Late Thanksgiving<\/td>\r\n<td style=\"text-align: center;\">11\/20\/03<\/td>\r\n<td style=\"text-align: center;\">20.66<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>9<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One with the Birth Mother<\/td>\r\n<td style=\"text-align: center;\">1\/8\/04<\/td>\r\n<td style=\"text-align: center;\">25.49<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>10<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One Where Chandler Gets Caught<\/td>\r\n<td style=\"text-align: center;\">1\/15\/04<\/td>\r\n<td style=\"text-align: center;\">26.68<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>11<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One Where the Stripper Cries<\/td>\r\n<td style=\"text-align: center;\">2\/5\/04<\/td>\r\n<td style=\"text-align: center;\">24.91<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>12<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One with Phoebe's Wedding<\/td>\r\n<td style=\"text-align: center;\">2\/12\/04<\/td>\r\n<td style=\"text-align: center;\">25.9<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>13<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One Where Joey Speaks French<\/td>\r\n<td style=\"text-align: center;\">2\/19\/04<\/td>\r\n<td style=\"text-align: center;\">24.27<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>14<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One with Princess Consuela<\/td>\r\n<td style=\"text-align: center;\">2\/26\/04<\/td>\r\n<td style=\"text-align: center;\">22.83<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>15<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One Where Estelle Dies<\/td>\r\n<td style=\"text-align: center;\">4\/22\/04<\/td>\r\n<td style=\"text-align: center;\">22.64<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>16<\/strong><\/td>\r\n<td style=\"text-align: center;\">The One with Rachel's Going Away Party<\/td>\r\n<td style=\"text-align: center;\">4\/29\/04<\/td>\r\n<td style=\"text-align: center;\">24.51<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>17<\/strong><\/td>\r\n<td style=\"text-align: center;\">The Last One*<\/td>\r\n<td style=\"text-align: center;\">5\/6\/04<\/td>\r\n<td style=\"text-align: center;\">52.46<\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"text-align: center;\"><strong>18<\/strong><\/td>\r\n<td style=\"text-align: center;\">The Last One*<\/td>\r\n<td style=\"text-align: center;\">5\/6\/04<\/td>\r\n<td style=\"text-align: center;\">52.46<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<table class=\"fin-table gridded\"><caption class=\"center\">\u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0*Note: the final two episodes aired back-to-back on the same night\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/caption>\r\n<thead><\/thead>\r\n<\/table>\r\n<span style=\"font-size: 1rem; text-align: initial;\">We'll use technology to analyze this dataset.<\/span>\r\n\r\n<\/div>\r\n<div class=\"textbox\">\r\n\r\nGo to the <em>Describing and Exploring Quantitative Variables<\/em> tool at <a href=\"https:\/\/dcmathpathways.shinyapps.io\/EDA_quantitative\/\" target=\"_blank\" rel=\"noopener\">https:\/\/dcmathpathways.shinyapps.io\/EDA_quantitative\/<\/a>.\r\n<p style=\"padding-left: 30px;\">Step 1) Select the <strong>Single Group<\/strong> tab.<\/p>\r\n<p style=\"padding-left: 30px;\">Step 2) Locate the drop-down menu under <strong>Enter Data<\/strong> and select <strong>Your Own<\/strong>.<\/p>\r\n<p style=\"padding-left: 30px;\">Step 3) Under\u00a0<strong>Do you have<\/strong>, select\u00a0<strong>Individual Observations<\/strong>.<\/p>\r\n<p style=\"padding-left: 30px;\">Step 4)\u00a0Under <strong>Name of Variable<\/strong>, type \u201cU.S. Viewers (Millions).\u201d<\/p>\r\n<p style=\"padding-left: 30px;\">Step 5) Cut and paste or enter the data presented in the above table for U.S. Viewers (Millions).<\/p>\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 1<\/h3>\r\nUse the tool to calculate the median episode viewership for Season 10 of Friends.\u00a0You can scroll in the observations entry box to verify that you pasted the data correctly.\r\n\r\n[reveal-answer q=\"253029\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"253029\"]The median will be located in Descriptive Statistics.[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 2<\/h3>\r\nWhich of the following does the median tell you about the number of people who watched episodes of Friends during Season 10?\r\n<p style=\"padding-left: 30px;\">a) Half the episodes in Season 10 of Friends had more than 23.5 million viewers, and half the episodes had fewer than 23.5 million viewers.<\/p>\r\n<p style=\"padding-left: 30px;\">b) The most common episode viewership was 23.5 million viewers per episode during Season 10.<\/p>\r\n<p style=\"padding-left: 30px;\">c) If we took the total number of viewers for the whole season and split them equally among all 18 episodes, each episode would have about 23.5 million viewers.<\/p>\r\n[reveal-answer q=\"497957\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"497957\"]The median is the 50th percentile and splits the data in half.[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 3<\/h3>\r\nUse the tool\u00a0to calculate the mean episode viewership for Season 10 of Friends.\r\n\r\n[reveal-answer q=\"704874\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"704874\"]The mean will be located in Descriptive Statistics[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 4<\/h3>\r\nWhich of the following does the mean tell you about the number of people watching episodes of Friends during Season 10?\r\n<p style=\"padding-left: 30px;\">a) Half the episodes in Season 10 of Friends had more than 26.1 million viewers, and half the episodes had fewer than 26.1 million viewers.<\/p>\r\n<p style=\"padding-left: 30px;\">b) The most common episode viewership was 26.1 million viewers per episode during Season 10.<\/p>\r\n<p style=\"padding-left: 30px;\">c) If we took the total number of viewers for the whole season and split them equally among all 18 episodes, each episode would have about 26.1 million viewers.<\/p>\r\n[reveal-answer q=\"203004\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"203004\"]The mean is what we think of as the \"average\" value in the set.[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 5<\/h3>\r\nThe mean number of viewers is _______ the median number of viewers.\r\n<p style=\"padding-left: 30px;\">a) greater than<\/p>\r\n<p style=\"padding-left: 30px;\">b) less than<\/p>\r\n<p style=\"padding-left: 30px;\">c) roughly equal to<\/p>\r\n[reveal-answer q=\"412918\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"412918\"]Use Descriptive Statistics in the tool to compare them.[\/hidden-answer]\r\n\r\n<\/div>\r\nFor this question, use the following histogram of the Season 10 Friends viewership data.\r\n\r\n<strong><img class=\"alignnone wp-image-1010\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/11194844\/Picture43-300x112.png\" alt=\"A histogram labeled &quot;US Viewers (Millions)&quot; on the x-axis and &quot;Count&quot; on the y-axis. The x-axis is numbered in increments of five from 15 to 55 and the y-axis is numbered in increments of 1 from 0 to 4. For 18-19, the count is 1. For 19-20, the count is 1. For 20-21, the count is 3. For 21-22, the count is 1. For 22-23, the count is 3. For 24-25, the count is 4. For 25-26, the count is 2. For 26-27, the count is 1. For 52-53, the count is 2. For all other ranges, the count is 0.\" width=\"892\" height=\"333\" \/><\/strong>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 6<\/h3>\r\nWhich of the following describes the distribution of the data?\r\n<p style=\"padding-left: 30px;\">a) Left-skewed<\/p>\r\n<p style=\"padding-left: 30px;\">b) Symmetric<\/p>\r\n<p style=\"padding-left: 30px;\">c) Right-skewed<\/p>\r\n[reveal-answer q=\"755933\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"755933\"]Refer to the definitions at the beginning of this assignment.[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 7<\/h3>\r\nUse what you see on the histogram to justify your answer to Question 5.\r\n\r\n[reveal-answer q=\"875490\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"875490\"]Consider the implications that the shape of the graph has for the size of the mean relative to the median.[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 8<\/h3>\r\nWhich episodes have unusually high numbers of viewers?\r\n\r\n[reveal-answer q=\"55553\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"55553\"]Refer to the table to locate specific episodes.[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 9<\/h3>\r\nThe last two episodes of Friends aired in a row on the same night. Why do you think these episodes have such high numbers of viewers?\r\n\r\n[reveal-answer q=\"972576\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"972576\"]What do <em>you<\/em> think?[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox tryit\">\r\n<h3>effects of skew on mean and median<\/h3>\r\n<span style=\"background-color: #99cc00;\">[Perspective video -- a 3-instructor video that shows how to think about the tail and the two outliers in the data above together with the fact that the mean is larger than the median to begin to understand that the mean tends to be pulled to the right of the median under a right skew.]\u00a0<\/span>\r\n\r\n<\/div>\r\n<h3 id=\"IdentSkew\">Relating Mean and Median to the Skewness of a Dataset from a Histogram<\/h3>\r\nFor each of the plots of data below, choose the description that matches the shape of the data\u2019s distribution, and then select the choice that gives the relationship between the mean and median for those data. Base your answers on the understanding you established in Questions 1 - 9 about the direction the mean was pulled in under the skewness in the dataset.\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 10<\/h3>\r\n<img class=\"alignnone wp-image-1012 size-medium\" style=\"background-color: initial;\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/11194917\/ChartPicture2-300x297.jpg\" alt=\"An unlabeled bar graph with seven bars. The bar on the far left is the highest. Moving to the right, each bar is progressively shorter than the last. In most places, this is by approximately the same amount, but it is a larger difference between the third and fourth bars.\" width=\"300\" height=\"297\" \/>\r\n<table style=\"border-collapse: collapse; width: 100%;\" border=\"1\">\r\n<tbody>\r\n<tr>\r\n<td style=\"width: 50%;\"><span style=\"background-color: #ffff00;\">Is the distribution Left-skewed, Symmetric, or Right-skewed?<\/span><\/td>\r\n<td style=\"width: 50%;\"><span style=\"background-color: #ffff00;\">[drop down choices]<\/span><\/td>\r\n<\/tr>\r\n<tr>\r\n<td style=\"width: 50%;\"><span style=\"background-color: #ffff00;\">Is the mean greater than, less than, or roughly equal to the median?<\/span><\/td>\r\n<td style=\"width: 50%;\"><span style=\"background-color: #ffff00;\">[drop down choices]<\/span><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n&nbsp;\r\n\r\n[reveal-answer q=\"16821\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"16821\"]Refer to the definitions at the top of the page and your answer to Question 7 for guidance.[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 11<\/h3>\r\n<img class=\"alignnone wp-image-1013 size-medium\" style=\"background-color: initial; font-size: 0.9em;\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/11194922\/ChartPicture3-300x287.jpg\" alt=\"An unlabeled bar graph with seven bars. The bar on the far right is the highest. Moving from the left, each bar is progressively taller than the last. In most places, this is by approximately the same amount, but it is a larger difference between the fourth and fifth bars.\" width=\"300\" height=\"287\" \/>\r\n\r\nIs the distribution Left-skewed, Symmetric, or Right-skewed?\r\n\r\nIs the mean greater than, less than, or roughly equal to the median?\r\n\r\n[reveal-answer q=\"738598\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"738598\"]Refer to the definitions at the top of the page and your answer to Question 7 for guidance.[\/hidden-answer]\r\n\r\n<span style=\"background-color: #ffff00;\">[see question 10 as option for question\/answer method]<\/span>\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 12<\/h3>\r\n<img class=\"alignnone wp-image-1011 size-medium\" style=\"background-color: initial;\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/11194913\/ChartPicture1-300x300.png\" alt=\"An unlabeled bar graph. The bar is the center is the highest and going either direction away from it, the bars get shorter by equal increments.\" width=\"300\" height=\"300\" \/>\r\n\r\nIs the distribution Left-skewed, Symmetric, or Right-skewed?\r\n\r\nIs the mean greater than, less than, or roughly equal to the median?\r\n\r\n[reveal-answer q=\"627907\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"627907\"]Refer to the definitions at the top of the page and your answer to Question 7 for guidance.[\/hidden-answer]\r\n\r\n<span style=\"background-color: #ffff00;\">[see question 10 as option for question\/answer method]<\/span>\r\n\r\n<\/div>\r\n<div class=\"textbox tryit\">\r\n<h3>Resistant and Nonresistant Measures of Center<\/h3>\r\n<span style=\"background-color: #99cc00;\">[Worked example - a 3-instructor video showing a symmetric dataset with the mean and median identical, then, skewing the distribution to show what happens to the mean while the median remains in place.]<\/span>\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 13<\/h3>\r\nLook back on your answers to Questions 10, 11, and 12. Which of mean or median appeared to be <strong>resistant<\/strong> to skew? That is, which of the two measures of center is not affected by the skewness of a graph?\r\n<p style=\"padding-left: 30px;\">a) The mean is resistant to skew. The median is sensitive to skew and\/or the presence of outliers.<\/p>\r\n<p style=\"padding-left: 30px;\">b) The median is resistant to skew. The mean is sensitive to skew and\/or the presence of outliers.<\/p>\r\n[reveal-answer q=\"56189\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"56189\"]\r\n\r\nConsider which measure (mean or median) seemed to be \"pulled\" in the direction of the tail in the skewed distributions and which did not.\r\n[\/hidden-answer]\r\n\r\n<\/div>\r\nHopefully, you have noticed that when a distribution is symmetric, the mean and median occupy the same value. But under a skew, the mean is \"pulled\" in the direction of the outliers: greater than the median in the case of positive (right) skew, and less than the median in the case of negative (left) skew. It appears that the mean is affected by the presence of outliers while the median is not.\r\n<h3>Looking ahead<\/h3>\r\nBroadly speaking, we consider a value in a dataset to be an outlier if that value is unusual or extreme, given the other values in the dataset.\r\n\r\nSuppose you have two groups of people:\r\n<ol style=\"list-style-type: lower-roman;\">\r\n \t<li style=\"list-style-type: none;\">\r\n<ol style=\"list-style-type: lower-roman;\">\r\n \t<li style=\"list-style-type: none;\">\r\n<ul>\r\n \t<li style=\"font-weight: 400;\" aria-level=\"1\">Group 1 is made up of five professional basketball players, and Group 2 is made up of four professional basketball players and one kindergartener.<\/li>\r\n \t<li style=\"font-weight: 400;\" aria-level=\"1\">Dataset 1 contains the number of three-pointers each person in Group 1 can make in one minute. Dataset 2 contains the number of three-pointers each person in Group 2 can make in an hour.<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ol>\r\n<\/li>\r\n<\/ol>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 14<\/h3>\r\nWhich dataset do you think is more likely to contain an outlier?\r\n<p style=\"padding-left: 30px;\">a) Group 1<\/p>\r\n<p style=\"padding-left: 30px;\">b) Group 2<\/p>\r\n[reveal-answer q=\"785920\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"785920\"]Imagine a dotplot of the observations in each dataset.[\/hidden-answer]\r\n\r\n<\/div>\r\n<h2>Summary<\/h2>\r\nIn this section, you've learned about skewed distributions vs. symmetric distributions and how skew affects the mean of a data distribution. You also got some practice calculating and interpreting the mean and median of a dataset. Let's summarize where these skills showed up in the material.\r\n<ol style=\"list-style-type: lower-roman;\">\r\n \t<li style=\"list-style-type: none;\">\r\n<ol style=\"list-style-type: lower-roman;\">\r\n \t<li style=\"list-style-type: none;\">\r\n<ul>\r\n \t<li>In Question 1, you calculated the median of a dataset, and interpreted the median in Question 2.<\/li>\r\n \t<li>In Question 3, you calculated the mean of a dataset, and interpreted the mean in Question 4.<\/li>\r\n \t<li>In Question 5, you began to see how the mean and median relate in a distribution.<\/li>\r\n \t<li>In Questions 6, and 10 - 13, you used statistical terms for skew and extreme values to describe the features of a dataset, and began to make connections between the mean and median under differently shaped distributions.<\/li>\r\n \t<li>In Questions 7 -9, you interpreted the mean and median to make connections between them and the data distribution.<\/li>\r\n \t<li>In Question 13, you identified which of the mean or median is resistant to skew.<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ol>\r\n<\/li>\r\n<\/ol>\r\nBeing able to interpret the mean and median with regard to the shape of a distribution and the presence of outliers will be essential skills to use when assessing claims made about data that rely on measures of center. If you feel comfortable with these skills, please move on to the activity!","rendered":"<div class=\"textbox learning-objectives\">\n<h3>goals for this section<\/h3>\n<p>After completing this section, you should feel comfortable performing these skills.<\/p>\n<ul>\n<li><a href=\"#IntMeanMedian\">Interpret the median of a dataset.<\/a><\/li>\n<li><a href=\"#IntMeanMedian\">Interpret the mean of a dataset.<\/a><\/li>\n<li><a href=\"#IdentSkew\">Identify whether a dataset is left-skewed, symmetric, or right-skewed.<\/a><\/li>\n<li><a href=\"#IdentSkew\">Identify in which dataset the mean is greater than, less than, or approximately equal to the median.<\/a><\/li>\n<li><a href=\"#resistant\">Identify which of the mean or median is resistant to skew.<\/a><\/li>\n<\/ul>\n<p>Click on a skill above to jump to its location in this section.<\/p>\n<\/div>\n<p>When examining the distribution of a quantitative variable using a histogram or a dotplot, we often find that the distribution follows a bell shape with a mound of observances in the middle of the distribution and even amounts of data falling to the right and left. But sometimes a distribution&#8217;s values are bunched up to one side or the other, with a few observations stretching way out to the other side. You may recall from <em>What to Know About Applications of Histograms: 3D <\/em>that there are specialized statistical terms we use for these different distribution shapes: skewness and symmetry. In this section, you&#8217;ll learn that there are certain ways the mean of the data relates to the median under these different shapes.<\/p>\n<h2>Using Skew to Describe Datasets<\/h2>\n<p>Recall that we say a quantitative variable has a <strong>right-skewed<\/strong> distribution or a <strong>positive skew<\/strong> if there is a &#8220;tail&#8221; of infrequent values on the right (upper) end of the distribution. We say a dataset has an approximately <strong>symmetric<\/strong> distribution if values are similarly distributed on either side of the mean\/median. We say a dataset has a <strong>left-skewed<\/strong> distribution or a <strong>negative skew<\/strong> if there is a &#8220;tail&#8221; of infrequent values on the left (lower) end of the distribution.<\/p>\n<div class=\"textbox tryit\">\n<h3>skewed distributions<\/h3>\n<p><span style=\"background-color: #ffff99;\">I&#8217;d like an animation here (super simple) of a data set that moves from right skew to symmetry to left skew with a slider students can manipulate. The labels would change over the slider: right skew \/ roughly symmetric \/ roughly symmetric \/ left skew.<\/span><\/p>\n<\/div>\n<p>In the next activity, you&#8217;ll need to calculate and interpret the mean and median in skewed distributions. Let&#8217;s get some practice with these skills using data collected around the T.V. show\u00a0<em>Friends<\/em>.<\/p>\n<h3 id=\"IntMeanMedian\">Interpreting Mean and Median<\/h3>\n<p><em>Friends<\/em> was a popular American television show that aired from 1994 to 2004. The show followed a group of six friends living in New York City and chronicled their relationships and day-to-day adventures. The show became known in popular culture for its comedy and for the closeness of its cast.<a class=\"footnote\" title=\"Encyclopedia Britannica. (n.d.). Friends. In Encyclopedia Britannica.com. https:\/\/www.britannica.com\/topic\/Friends\" id=\"return-footnote-450-1\" href=\"#footnote-450-1\" aria-label=\"Footnote 1\"><sup class=\"footnote\">[1]<\/sup><\/a><\/p>\n<p>The following table lists the number of U.S. viewers of each episode of the 10th and final season of Friends.<a class=\"footnote\" title=\"Mock, T. (2020). A weekly data project aimed at the R ecosystem. TidyTuesday. https:\/\/github.com\/rfordatascience\/tidytuesday\/blob\/master\/data\/2020\/2020-09-08\/readme.md#friends_infocsv\" id=\"return-footnote-450-2\" href=\"#footnote-450-2\" aria-label=\"Footnote 2\"><sup class=\"footnote\">[2]<\/sup><\/a><\/p>\n<div style=\"text-align: left;\">\n<table>\n<caption class=\"center\"><span style=\"text-transform: uppercase;\">Friends Final Season Viewers by episode<\/span><strong><br \/>\n<\/strong><\/caption>\n<tbody>\n<tr>\n<td style=\"text-align: center;\"><strong>Episode Number<\/strong><\/td>\n<td style=\"text-align: center;\"><strong>Episode Title<\/strong><\/td>\n<td style=\"text-align: center;\"><strong>Air Date<\/strong><\/td>\n<td style=\"text-align: center;\"><strong>U.S. Viewers (Millions)<\/strong><\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>1<\/strong><\/td>\n<td style=\"text-align: center;\">The One After Joey and Rachel Kiss<\/td>\n<td style=\"text-align: center;\">9\/25\/03<\/td>\n<td style=\"text-align: center;\">24.54<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>2<\/strong><\/td>\n<td style=\"text-align: center;\">The One Where Ross Is Fine<\/td>\n<td style=\"text-align: center;\">10\/2\/03<\/td>\n<td style=\"text-align: center;\">22.38<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>3<\/strong><\/td>\n<td style=\"text-align: center;\">The One with Ross&#8217;s Tan<\/td>\n<td style=\"text-align: center;\">10\/9\/03<\/td>\n<td style=\"text-align: center;\">21.87<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>4<\/strong><\/td>\n<td style=\"text-align: center;\">The One with the Cake<\/td>\n<td style=\"text-align: center;\">10\/23\/03<\/td>\n<td style=\"text-align: center;\">18.77<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>5<\/strong><\/td>\n<td style=\"text-align: center;\">The One Where Rachel&#8217;s Sister Babysits<\/td>\n<td style=\"text-align: center;\">10\/30\/03<\/td>\n<td style=\"text-align: center;\">19.37<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>6<\/strong><\/td>\n<td style=\"text-align: center;\">The One with Ross&#8217;s Grant<\/td>\n<td style=\"text-align: center;\">11\/6\/03<\/td>\n<td style=\"text-align: center;\">20.38<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>7<\/strong><\/td>\n<td style=\"text-align: center;\">The One with the Home Study<\/td>\n<td style=\"text-align: center;\">11\/13\/03<\/td>\n<td style=\"text-align: center;\">20.21<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>8<\/strong><\/td>\n<td style=\"text-align: center;\">The One with the Late Thanksgiving<\/td>\n<td style=\"text-align: center;\">11\/20\/03<\/td>\n<td style=\"text-align: center;\">20.66<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>9<\/strong><\/td>\n<td style=\"text-align: center;\">The One with the Birth Mother<\/td>\n<td style=\"text-align: center;\">1\/8\/04<\/td>\n<td style=\"text-align: center;\">25.49<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>10<\/strong><\/td>\n<td style=\"text-align: center;\">The One Where Chandler Gets Caught<\/td>\n<td style=\"text-align: center;\">1\/15\/04<\/td>\n<td style=\"text-align: center;\">26.68<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>11<\/strong><\/td>\n<td style=\"text-align: center;\">The One Where the Stripper Cries<\/td>\n<td style=\"text-align: center;\">2\/5\/04<\/td>\n<td style=\"text-align: center;\">24.91<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>12<\/strong><\/td>\n<td style=\"text-align: center;\">The One with Phoebe&#8217;s Wedding<\/td>\n<td style=\"text-align: center;\">2\/12\/04<\/td>\n<td style=\"text-align: center;\">25.9<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>13<\/strong><\/td>\n<td style=\"text-align: center;\">The One Where Joey Speaks French<\/td>\n<td style=\"text-align: center;\">2\/19\/04<\/td>\n<td style=\"text-align: center;\">24.27<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>14<\/strong><\/td>\n<td style=\"text-align: center;\">The One with Princess Consuela<\/td>\n<td style=\"text-align: center;\">2\/26\/04<\/td>\n<td style=\"text-align: center;\">22.83<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>15<\/strong><\/td>\n<td style=\"text-align: center;\">The One Where Estelle Dies<\/td>\n<td style=\"text-align: center;\">4\/22\/04<\/td>\n<td style=\"text-align: center;\">22.64<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>16<\/strong><\/td>\n<td style=\"text-align: center;\">The One with Rachel&#8217;s Going Away Party<\/td>\n<td style=\"text-align: center;\">4\/29\/04<\/td>\n<td style=\"text-align: center;\">24.51<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>17<\/strong><\/td>\n<td style=\"text-align: center;\">The Last One*<\/td>\n<td style=\"text-align: center;\">5\/6\/04<\/td>\n<td style=\"text-align: center;\">52.46<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\"><strong>18<\/strong><\/td>\n<td style=\"text-align: center;\">The Last One*<\/td>\n<td style=\"text-align: center;\">5\/6\/04<\/td>\n<td style=\"text-align: center;\">52.46<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<table class=\"fin-table gridded\">\n<caption class=\"center\">\u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0*Note: the final two episodes aired back-to-back on the same night\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/caption>\n<thead><\/thead>\n<\/table>\n<p><span style=\"font-size: 1rem; text-align: initial;\">We&#8217;ll use technology to analyze this dataset.<\/span><\/p>\n<\/div>\n<div class=\"textbox\">\n<p>Go to the <em>Describing and Exploring Quantitative Variables<\/em> tool at <a href=\"https:\/\/dcmathpathways.shinyapps.io\/EDA_quantitative\/\" target=\"_blank\" rel=\"noopener\">https:\/\/dcmathpathways.shinyapps.io\/EDA_quantitative\/<\/a>.<\/p>\n<p style=\"padding-left: 30px;\">Step 1) Select the <strong>Single Group<\/strong> tab.<\/p>\n<p style=\"padding-left: 30px;\">Step 2) Locate the drop-down menu under <strong>Enter Data<\/strong> and select <strong>Your Own<\/strong>.<\/p>\n<p style=\"padding-left: 30px;\">Step 3) Under\u00a0<strong>Do you have<\/strong>, select\u00a0<strong>Individual Observations<\/strong>.<\/p>\n<p style=\"padding-left: 30px;\">Step 4)\u00a0Under <strong>Name of Variable<\/strong>, type \u201cU.S. Viewers (Millions).\u201d<\/p>\n<p style=\"padding-left: 30px;\">Step 5) Cut and paste or enter the data presented in the above table for U.S. Viewers (Millions).<\/p>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 1<\/h3>\n<p>Use the tool to calculate the median episode viewership for Season 10 of Friends.\u00a0You can scroll in the observations entry box to verify that you pasted the data correctly.<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q253029\">Hint<\/span><\/p>\n<div id=\"q253029\" class=\"hidden-answer\" style=\"display: none\">The median will be located in Descriptive Statistics.<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 2<\/h3>\n<p>Which of the following does the median tell you about the number of people who watched episodes of Friends during Season 10?<\/p>\n<p style=\"padding-left: 30px;\">a) Half the episodes in Season 10 of Friends had more than 23.5 million viewers, and half the episodes had fewer than 23.5 million viewers.<\/p>\n<p style=\"padding-left: 30px;\">b) The most common episode viewership was 23.5 million viewers per episode during Season 10.<\/p>\n<p style=\"padding-left: 30px;\">c) If we took the total number of viewers for the whole season and split them equally among all 18 episodes, each episode would have about 23.5 million viewers.<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q497957\">Hint<\/span><\/p>\n<div id=\"q497957\" class=\"hidden-answer\" style=\"display: none\">The median is the 50th percentile and splits the data in half.<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 3<\/h3>\n<p>Use the tool\u00a0to calculate the mean episode viewership for Season 10 of Friends.<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q704874\">Hint<\/span><\/p>\n<div id=\"q704874\" class=\"hidden-answer\" style=\"display: none\">The mean will be located in Descriptive Statistics<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 4<\/h3>\n<p>Which of the following does the mean tell you about the number of people watching episodes of Friends during Season 10?<\/p>\n<p style=\"padding-left: 30px;\">a) Half the episodes in Season 10 of Friends had more than 26.1 million viewers, and half the episodes had fewer than 26.1 million viewers.<\/p>\n<p style=\"padding-left: 30px;\">b) The most common episode viewership was 26.1 million viewers per episode during Season 10.<\/p>\n<p style=\"padding-left: 30px;\">c) If we took the total number of viewers for the whole season and split them equally among all 18 episodes, each episode would have about 26.1 million viewers.<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q203004\">Hint<\/span><\/p>\n<div id=\"q203004\" class=\"hidden-answer\" style=\"display: none\">The mean is what we think of as the &#8220;average&#8221; value in the set.<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 5<\/h3>\n<p>The mean number of viewers is _______ the median number of viewers.<\/p>\n<p style=\"padding-left: 30px;\">a) greater than<\/p>\n<p style=\"padding-left: 30px;\">b) less than<\/p>\n<p style=\"padding-left: 30px;\">c) roughly equal to<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q412918\">Hint<\/span><\/p>\n<div id=\"q412918\" class=\"hidden-answer\" style=\"display: none\">Use Descriptive Statistics in the tool to compare them.<\/div>\n<\/div>\n<\/div>\n<p>For this question, use the following histogram of the Season 10 Friends viewership data.<\/p>\n<p><strong><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1010\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/11194844\/Picture43-300x112.png\" alt=\"A histogram labeled &quot;US Viewers (Millions)&quot; on the x-axis and &quot;Count&quot; on the y-axis. The x-axis is numbered in increments of five from 15 to 55 and the y-axis is numbered in increments of 1 from 0 to 4. For 18-19, the count is 1. For 19-20, the count is 1. For 20-21, the count is 3. For 21-22, the count is 1. For 22-23, the count is 3. For 24-25, the count is 4. For 25-26, the count is 2. For 26-27, the count is 1. For 52-53, the count is 2. For all other ranges, the count is 0.\" width=\"892\" height=\"333\" \/><\/strong><\/p>\n<div class=\"textbox key-takeaways\">\n<h3>question 6<\/h3>\n<p>Which of the following describes the distribution of the data?<\/p>\n<p style=\"padding-left: 30px;\">a) Left-skewed<\/p>\n<p style=\"padding-left: 30px;\">b) Symmetric<\/p>\n<p style=\"padding-left: 30px;\">c) Right-skewed<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q755933\">Hint<\/span><\/p>\n<div id=\"q755933\" class=\"hidden-answer\" style=\"display: none\">Refer to the definitions at the beginning of this assignment.<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 7<\/h3>\n<p>Use what you see on the histogram to justify your answer to Question 5.<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q875490\">Hint<\/span><\/p>\n<div id=\"q875490\" class=\"hidden-answer\" style=\"display: none\">Consider the implications that the shape of the graph has for the size of the mean relative to the median.<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 8<\/h3>\n<p>Which episodes have unusually high numbers of viewers?<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q55553\">Hint<\/span><\/p>\n<div id=\"q55553\" class=\"hidden-answer\" style=\"display: none\">Refer to the table to locate specific episodes.<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 9<\/h3>\n<p>The last two episodes of Friends aired in a row on the same night. Why do you think these episodes have such high numbers of viewers?<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q972576\">Hint<\/span><\/p>\n<div id=\"q972576\" class=\"hidden-answer\" style=\"display: none\">What do <em>you<\/em> think?<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox tryit\">\n<h3>effects of skew on mean and median<\/h3>\n<p><span style=\"background-color: #99cc00;\">[Perspective video &#8212; a 3-instructor video that shows how to think about the tail and the two outliers in the data above together with the fact that the mean is larger than the median to begin to understand that the mean tends to be pulled to the right of the median under a right skew.]\u00a0<\/span><\/p>\n<\/div>\n<h3 id=\"IdentSkew\">Relating Mean and Median to the Skewness of a Dataset from a Histogram<\/h3>\n<p>For each of the plots of data below, choose the description that matches the shape of the data\u2019s distribution, and then select the choice that gives the relationship between the mean and median for those data. Base your answers on the understanding you established in Questions 1 &#8211; 9 about the direction the mean was pulled in under the skewness in the dataset.<\/p>\n<div class=\"textbox key-takeaways\">\n<h3>question 10<\/h3>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1012 size-medium\" style=\"background-color: initial;\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/11194917\/ChartPicture2-300x297.jpg\" alt=\"An unlabeled bar graph with seven bars. The bar on the far left is the highest. Moving to the right, each bar is progressively shorter than the last. In most places, this is by approximately the same amount, but it is a larger difference between the third and fourth bars.\" width=\"300\" height=\"297\" \/><\/p>\n<table style=\"border-collapse: collapse; width: 100%;\">\n<tbody>\n<tr>\n<td style=\"width: 50%;\"><span style=\"background-color: #ffff00;\">Is the distribution Left-skewed, Symmetric, or Right-skewed?<\/span><\/td>\n<td style=\"width: 50%;\"><span style=\"background-color: #ffff00;\">[drop down choices]<\/span><\/td>\n<\/tr>\n<tr>\n<td style=\"width: 50%;\"><span style=\"background-color: #ffff00;\">Is the mean greater than, less than, or roughly equal to the median?<\/span><\/td>\n<td style=\"width: 50%;\"><span style=\"background-color: #ffff00;\">[drop down choices]<\/span><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q16821\">Hint<\/span><\/p>\n<div id=\"q16821\" class=\"hidden-answer\" style=\"display: none\">Refer to the definitions at the top of the page and your answer to Question 7 for guidance.<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 11<\/h3>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1013 size-medium\" style=\"background-color: initial; font-size: 0.9em;\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/11194922\/ChartPicture3-300x287.jpg\" alt=\"An unlabeled bar graph with seven bars. The bar on the far right is the highest. Moving from the left, each bar is progressively taller than the last. In most places, this is by approximately the same amount, but it is a larger difference between the fourth and fifth bars.\" width=\"300\" height=\"287\" \/><\/p>\n<p>Is the distribution Left-skewed, Symmetric, or Right-skewed?<\/p>\n<p>Is the mean greater than, less than, or roughly equal to the median?<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q738598\">Hint<\/span><\/p>\n<div id=\"q738598\" class=\"hidden-answer\" style=\"display: none\">Refer to the definitions at the top of the page and your answer to Question 7 for guidance.<\/div>\n<\/div>\n<p><span style=\"background-color: #ffff00;\">[see question 10 as option for question\/answer method]<\/span><\/p>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 12<\/h3>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1011 size-medium\" style=\"background-color: initial;\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/11194913\/ChartPicture1-300x300.png\" alt=\"An unlabeled bar graph. The bar is the center is the highest and going either direction away from it, the bars get shorter by equal increments.\" width=\"300\" height=\"300\" \/><\/p>\n<p>Is the distribution Left-skewed, Symmetric, or Right-skewed?<\/p>\n<p>Is the mean greater than, less than, or roughly equal to the median?<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q627907\">Hint<\/span><\/p>\n<div id=\"q627907\" class=\"hidden-answer\" style=\"display: none\">Refer to the definitions at the top of the page and your answer to Question 7 for guidance.<\/div>\n<\/div>\n<p><span style=\"background-color: #ffff00;\">[see question 10 as option for question\/answer method]<\/span><\/p>\n<\/div>\n<div class=\"textbox tryit\">\n<h3>Resistant and Nonresistant Measures of Center<\/h3>\n<p><span style=\"background-color: #99cc00;\">[Worked example &#8211; a 3-instructor video showing a symmetric dataset with the mean and median identical, then, skewing the distribution to show what happens to the mean while the median remains in place.]<\/span><\/p>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 13<\/h3>\n<p>Look back on your answers to Questions 10, 11, and 12. Which of mean or median appeared to be <strong>resistant<\/strong> to skew? That is, which of the two measures of center is not affected by the skewness of a graph?<\/p>\n<p style=\"padding-left: 30px;\">a) The mean is resistant to skew. The median is sensitive to skew and\/or the presence of outliers.<\/p>\n<p style=\"padding-left: 30px;\">b) The median is resistant to skew. The mean is sensitive to skew and\/or the presence of outliers.<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q56189\">Hint<\/span><\/p>\n<div id=\"q56189\" class=\"hidden-answer\" style=\"display: none\">\n<p>Consider which measure (mean or median) seemed to be &#8220;pulled&#8221; in the direction of the tail in the skewed distributions and which did not.\n<\/p><\/div>\n<\/div>\n<\/div>\n<p>Hopefully, you have noticed that when a distribution is symmetric, the mean and median occupy the same value. But under a skew, the mean is &#8220;pulled&#8221; in the direction of the outliers: greater than the median in the case of positive (right) skew, and less than the median in the case of negative (left) skew. It appears that the mean is affected by the presence of outliers while the median is not.<\/p>\n<h3>Looking ahead<\/h3>\n<p>Broadly speaking, we consider a value in a dataset to be an outlier if that value is unusual or extreme, given the other values in the dataset.<\/p>\n<p>Suppose you have two groups of people:<\/p>\n<ol style=\"list-style-type: lower-roman;\">\n<li style=\"list-style-type: none;\">\n<ol style=\"list-style-type: lower-roman;\">\n<li style=\"list-style-type: none;\">\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Group 1 is made up of five professional basketball players, and Group 2 is made up of four professional basketball players and one kindergartener.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Dataset 1 contains the number of three-pointers each person in Group 1 can make in one minute. Dataset 2 contains the number of three-pointers each person in Group 2 can make in an hour.<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<div class=\"textbox key-takeaways\">\n<h3>question 14<\/h3>\n<p>Which dataset do you think is more likely to contain an outlier?<\/p>\n<p style=\"padding-left: 30px;\">a) Group 1<\/p>\n<p style=\"padding-left: 30px;\">b) Group 2<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q785920\">Hint<\/span><\/p>\n<div id=\"q785920\" class=\"hidden-answer\" style=\"display: none\">Imagine a dotplot of the observations in each dataset.<\/div>\n<\/div>\n<\/div>\n<h2>Summary<\/h2>\n<p>In this section, you&#8217;ve learned about skewed distributions vs. symmetric distributions and how skew affects the mean of a data distribution. You also got some practice calculating and interpreting the mean and median of a dataset. Let&#8217;s summarize where these skills showed up in the material.<\/p>\n<ol style=\"list-style-type: lower-roman;\">\n<li style=\"list-style-type: none;\">\n<ol style=\"list-style-type: lower-roman;\">\n<li style=\"list-style-type: none;\">\n<ul>\n<li>In Question 1, you calculated the median of a dataset, and interpreted the median in Question 2.<\/li>\n<li>In Question 3, you calculated the mean of a dataset, and interpreted the mean in Question 4.<\/li>\n<li>In Question 5, you began to see how the mean and median relate in a distribution.<\/li>\n<li>In Questions 6, and 10 &#8211; 13, you used statistical terms for skew and extreme values to describe the features of a dataset, and began to make connections between the mean and median under differently shaped distributions.<\/li>\n<li>In Questions 7 -9, you interpreted the mean and median to make connections between them and the data distribution.<\/li>\n<li>In Question 13, you identified which of the mean or median is resistant to skew.<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<p>Being able to interpret the mean and median with regard to the shape of a distribution and the presence of outliers will be essential skills to use when assessing claims made about data that rely on measures of center. If you feel comfortable with these skills, please move on to the activity!<\/p>\n<hr class=\"before-footnotes clear\" \/><div class=\"footnotes\"><ol><li id=\"footnote-450-1\">Encyclopedia Britannica. (n.d.). Friends. In <em>Encyclopedia Britannica.com<\/em>. https:\/\/www.britannica.com\/topic\/Friends <a href=\"#return-footnote-450-1\" class=\"return-footnote\" aria-label=\"Return to footnote 1\">&crarr;<\/a><\/li><li id=\"footnote-450-2\">Mock, T. (2020). <em>A weekly data project aimed at the R ecosystem<\/em>. TidyTuesday. https:\/\/github.com\/rfordatascience\/tidytuesday\/blob\/master\/data\/2020\/2020-09-08\/readme.md#friends_infocsv <a href=\"#return-footnote-450-2\" class=\"return-footnote\" aria-label=\"Return to footnote 2\">&crarr;<\/a><\/li><\/ol><\/div>","protected":false},"author":25777,"menu_order":17,"template":"","meta":{"_candela_citation":"[]","CANDELA_OUTCOMES_GUID":"","pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-450","chapter","type-chapter","status-publish","hentry"],"part":621,"_links":{"self":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/450","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/users\/25777"}],"version-history":[{"count":39,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/450\/revisions"}],"predecessor-version":[{"id":3314,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/450\/revisions\/3314"}],"part":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/parts\/621"}],"metadata":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/450\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/media?parent=450"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapter-type?post=450"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/contributor?post=450"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/license?post=450"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}