{"id":87,"date":"2017-04-15T03:16:20","date_gmt":"2017-04-15T03:16:20","guid":{"rendered":"https:\/\/courses.lumenlearning.com\/conceptstest1\/chapter\/mean-and-median-2-of-2\/"},"modified":"2017-05-28T00:19:10","modified_gmt":"2017-05-28T00:19:10","slug":"mean-and-median-2-of-2","status":"web-only","type":"chapter","link":"https:\/\/courses.lumenlearning.com\/suny-hccc-wm-concepts-statistics\/chapter\/mean-and-median-2-of-2\/","title":{"raw":"Mean and Median (2 of 2)","rendered":"Mean and Median (2 of 2)"},"content":{"raw":"&nbsp;\r\n<div class=\"textbox learning-objectives\">\r\n<h3>Learning Objectives<\/h3>\r\n<ul>\r\n \t<li>Use mean and median to describe the center of a distribution.<\/li>\r\n<\/ul>\r\n<\/div>\r\n<h3>Choosing between Median and Mean<\/h3>\r\nWe now have a choice between two measurements of center. We can use the median, or we can use the mean. How do we decide which measurement to use?\r\n\r\nIn these next examples, we learn that the shape of the distribution and the presence of outliers helps us answer this question.\r\n<div class=\"textbox examples\">\r\n<h3>Example<\/h3>\r\n<h2>Homework Scores with an Outlier<\/h2>\r\nHere is a dotplot of the 26 homework scores earned by a student. Notice that the distribution of scores has an outlier. This student typically scores between 80 and 90 on homework, but there is one score of 0. Which measurement of center gives a better summary of this distribution?\r\n<ul>\r\n \t<li>Median = 84.5<\/li>\r\n \t<li>Mean = 81.8<\/li>\r\n<\/ul>\r\n<img class=\"alignnone\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/1729\/2017\/04\/15031613\/m2_summarizing_data_topic_2_2_Topic2_2MeanandMedian2of2_image1.png\" alt=\"Dotplot of a student's 26 homework scores that shows where the mean and median are for the student.\" width=\"565\" height=\"114\" \/>\r\n\r\nBoth measures of center are in the B grade range, but the median is a better summary of this student\u2019s homework scores. The outlier does not affect the median. This makes sense because the median depends primarily on the order of the data. Changing the lowest score does not affect the order of the scores, so the median is not affected by the value of this point.\r\n\r\nThe mean is not a good summary of this student\u2019s homework scores. The outlier decreases the mean so that the mean is a bit too low to be a representative measure of this student\u2019s typical performance. This makes sense because when we calculate the mean, we first add the scores together, then divide by the number of scores. Every score therefore affects the mean.\r\n\r\nNote: In the distribution above, there are 26 homework scores for this student. If the teacher made fewer homework assignments, a zero would have a greater impact on the mean. We can see this in the distribution below. This distribution has only 10 scores. The one grade of 0 moves the mean into the C grade range.\r\n\r\n<img class=\"alignnone\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/1729\/2017\/04\/15031615\/m2_summarizing_data_topic_2_2_Topic2_2MeanandMedian2of2_image2.png\" alt=\"Dotplot of a student's 10 homework scores, that shows the outlier, mean and median\" width=\"559\" height=\"111\" \/>\r\n\r\n<\/div>\r\n<div class=\"textbox examples\">\r\n<h3>Example<\/h3>\r\n<h2>Skewed Incomes<\/h2>\r\nIn this example, we look at how skewness in a data set affects the mean and median. The following histogram shows the personal income of a large sample of individuals drawn from U.S. census data for the year 2000. Notice that it is strongly skewed to the right. This type of skewness is often present in data sets of variables such as income.\r\n\r\n<img class=\"alignnone\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/1729\/2017\/04\/15031617\/m2_summarizing_data_topic_2_2_Topic2_2MeanandMedian2of2_image3.png\" alt=\"Histogram of U.S. census personal income data for a large sample population. The data is skewed far right \" width=\"545\" height=\"397\" \/>\r\n\r\nThe mean and median for this data set are\r\n<ul>\r\n \t<li>Mean = $24,000<\/li>\r\n \t<li>Median = $16,900<\/li>\r\n<\/ul>\r\nHere again we see that the mean income does not represent the typical income for this sample very well. The small number of people with higher incomes increase the mean. The mean is too high to represent the large number of people making less than $20,000 a year. A small number of high incomes gives the misleading impression that the typical income in the sample is $24,000. The small number of people with higher incomes does not impact the median, so the median income of $16,900 better represents the typical income in this sample.\r\n\r\n<\/div>\r\n<h3>What's the Main Point?<\/h3>\r\nThese examples illustrate some general guidelines for choosing a measure of center:\r\n<ul>\r\n \t<li>Use the mean as a measure of center <em>only<\/em> for distributions that are reasonably symmetric with a central peak. When outliers are present, the mean is not a good choice.<\/li>\r\n<\/ul>\r\n<ul>\r\n \t<li>Use the median as a measure of center for all other cases.<\/li>\r\n<\/ul>\r\nBoth of these examples also highlight another important principle: <em>Always plot the data<\/em>.\r\n\r\nWe need to use a graph to determine the shape of the distribution. By looking at the shape, we can determine which measures of center best describe the data.\r\n<div class=\"textbox exercises\">\r\n<h3>Learn By Doing<\/h3>\r\nhttps:\/\/assessments.lumenlearning.com\/assessments\/3443\r\n\r\nhttps:\/\/assessments.lumenlearning.com\/assessments\/3444\r\n\r\n<\/div>\r\nInstructions for using the simulation:\r\n<ul>\r\n \t<li>To add a point, move the slider to the value you want, then click <strong>Add<\/strong>.<\/li>\r\n \t<li>To remove a point, move the slider to the value you want, then click <strong>Minus<\/strong>.<\/li>\r\n \t<li>To reset the simulation, click the button in the upper left corner that says <strong>Reset<\/strong>.<\/li>\r\n<\/ul>\r\n<a href=\"https:\/\/s3-us-west-2.amazonaws.com\/oerfiles\/Concepts+in+Statistics\/interactives\/meanandmedian\/meanAndMedian.html\" target=\"new\">Click here to open this simulation in its own window.<\/a>\r\n\r\n<iframe id=\"_i_2b\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/oerfiles\/Concepts+in+Statistics\/interactives\/meanandmedian\/meanAndMedian.html\" width=\"750\" height=\"500\"><\/iframe>\r\n<h3><strong>Let\u2019s Summarize<\/strong><\/h3>\r\n<ul>\r\n \t<li>We have two different measurements for determining the center of a distribution: mean and median. When we use the term <em>center<\/em>, we mean a typical value that can represent the distribution of data.<\/li>\r\n<\/ul>\r\n<ul>\r\n \t<li>The <em>mean <\/em>is the average. We calculate the mean by adding the data values and dividing by the number of individual data points.<\/li>\r\n<\/ul>\r\n<ul>\r\n \t<li>The mean has the following properties:\r\n<ul>\r\n \t<li>It is the <em>fair-share<\/em> measure. For example, imagine that you have 10 homework scores. Say that your scores vary, but the mean is 84. Then you have 84(10) = 840 points, which is like having an 84 on each of the 10 assignments.<\/li>\r\n \t<li>The mean is also referred to as the <em>balancing point<\/em> of a distribution. If we measure the distance between each data point and the mean, the distances are balanced on each side of the mean.<\/li>\r\n<\/ul>\r\n<\/li>\r\n \t<li>The <em>median <\/em>is the physical center of the data when we make an ordered list. It has the same number of values above it as below it.<\/li>\r\n \t<li><strong>General Guidelines for Choosing a Measure of Center<\/strong>\r\n<ul>\r\n \t<li>Use the mean as a measure of center <em>only <\/em>for distributions that are reasonably symmetric with a central peak. When outliers are present, the mean is not a good choice.<\/li>\r\n \t<li>Use the median as a measure of center for all other cases.<\/li>\r\n<\/ul>\r\n<\/li>\r\n \t<li><em>Always plot the data. <\/em>We need to use a graph to determine the shape of the distribution. By looking at the shape, we can determine which measures of center best describe the data.<\/li>\r\n<\/ul>\r\n<h3><\/h3>","rendered":"<p>&nbsp;<\/p>\n<div class=\"textbox learning-objectives\">\n<h3>Learning Objectives<\/h3>\n<ul>\n<li>Use mean and median to describe the center of a distribution.<\/li>\n<\/ul>\n<\/div>\n<h3>Choosing between Median and Mean<\/h3>\n<p>We now have a choice between two measurements of center. We can use the median, or we can use the mean. How do we decide which measurement to use?<\/p>\n<p>In these next examples, we learn that the shape of the distribution and the presence of outliers helps us answer this question.<\/p>\n<div class=\"textbox examples\">\n<h3>Example<\/h3>\n<h2>Homework Scores with an Outlier<\/h2>\n<p>Here is a dotplot of the 26 homework scores earned by a student. Notice that the distribution of scores has an outlier. This student typically scores between 80 and 90 on homework, but there is one score of 0. Which measurement of center gives a better summary of this distribution?<\/p>\n<ul>\n<li>Median = 84.5<\/li>\n<li>Mean = 81.8<\/li>\n<\/ul>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/1729\/2017\/04\/15031613\/m2_summarizing_data_topic_2_2_Topic2_2MeanandMedian2of2_image1.png\" alt=\"Dotplot of a student's 26 homework scores that shows where the mean and median are for the student.\" width=\"565\" height=\"114\" \/><\/p>\n<p>Both measures of center are in the B grade range, but the median is a better summary of this student\u2019s homework scores. The outlier does not affect the median. This makes sense because the median depends primarily on the order of the data. Changing the lowest score does not affect the order of the scores, so the median is not affected by the value of this point.<\/p>\n<p>The mean is not a good summary of this student\u2019s homework scores. The outlier decreases the mean so that the mean is a bit too low to be a representative measure of this student\u2019s typical performance. This makes sense because when we calculate the mean, we first add the scores together, then divide by the number of scores. Every score therefore affects the mean.<\/p>\n<p>Note: In the distribution above, there are 26 homework scores for this student. If the teacher made fewer homework assignments, a zero would have a greater impact on the mean. We can see this in the distribution below. This distribution has only 10 scores. The one grade of 0 moves the mean into the C grade range.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/1729\/2017\/04\/15031615\/m2_summarizing_data_topic_2_2_Topic2_2MeanandMedian2of2_image2.png\" alt=\"Dotplot of a student's 10 homework scores, that shows the outlier, mean and median\" width=\"559\" height=\"111\" \/><\/p>\n<\/div>\n<div class=\"textbox examples\">\n<h3>Example<\/h3>\n<h2>Skewed Incomes<\/h2>\n<p>In this example, we look at how skewness in a data set affects the mean and median. The following histogram shows the personal income of a large sample of individuals drawn from U.S. census data for the year 2000. Notice that it is strongly skewed to the right. This type of skewness is often present in data sets of variables such as income.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/1729\/2017\/04\/15031617\/m2_summarizing_data_topic_2_2_Topic2_2MeanandMedian2of2_image3.png\" alt=\"Histogram of U.S. census personal income data for a large sample population. The data is skewed far right\" width=\"545\" height=\"397\" \/><\/p>\n<p>The mean and median for this data set are<\/p>\n<ul>\n<li>Mean = $24,000<\/li>\n<li>Median = $16,900<\/li>\n<\/ul>\n<p>Here again we see that the mean income does not represent the typical income for this sample very well. The small number of people with higher incomes increase the mean. The mean is too high to represent the large number of people making less than $20,000 a year. A small number of high incomes gives the misleading impression that the typical income in the sample is $24,000. The small number of people with higher incomes does not impact the median, so the median income of $16,900 better represents the typical income in this sample.<\/p>\n<\/div>\n<h3>What&#8217;s the Main Point?<\/h3>\n<p>These examples illustrate some general guidelines for choosing a measure of center:<\/p>\n<ul>\n<li>Use the mean as a measure of center <em>only<\/em> for distributions that are reasonably symmetric with a central peak. When outliers are present, the mean is not a good choice.<\/li>\n<\/ul>\n<ul>\n<li>Use the median as a measure of center for all other cases.<\/li>\n<\/ul>\n<p>Both of these examples also highlight another important principle: <em>Always plot the data<\/em>.<\/p>\n<p>We need to use a graph to determine the shape of the distribution. By looking at the shape, we can determine which measures of center best describe the data.<\/p>\n<div class=\"textbox exercises\">\n<h3>Learn By Doing<\/h3>\n<p>\t<iframe id=\"lumen_assessment_3443\" class=\"resizable\" src=\"https:\/\/assessments.lumenlearning.com\/assessments\/load?assessment_id=3443&#38;embed=1&#38;external_user_id=&#38;external_context_id=&#38;iframe_resize_id=lumen_assessment_3443\" frameborder=\"0\" style=\"border:none;width:100%;height:100%;min-height:400px;\"><br \/>\n\t<\/iframe><\/p>\n<p>\t<iframe id=\"lumen_assessment_3444\" class=\"resizable\" src=\"https:\/\/assessments.lumenlearning.com\/assessments\/load?assessment_id=3444&#38;embed=1&#38;external_user_id=&#38;external_context_id=&#38;iframe_resize_id=lumen_assessment_3444\" frameborder=\"0\" style=\"border:none;width:100%;height:100%;min-height:400px;\"><br \/>\n\t<\/iframe><\/p>\n<\/div>\n<p>Instructions for using the simulation:<\/p>\n<ul>\n<li>To add a point, move the slider to the value you want, then click <strong>Add<\/strong>.<\/li>\n<li>To remove a point, move the slider to the value you want, then click <strong>Minus<\/strong>.<\/li>\n<li>To reset the simulation, click the button in the upper left corner that says <strong>Reset<\/strong>.<\/li>\n<\/ul>\n<p><a href=\"https:\/\/s3-us-west-2.amazonaws.com\/oerfiles\/Concepts+in+Statistics\/interactives\/meanandmedian\/meanAndMedian.html\" target=\"new\">Click here to open this simulation in its own window.<\/a><\/p>\n<p><iframe loading=\"lazy\" id=\"_i_2b\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/oerfiles\/Concepts+in+Statistics\/interactives\/meanandmedian\/meanAndMedian.html\" width=\"750\" height=\"500\"><\/iframe><\/p>\n<h3><strong>Let\u2019s Summarize<\/strong><\/h3>\n<ul>\n<li>We have two different measurements for determining the center of a distribution: mean and median. When we use the term <em>center<\/em>, we mean a typical value that can represent the distribution of data.<\/li>\n<\/ul>\n<ul>\n<li>The <em>mean <\/em>is the average. We calculate the mean by adding the data values and dividing by the number of individual data points.<\/li>\n<\/ul>\n<ul>\n<li>The mean has the following properties:\n<ul>\n<li>It is the <em>fair-share<\/em> measure. For example, imagine that you have 10 homework scores. Say that your scores vary, but the mean is 84. Then you have 84(10) = 840 points, which is like having an 84 on each of the 10 assignments.<\/li>\n<li>The mean is also referred to as the <em>balancing point<\/em> of a distribution. If we measure the distance between each data point and the mean, the distances are balanced on each side of the mean.<\/li>\n<\/ul>\n<\/li>\n<li>The <em>median <\/em>is the physical center of the data when we make an ordered list. It has the same number of values above it as below it.<\/li>\n<li><strong>General Guidelines for Choosing a Measure of Center<\/strong>\n<ul>\n<li>Use the mean as a measure of center <em>only <\/em>for distributions that are reasonably symmetric with a central peak. When outliers are present, the mean is not a good choice.<\/li>\n<li>Use the median as a measure of center for all other cases.<\/li>\n<\/ul>\n<\/li>\n<li><em>Always plot the data. <\/em>We need to use a graph to determine the shape of the distribution. By looking at the shape, we can determine which measures of center best describe the data.<\/li>\n<\/ul>\n<h3><\/h3>\n\n\t\t\t <section class=\"citations-section\" role=\"contentinfo\">\n\t\t\t <h3>Candela Citations<\/h3>\n\t\t\t\t\t <div>\n\t\t\t\t\t\t <div id=\"citation-list-87\">\n\t\t\t\t\t\t\t <div class=\"licensing\"><div class=\"license-attribution-dropdown-subheading\">CC licensed content, Shared previously<\/div><ul class=\"citation-list\"><li>Concepts in Statistics. <strong>Provided by<\/strong>: Open Learning Initiative. <strong>Located at<\/strong>: <a target=\"_blank\" href=\"http:\/\/oli.cmu.edu\">http:\/\/oli.cmu.edu<\/a>. <strong>License<\/strong>: <em><a target=\"_blank\" rel=\"license\" href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\">CC BY: Attribution<\/a><\/em><\/li><\/ul><\/div>\n\t\t\t\t\t\t <\/div>\n\t\t\t\t\t <\/div>\n\t\t\t <\/section>","protected":false},"author":163,"menu_order":14,"template":"","meta":{"_candela_citation":"[{\"type\":\"cc\",\"description\":\"Concepts in Statistics\",\"author\":\"\",\"organization\":\"Open Learning Initiative\",\"url\":\"http:\/\/oli.cmu.edu\",\"project\":\"\",\"license\":\"cc-by\",\"license_terms\":\"\"}]","CANDELA_OUTCOMES_GUID":"db2250c6-f9ce-43c8-b1ef-31b3c4e37a6f","pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-87","chapter","type-chapter","status-web-only","hentry"],"part":43,"_links":{"self":[{"href":"https:\/\/courses.lumenlearning.com\/suny-hccc-wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapters\/87","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/courses.lumenlearning.com\/suny-hccc-wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/courses.lumenlearning.com\/suny-hccc-wm-concepts-statistics\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/suny-hccc-wm-concepts-statistics\/wp-json\/wp\/v2\/users\/163"}],"version-history":[{"count":7,"href":"https:\/\/courses.lumenlearning.com\/suny-hccc-wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapters\/87\/revisions"}],"predecessor-version":[{"id":1323,"href":"https:\/\/courses.lumenlearning.com\/suny-hccc-wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapters\/87\/revisions\/1323"}],"part":[{"href":"https:\/\/courses.lumenlearning.com\/suny-hccc-wm-concepts-statistics\/wp-json\/pressbooks\/v2\/parts\/43"}],"metadata":[{"href":"https:\/\/courses.lumenlearning.com\/suny-hccc-wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapters\/87\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/courses.lumenlearning.com\/suny-hccc-wm-concepts-statistics\/wp-json\/wp\/v2\/media?parent=87"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/suny-hccc-wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapter-type?post=87"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/suny-hccc-wm-concepts-statistics\/wp-json\/wp\/v2\/contributor?post=87"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/suny-hccc-wm-concepts-statistics\/wp-json\/wp\/v2\/license?post=87"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}