{"id":69,"date":"2017-04-15T03:15:56","date_gmt":"2017-04-15T03:15:56","guid":{"rendered":"https:\/\/courses.lumenlearning.com\/conceptstest1\/chapter\/histograms-2-of-4\/"},"modified":"2022-08-01T16:04:20","modified_gmt":"2022-08-01T16:04:20","slug":"histograms-2-of-4","status":"publish","type":"chapter","link":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/chapter\/histograms-2-of-4\/","title":{"raw":"Histograms (2 of 4)","rendered":"Histograms (2 of 4)"},"content":{"raw":"<div class=\"textbox learning-objectives\">\r\n<h3>Learning OUTCOMES<\/h3>\r\n<ul>\r\n \t<li>Describe the distribution of quantitative data using a histogram.<\/li>\r\n<\/ul>\r\n<\/div>\r\nWe have discussed two types of graphs that summarize a distribution of a quantitative variable: dotplots and histograms.\r\n\r\nFrom a dotplot, we also described the pattern in the data with statements about shape, center, and spread. We have to be more cautious making similar statements using a histogram because our perception of shape, center, and spread can be affected by how the bins are defined. We investigate this important point in the next example.\r\n<div class=\"textbox exercises\">\r\n<h3>Example<\/h3>\r\nWe used the <em>same set of data <\/em>to construct these three histograms of student scores. Are you surprised by how different the distribution looks in each histogram?\r\n\r\n<img class=\"alignnone\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/1729\/2017\/04\/15031552\/m2_summarizing_data_topic_2_1_Topic2_1Histograms2of4_image1.png\" alt=\"Three histograms illustrating how bin width affects distribution, with the percentages spreading out more in each graph.\" width=\"612\" height=\"163\" \/>\r\n\r\nThe histogram on the left has a bin width of 20. The first bin starts at 40. To create the middle histogram, we changed the bin width to 10 but kept the first bin starting at 40. To create the last histogram, we kept the bin width at 10 but started the first bin at 45.\r\n\r\nThese changes affect our description of the shape, center, and spread of this set of data. For example, in the histogram on the left, the distribution looks symmetric with a central peak. In the histogram on the right, the distribution looks slightly skewed to the right. Based on the middle histogram, we might estimate that most students scored between 70 and 80. But the histogram on the right suggests that typical students scored between 65 and 75.\r\n\r\n<strong>Why does changing the bin size and the starting point of the first bin change the histogram so drastically?<\/strong>\r\n\r\nWhen we change the bins, the data gets grouped differently. The different grouping affects the appearance of the histogram.\r\n\r\nTo illustrate this point, we highlighted the five students who scored in the 70s in each histogram.\r\n<ul>\r\n \t<li>In the histogram on the left, these five students are grouped in the middle bin with other students who scored between 60 and 80.<\/li>\r\n \t<li>In the histogram in the middle, these five students form a bin of their own, since no other students scored between 70 and 80.<\/li>\r\n \t<li>In the histogram on the right, these five students are in separate bins.<\/li>\r\n<\/ul>\r\n<img class=\"alignnone\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/1729\/2017\/04\/15031555\/m2_summarizing_data_topic_2_1_Topic2_1Histograms2of4_image2.png\" alt=\"Three histograms showing the importance of appropriately sized bin width. In the first, the highest bar shows between sixy and eighty percent. In the second it expands to show between seventy to eighty percent. In the third, it shows that the highest percentage was in the sixty fifth to seventy fifth percentile. \" width=\"689\" height=\"257\" \/>\r\n\r\n<strong>Which histogram gives the most helpful summary of the distribution?<\/strong>\r\n\r\nFor this situation, the middle histogram is probably the most useful summary because the intervals correspond to letter grades.\r\n\r\nOur general advice is as follows:\r\n<ul>\r\n \t<li>Avoid histograms with large bin widths that group data into only a few bins. A histogram constructed with large bin widths will show the distribution as a \u201cskyscraper.\u201d This does not give good information about variability in the distribution.<\/li>\r\n \t<li>Avoid histograms with small bin widths that group data into lots of bins. A histogram constructed with small bin widths will show the distribution as a \u201cpancake.\u201d This does not help us see the pattern in the data.<\/li>\r\n<\/ul>\r\n<\/div>\r\nUse the simulation below to answer the questions in the next Try It.\r\n\r\n<a href=\"https:\/\/s3-us-west-2.amazonaws.com\/oerfiles\/Concepts+in+Statistics\/interactives\/histogram_of_grades\/histogram_of_grades.html\" target=\"new\">Click here to open this simulation in its own window.<\/a>\r\n\r\n<iframe id=\"_i_3a\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/oerfiles\/Concepts+in+Statistics\/interactives\/histogram_of_grades\/histogram_of_grades.html\" width=\"875\" height=\"600\"><\/iframe>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Try It<\/h3>\r\nhttps:\/\/assess.lumenlearning.com\/practice\/cd76a0c5-b572-4257-8ce2-1a22d84dd67b\r\n\r\nhttps:\/\/assess.lumenlearning.com\/practice\/2ee5b099-c645-4f96-826f-a3580403ce33\r\n\r\nhttps:\/\/assess.lumenlearning.com\/practice\/f7381a3e-d40a-43c0-9895-907b52be1a37\r\n\r\n<\/div>\r\nThese next exercises focus on recognizing the shape of a distribution using a histogram. We know that changes in the bin width can change the appearance of the distribution. But a histogram with an appropriate bin width can give good information about the shape of the distribution.\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Try It<\/h3>\r\nhttps:\/\/assess.lumenlearning.com\/practice\/b57b0c91-7084-4a47-a6e8-9475a7062aa5\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Try It<\/h3>\r\nhttps:\/\/assess.lumenlearning.com\/practice\/09834a0b-62ee-44f2-a676-3a1450387c5d\r\n\r\nhttps:\/\/assess.lumenlearning.com\/practice\/0458e3ae-90f6-4db6-b42e-edb28d2c78bc\r\n\r\nhttps:\/\/assess.lumenlearning.com\/practice\/35fb9e8f-2e17-44c0-82ce-27397a9a603e\r\n\r\nhttps:\/\/assess.lumenlearning.com\/practice\/00457cde-7aba-4e8b-9663-e691793a2f74\r\n\r\n<\/div>\r\n<h2>Contribute!<\/h2><div style=\"margin-bottom: 8px;\">Did you have an idea for improving this content? We\u2019d love your input.<\/div><a href=\"https:\/\/docs.google.com\/document\/d\/1l9js3166FFNOgXkkJqgr9mEeotESWG3OTKSy8KnYa1I\" target=\"_blank\" style=\"font-size: 10pt; font-weight: 600; color: #077fab; text-decoration: none; border: 2px solid #077fab; border-radius: 7px; padding: 5px 25px; text-align: center; cursor: pointer; line-height: 1.5em;\">Improve this page<\/a><a style=\"margin-left: 16px;\" target=\"_blank\" href=\"https:\/\/docs.google.com\/document\/d\/1vy-T6DtTF-BbMfpVEI7VP_R7w2A4anzYZLXR8Pk4Fu4\">Learn More<\/a>","rendered":"<div class=\"textbox learning-objectives\">\n<h3>Learning OUTCOMES<\/h3>\n<ul>\n<li>Describe the distribution of quantitative data using a histogram.<\/li>\n<\/ul>\n<\/div>\n<p>We have discussed two types of graphs that summarize a distribution of a quantitative variable: dotplots and histograms.<\/p>\n<p>From a dotplot, we also described the pattern in the data with statements about shape, center, and spread. We have to be more cautious making similar statements using a histogram because our perception of shape, center, and spread can be affected by how the bins are defined. We investigate this important point in the next example.<\/p>\n<div class=\"textbox exercises\">\n<h3>Example<\/h3>\n<p>We used the <em>same set of data <\/em>to construct these three histograms of student scores. Are you surprised by how different the distribution looks in each histogram?<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/1729\/2017\/04\/15031552\/m2_summarizing_data_topic_2_1_Topic2_1Histograms2of4_image1.png\" alt=\"Three histograms illustrating how bin width affects distribution, with the percentages spreading out more in each graph.\" width=\"612\" height=\"163\" \/><\/p>\n<p>The histogram on the left has a bin width of 20. The first bin starts at 40. To create the middle histogram, we changed the bin width to 10 but kept the first bin starting at 40. To create the last histogram, we kept the bin width at 10 but started the first bin at 45.<\/p>\n<p>These changes affect our description of the shape, center, and spread of this set of data. For example, in the histogram on the left, the distribution looks symmetric with a central peak. In the histogram on the right, the distribution looks slightly skewed to the right. Based on the middle histogram, we might estimate that most students scored between 70 and 80. But the histogram on the right suggests that typical students scored between 65 and 75.<\/p>\n<p><strong>Why does changing the bin size and the starting point of the first bin change the histogram so drastically?<\/strong><\/p>\n<p>When we change the bins, the data gets grouped differently. The different grouping affects the appearance of the histogram.<\/p>\n<p>To illustrate this point, we highlighted the five students who scored in the 70s in each histogram.<\/p>\n<ul>\n<li>In the histogram on the left, these five students are grouped in the middle bin with other students who scored between 60 and 80.<\/li>\n<li>In the histogram in the middle, these five students form a bin of their own, since no other students scored between 70 and 80.<\/li>\n<li>In the histogram on the right, these five students are in separate bins.<\/li>\n<\/ul>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/1729\/2017\/04\/15031555\/m2_summarizing_data_topic_2_1_Topic2_1Histograms2of4_image2.png\" alt=\"Three histograms showing the importance of appropriately sized bin width. In the first, the highest bar shows between sixy and eighty percent. In the second it expands to show between seventy to eighty percent. In the third, it shows that the highest percentage was in the sixty fifth to seventy fifth percentile.\" width=\"689\" height=\"257\" \/><\/p>\n<p><strong>Which histogram gives the most helpful summary of the distribution?<\/strong><\/p>\n<p>For this situation, the middle histogram is probably the most useful summary because the intervals correspond to letter grades.<\/p>\n<p>Our general advice is as follows:<\/p>\n<ul>\n<li>Avoid histograms with large bin widths that group data into only a few bins. A histogram constructed with large bin widths will show the distribution as a \u201cskyscraper.\u201d This does not give good information about variability in the distribution.<\/li>\n<li>Avoid histograms with small bin widths that group data into lots of bins. A histogram constructed with small bin widths will show the distribution as a \u201cpancake.\u201d This does not help us see the pattern in the data.<\/li>\n<\/ul>\n<\/div>\n<p>Use the simulation below to answer the questions in the next Try It.<\/p>\n<p><a href=\"https:\/\/s3-us-west-2.amazonaws.com\/oerfiles\/Concepts+in+Statistics\/interactives\/histogram_of_grades\/histogram_of_grades.html\" target=\"new\">Click here to open this simulation in its own window.<\/a><\/p>\n<p><iframe loading=\"lazy\" id=\"_i_3a\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/oerfiles\/Concepts+in+Statistics\/interactives\/histogram_of_grades\/histogram_of_grades.html\" width=\"875\" height=\"600\"><\/iframe><\/p>\n<div class=\"textbox key-takeaways\">\n<h3>Try It<\/h3>\n<p>\t<iframe id=\"assessment_practice_cd76a0c5-b572-4257-8ce2-1a22d84dd67b\" class=\"resizable\" src=\"https:\/\/assess.lumenlearning.com\/practice\/cd76a0c5-b572-4257-8ce2-1a22d84dd67b?iframe_resize_id=assessment_practice_id_cd76a0c5-b572-4257-8ce2-1a22d84dd67b\" frameborder=\"0\" style=\"border:none;width:100%;height:100%;min-height:300px;\"><br \/>\n\t<\/iframe><\/p>\n<p>\t<iframe id=\"assessment_practice_2ee5b099-c645-4f96-826f-a3580403ce33\" class=\"resizable\" src=\"https:\/\/assess.lumenlearning.com\/practice\/2ee5b099-c645-4f96-826f-a3580403ce33?iframe_resize_id=assessment_practice_id_2ee5b099-c645-4f96-826f-a3580403ce33\" frameborder=\"0\" style=\"border:none;width:100%;height:100%;min-height:300px;\"><br \/>\n\t<\/iframe><\/p>\n<p>\t<iframe id=\"assessment_practice_f7381a3e-d40a-43c0-9895-907b52be1a37\" class=\"resizable\" src=\"https:\/\/assess.lumenlearning.com\/practice\/f7381a3e-d40a-43c0-9895-907b52be1a37?iframe_resize_id=assessment_practice_id_f7381a3e-d40a-43c0-9895-907b52be1a37\" frameborder=\"0\" style=\"border:none;width:100%;height:100%;min-height:300px;\"><br \/>\n\t<\/iframe><\/p>\n<\/div>\n<p>These next exercises focus on recognizing the shape of a distribution using a histogram. We know that changes in the bin width can change the appearance of the distribution. But a histogram with an appropriate bin width can give good information about the shape of the distribution.<\/p>\n<div class=\"textbox key-takeaways\">\n<h3>Try It<\/h3>\n<p>\t<iframe id=\"assessment_practice_b57b0c91-7084-4a47-a6e8-9475a7062aa5\" class=\"resizable\" src=\"https:\/\/assess.lumenlearning.com\/practice\/b57b0c91-7084-4a47-a6e8-9475a7062aa5?iframe_resize_id=assessment_practice_id_b57b0c91-7084-4a47-a6e8-9475a7062aa5\" frameborder=\"0\" style=\"border:none;width:100%;height:100%;min-height:300px;\"><br \/>\n\t<\/iframe><\/p>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>Try It<\/h3>\n<p>\t<iframe id=\"assessment_practice_09834a0b-62ee-44f2-a676-3a1450387c5d\" class=\"resizable\" src=\"https:\/\/assess.lumenlearning.com\/practice\/09834a0b-62ee-44f2-a676-3a1450387c5d?iframe_resize_id=assessment_practice_id_09834a0b-62ee-44f2-a676-3a1450387c5d\" frameborder=\"0\" style=\"border:none;width:100%;height:100%;min-height:300px;\"><br \/>\n\t<\/iframe><\/p>\n<p>\t<iframe id=\"assessment_practice_0458e3ae-90f6-4db6-b42e-edb28d2c78bc\" class=\"resizable\" src=\"https:\/\/assess.lumenlearning.com\/practice\/0458e3ae-90f6-4db6-b42e-edb28d2c78bc?iframe_resize_id=assessment_practice_id_0458e3ae-90f6-4db6-b42e-edb28d2c78bc\" frameborder=\"0\" style=\"border:none;width:100%;height:100%;min-height:300px;\"><br \/>\n\t<\/iframe><\/p>\n<p>\t<iframe id=\"assessment_practice_35fb9e8f-2e17-44c0-82ce-27397a9a603e\" class=\"resizable\" src=\"https:\/\/assess.lumenlearning.com\/practice\/35fb9e8f-2e17-44c0-82ce-27397a9a603e?iframe_resize_id=assessment_practice_id_35fb9e8f-2e17-44c0-82ce-27397a9a603e\" frameborder=\"0\" style=\"border:none;width:100%;height:100%;min-height:300px;\"><br \/>\n\t<\/iframe><\/p>\n<p>\t<iframe id=\"assessment_practice_00457cde-7aba-4e8b-9663-e691793a2f74\" class=\"resizable\" src=\"https:\/\/assess.lumenlearning.com\/practice\/00457cde-7aba-4e8b-9663-e691793a2f74?iframe_resize_id=assessment_practice_id_00457cde-7aba-4e8b-9663-e691793a2f74\" frameborder=\"0\" style=\"border:none;width:100%;height:100%;min-height:300px;\"><br \/>\n\t<\/iframe><\/p>\n<\/div>\n<h2>Contribute!<\/h2>\n<div style=\"margin-bottom: 8px;\">Did you have an idea for improving this content? We\u2019d love your input.<\/div>\n<p><a href=\"https:\/\/docs.google.com\/document\/d\/1l9js3166FFNOgXkkJqgr9mEeotESWG3OTKSy8KnYa1I\" target=\"_blank\" style=\"font-size: 10pt; font-weight: 600; color: #077fab; text-decoration: none; border: 2px solid #077fab; border-radius: 7px; padding: 5px 25px; text-align: center; cursor: pointer; line-height: 1.5em;\">Improve this page<\/a><a style=\"margin-left: 16px;\" target=\"_blank\" href=\"https:\/\/docs.google.com\/document\/d\/1vy-T6DtTF-BbMfpVEI7VP_R7w2A4anzYZLXR8Pk4Fu4\">Learn More<\/a><\/p>\n\n\t\t\t <section class=\"citations-section\" role=\"contentinfo\">\n\t\t\t <h3>Candela Citations<\/h3>\n\t\t\t\t\t <div>\n\t\t\t\t\t\t <div id=\"citation-list-69\">\n\t\t\t\t\t\t\t <div class=\"licensing\"><div class=\"license-attribution-dropdown-subheading\">CC licensed content, Shared previously<\/div><ul class=\"citation-list\"><li>Concepts in Statistics. <strong>Provided by<\/strong>: Open Learning Initiative. <strong>Located at<\/strong>: <a target=\"_blank\" href=\"http:\/\/oli.cmu.edu\">http:\/\/oli.cmu.edu<\/a>. <strong>License<\/strong>: <em><a target=\"_blank\" rel=\"license\" href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\">CC BY: Attribution<\/a><\/em><\/li><\/ul><\/div>\n\t\t\t\t\t\t <\/div>\n\t\t\t\t\t <\/div>\n\t\t\t <\/section>","protected":false},"author":163,"menu_order":9,"template":"","meta":{"_candela_citation":"[{\"type\":\"cc\",\"description\":\"Concepts in Statistics\",\"author\":\"\",\"organization\":\"Open Learning Initiative\",\"url\":\"http:\/\/oli.cmu.edu\",\"project\":\"\",\"license\":\"cc-by\",\"license_terms\":\"\"}]","CANDELA_OUTCOMES_GUID":"3880a53a-a158-489e-9e22-6ab866dde55f, 15c8f8ce-6aed-42e7-88d0-cc3392d93e27","pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-69","chapter","type-chapter","status-publish","hentry"],"part":43,"_links":{"self":[{"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapters\/69","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/wp\/v2\/users\/163"}],"version-history":[{"count":8,"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapters\/69\/revisions"}],"predecessor-version":[{"id":2717,"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapters\/69\/revisions\/2717"}],"part":[{"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/pressbooks\/v2\/parts\/43"}],"metadata":[{"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapters\/69\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/wp\/v2\/media?parent=69"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/pressbooks\/v2\/chapter-type?post=69"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/wp\/v2\/contributor?post=69"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/wm-concepts-statistics\/wp-json\/wp\/v2\/license?post=69"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}