{"id":5538,"date":"2022-09-21T15:50:34","date_gmt":"2022-09-21T15:50:34","guid":{"rendered":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/?post_type=chapter&#038;p=5538"},"modified":"2022-10-17T16:32:54","modified_gmt":"2022-10-17T16:32:54","slug":"16d-inclass","status":"publish","type":"chapter","link":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/chapter\/16d-inclass\/","title":{"raw":"16D InClass","rendered":"16D InClass"},"content":{"raw":"<div id=\"bp-page-1\" class=\"page\" data-page-number=\"1\" data-loaded=\"true\">\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 1<\/h3>\r\n<div class=\"textLayer\">1) Choose one of the following options and explain how you made your choice. Data on incomes areusually...<\/div>\r\n<div class=\"textLayer\">a) Skewed right<\/div>\r\n<div class=\"textLayer\">b) Skewed left<\/div>\r\n<div class=\"textLayer\">c) Symmetric<\/div>\r\n<\/div>\r\n<\/div>\r\n<div><img class=\"\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012312\/Picture97-300x160.jpg\" alt=\"Figurines of golfers on tall stacks of coins next to shorter stacks of coins with figurines of construction workers on them.\" width=\"572\" height=\"305\" \/><\/div>\r\n<div class=\"textLayer\">The data for this in-class activity are from the Gapminder site on global development, which shows 2018 data on countries\u2019 income per person[footnote]Gapminder. (n.d.). GDP per capita in constant PPP dollars. http:\/\/gapm.io\/dgdppc[\/footnote] (in standardized dollar amounts) and life expectancy.[footnote]Gapminder. (n.d.). Life expectancy at birth. http:\/\/gapm.io\/ilexDotplot created atstapplet.comQatarUSASingaporeAlbaniaCredit: iStock\/hyejin kang[\/footnote] Each data point represents a different nation.<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 2<\/h3>\r\n<div class=\"textLayer\">2) The following is a dotplot showing the income per person among all the nations in the dataset.<\/div>\r\n<div><img class=\"\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012319\/Picture981-300x48.png\" alt=\"A dot plot labeled \u201cIncome per Person (in 2011, international dollars),\u201d with the x-axis labeled in increments of 10000. The graph is significantly skewed to the left. Albania is marked at approximately 12000, the USA is marked at approximately 56000, Singapore is marked at approximately 91000, and Qatar is marked at approximately 113000.\" width=\"619\" height=\"99\" \/><\/div>\r\n<div class=\"textLayer\">Part A: Do any of the labeled countries have higher or lower income values than you expected? Explain.<\/div>\r\n<div class=\"ba-Layer ba-Layer--region\" data-resin-fileid=\"910624804412\" data-resin-iscurrent=\"true\" data-resin-feature=\"annotations\" data-testid=\"ba-Layer--region\">\r\n<div class=\"ba-RegionAnnotations-list is-listening\" data-resin-component=\"regionList\"><span style=\"font-size: 1em;\">Part B: Describe the shape of the distribution.<\/span><\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div class=\"ba-Layer ba-Layer--region\" data-resin-fileid=\"910624804412\" data-resin-iscurrent=\"true\" data-resin-feature=\"annotations\" data-testid=\"ba-Layer--region\">\r\n<div class=\"ba-RegionAnnotations-list is-listening\" data-resin-component=\"regionList\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 3<\/h3>\r\n<div class=\"ba-RegionAnnotations-list is-listening\" data-resin-component=\"regionList\"><span style=\"font-size: 1em;\">3) The following is a scatterplot (created using theDCMP Linear Regression tool) that visualizes each nation\u2019s life expectancy as predicted by its income per person. A linear model is fit to the data, with the least square regression equation shown. The fit has an \ud835\udc452value of 46.6%. The following scatterplot is a residual plot from this fit.<\/span><\/div>\r\n<div data-resin-component=\"regionList\"><img class=\"\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012327\/Picture991-300x113.png\" alt=\"A scatterplot titled \u201cLife Expectancy &amp; Nations\u2019 Incomes.\u201d It is labeled \u201cIncome Per Person (international, 2011 dollars)\u201d on the x-axis and \u201cLife Expectancy\u201d on the y-axis. There is a line whose slope is given as \u201cRegression Line: y =68.5 + 0.000245x.\u201d\" width=\"696\" height=\"262\" \/><\/div>\r\n<div data-resin-component=\"regionList\"><img class=\"\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012336\/Picture1002-300x94.png\" alt=\"A residual plot labeled \u201cIncome Per Person (international, 2011 dollars)\u201d on the x-axis and \u201cResidual\u201d on the y-axis. There is a horizontal line at y = 0. There are more points for lower x-values and both lower x-values and higher x-values are more likely to be below the horizontal line.\" width=\"696\" height=\"218\" \/><\/div>\r\n<div class=\"ba-RegionAnnotations-list is-listening\" data-resin-component=\"regionList\"><span style=\"font-size: 1em;\">Part A: Is the linear model a good fit for these data? Would you trust predictions or inferences made from a linearmodel? Justify your answer using the \ud835\udc452value, residual plot, and scatterplot.<\/span><\/div>\r\n<div class=\"ba-RegionAnnotations-list is-listening\" data-resin-component=\"regionList\"><span style=\"font-size: 1em;\">Part B: Why do you think the data take this shape in the scatterplot? Explain while referencing the dotplot in Question 2. Life Expectancy &amp; Nations\u2019 Incomes Residual Plot<\/span><\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"bp-page-3\" class=\"page\" data-page-number=\"3\" data-loaded=\"true\">\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 4<\/h3>\r\n<div class=\"textLayer\">4) To handle the right skew in the income data, let\u2019s try different transformations.<\/div>\r\n<div class=\"textLayer\">Part A: The following is a dotplot of incomes per person after taking the square root of all the values. Is the right skew reduced in severity? Is it still present? Explain.<\/div>\r\n<div><img src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012343\/Picture1014-300x52.png\" alt=\"A dot plot titled \u201cSquare Root of Incomes per Person.\u201d There are more dots on the left than on the right, although it is less skewed than the data was in the previous dot pot.\" \/><\/div>\r\n<div class=\"textLayer\">Part B: The following is a dotplot of incomes per person after taking the base 10 logarithms of all the values. Is the right skew reducedin severity? Is it still present? Explain.<\/div>\r\n<div><img src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012347\/Picture1022-300x37.png\" alt=\"A dot plot titled \u201cLog base ten of Nations\u2019 Incomes.\u201d The dots are spread relatively evenly across the dot plot.\" \/><\/div>\r\n<div class=\"textLayer\">Part C: Which transformation makes the data more symmetric? Explain.<\/div>\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 5<\/h3>\r\n<div class=\"textLayer\">5) The following is a scatterplot after taking the base 10 log transformations of the income values. A linear model is fit to the data (using log income in place of income on the x-axis), with the least square regression equation shown. The fit has an \ud835\udc452 value of 71.1%. The following scatterplot is a residual plot from this fit: Dotplot created at stapplet.com Dotplot created at stapplet.com Log10(Nations\u2019 Incomes)<\/div>\r\n<div><img src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012353\/Picture1032-300x110.png\" alt=\"A scatterplot titled \u201cLife Expectancy &amp; Nations\u2019 Incomes.\u201d It is labeled \u201cBase-10 Logarithm of Income Per Person\u201c on the x-axis and \u201cLife Expectancy\u201d on the y-axis. There is a line whose slope is given as \u201cRegression Line: y = 27.7 + 11.3x.\u201d\" \/><\/div>\r\n<div><img src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012403\/Picture1042-300x92.png\" alt=\"A residual plot labeled \u201cBase-10 Logarithm of Income Per Person\u201d on the x-axis and \u201cResidual\u201d on the y-axis. There is a horizontal line at y = 0. There does not appear to be a pattern to the points.\" \/><\/div>\r\n<div class=\"textLayer\">Part A: Did the transformation result in data for which a linear model provides abetter fit for the data? Explain your answer using the \ud835\udc452 value, residual plot, and scatterplot.<\/div>\r\n<div class=\"textLayer\">Part B: Using the previous scatterplot and the linear regression model, a statistician claims that, \u201cA nation with an income of about $4 per person has a predicted national life expectancy of about 73 years.\u201d Explain what\u2019s wrong with their statement and correct it.<\/div>\r\n<div class=\"textLayer\">Part C: Imagine that we instead looked at the national budget balance per person in every nation. In some nations, the budget balance is negative (more debts than revenue). In such a case, we can no longer use the log transformation. Explain. Life Expectancy &amp; Log (Nations\u2019 Incomes)Residual Plot<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"bp-page-5\" class=\"page\" data-page-number=\"5\" data-loaded=\"true\">\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 6<\/h3>\r\n<div id=\"bp-page-5\" class=\"page\" data-page-number=\"5\" data-loaded=\"true\">\r\n<div class=\"textLayer\">6) In addition to transformations that make right-skew distributions more symmetric, there are transformations that can make left-skew distributions more symmetric.High school GPAs tend to be distributed in a left-skew shape; most students get A\u2019s, B\u2019s, and C\u2019s in their classes, while fewerstudents consistently get lower grades (left tail). The followingis a dataset[footnote]OpenIntro. (n.d.). SAT and GPA data. https:\/\/www.openintro.org\/data\/index.php?data=satgpa[\/footnote] of 1,000 high school student GPAsvisualized as a histogram:<\/div>\r\n<div><img src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012408\/Picture1051-300x66.png\" alt=\"A histogram labeled \u201cHigh School GPAs of 1000 Students\u201d on the x-axis and \u201cCount\u201d on the y-axis. For 1.8-2, the count is approximately 5. For 2-2.2, the count is approximately 25. For 2.2-2.4, the count is approximately 45. For 2.4-2.6, the count is approximately 75. For 2.6-2.8, the count is approximately 100. For 2.8-3, the count is approximately 90. For 3-3.2, the count is approximately 135. For 3.2-3.4, the count is approximately 115. For 3.4-3.6, the count is approximately 130. For 3.6-3.8, the count is approximately 110. For 3.8-4, the count is approximately 55. For 4-4.2, the count is approximately 120. For 4.4-4.6, the count is approximately 5.\" \/><\/div>\r\n<div class=\"textLayer\">Part A: To try to make this distribution more symmetric, let\u2019s first try to square all the values.The following graph shows the squared GPAs.Is the left skew reduced in severity? Is it still present? Explain.<\/div>\r\n<div><img src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012412\/Picture1061-300x67.png\" alt=\"A histogram labeled \u201cSquared High School GPAs of 1000 Students\u201d on the x-axis and \u201cCount\u201d on the y-axis. For 2-4, the count is approximately 5. For 4-6, the count is approximately 80. For 6-8, the count is approximately 220. For 8-10, the count is approximately 170. For 10-12, the count is approximately 150. For 12-14, the count is approximately 160. For 14-16, the count is approximately 100. For 16-18, the count is approximately 120. For 20-22, the count is approximately 5.\" \/><\/div>\r\n<div class=\"textLayer\">Part B: The following graph shows the GPAs cubed. Is the left skew reduced in severity? Is it still present? Explain.<\/div>\r\n<div><img src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012416\/Picture1072-300x68.png\" alt=\"A histogram labeled \u201cCubed High School GPAs of 1000 Students\u201d on the x-axis and \u201cCount\u201d on the y-axis. For 0-9, the count is approximately 20. For 9-18, the count is approximately 150. For 18-27, the count is approximately 170. For 27-36, the count is approximately 250. For 36-45, the count is approximately 140. For 45-54, the count is approximately 120. For 54-63, the count is approximately 60. For 63-72, the count is approximately 125. For 90-99, the count is approximately 5.\" \/><\/div>\r\n<\/div>\r\n<div id=\"bp-page-6\" class=\"page\" data-page-number=\"6\" data-loaded=\"true\">\r\n<div class=\"textLayer\">Part C: If your goal is to make the distribution symmetric, would you use the square or cube transformation of GPA values? Explain.<\/div>\r\n<div class=\"textLayer\">Part D: In some countries, GPA values can be negative. In such cases, a transformation that squares every data value wouldn\u2019t be appropriate. Explain.<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>","rendered":"<div id=\"bp-page-1\" class=\"page\" data-page-number=\"1\" data-loaded=\"true\">\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 1<\/h3>\n<div class=\"textLayer\">1) Choose one of the following options and explain how you made your choice. Data on incomes areusually&#8230;<\/div>\n<div class=\"textLayer\">a) Skewed right<\/div>\n<div class=\"textLayer\">b) Skewed left<\/div>\n<div class=\"textLayer\">c) Symmetric<\/div>\n<\/div>\n<\/div>\n<div><img loading=\"lazy\" decoding=\"async\" class=\"\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012312\/Picture97-300x160.jpg\" alt=\"Figurines of golfers on tall stacks of coins next to shorter stacks of coins with figurines of construction workers on them.\" width=\"572\" height=\"305\" \/><\/div>\n<div class=\"textLayer\">The data for this in-class activity are from the Gapminder site on global development, which shows 2018 data on countries\u2019 income per person<a class=\"footnote\" title=\"Gapminder. (n.d.). GDP per capita in constant PPP dollars. http:\/\/gapm.io\/dgdppc\" id=\"return-footnote-5538-1\" href=\"#footnote-5538-1\" aria-label=\"Footnote 1\"><sup class=\"footnote\">[1]<\/sup><\/a> (in standardized dollar amounts) and life expectancy.<a class=\"footnote\" title=\"Gapminder. (n.d.). Life expectancy at birth. http:\/\/gapm.io\/ilexDotplot created atstapplet.comQatarUSASingaporeAlbaniaCredit: iStock\/hyejin kang\" id=\"return-footnote-5538-2\" href=\"#footnote-5538-2\" aria-label=\"Footnote 2\"><sup class=\"footnote\">[2]<\/sup><\/a> Each data point represents a different nation.<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 2<\/h3>\n<div class=\"textLayer\">2) The following is a dotplot showing the income per person among all the nations in the dataset.<\/div>\n<div><img loading=\"lazy\" decoding=\"async\" class=\"\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012319\/Picture981-300x48.png\" alt=\"A dot plot labeled \u201cIncome per Person (in 2011, international dollars),\u201d with the x-axis labeled in increments of 10000. The graph is significantly skewed to the left. Albania is marked at approximately 12000, the USA is marked at approximately 56000, Singapore is marked at approximately 91000, and Qatar is marked at approximately 113000.\" width=\"619\" height=\"99\" \/><\/div>\n<div class=\"textLayer\">Part A: Do any of the labeled countries have higher or lower income values than you expected? Explain.<\/div>\n<div class=\"ba-Layer ba-Layer--region\" data-resin-fileid=\"910624804412\" data-resin-iscurrent=\"true\" data-resin-feature=\"annotations\" data-testid=\"ba-Layer--region\">\n<div class=\"ba-RegionAnnotations-list is-listening\" data-resin-component=\"regionList\"><span style=\"font-size: 1em;\">Part B: Describe the shape of the distribution.<\/span><\/div>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"ba-Layer ba-Layer--region\" data-resin-fileid=\"910624804412\" data-resin-iscurrent=\"true\" data-resin-feature=\"annotations\" data-testid=\"ba-Layer--region\">\n<div class=\"ba-RegionAnnotations-list is-listening\" data-resin-component=\"regionList\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 3<\/h3>\n<div class=\"ba-RegionAnnotations-list is-listening\" data-resin-component=\"regionList\"><span style=\"font-size: 1em;\">3) The following is a scatterplot (created using theDCMP Linear Regression tool) that visualizes each nation\u2019s life expectancy as predicted by its income per person. A linear model is fit to the data, with the least square regression equation shown. The fit has an \ud835\udc452value of 46.6%. The following scatterplot is a residual plot from this fit.<\/span><\/div>\n<div data-resin-component=\"regionList\"><img loading=\"lazy\" decoding=\"async\" class=\"\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012327\/Picture991-300x113.png\" alt=\"A scatterplot titled \u201cLife Expectancy &amp; Nations\u2019 Incomes.\u201d It is labeled \u201cIncome Per Person (international, 2011 dollars)\u201d on the x-axis and \u201cLife Expectancy\u201d on the y-axis. There is a line whose slope is given as \u201cRegression Line: y =68.5 + 0.000245x.\u201d\" width=\"696\" height=\"262\" \/><\/div>\n<div data-resin-component=\"regionList\"><img loading=\"lazy\" decoding=\"async\" class=\"\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012336\/Picture1002-300x94.png\" alt=\"A residual plot labeled \u201cIncome Per Person (international, 2011 dollars)\u201d on the x-axis and \u201cResidual\u201d on the y-axis. There is a horizontal line at y = 0. There are more points for lower x-values and both lower x-values and higher x-values are more likely to be below the horizontal line.\" width=\"696\" height=\"218\" \/><\/div>\n<div class=\"ba-RegionAnnotations-list is-listening\" data-resin-component=\"regionList\"><span style=\"font-size: 1em;\">Part A: Is the linear model a good fit for these data? Would you trust predictions or inferences made from a linearmodel? Justify your answer using the \ud835\udc452value, residual plot, and scatterplot.<\/span><\/div>\n<div class=\"ba-RegionAnnotations-list is-listening\" data-resin-component=\"regionList\"><span style=\"font-size: 1em;\">Part B: Why do you think the data take this shape in the scatterplot? Explain while referencing the dotplot in Question 2. Life Expectancy &amp; Nations\u2019 Incomes Residual Plot<\/span><\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"bp-page-3\" class=\"page\" data-page-number=\"3\" data-loaded=\"true\">\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 4<\/h3>\n<div class=\"textLayer\">4) To handle the right skew in the income data, let\u2019s try different transformations.<\/div>\n<div class=\"textLayer\">Part A: The following is a dotplot of incomes per person after taking the square root of all the values. Is the right skew reduced in severity? Is it still present? Explain.<\/div>\n<div><img decoding=\"async\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012343\/Picture1014-300x52.png\" alt=\"A dot plot titled \u201cSquare Root of Incomes per Person.\u201d There are more dots on the left than on the right, although it is less skewed than the data was in the previous dot pot.\" \/><\/div>\n<div class=\"textLayer\">Part B: The following is a dotplot of incomes per person after taking the base 10 logarithms of all the values. Is the right skew reducedin severity? Is it still present? Explain.<\/div>\n<div><img decoding=\"async\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012347\/Picture1022-300x37.png\" alt=\"A dot plot titled \u201cLog base ten of Nations\u2019 Incomes.\u201d The dots are spread relatively evenly across the dot plot.\" \/><\/div>\n<div class=\"textLayer\">Part C: Which transformation makes the data more symmetric? Explain.<\/div>\n<\/div>\n<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 5<\/h3>\n<div class=\"textLayer\">5) The following is a scatterplot after taking the base 10 log transformations of the income values. A linear model is fit to the data (using log income in place of income on the x-axis), with the least square regression equation shown. The fit has an \ud835\udc452 value of 71.1%. The following scatterplot is a residual plot from this fit: Dotplot created at stapplet.com Dotplot created at stapplet.com Log10(Nations\u2019 Incomes)<\/div>\n<div><img decoding=\"async\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012353\/Picture1032-300x110.png\" alt=\"A scatterplot titled \u201cLife Expectancy &amp; Nations\u2019 Incomes.\u201d It is labeled \u201cBase-10 Logarithm of Income Per Person\u201c on the x-axis and \u201cLife Expectancy\u201d on the y-axis. There is a line whose slope is given as \u201cRegression Line: y = 27.7 + 11.3x.\u201d\" \/><\/div>\n<div><img decoding=\"async\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012403\/Picture1042-300x92.png\" alt=\"A residual plot labeled \u201cBase-10 Logarithm of Income Per Person\u201d on the x-axis and \u201cResidual\u201d on the y-axis. There is a horizontal line at y = 0. There does not appear to be a pattern to the points.\" \/><\/div>\n<div class=\"textLayer\">Part A: Did the transformation result in data for which a linear model provides abetter fit for the data? Explain your answer using the \ud835\udc452 value, residual plot, and scatterplot.<\/div>\n<div class=\"textLayer\">Part B: Using the previous scatterplot and the linear regression model, a statistician claims that, \u201cA nation with an income of about $4 per person has a predicted national life expectancy of about 73 years.\u201d Explain what\u2019s wrong with their statement and correct it.<\/div>\n<div class=\"textLayer\">Part C: Imagine that we instead looked at the national budget balance per person in every nation. In some nations, the budget balance is negative (more debts than revenue). In such a case, we can no longer use the log transformation. Explain. Life Expectancy &amp; Log (Nations\u2019 Incomes)Residual Plot<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"bp-page-5\" class=\"page\" data-page-number=\"5\" data-loaded=\"true\">\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 6<\/h3>\n<div id=\"bp-page-5\" class=\"page\" data-page-number=\"5\" data-loaded=\"true\">\n<div class=\"textLayer\">6) In addition to transformations that make right-skew distributions more symmetric, there are transformations that can make left-skew distributions more symmetric.High school GPAs tend to be distributed in a left-skew shape; most students get A\u2019s, B\u2019s, and C\u2019s in their classes, while fewerstudents consistently get lower grades (left tail). The followingis a dataset<a class=\"footnote\" title=\"OpenIntro. (n.d.). SAT and GPA data. https:\/\/www.openintro.org\/data\/index.php?data=satgpa\" id=\"return-footnote-5538-3\" href=\"#footnote-5538-3\" aria-label=\"Footnote 3\"><sup class=\"footnote\">[3]<\/sup><\/a> of 1,000 high school student GPAsvisualized as a histogram:<\/div>\n<div><img decoding=\"async\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012408\/Picture1051-300x66.png\" alt=\"A histogram labeled \u201cHigh School GPAs of 1000 Students\u201d on the x-axis and \u201cCount\u201d on the y-axis. For 1.8-2, the count is approximately 5. For 2-2.2, the count is approximately 25. For 2.2-2.4, the count is approximately 45. For 2.4-2.6, the count is approximately 75. For 2.6-2.8, the count is approximately 100. For 2.8-3, the count is approximately 90. For 3-3.2, the count is approximately 135. For 3.2-3.4, the count is approximately 115. For 3.4-3.6, the count is approximately 130. For 3.6-3.8, the count is approximately 110. For 3.8-4, the count is approximately 55. For 4-4.2, the count is approximately 120. For 4.4-4.6, the count is approximately 5.\" \/><\/div>\n<div class=\"textLayer\">Part A: To try to make this distribution more symmetric, let\u2019s first try to square all the values.The following graph shows the squared GPAs.Is the left skew reduced in severity? Is it still present? Explain.<\/div>\n<div><img decoding=\"async\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012412\/Picture1061-300x67.png\" alt=\"A histogram labeled \u201cSquared High School GPAs of 1000 Students\u201d on the x-axis and \u201cCount\u201d on the y-axis. For 2-4, the count is approximately 5. For 4-6, the count is approximately 80. For 6-8, the count is approximately 220. For 8-10, the count is approximately 170. For 10-12, the count is approximately 150. For 12-14, the count is approximately 160. For 14-16, the count is approximately 100. For 16-18, the count is approximately 120. For 20-22, the count is approximately 5.\" \/><\/div>\n<div class=\"textLayer\">Part B: The following graph shows the GPAs cubed. Is the left skew reduced in severity? Is it still present? Explain.<\/div>\n<div><img decoding=\"async\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27012416\/Picture1072-300x68.png\" alt=\"A histogram labeled \u201cCubed High School GPAs of 1000 Students\u201d on the x-axis and \u201cCount\u201d on the y-axis. For 0-9, the count is approximately 20. For 9-18, the count is approximately 150. For 18-27, the count is approximately 170. For 27-36, the count is approximately 250. For 36-45, the count is approximately 140. For 45-54, the count is approximately 120. For 54-63, the count is approximately 60. For 63-72, the count is approximately 125. For 90-99, the count is approximately 5.\" \/><\/div>\n<\/div>\n<div id=\"bp-page-6\" class=\"page\" data-page-number=\"6\" data-loaded=\"true\">\n<div class=\"textLayer\">Part C: If your goal is to make the distribution symmetric, would you use the square or cube transformation of GPA values? Explain.<\/div>\n<div class=\"textLayer\">Part D: In some countries, GPA values can be negative. In such cases, a transformation that squares every data value wouldn\u2019t be appropriate. Explain.<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<hr class=\"before-footnotes clear\" \/><div class=\"footnotes\"><ol><li id=\"footnote-5538-1\">Gapminder. (n.d.). GDP per capita in constant PPP dollars. http:\/\/gapm.io\/dgdppc <a href=\"#return-footnote-5538-1\" class=\"return-footnote\" aria-label=\"Return to footnote 1\">&crarr;<\/a><\/li><li id=\"footnote-5538-2\">Gapminder. (n.d.). Life expectancy at birth. http:\/\/gapm.io\/ilexDotplot created atstapplet.comQatarUSASingaporeAlbaniaCredit: iStock\/hyejin kang <a href=\"#return-footnote-5538-2\" class=\"return-footnote\" aria-label=\"Return to footnote 2\">&crarr;<\/a><\/li><li id=\"footnote-5538-3\">OpenIntro. (n.d.). SAT and GPA data. https:\/\/www.openintro.org\/data\/index.php?data=satgpa <a href=\"#return-footnote-5538-3\" class=\"return-footnote\" aria-label=\"Return to footnote 3\">&crarr;<\/a><\/li><\/ol><\/div>","protected":false},"author":23592,"menu_order":68,"template":"","meta":{"_candela_citation":"[]","CANDELA_OUTCOMES_GUID":"","pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-5538","chapter","type-chapter","status-publish","hentry"],"part":5514,"_links":{"self":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5538","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/users\/23592"}],"version-history":[{"count":3,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5538\/revisions"}],"predecessor-version":[{"id":5645,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5538\/revisions\/5645"}],"part":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/parts\/5514"}],"metadata":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5538\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/media?parent=5538"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapter-type?post=5538"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/contributor?post=5538"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/license?post=5538"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}