{"id":312,"date":"2021-07-14T15:59:12","date_gmt":"2021-07-14T15:59:12","guid":{"rendered":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/chapter\/answers-to-selected-exercises-3\/"},"modified":"2023-12-05T09:50:16","modified_gmt":"2023-12-05T09:50:16","slug":"answers-to-selected-exercises-3","status":"publish","type":"chapter","link":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/chapter\/answers-to-selected-exercises-3\/","title":{"raw":"Answers to Selected Exercises","rendered":"Answers to Selected Exercises"},"content":{"raw":"<h2>Linear Equations - Homework<\/h2>\r\n1.\u00a0dependent variable: fee amount; independent variable: time\r\n\r\n3.\r\n\r\n<section class=\"ui-body\">\r\n<figure id=\"fs-idm133416208\"><span id=\"eip-idp89831856\" data-type=\"media\" data-alt=\"This is a graph of the equation y = 25 + 12.50x. The x-axis is labeled in intervals of 1 from 0 - 7; the y-axis is labeled in intervals of 25 from 0 - 100. The equation's graph is a line that crosses the y-axis at 25 and is sloped up and to the right, rising 12.50 units for every one unit of run.\" data-display=\"block\"><img src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/132\/2016\/04\/21214829\/CNX_Stats_C012_M02_item001anno.jpg\" alt=\"This is a graph of the equation y = 25 + 12.50x. The x-axis is labeled in intervals of 1 from 0 - 7; the y-axis is labeled in intervals of 25 from 0 - 100. The equation's graph is a line that crosses the y-axis at 25 and is sloped up and to the right, rising 12.50 units for every one unit of run.\" width=\"380\" data-media-type=\"image\/jpeg\" \/><\/span><\/figure>\r\n5.\r\n\r\n<section class=\"ui-body\">\r\n<figure id=\"fs-idp120210480\"><span id=\"eip-idp116656032\" data-type=\"media\" data-alt=\"This is a graph of the equation y = 10 + 5x. The x-axis is labeled in intervals of 1 from 0 - 7; the y-axis is labeled in intervals of 10 from 0 - 50. The equation's graph is a line that crosses the y-axis at 10 and is sloped up and to the right, rising 5 units for every one unit of run.\" data-display=\"block\"><img src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/132\/2016\/04\/21214831\/CNX_Stats_C012_M02_item002anno.jpg\" alt=\"This is a graph of the equation y = 10 + 5x. The x-axis is labeled in intervals of 1 from 0 - 7; the y-axis is labeled in intervals of 10 from 0 - 50. The equation's graph is a line that crosses the y-axis at 10 and is sloped up and to the right, rising 5 units for every one unit of run.\" width=\"380\" data-media-type=\"image\/jpeg\" \/><\/span><\/figure>\r\n7.\u00a0<em data-effect=\"italics\">y<\/em> = 6<em data-effect=\"italics\">x<\/em> + 8, 4<em data-effect=\"italics\">y<\/em> = 8, and <em data-effect=\"italics\">y<\/em> + 7 = 3<em data-effect=\"italics\">x<\/em> are all linear equations.\r\n\r\n9.\u00a0The number of flu cases depends on the year. Therefore, year becomes the independent variable and the number of flu cases is the dependent variable.\r\n\r\n11.\u00a0The <em data-effect=\"italics\">y<\/em>-intercept is 50 (<em data-effect=\"italics\">a<\/em> = 50). At the start of the cleaning, the company charges a one-time fee of $50 (this is when <em data-effect=\"italics\">x<\/em> = 0). The slope is 100 (<em data-effect=\"italics\">b<\/em> = 100). For each session, the company charges $100 for each hour they clean.\r\n\r\n13.\u00a012,000 pounds of soil\r\n\r\n15.\u00a0The slope is \u20131.5 (<em data-effect=\"italics\">b<\/em> = \u20131.5). This means the stock is losing value at a rate of $1.50 per hour. The <em data-effect=\"italics\">y<\/em>-intercept is $15 (<em data-effect=\"italics\">a<\/em> = 15). This means the price of the stock before the trading day was $15.\r\n<h2 data-type=\"solution\">Scatter Plots \u2013 Practice<\/h2>\r\n<\/section><\/section>\r\n<div class=\"problem\" data-type=\"problem\">\r\n\r\n17.\u00a0The data appear to be linear with a strong, positive correlation.\r\n\r\n19.\u00a0The data appear to have no correlation.\r\n\r\n<\/div>\r\n<div data-type=\"problem\">\r\n<h2 data-type=\"solution\">The Regression Equation \u2013 Practice<\/h2>\r\n21.\u00a0<em data-effect=\"italics\">\u0177<\/em> = 2.23 + 1.99<em data-effect=\"italics\">x<\/em>\r\n\r\n23.\u00a0The slope is 1.99 (<em data-effect=\"italics\">b<\/em> = 1.99). It means that for every endorsement deal a professional player gets, he gets an average of another $1.99 million in pay each year.\r\n\r\n25.\u00a0It means that there is no correlation between the data sets.\r\n\r\n27.\u00a0Yes, there are enough data points and the value of r is strong enough to show that there is a strong negative correlation between the data sets.\r\n<h2 data-type=\"solution\">Testing the Significance of the Correlation Coefficient \u2013 Practice<\/h2>\r\n<div data-type=\"solution\">\r\n<div class=\"exercise\" data-type=\"exercise\"><section class=\" focusable\" tabindex=\"-1\">\r\n<div id=\"eip-226\" class=\"problem\" data-type=\"problem\"><\/div>\r\n<\/section><\/div>\r\n<\/div>\r\n<\/div>\r\n<div data-type=\"problem\">29.\u00a0<em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u03c1<\/em> \u2260 0<\/div>\r\n<div data-type=\"problem\"><\/div>\r\n<div data-type=\"problem\">\r\n<h2 data-type=\"solution\">Prediction \u2013 Practice<\/h2>\r\n<\/div>\r\n<div data-type=\"solution\"><section class=\"ui-body\">\r\n<div class=\"exercise\" data-type=\"exercise\">\r\n\r\n31.\u00a0$250,120\r\n\r\n33.\u00a01,326 acres\r\n\r\n35.\u00a01,125 hours, or when <em data-effect=\"italics\">x<\/em> = 1,125\r\n\r\n37. Check student's solution.\r\n\r\n39.\r\n<div id=\"eip-idp156589440\" data-type=\"list\" data-list-type=\"enumerated\" data-number-style=\"lower-alpha\">\r\n<ol>\r\n \t<li data-type=\"item\">When <em data-effect=\"italics\">x<\/em> = 1985, <em data-effect=\"italics\">\u0177<\/em> = 25,52<\/li>\r\n \t<li data-type=\"item\">When <em data-effect=\"italics\">x<\/em> = 1990, <em data-effect=\"italics\">\u0177<\/em> = 34,275<\/li>\r\n \t<li data-type=\"item\">When <em data-effect=\"italics\">x<\/em> = 1970, <em data-effect=\"italics\">\u0177<\/em> = \u2013725 Why doesn\u2019t this answer make sense? The range of <em data-effect=\"italics\">x<\/em> values was 1981 to 2002; the year 1970 is not in this range. The regression equation does not apply, because predicting for the year 1970 is extrapolation, which requires a different process. Also, a negative number does not make sense in this context, where we are predicting flu cases diagnosed.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<div data-type=\"item\">41.\u00a0Also, the correlation <em data-effect=\"italics\">r<\/em> = 0.4526. If <em data-effect=\"italics\">r<\/em> is compared to the value in the 95% Critical Values of the Sample Correlation Coefficient Table, because <em data-effect=\"italics\">r<\/em> &gt; 0.423, <em data-effect=\"italics\">r<\/em> is significant, and you would think that the line could be used for prediction. But the scatter plot indicates otherwise.<\/div>\r\n<\/div>\r\n<div data-type=\"item\"><\/div>\r\n<div data-type=\"item\">43.\u00a0<span class=\"MathJax\"><span id=\"MathJax-Span-491\" class=\"math\"><span id=\"MathJax-Span-492\" class=\"mrow\"><span id=\"MathJax-Span-493\" class=\"semantics\"><span id=\"MathJax-Span-494\" class=\"mrow\"><span id=\"MathJax-Span-495\" class=\"mover\"><span id=\"MathJax-Span-496\" class=\"mi\">y<\/span><span id=\"MathJax-Span-497\" class=\"mo\">\u02c6<\/span><\/span><\/span><\/span><\/span><\/span><\/span> = 3,448,225 + 1750<em data-effect=\"italics\">x<\/em><\/div>\r\n<\/section>\r\n<div data-type=\"item\"><\/div>\r\n<div data-type=\"item\">45.\u00a0There was an increase in flu cases diagnosed until 1993. From 1993 through 2002, the number of flu cases diagnosed declined each year. It is not appropriate to use a linear regression line to fit the data.<\/div>\r\n<div data-type=\"item\"><\/div>\r\n<div data-type=\"item\">47.\u00a0Since there is no linear association between year and # of flu cases diagnosed, it is not appropriate to calculate a linear correlation coefficient. When there is a linear association and it is appropriate to calculate a correlation, we cannot say that one variable \u201ccauses\u201d the other variable.<\/div>\r\n<div data-type=\"item\"><\/div>\r\n<div data-type=\"item\">\r\n\r\n49. We don\u2019t know if the pre-1981 data was collected from a single year. So we don\u2019t have an accurate <em data-effect=\"italics\">x<\/em> value for this figure.\r\n<p id=\"fs-idm126049344\">Regression equation: <em data-effect=\"italics\">\u0177<\/em> (#Flu Cases) = \u20133,448,225 + 1749.777 (year)<\/p>\r\n\r\n<table id=\"fs-idm85288224\" summary=\"..\">\r\n<thead>\r\n<tr>\r\n<th>Coefficients<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td>Intercept<\/td>\r\n<td>\u20133,448,225<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><em data-effect=\"italics\">X<\/em> Variable 1<\/td>\r\n<td>1,749.777\r\n\r\n&nbsp;<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<h2 data-type=\"solution\">Outliers \u2013 Practice<\/h2>\r\n<\/div>\r\n<div data-type=\"solution\">\r\n\r\n51.\u00a0Yes, there appears to be an outlier at (6, 58).\r\n\r\n53.\u00a0The potential outlier flattened the slope of the line of best fit because it was below the data set. It made the line of best fit less accurate as a predictor for the data.\r\n\r\n55.\u00a0<em data-effect=\"italics\">s<\/em> = 1.75\r\n<div data-type=\"solution\"><section id=\"fs-idm86899200\" class=\"free-response focusable\" tabindex=\"-1\" data-depth=\"1\">\r\n<div id=\"element-957\" class=\"exercise\" data-type=\"exercise\"><section class=\" focusable\" tabindex=\"-1\">\r\n<div id=\"id44886916\" class=\"problem\" data-type=\"problem\">\r\n<h2>Linear Equations \u2013 Homework<\/h2>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n57.\r\n\r\n<\/div>\r\n<div data-type=\"solution\">\r\n<ol>\r\n \t<li>independent variable: age; dependent variable: fatalities<\/li>\r\n \t<li>independent variable: # of family members; dependent variable: grocery bill<\/li>\r\n \t<li>independent variable: age of applicant; dependent variable: insurance premium<\/li>\r\n \t<li>independent variable: power consumption; dependent variable: utility<\/li>\r\n \t<li>independent variable: higher education (years); dependent variable: crime rates<\/li>\r\n<\/ol>\r\n<\/div>\r\n<div data-type=\"problem\">\r\n<h2>Scatter Plots \u2013 Homework<\/h2>\r\n59. Check student's solution.\r\n\r\n<\/div>\r\n<section class=\"ui-body\"><section class=\"ui-body\"><section class=\"ui-body\">61.\u00a0For graph: check student\u2019s solution. Note that tuition is the independent variable and salary is the dependent variable.\r\n<div class=\"exercise\" data-type=\"exercise\">63. 13<\/div>\r\n<\/section><\/section>\r\n<div class=\"exercise\" data-type=\"exercise\">\r\n<h2>The Regression Equation - Homework<\/h2>\r\n65.\u00a0It means that 72% of the variation in the dependent variable (<em data-effect=\"italics\">y<\/em>) can be explained by the variation in the independent variable (<em data-effect=\"italics\">x<\/em>).\r\n<h2>Prediction - Homework<\/h2>\r\n<\/div>\r\n<\/section>67.\r\n<ol>\r\n \t<li>\r\n<table id=\"id108047k45\" summary=\"This table presents the age groups in the first column and the number of driver deaths per 100,000 in the second column.\">\r\n<thead>\r\n<tr>\r\n<th>Age<\/th>\r\n<th>Number of Driver Deaths per 100,000<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td>16\u201319<\/td>\r\n<td data-align=\"center\">38<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>20\u201324<\/td>\r\n<td data-align=\"center\">36<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>25\u201334<\/td>\r\n<td data-align=\"center\">24<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>35\u201354<\/td>\r\n<td data-align=\"center\">20<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>55\u201374<\/td>\r\n<td data-align=\"center\">18<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>75+<\/td>\r\n<td data-align=\"center\">28<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/li>\r\n \t<li>Check student\u2019s solution.<\/li>\r\n \t<li><em data-effect=\"italics\">\u0177<\/em> = 35.5818045 \u2013 0.19182491<em data-effect=\"italics\">x<\/em><\/li>\r\n \t<li><em data-effect=\"italics\">r<\/em> = \u20130.57874\r\n<div data-type=\"newline\"><\/div>\r\nFor four <em data-effect=\"italics\">df<\/em> and alpha = 0.05, the LinRegTTest gives <em data-effect=\"italics\">p<\/em>-value = 0.2288 so we do not reject the null hypothesis; there is not a significant linear relationship between deaths and age.\r\n\r\nUsing the table of critical values for the correlation coefficient, with four <em data-effect=\"italics\">df<\/em>, the critical value is 0.811. The correlation coefficient <em data-effect=\"italics\">r<\/em> = \u20130.57874 is not less than \u20130.811, so we do not reject the null hypothesis.<\/li>\r\n \t<li>There is not a linear relationship between the two variables, as evidenced by a p-value greater than 0.05.<\/li>\r\n<\/ol>\r\n69.\r\n<div class=\"exercise\" data-type=\"exercise\"><section>\r\n<div class=\"solution ui-solution-visible\" data-type=\"solution\"><section class=\"ui-body\">\r\n<ol id=\"fs-idp21667344\" data-number-style=\"lower-alpha\">\r\n \t<li>We wonder if the better discounts appear earlier in the book so we select page as <em data-effect=\"italics\">X<\/em> and discount as <em data-effect=\"italics\">Y<\/em>.<\/li>\r\n \t<li>Check student\u2019s solution.<\/li>\r\n \t<li><em data-effect=\"italics\">\u0177<\/em> = 17.21757 \u2013 0.01412<em data-effect=\"italics\">x<\/em><\/li>\r\n \t<li><em data-effect=\"italics\">r<\/em> = \u2013 0.2752\r\n<div data-type=\"newline\"><\/div>\r\nFor seven <em data-effect=\"italics\">df<\/em> and alpha = 0.05, using LinRegTTest <em data-effect=\"italics\">p<\/em>-value = 0.4736 so we do not reject; there is a not a significant linear relationship between page and discount.\r\n\r\nUsing the table of critical values for the correlation coefficient, with seven <em data-effect=\"italics\">df<\/em>, the critical value is 0.666. The correlation coefficient <em data-effect=\"italics\">xi<\/em> = \u20130.2752 is not less than 0.666 so we do not reject it.<\/li>\r\n \t<li>There is not a significant linear correlation so it appears there is no relationship between the page and the amount of the discount.<\/li>\r\n<\/ol>\r\n<p id=\"fs-idp70584384\">As the page number increases by one page, the discount decreases by $0.01412<\/p>\r\n\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<div class=\"exercise\" data-type=\"exercise\">\r\n\r\n71.\r\n\r\n<section class=\"ui-body\">\r\n<ol id=\"fs-idp34721504\" data-number-style=\"lower-alpha\">\r\n \t<li>Year is the independent or <em data-effect=\"italics\">x<\/em> variable; the number of letters is the dependent or <em data-effect=\"italics\">y<\/em> variable.<\/li>\r\n \t<li>Check student\u2019s solution.<\/li>\r\n \t<li>no<\/li>\r\n \t<li><em data-effect=\"italics\">\u0177<\/em> = 47.03 \u2013 0.0216<em data-effect=\"italics\">x<\/em><\/li>\r\n \t<li>\u20130.4280<\/li>\r\n \t<li>6; 5<\/li>\r\n \t<li>No, the relationship does not appear to be linear; the correlation is not significant.<\/li>\r\n \t<li>current year: 2013: 3.55 or four letters; this is not an appropriate use of the least-squares line. It is extrapolation.<\/li>\r\n<\/ol>\r\n<h2>Outliers - Homework<\/h2>\r\n<\/section>73.\r\n\r\n<section class=\"ui-body\">\r\n<p id=\"fs-idp172680496\">a. and b. Check student\u2019s solution.<\/p>\r\n<p id=\"fs-idp172680880\">c. The slope of the regression line is -0.3179 with a y-intercept of 32.966. In context, the y-intercept indicates that when there are no returning sparrow hawks, there will be almost 31% new sparrow hawks, which doesn\u2019t make sense since if there are no returning birds, then the new percentage would have to be 100% (this is an example of why we do not extrapolate). The slope tells us that for each percentage increase in returning birds, the percentage of new birds in the colony decreases by 0.3179%.<\/p>\r\n<p id=\"fs-idp36047136\">d. If we examine r2, we see that only 50.238% of the variation in the percent of new birds is explained by the model, and the correlation coefficient, r = 0.71, only indicates a somewhat strong correlation between returning and new percentages.<\/p>\r\n<p id=\"fs-idp137409168\">e. The ordered pair (66, 6) generates the largest residual of 6.0. This means that when the observed return percentage is 66%, our observed new percentage, 6%, is almost 6% less than the predicted new value of 11.98%. If we remove this data pair, we see only an adjusted slope of -0.2723 and an adjusted intercept of 30.606. In other words, even though this data generates the largest residual, it is not an outlier, nor is the data pair an influential point.<\/p>\r\n<p id=\"fs-idp89140080\">f. If there are 70% returning birds, we would expect to see y = -0.2723(70) + 30.606 = 0.115 or 11.5% new birds in the colony.<\/p>\r\n\r\n<\/section>75.\r\n\r\n<section class=\"ui-body\">\r\n<ol id=\"eip-idm38243616\" data-number-style=\"lower-alpha\">\r\n \t<li>Check student\u2019s solution.<\/li>\r\n \t<li>Check student\u2019s solution.<\/li>\r\n \t<li>We have a slope of \u20131.4946 with a <em data-effect=\"italics\">y<\/em>-intercept of 193.88. The slope, in context, indicates that for each additional minute added to the swim time, the heart rate will decrease by 1.5 beats per minute. If the student is not swimming at all, the <em data-effect=\"italics\">y<\/em>-intercept indicates that his heart rate will be 193.88 beats per minute. While the slope has meaning (the longer it takes to swim 2,000 meters, the less effort the heart puts out), the <em data-effect=\"italics\">y<\/em>-intercept does not make sense. If the athlete is not swimming (resting), then his heart rate should be very low.<\/li>\r\n \t<li>Since only 1.5% of the heart rate variation is explained by this regression equation, we must conclude that this association is not explained by a linear relationship.<\/li>\r\n \t<li>The point (34.72, 124) generates the largest residual of \u201311.82. This means that our observed heart rate is almost 12 beats less than our predicted rate of 136 beats per minute. When this point is removed, the slope becomes 1.6914 with the y-intercept changing to 83.694. While the linear association is still very weak, we see that the removed data pair can be considered an influential point in the sense that the <em data-effect=\"italics\">y<\/em>-intercept becomes more meaningful.<\/li>\r\n<\/ol>\r\n77.\u00a0If we remove the two service academies (the tuition is $0.00), we construct a new regression equation of <em data-effect=\"italics\">y<\/em> = \u20130.0009x + 160 with a correlation coefficient of 0.71397 and a coefficient of determination of 0.50976. This allows us to say there is a fairly strong linear association between tuition costs and salaries if the service academies are removed from the data set.\r\n\r\n<\/section>79.\r\n\r\n<section class=\"ui-body\">\r\n<ol data-number-style=\"lower-alpha\">\r\n \t<li>Check student's solution.<\/li>\r\n \t<li>yes<\/li>\r\n \t<li><em data-effect=\"italics\">\u0177<\/em> = \u2212266.8863+0.1656x<\/li>\r\n \t<li>0.9448; Yes<\/li>\r\n \t<li>62.8233; 62.3265<\/li>\r\n \t<li>yes<\/li>\r\n \t<li>yes; (1987, 62.7)<\/li>\r\n \t<li>72.5937; no<\/li>\r\n \t<li>slope = 0.1656.\r\n<div data-type=\"newline\"><\/div>\r\nAs the year increases by one, the percent of workers paid hourly rates tends to increase by 0.1656.<\/li>\r\n<\/ol>\r\n81.\r\n<table id=\"eip-idm171664512\" summary=\"..\">\r\n<thead>\r\n<tr>\r\n<th>Size (ounces)<\/th>\r\n<th>Cost ($)<\/th>\r\n<th>cents\/oz<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td>16<\/td>\r\n<td>3.99<\/td>\r\n<td>24.94<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>32<\/td>\r\n<td>4.99<\/td>\r\n<td>15.59<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>64<\/td>\r\n<td>5.99<\/td>\r\n<td>9.36<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>200<\/td>\r\n<td>10.99<\/td>\r\n<td>5.50<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nCheck student\u2019s solution.\r\nThere is a linear relationship for the sizes 16 through 64, but that linear trend does not continue to the 200-oz size.\r\n<em data-effect=\"italics\">\u0177<\/em> = 20.2368 \u2013 0.0819x\r\n<em data-effect=\"italics\">r<\/em> = \u20130.8086\r\n40-oz: 16.96 cents\/oz\r\n90-oz: 12.87 cents\/oz\r\nThe relationship is not linear; the least-squares line is not appropriate.\r\nno outliers\r\nNo, you would be extrapolating. The 300-oz size is outside the range of <em data-effect=\"italics\">x<\/em>.\r\nslope = \u20130.08194; for each additional ounce in size, the cost per ounce decreases by 0.082 cents.\r\n\r\n<\/section>83.\r\n\r\n<section class=\"ui-body\">\r\n<ol id=\"fs-idm47882016\" data-number-style=\"lower-alpha\">\r\n \t<li>Size is <em data-effect=\"italics\">x<\/em>, the independent variable, price is <em data-effect=\"italics\">y<\/em>, the dependent variable.<\/li>\r\n \t<li>Check student\u2019s solution.<\/li>\r\n \t<li>The relationship does not appear to be linear.<\/li>\r\n \t<li><em data-effect=\"italics\">\u0177<\/em> = \u2013745.252 + 54.75569<em data-effect=\"italics\">x<\/em><\/li>\r\n \t<li><em data-effect=\"italics\">r<\/em> = 0.8944, yes it is significant<\/li>\r\n \t<li>32-inch: $1006.93, 50-inch: $1992.53<\/li>\r\n \t<li>No, the relationship does not appear to be linear. However, <em data-effect=\"italics\">r<\/em> is significant.<\/li>\r\n \t<li>yes, the 60-inch TV<\/li>\r\n \t<li>For each additional inch, the price increases by $54.76<\/li>\r\n<\/ol>\r\n85.\r\n\r\n<section class=\"ui-body\">\r\n<ol id=\"fs-idp131482224\" data-number-style=\"lower-alpha\">\r\n \t<li>Let rank be the independent variable and area be the dependent variable.<\/li>\r\n \t<li>Check student\u2019s solution.<\/li>\r\n \t<li>There appears to be a linear relationship, with one outlier.<\/li>\r\n \t<li><em data-effect=\"italics\">\u0177<\/em> (area) = 24177.06 + 1010.478<em data-effect=\"italics\">x<\/em><\/li>\r\n \t<li><em data-effect=\"italics\">r<\/em> = 0.50047, <em data-effect=\"italics\">r<\/em> is not significant so there is no relationship between the variables.<\/li>\r\n \t<li>Alabama: 46407.576 Colorado: 62575.224<\/li>\r\n \t<li>Alabama's estimate is closer than Colorado's estimate.<\/li>\r\n \t<li>If the outlier is removed, there is a linear relationship.<\/li>\r\n \t<li>There is one outlier (Hawaii).<\/li>\r\n \t<li>rank 51: 75711.4; no<\/li>\r\n \t<li>\r\n<table id=\"fs-idp169197456\" summary=\"..\"><caption>\u00a0<\/caption>\r\n<thead>\r\n<tr>\r\n<th>Alabama<\/th>\r\n<th>7<\/th>\r\n<th>1819<\/th>\r\n<th>22<\/th>\r\n<th>52,423<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td>Colorado<\/td>\r\n<td>8<\/td>\r\n<td>1876<\/td>\r\n<td>38<\/td>\r\n<td>104,100<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Alaska<\/td>\r\n<td>6<\/td>\r\n<td>1959<\/td>\r\n<td>51<\/td>\r\n<td>656,424<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Iowa<\/td>\r\n<td>4<\/td>\r\n<td>1846<\/td>\r\n<td>29<\/td>\r\n<td>56,276<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Maryland<\/td>\r\n<td>8<\/td>\r\n<td>1788<\/td>\r\n<td>7<\/td>\r\n<td>12,407<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Missouri<\/td>\r\n<td>8<\/td>\r\n<td>1821<\/td>\r\n<td>24<\/td>\r\n<td>69,709<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>New Jersey<\/td>\r\n<td>9<\/td>\r\n<td>1787<\/td>\r\n<td>3<\/td>\r\n<td>8,722<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Ohio<\/td>\r\n<td>4<\/td>\r\n<td>1803<\/td>\r\n<td>17<\/td>\r\n<td>44,828<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>South Carolina<\/td>\r\n<td>13<\/td>\r\n<td>1788<\/td>\r\n<td>8<\/td>\r\n<td>32,008<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Utah<\/td>\r\n<td>4<\/td>\r\n<td>1896<\/td>\r\n<td>45<\/td>\r\n<td>84,904<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Wisconsin<\/td>\r\n<td>9<\/td>\r\n<td>1848<\/td>\r\n<td>30<\/td>\r\n<td>65,499<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/li>\r\n \t<li><em data-effect=\"italics\">\u0177<\/em> = \u201387065.3 + 7828.532x<\/li>\r\n \t<li>Alabama: 85,162.404; the prior estimate was closer. Alaska is an outlier.<\/li>\r\n \t<li>yes, with the exception of Hawaii<\/li>\r\n<\/ol>\r\n<\/section><\/section><\/div>","rendered":"<h2>Linear Equations &#8211; Homework<\/h2>\n<p>1.\u00a0dependent variable: fee amount; independent variable: time<\/p>\n<p>3.<\/p>\n<section class=\"ui-body\">\n<figure id=\"fs-idm133416208\"><span id=\"eip-idp89831856\" data-type=\"media\" data-alt=\"This is a graph of the equation y = 25 + 12.50x. The x-axis is labeled in intervals of 1 from 0 - 7; the y-axis is labeled in intervals of 25 from 0 - 100. The equation's graph is a line that crosses the y-axis at 25 and is sloped up and to the right, rising 12.50 units for every one unit of run.\" data-display=\"block\"><img decoding=\"async\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/132\/2016\/04\/21214829\/CNX_Stats_C012_M02_item001anno.jpg\" alt=\"This is a graph of the equation y = 25 + 12.50x. The x-axis is labeled in intervals of 1 from 0 - 7; the y-axis is labeled in intervals of 25 from 0 - 100. The equation's graph is a line that crosses the y-axis at 25 and is sloped up and to the right, rising 12.50 units for every one unit of run.\" width=\"380\" data-media-type=\"image\/jpeg\" \/><\/span><\/figure>\n<p>5.<\/p>\n<section class=\"ui-body\">\n<figure id=\"fs-idp120210480\"><span id=\"eip-idp116656032\" data-type=\"media\" data-alt=\"This is a graph of the equation y = 10 + 5x. The x-axis is labeled in intervals of 1 from 0 - 7; the y-axis is labeled in intervals of 10 from 0 - 50. The equation's graph is a line that crosses the y-axis at 10 and is sloped up and to the right, rising 5 units for every one unit of run.\" data-display=\"block\"><img decoding=\"async\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/132\/2016\/04\/21214831\/CNX_Stats_C012_M02_item002anno.jpg\" alt=\"This is a graph of the equation y = 10 + 5x. The x-axis is labeled in intervals of 1 from 0 - 7; the y-axis is labeled in intervals of 10 from 0 - 50. The equation's graph is a line that crosses the y-axis at 10 and is sloped up and to the right, rising 5 units for every one unit of run.\" width=\"380\" data-media-type=\"image\/jpeg\" \/><\/span><\/figure>\n<p>7.\u00a0<em data-effect=\"italics\">y<\/em> = 6<em data-effect=\"italics\">x<\/em> + 8, 4<em data-effect=\"italics\">y<\/em> = 8, and <em data-effect=\"italics\">y<\/em> + 7 = 3<em data-effect=\"italics\">x<\/em> are all linear equations.<\/p>\n<p>9.\u00a0The number of flu cases depends on the year. Therefore, year becomes the independent variable and the number of flu cases is the dependent variable.<\/p>\n<p>11.\u00a0The <em data-effect=\"italics\">y<\/em>-intercept is 50 (<em data-effect=\"italics\">a<\/em> = 50). At the start of the cleaning, the company charges a one-time fee of $50 (this is when <em data-effect=\"italics\">x<\/em> = 0). The slope is 100 (<em data-effect=\"italics\">b<\/em> = 100). For each session, the company charges $100 for each hour they clean.<\/p>\n<p>13.\u00a012,000 pounds of soil<\/p>\n<p>15.\u00a0The slope is \u20131.5 (<em data-effect=\"italics\">b<\/em> = \u20131.5). This means the stock is losing value at a rate of $1.50 per hour. The <em data-effect=\"italics\">y<\/em>-intercept is $15 (<em data-effect=\"italics\">a<\/em> = 15). This means the price of the stock before the trading day was $15.<\/p>\n<h2 data-type=\"solution\">Scatter Plots \u2013 Practice<\/h2>\n<\/section>\n<\/section>\n<div class=\"problem\" data-type=\"problem\">\n<p>17.\u00a0The data appear to be linear with a strong, positive correlation.<\/p>\n<p>19.\u00a0The data appear to have no correlation.<\/p>\n<\/div>\n<div data-type=\"problem\">\n<h2 data-type=\"solution\">The Regression Equation \u2013 Practice<\/h2>\n<p>21.\u00a0<em data-effect=\"italics\">\u0177<\/em> = 2.23 + 1.99<em data-effect=\"italics\">x<\/em><\/p>\n<p>23.\u00a0The slope is 1.99 (<em data-effect=\"italics\">b<\/em> = 1.99). It means that for every endorsement deal a professional player gets, he gets an average of another $1.99 million in pay each year.<\/p>\n<p>25.\u00a0It means that there is no correlation between the data sets.<\/p>\n<p>27.\u00a0Yes, there are enough data points and the value of r is strong enough to show that there is a strong negative correlation between the data sets.<\/p>\n<h2 data-type=\"solution\">Testing the Significance of the Correlation Coefficient \u2013 Practice<\/h2>\n<div data-type=\"solution\">\n<div class=\"exercise\" data-type=\"exercise\">\n<section class=\"focusable\" tabindex=\"-1\">\n<div id=\"eip-226\" class=\"problem\" data-type=\"problem\"><\/div>\n<\/section>\n<\/div>\n<\/div>\n<\/div>\n<div data-type=\"problem\">29.\u00a0<em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u03c1<\/em> \u2260 0<\/div>\n<div data-type=\"problem\"><\/div>\n<div data-type=\"problem\">\n<h2 data-type=\"solution\">Prediction \u2013 Practice<\/h2>\n<\/div>\n<div data-type=\"solution\">\n<section class=\"ui-body\">\n<div class=\"exercise\" data-type=\"exercise\">\n<p>31.\u00a0$250,120<\/p>\n<p>33.\u00a01,326 acres<\/p>\n<p>35.\u00a01,125 hours, or when <em data-effect=\"italics\">x<\/em> = 1,125<\/p>\n<p>37. Check student&#8217;s solution.<\/p>\n<p>39.<\/p>\n<div id=\"eip-idp156589440\" data-type=\"list\" data-list-type=\"enumerated\" data-number-style=\"lower-alpha\">\n<ol>\n<li data-type=\"item\">When <em data-effect=\"italics\">x<\/em> = 1985, <em data-effect=\"italics\">\u0177<\/em> = 25,52<\/li>\n<li data-type=\"item\">When <em data-effect=\"italics\">x<\/em> = 1990, <em data-effect=\"italics\">\u0177<\/em> = 34,275<\/li>\n<li data-type=\"item\">When <em data-effect=\"italics\">x<\/em> = 1970, <em data-effect=\"italics\">\u0177<\/em> = \u2013725 Why doesn\u2019t this answer make sense? The range of <em data-effect=\"italics\">x<\/em> values was 1981 to 2002; the year 1970 is not in this range. The regression equation does not apply, because predicting for the year 1970 is extrapolation, which requires a different process. Also, a negative number does not make sense in this context, where we are predicting flu cases diagnosed.<\/li>\n<\/ol>\n<\/div>\n<div data-type=\"item\">41.\u00a0Also, the correlation <em data-effect=\"italics\">r<\/em> = 0.4526. If <em data-effect=\"italics\">r<\/em> is compared to the value in the 95% Critical Values of the Sample Correlation Coefficient Table, because <em data-effect=\"italics\">r<\/em> &gt; 0.423, <em data-effect=\"italics\">r<\/em> is significant, and you would think that the line could be used for prediction. But the scatter plot indicates otherwise.<\/div>\n<\/div>\n<div data-type=\"item\"><\/div>\n<div data-type=\"item\">43.\u00a0<span class=\"MathJax\"><span id=\"MathJax-Span-491\" class=\"math\"><span id=\"MathJax-Span-492\" class=\"mrow\"><span id=\"MathJax-Span-493\" class=\"semantics\"><span id=\"MathJax-Span-494\" class=\"mrow\"><span id=\"MathJax-Span-495\" class=\"mover\"><span id=\"MathJax-Span-496\" class=\"mi\">y<\/span><span id=\"MathJax-Span-497\" class=\"mo\">\u02c6<\/span><\/span><\/span><\/span><\/span><\/span><\/span> = 3,448,225 + 1750<em data-effect=\"italics\">x<\/em><\/div>\n<\/section>\n<div data-type=\"item\"><\/div>\n<div data-type=\"item\">45.\u00a0There was an increase in flu cases diagnosed until 1993. From 1993 through 2002, the number of flu cases diagnosed declined each year. It is not appropriate to use a linear regression line to fit the data.<\/div>\n<div data-type=\"item\"><\/div>\n<div data-type=\"item\">47.\u00a0Since there is no linear association between year and # of flu cases diagnosed, it is not appropriate to calculate a linear correlation coefficient. When there is a linear association and it is appropriate to calculate a correlation, we cannot say that one variable \u201ccauses\u201d the other variable.<\/div>\n<div data-type=\"item\"><\/div>\n<div data-type=\"item\">\n<p>49. We don\u2019t know if the pre-1981 data was collected from a single year. So we don\u2019t have an accurate <em data-effect=\"italics\">x<\/em> value for this figure.<\/p>\n<p id=\"fs-idm126049344\">Regression equation: <em data-effect=\"italics\">\u0177<\/em> (#Flu Cases) = \u20133,448,225 + 1749.777 (year)<\/p>\n<table id=\"fs-idm85288224\" summary=\"..\">\n<thead>\n<tr>\n<th>Coefficients<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Intercept<\/td>\n<td>\u20133,448,225<\/td>\n<\/tr>\n<tr>\n<td><em data-effect=\"italics\">X<\/em> Variable 1<\/td>\n<td>1,749.777<\/p>\n<p>&nbsp;<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<h2 data-type=\"solution\">Outliers \u2013 Practice<\/h2>\n<\/div>\n<div data-type=\"solution\">\n<p>51.\u00a0Yes, there appears to be an outlier at (6, 58).<\/p>\n<p>53.\u00a0The potential outlier flattened the slope of the line of best fit because it was below the data set. It made the line of best fit less accurate as a predictor for the data.<\/p>\n<p>55.\u00a0<em data-effect=\"italics\">s<\/em> = 1.75<\/p>\n<div data-type=\"solution\">\n<section id=\"fs-idm86899200\" class=\"free-response focusable\" tabindex=\"-1\" data-depth=\"1\">\n<div id=\"element-957\" class=\"exercise\" data-type=\"exercise\">\n<section class=\"focusable\" tabindex=\"-1\">\n<div id=\"id44886916\" class=\"problem\" data-type=\"problem\">\n<h2>Linear Equations \u2013 Homework<\/h2>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<p>57.<\/p>\n<\/div>\n<div data-type=\"solution\">\n<ol>\n<li>independent variable: age; dependent variable: fatalities<\/li>\n<li>independent variable: # of family members; dependent variable: grocery bill<\/li>\n<li>independent variable: age of applicant; dependent variable: insurance premium<\/li>\n<li>independent variable: power consumption; dependent variable: utility<\/li>\n<li>independent variable: higher education (years); dependent variable: crime rates<\/li>\n<\/ol>\n<\/div>\n<div data-type=\"problem\">\n<h2>Scatter Plots \u2013 Homework<\/h2>\n<p>59. Check student&#8217;s solution.<\/p>\n<\/div>\n<section class=\"ui-body\">\n<section class=\"ui-body\">\n<section class=\"ui-body\">61.\u00a0For graph: check student\u2019s solution. Note that tuition is the independent variable and salary is the dependent variable.<\/p>\n<div class=\"exercise\" data-type=\"exercise\">63. 13<\/div>\n<\/section>\n<\/section>\n<div class=\"exercise\" data-type=\"exercise\">\n<h2>The Regression Equation &#8211; Homework<\/h2>\n<p>65.\u00a0It means that 72% of the variation in the dependent variable (<em data-effect=\"italics\">y<\/em>) can be explained by the variation in the independent variable (<em data-effect=\"italics\">x<\/em>).<\/p>\n<h2>Prediction &#8211; Homework<\/h2>\n<\/div>\n<\/section>\n<p>67.<\/p>\n<ol>\n<li>\n<table id=\"id108047k45\" summary=\"This table presents the age groups in the first column and the number of driver deaths per 100,000 in the second column.\">\n<thead>\n<tr>\n<th>Age<\/th>\n<th>Number of Driver Deaths per 100,000<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>16\u201319<\/td>\n<td data-align=\"center\">38<\/td>\n<\/tr>\n<tr>\n<td>20\u201324<\/td>\n<td data-align=\"center\">36<\/td>\n<\/tr>\n<tr>\n<td>25\u201334<\/td>\n<td data-align=\"center\">24<\/td>\n<\/tr>\n<tr>\n<td>35\u201354<\/td>\n<td data-align=\"center\">20<\/td>\n<\/tr>\n<tr>\n<td>55\u201374<\/td>\n<td data-align=\"center\">18<\/td>\n<\/tr>\n<tr>\n<td>75+<\/td>\n<td data-align=\"center\">28<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/li>\n<li>Check student\u2019s solution.<\/li>\n<li><em data-effect=\"italics\">\u0177<\/em> = 35.5818045 \u2013 0.19182491<em data-effect=\"italics\">x<\/em><\/li>\n<li><em data-effect=\"italics\">r<\/em> = \u20130.57874\n<div data-type=\"newline\"><\/div>\n<p>For four <em data-effect=\"italics\">df<\/em> and alpha = 0.05, the LinRegTTest gives <em data-effect=\"italics\">p<\/em>-value = 0.2288 so we do not reject the null hypothesis; there is not a significant linear relationship between deaths and age.<\/p>\n<p>Using the table of critical values for the correlation coefficient, with four <em data-effect=\"italics\">df<\/em>, the critical value is 0.811. The correlation coefficient <em data-effect=\"italics\">r<\/em> = \u20130.57874 is not less than \u20130.811, so we do not reject the null hypothesis.<\/li>\n<li>There is not a linear relationship between the two variables, as evidenced by a p-value greater than 0.05.<\/li>\n<\/ol>\n<p>69.<\/p>\n<div class=\"exercise\" data-type=\"exercise\">\n<section>\n<div class=\"solution ui-solution-visible\" data-type=\"solution\">\n<section class=\"ui-body\">\n<ol id=\"fs-idp21667344\" data-number-style=\"lower-alpha\">\n<li>We wonder if the better discounts appear earlier in the book so we select page as <em data-effect=\"italics\">X<\/em> and discount as <em data-effect=\"italics\">Y<\/em>.<\/li>\n<li>Check student\u2019s solution.<\/li>\n<li><em data-effect=\"italics\">\u0177<\/em> = 17.21757 \u2013 0.01412<em data-effect=\"italics\">x<\/em><\/li>\n<li><em data-effect=\"italics\">r<\/em> = \u2013 0.2752\n<div data-type=\"newline\"><\/div>\n<p>For seven <em data-effect=\"italics\">df<\/em> and alpha = 0.05, using LinRegTTest <em data-effect=\"italics\">p<\/em>-value = 0.4736 so we do not reject; there is a not a significant linear relationship between page and discount.<\/p>\n<p>Using the table of critical values for the correlation coefficient, with seven <em data-effect=\"italics\">df<\/em>, the critical value is 0.666. The correlation coefficient <em data-effect=\"italics\">xi<\/em> = \u20130.2752 is not less than 0.666 so we do not reject it.<\/li>\n<li>There is not a significant linear correlation so it appears there is no relationship between the page and the amount of the discount.<\/li>\n<\/ol>\n<p id=\"fs-idp70584384\">As the page number increases by one page, the discount decreases by $0.01412<\/p>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<div class=\"exercise\" data-type=\"exercise\">\n<p>71.<\/p>\n<section class=\"ui-body\">\n<ol id=\"fs-idp34721504\" data-number-style=\"lower-alpha\">\n<li>Year is the independent or <em data-effect=\"italics\">x<\/em> variable; the number of letters is the dependent or <em data-effect=\"italics\">y<\/em> variable.<\/li>\n<li>Check student\u2019s solution.<\/li>\n<li>no<\/li>\n<li><em data-effect=\"italics\">\u0177<\/em> = 47.03 \u2013 0.0216<em data-effect=\"italics\">x<\/em><\/li>\n<li>\u20130.4280<\/li>\n<li>6; 5<\/li>\n<li>No, the relationship does not appear to be linear; the correlation is not significant.<\/li>\n<li>current year: 2013: 3.55 or four letters; this is not an appropriate use of the least-squares line. It is extrapolation.<\/li>\n<\/ol>\n<h2>Outliers &#8211; Homework<\/h2>\n<\/section>\n<p>73.<\/p>\n<section class=\"ui-body\">\n<p id=\"fs-idp172680496\">a. and b. Check student\u2019s solution.<\/p>\n<p id=\"fs-idp172680880\">c. The slope of the regression line is -0.3179 with a y-intercept of 32.966. In context, the y-intercept indicates that when there are no returning sparrow hawks, there will be almost 31% new sparrow hawks, which doesn\u2019t make sense since if there are no returning birds, then the new percentage would have to be 100% (this is an example of why we do not extrapolate). The slope tells us that for each percentage increase in returning birds, the percentage of new birds in the colony decreases by 0.3179%.<\/p>\n<p id=\"fs-idp36047136\">d. If we examine r2, we see that only 50.238% of the variation in the percent of new birds is explained by the model, and the correlation coefficient, r = 0.71, only indicates a somewhat strong correlation between returning and new percentages.<\/p>\n<p id=\"fs-idp137409168\">e. The ordered pair (66, 6) generates the largest residual of 6.0. This means that when the observed return percentage is 66%, our observed new percentage, 6%, is almost 6% less than the predicted new value of 11.98%. If we remove this data pair, we see only an adjusted slope of -0.2723 and an adjusted intercept of 30.606. In other words, even though this data generates the largest residual, it is not an outlier, nor is the data pair an influential point.<\/p>\n<p id=\"fs-idp89140080\">f. If there are 70% returning birds, we would expect to see y = -0.2723(70) + 30.606 = 0.115 or 11.5% new birds in the colony.<\/p>\n<\/section>\n<p>75.<\/p>\n<section class=\"ui-body\">\n<ol id=\"eip-idm38243616\" data-number-style=\"lower-alpha\">\n<li>Check student\u2019s solution.<\/li>\n<li>Check student\u2019s solution.<\/li>\n<li>We have a slope of \u20131.4946 with a <em data-effect=\"italics\">y<\/em>-intercept of 193.88. The slope, in context, indicates that for each additional minute added to the swim time, the heart rate will decrease by 1.5 beats per minute. If the student is not swimming at all, the <em data-effect=\"italics\">y<\/em>-intercept indicates that his heart rate will be 193.88 beats per minute. While the slope has meaning (the longer it takes to swim 2,000 meters, the less effort the heart puts out), the <em data-effect=\"italics\">y<\/em>-intercept does not make sense. If the athlete is not swimming (resting), then his heart rate should be very low.<\/li>\n<li>Since only 1.5% of the heart rate variation is explained by this regression equation, we must conclude that this association is not explained by a linear relationship.<\/li>\n<li>The point (34.72, 124) generates the largest residual of \u201311.82. This means that our observed heart rate is almost 12 beats less than our predicted rate of 136 beats per minute. When this point is removed, the slope becomes 1.6914 with the y-intercept changing to 83.694. While the linear association is still very weak, we see that the removed data pair can be considered an influential point in the sense that the <em data-effect=\"italics\">y<\/em>-intercept becomes more meaningful.<\/li>\n<\/ol>\n<p>77.\u00a0If we remove the two service academies (the tuition is $0.00), we construct a new regression equation of <em data-effect=\"italics\">y<\/em> = \u20130.0009x + 160 with a correlation coefficient of 0.71397 and a coefficient of determination of 0.50976. This allows us to say there is a fairly strong linear association between tuition costs and salaries if the service academies are removed from the data set.<\/p>\n<\/section>\n<p>79.<\/p>\n<section class=\"ui-body\">\n<ol data-number-style=\"lower-alpha\">\n<li>Check student&#8217;s solution.<\/li>\n<li>yes<\/li>\n<li><em data-effect=\"italics\">\u0177<\/em> = \u2212266.8863+0.1656x<\/li>\n<li>0.9448; Yes<\/li>\n<li>62.8233; 62.3265<\/li>\n<li>yes<\/li>\n<li>yes; (1987, 62.7)<\/li>\n<li>72.5937; no<\/li>\n<li>slope = 0.1656.\n<div data-type=\"newline\"><\/div>\n<p>As the year increases by one, the percent of workers paid hourly rates tends to increase by 0.1656.<\/li>\n<\/ol>\n<p>81.<\/p>\n<table id=\"eip-idm171664512\" summary=\"..\">\n<thead>\n<tr>\n<th>Size (ounces)<\/th>\n<th>Cost ($)<\/th>\n<th>cents\/oz<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>16<\/td>\n<td>3.99<\/td>\n<td>24.94<\/td>\n<\/tr>\n<tr>\n<td>32<\/td>\n<td>4.99<\/td>\n<td>15.59<\/td>\n<\/tr>\n<tr>\n<td>64<\/td>\n<td>5.99<\/td>\n<td>9.36<\/td>\n<\/tr>\n<tr>\n<td>200<\/td>\n<td>10.99<\/td>\n<td>5.50<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Check student\u2019s solution.<br \/>\nThere is a linear relationship for the sizes 16 through 64, but that linear trend does not continue to the 200-oz size.<br \/>\n<em data-effect=\"italics\">\u0177<\/em> = 20.2368 \u2013 0.0819x<br \/>\n<em data-effect=\"italics\">r<\/em> = \u20130.8086<br \/>\n40-oz: 16.96 cents\/oz<br \/>\n90-oz: 12.87 cents\/oz<br \/>\nThe relationship is not linear; the least-squares line is not appropriate.<br \/>\nno outliers<br \/>\nNo, you would be extrapolating. The 300-oz size is outside the range of <em data-effect=\"italics\">x<\/em>.<br \/>\nslope = \u20130.08194; for each additional ounce in size, the cost per ounce decreases by 0.082 cents.<\/p>\n<\/section>\n<p>83.<\/p>\n<section class=\"ui-body\">\n<ol id=\"fs-idm47882016\" data-number-style=\"lower-alpha\">\n<li>Size is <em data-effect=\"italics\">x<\/em>, the independent variable, price is <em data-effect=\"italics\">y<\/em>, the dependent variable.<\/li>\n<li>Check student\u2019s solution.<\/li>\n<li>The relationship does not appear to be linear.<\/li>\n<li><em data-effect=\"italics\">\u0177<\/em> = \u2013745.252 + 54.75569<em data-effect=\"italics\">x<\/em><\/li>\n<li><em data-effect=\"italics\">r<\/em> = 0.8944, yes it is significant<\/li>\n<li>32-inch: $1006.93, 50-inch: $1992.53<\/li>\n<li>No, the relationship does not appear to be linear. However, <em data-effect=\"italics\">r<\/em> is significant.<\/li>\n<li>yes, the 60-inch TV<\/li>\n<li>For each additional inch, the price increases by $54.76<\/li>\n<\/ol>\n<p>85.<\/p>\n<section class=\"ui-body\">\n<ol id=\"fs-idp131482224\" data-number-style=\"lower-alpha\">\n<li>Let rank be the independent variable and area be the dependent variable.<\/li>\n<li>Check student\u2019s solution.<\/li>\n<li>There appears to be a linear relationship, with one outlier.<\/li>\n<li><em data-effect=\"italics\">\u0177<\/em> (area) = 24177.06 + 1010.478<em data-effect=\"italics\">x<\/em><\/li>\n<li><em data-effect=\"italics\">r<\/em> = 0.50047, <em data-effect=\"italics\">r<\/em> is not significant so there is no relationship between the variables.<\/li>\n<li>Alabama: 46407.576 Colorado: 62575.224<\/li>\n<li>Alabama&#8217;s estimate is closer than Colorado&#8217;s estimate.<\/li>\n<li>If the outlier is removed, there is a linear relationship.<\/li>\n<li>There is one outlier (Hawaii).<\/li>\n<li>rank 51: 75711.4; no<\/li>\n<li>\n<table id=\"fs-idp169197456\" summary=\"..\">\n<caption>\u00a0<\/caption>\n<thead>\n<tr>\n<th>Alabama<\/th>\n<th>7<\/th>\n<th>1819<\/th>\n<th>22<\/th>\n<th>52,423<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Colorado<\/td>\n<td>8<\/td>\n<td>1876<\/td>\n<td>38<\/td>\n<td>104,100<\/td>\n<\/tr>\n<tr>\n<td>Alaska<\/td>\n<td>6<\/td>\n<td>1959<\/td>\n<td>51<\/td>\n<td>656,424<\/td>\n<\/tr>\n<tr>\n<td>Iowa<\/td>\n<td>4<\/td>\n<td>1846<\/td>\n<td>29<\/td>\n<td>56,276<\/td>\n<\/tr>\n<tr>\n<td>Maryland<\/td>\n<td>8<\/td>\n<td>1788<\/td>\n<td>7<\/td>\n<td>12,407<\/td>\n<\/tr>\n<tr>\n<td>Missouri<\/td>\n<td>8<\/td>\n<td>1821<\/td>\n<td>24<\/td>\n<td>69,709<\/td>\n<\/tr>\n<tr>\n<td>New Jersey<\/td>\n<td>9<\/td>\n<td>1787<\/td>\n<td>3<\/td>\n<td>8,722<\/td>\n<\/tr>\n<tr>\n<td>Ohio<\/td>\n<td>4<\/td>\n<td>1803<\/td>\n<td>17<\/td>\n<td>44,828<\/td>\n<\/tr>\n<tr>\n<td>South Carolina<\/td>\n<td>13<\/td>\n<td>1788<\/td>\n<td>8<\/td>\n<td>32,008<\/td>\n<\/tr>\n<tr>\n<td>Utah<\/td>\n<td>4<\/td>\n<td>1896<\/td>\n<td>45<\/td>\n<td>84,904<\/td>\n<\/tr>\n<tr>\n<td>Wisconsin<\/td>\n<td>9<\/td>\n<td>1848<\/td>\n<td>30<\/td>\n<td>65,499<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/li>\n<li><em data-effect=\"italics\">\u0177<\/em> = \u201387065.3 + 7828.532x<\/li>\n<li>Alabama: 85,162.404; the prior estimate was closer. Alaska is an outlier.<\/li>\n<li>yes, with the exception of Hawaii<\/li>\n<\/ol>\n<\/section>\n<\/section>\n<\/div>\n\n\t\t\t <section class=\"citations-section\" role=\"contentinfo\">\n\t\t\t <h3>Candela Citations<\/h3>\n\t\t\t\t\t <div>\n\t\t\t\t\t\t <div id=\"citation-list-312\">\n\t\t\t\t\t\t\t <div class=\"licensing\"><div class=\"license-attribution-dropdown-subheading\">CC licensed content, Shared previously<\/div><ul class=\"citation-list\"><li>Introductory Statistics. <strong>Authored by<\/strong>: Barbara Illowsky, Susan Dean. <strong>Provided by<\/strong>: OpenStax. <strong>Located at<\/strong>: <a target=\"_blank\" href=\"https:\/\/openstax.org\/books\/introductory-statistics\/pages\/1-introduction\">https:\/\/openstax.org\/books\/introductory-statistics\/pages\/1-introduction<\/a>. <strong>License<\/strong>: <em><a target=\"_blank\" rel=\"license\" href=\"https:\/\/creativecommons.org\/licenses\/by\/4.0\/\">CC BY: Attribution<\/a><\/em>. <strong>License Terms<\/strong>: Access for free at https:\/\/openstax.org\/books\/introductory-statistics\/pages\/1-introduction<\/li><\/ul><\/div>\n\t\t\t\t\t\t <\/div>\n\t\t\t\t\t <\/div>\n\t\t\t <\/section>","protected":false},"author":169134,"menu_order":30,"template":"","meta":{"_candela_citation":"[{\"type\":\"cc\",\"description\":\"Introductory Statistics\",\"author\":\"Barbara Illowsky, Susan Dean\",\"organization\":\"OpenStax\",\"url\":\"https:\/\/openstax.org\/books\/introductory-statistics\/pages\/1-introduction\",\"project\":\"\",\"license\":\"cc-by\",\"license_terms\":\"Access for free at https:\/\/openstax.org\/books\/introductory-statistics\/pages\/1-introduction\"}]","CANDELA_OUTCOMES_GUID":"","pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-312","chapter","type-chapter","status-publish","hentry"],"part":303,"_links":{"self":[{"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/pressbooks\/v2\/chapters\/312","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/wp\/v2\/users\/169134"}],"version-history":[{"count":6,"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/pressbooks\/v2\/chapters\/312\/revisions"}],"predecessor-version":[{"id":4010,"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/pressbooks\/v2\/chapters\/312\/revisions\/4010"}],"part":[{"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/pressbooks\/v2\/parts\/303"}],"metadata":[{"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/pressbooks\/v2\/chapters\/312\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/wp\/v2\/media?parent=312"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/pressbooks\/v2\/chapter-type?post=312"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/wp\/v2\/contributor?post=312"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/introstatscorequisite\/wp-json\/wp\/v2\/license?post=312"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}