{"id":3843,"date":"2022-03-15T23:16:46","date_gmt":"2022-03-15T23:16:46","guid":{"rendered":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/?post_type=chapter&#038;p=3843"},"modified":"2022-06-03T06:56:38","modified_gmt":"2022-06-03T06:56:38","slug":"forming-connections-in-6-a","status":"publish","type":"chapter","link":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/chapter\/forming-connections-in-6-a\/","title":{"raw":"Forming Connections in 6.A: Exploring Lines of Best Fit","rendered":"Forming Connections in 6.A: Exploring Lines of Best Fit"},"content":{"raw":"<div class=\"textbox learning-objectives\">\r\n<h3>Objectives for this activity<\/h3>\r\nDuring this activity you will:\r\n<ul>\r\n \t<li style=\"font-weight: 400;\" aria-level=\"1\">Identify the explanatory and response variables given the context of a study.<\/li>\r\n \t<li style=\"font-weight: 400;\" aria-level=\"1\">Decide when linear regression is appropriate and when it is not appropriate.<\/li>\r\n \t<li style=\"font-weight: 400;\" aria-level=\"1\">Use data analysis tools to generate appropriate scatterplots and the line of best fit.<\/li>\r\n \t<li style=\"font-weight: 400;\" aria-level=\"1\">Use data analysis tools to identify the equation of the line of best fit and the correlation coefficient r.<\/li>\r\n<\/ul>\r\n<\/div>\r\n<h2>Straight Talk About Lines<\/h2>\r\n<img class=\"alignnone wp-image-1178\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/12030216\/Picture120-300x200.jpg\" alt=\"Two people smiling and looking at a whiteboard\" width=\"726\" height=\"484\" \/>\r\n\r\nIn the\u00a0<em>What to Know<\/em> page for this section, you learned definitions of the explanatory and response variables in a bivariate dataset and developed an understanding of when a linear relationship may exist between them. You should have enough background now to identify scenarios of bivariate data for which a linear regression analysis might be appropriate and to calculate and write the equation of a line of best fit. We'll continue to extend that knowledge in this activity as we practice these skills and learn about a measure of strength in a linear relationship: the correlation coefficient, [latex]r[\/latex]. Along the way, you'll deepen your understanding of the concept of a line of best fit and the method of linear regression analysis on a given dataset.\r\n<h3>Explanatory and Response Variables<\/h3>\r\nDuring the previous\u00a0<em>What to Know<\/em>, you were asked to look over your notes from [WTK 5A] and write down three different examples where you had listed an explanatory variable that could be used to predict a response variable. Both of those variables should have been quantitative for the purposes of this activity. You were asked to identify one set with a positive association, one with no (or almost no) association, and one with a negative association.\r\n\r\nPlease retrieve those examples from your notes now and discuss them with a partner to answer Question 1. For added interest, \"test\" your partner to see if they can identify the explanatory and response variables in your scenario.\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 1<\/h3>\r\nIn the preview assignment, you were asked to think about a few scenarios in which an explanatory variable could be used to predict a response variable.\r\n\r\n&nbsp;\r\n\r\nPart A: Share your favorite scenario with a partner and take turns identifying the explanatory and response variables.\r\n\r\n[reveal-answer q=\"714487\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"714487\"]What was your favorite or most interesting scenario? What was your partner's?[\/hidden-answer]\r\n\r\n&nbsp;\r\n\r\nPart B: Draw scatterplots describing the two scenarios, and then sketch the line of best fit for each scatterplot.\r\n\r\n[reveal-answer q=\"183960\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"183960\"]You can either upload your sketches or describe them here.[\/hidden-answer]\r\n\r\n<\/div>\r\nWhen analyzing bivariate data, it is important to first clearly identify the explanatory and response variables and plot the data to identify any visually obvious trends.\r\n\r\nLet\u2019s look at an example.\r\n<div class=\"textbox tryit\">\r\n<h3>Guidance<\/h3>\r\n<span style=\"background-color: #e6daf7;\">[Intro:\u00a0Form into groups of four to continue this activity. As you answer Questions 2 through 5, try to begin to establish for your group a list of steps involved in analyzing bivariate data. We know that the first step is to clearly identify the explanatory and response variables, ensuring they are both quantitative for a Least Squares Regression analysis, and responsibly obtain data. Recall that you should ensure your sampling methods are random and bias is minimized as much as possible. Then, in the second step, you would plot the data to visually assess any present trends. What steps follow after these when performing the analysis? Are there any concerns that arise for you as you follow the example below involving student test scores? Regroup after Question 5 to compare your group's list of steps and concerns with others before moving further in the activity. ]<\/span>\r\n\r\n<\/div>\r\n<h3>Linear Regression Analysis<\/h3>\r\nGeorge, a current student, got a 36 out of 50 on the first midterm (C-). He asked his instructor, \u201cIf I don\u2019t change my study approach, how do you predict I will do on the final exam?\u201d\r\n\r\nOne way to answer this question is to look at the bivariate data of student scores from a previous class. In this case, we choose a random sample of past students who did not seek out additional tutoring and\/or support between the midterm and the final.\r\n\r\nThe following is a dataset from a random sample of past students who did not seek out advice on study skills or additional tutoring between the midterm and the final exam. To protect their anonymity, only first names are shown.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td><strong>Student First Name<\/strong><\/td>\r\n<td><strong>Midterm Score<\/strong>\r\n\r\n<strong>(out of 50 points)<\/strong><\/td>\r\n<td><strong>Final Exam Score<\/strong>\r\n\r\n<strong>(out of 100 points)<\/strong><\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Joe<\/td>\r\n<td>42<\/td>\r\n<td>64<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Barak<\/td>\r\n<td>52<\/td>\r\n<td>94<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Hillary<\/td>\r\n<td>44<\/td>\r\n<td>87<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Donald<\/td>\r\n<td>25<\/td>\r\n<td>46<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Cher<\/td>\r\n<td>41<\/td>\r\n<td>73<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Katy<\/td>\r\n<td>39<\/td>\r\n<td>73<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Taylor<\/td>\r\n<td>33<\/td>\r\n<td>53<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Miley<\/td>\r\n<td>40<\/td>\r\n<td>77<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Justin<\/td>\r\n<td>35<\/td>\r\n<td>60<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Snoop<\/td>\r\n<td>31<\/td>\r\n<td>62<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Bruno<\/td>\r\n<td>37<\/td>\r\n<td>71<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Kanye<\/td>\r\n<td>49<\/td>\r\n<td>95<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Leonardo<\/td>\r\n<td>38<\/td>\r\n<td>70<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Rosie<\/td>\r\n<td>45<\/td>\r\n<td>80<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Maya<\/td>\r\n<td>49<\/td>\r\n<td>80<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Tyra<\/td>\r\n<td>48<\/td>\r\n<td>82<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Selena<\/td>\r\n<td>50<\/td>\r\n<td>81<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<h3>Using Technology in Analysis<\/h3>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 2<\/h3>\r\nIdentify the explanatory and response variables.\r\n\r\nGo to the <em>Linear Regression<\/em> tool at <a href=\"https:\/\/dcmathpathways.shinyapps.io\/LinearRegression\/\">https:\/\/dcmathpathways.shinyapps.io\/LinearRegression\/<\/a> and plot the data using the following inputs:\r\n<ul>\r\n \t<li style=\"font-weight: 400;\" aria-level=\"1\">Under \u201cEnter Data,\u201d select \u201cEnter Own.\u201d<\/li>\r\n \t<li style=\"font-weight: 400;\" aria-level=\"1\">Name the X (explanatory) and Y (response) variables appropriately.<\/li>\r\n \t<li style=\"font-weight: 400;\" aria-level=\"1\">Copy and paste the data from <span style=\"background-color: #ffff00;\">DCMP_STAT_6A_Student_Scores<\/span>\u00a0[<span style=\"background-color: #ffff00;\">link this spreadsheet here<\/span>] or enter the data in the table by hand. Make sure the explanatory variable is in the first column and the response variable is in the second column.<\/li>\r\n \t<li style=\"font-weight: 400;\" aria-level=\"1\">Under \u201cPlot Options,\u201d select \u201cRegression Line.\u201d<\/li>\r\n<\/ul>\r\n<ul>\r\n \t<li style=\"font-weight: 400;\" aria-level=\"1\">Click \u201cSubmit Data\u201d button.<\/li>\r\n<\/ul>\r\n[reveal-answer q=\"436274\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"436274\"]Ensure that the data is entered correctly, that the variables are in the correct column, and that you have selected to show the regression line.[\/hidden-answer]\r\n\r\n&nbsp;\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 3<\/h3>\r\nDo you think the line of best fit is a good model of the relationship between midterm and final exam score? Explain.\r\n\r\n[reveal-answer q=\"22743\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"22743\"]How closely do the data appear to follow the line?[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 4<\/h3>\r\nWrite the equation of the least squares regression line using appropriate notation.\r\n\r\n[reveal-answer q=\"971061\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"971061\"]The equation is shown in the data analysis tool.[\/hidden-answer]\r\n\r\n&nbsp;\r\n\r\nPart A: Is the relationship positive or negative?\r\n\r\n[reveal-answer q=\"324890\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"324890\"]Recall the shape of a positive (increasing trend) vs. a negative (decreasing) trend. [\/hidden-answer]\r\n\r\n&nbsp;\r\n\r\nPart B: What is the value of r? Does this value indicate that the linear relationship between the two variables will be strong, moderate, or weak?\r\n\r\n[reveal-answer q=\"983122\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"983122\"]This value is located in the tool under the Model Summary.[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 5<\/h3>\r\nDo you think George should be nervous about the final exam?\r\n\r\n[reveal-answer q=\"873849\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"873849\"]What do <em>you <\/em>think? Did students who struggled on the midterm tend to struggle on the final without getting any help?[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox tryit\">\r\n<h3>Guidance<\/h3>\r\n<span style=\"background-color: #e6daf7;\">[Summary: How did you do with a list of steps for performing a LSR analysis? Generally, the steps can be listed as follows:\r\nStep 1) Identify the explanatory and response variables, then gather data as needed.\r\nStep 2) Plot the data on a scatterplot, placing the explanatory variable along the horizontal (x) axis and the response variable along the vertical (y) axis. <\/span>\r\n\r\n<span style=\"background-color: #e6daf7;\">Step 3) Visually confirm that the data seem to follow a linear paggern.<\/span>\r\n\r\n<span style=\"background-color: #e6daf7;\">Step 4) Calculate and sketch the line of best fit in the plot and obtain the correlation coefficient [latex]r[\/latex]. Visually confirm that the line appears to pass through the data as closely as possible, minimizing how much the data points deviate from the line.<\/span>\r\n\r\n<span style=\"background-color: #e6daf7;\">You may not have listed a Step 5, which we will cover in [6E] later, during which you'll interpret the coefficients (correlation and determination), assess the model accuracy and fit, and make appropriate predictions.<\/span>\r\n\r\nDid you note any concerns about the data collection? Hopefully you discussed the need to make the data anonymous or de-identified since it involved student grades.\r\n\r\n<\/div>\r\nContinue to work in groups for the remainder of the activity. As you consider the opening question in Question 6, try not to spend too much time debating the issue. You might even take a moment to discuss the implications in Part A of answering both \"yes\" and \"no.\"\r\n<h3>Line of Best Fit and The Correlation Coefficient<\/h3>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 6<\/h3>\r\nNow, consider the following question: \u201cCan steady driving speed be used to predict fuel efficiency?\u201d\r\n\r\n&nbsp;\r\n\r\nPart A: If you answered \u201cyes,\u201d do you think the relationship between driving speed and fuel efficiency would be positive or negative? If you answered \u201cno,\u201d explain.\r\n\r\n[reveal-answer q=\"427257\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"427257\"]Give your opinion to answer this question. There is no correct or incorrect response.[\/hidden-answer]\r\n\r\n&nbsp;\r\n\r\nPart B: Identify the explanatory and response variables.\r\n\r\n[reveal-answer q=\"158870\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"158870\"]Which variable measures the outcome and which drives it?<span style=\"font-size: 1rem; text-align: initial;\">[\/hidden-answer]<\/span>\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 7<\/h3>\r\nGo to the <em>Linear Regression<\/em> tool at <a href=\"https:\/\/dcmathpathways.shinyapps.io\/LinearRegression\/\">https:\/\/dcmathpathways.shinyapps.io\/LinearRegression\/<\/a> and plot the data using the following inputs:\r\n<ul>\r\n \t<li style=\"font-weight: 400;\" aria-level=\"1\">Under \u201cEnter Data,\u201d select \u201cFrom Textbook.\u201d<\/li>\r\n \t<li style=\"font-weight: 400;\" aria-level=\"1\">Under \u201cChoose Dataset,\u201d select \u201cFuel Efficiency and Speed.\u201d<\/li>\r\n<\/ul>\r\n&nbsp;\r\n\r\nPart A: Is the relationship positive or negative?\r\n\r\n[reveal-answer q=\"725124\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"725124\"]Consider the shape of the entire plot before answering.[\/hidden-answer]\r\n\r\n&nbsp;\r\n\r\nPart B: Find the correlation coefficient. Does this value indicate that the linear relationship between the two variables will be strong, moderate, or weak?\r\n\r\n[reveal-answer q=\"527012\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"527012\"]Locate the value in the tool beneath the plot.[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox key-takeaways\">\r\n<h3>question 8<\/h3>\r\nIs a least squares regression line a reasonable model for the relationship between driving speed and fuel efficiency?\r\n\r\n[reveal-answer q=\"850817\"]Hint[\/reveal-answer]\r\n[hidden-answer a=\"850817\"]What do <em>you\u00a0<\/em>think?\u00a0[\/hidden-answer]\r\n\r\n<\/div>\r\n<div class=\"textbox tryit\">\r\n<h3>Guidance<\/h3>\r\n<span style=\"background-color: #e6daf7;\">[Wrap-up: Did you clearly state and justify your conclusions to answer Question 8? If you found that the LSR line is not a reasonable model for the relationship, clearly state why you believe this using both a visual analysis and the value of [latex]r[\/latex]. In this case, all analysis indicates that a linear model would fail to make reasonable predictions for this dataset. There is no linear relationship. As you end the activity, take a look back at the objectives and point out the places where they appeared in the questions. ]<\/span>\r\n\r\n<\/div>\r\n&nbsp;","rendered":"<div class=\"textbox learning-objectives\">\n<h3>Objectives for this activity<\/h3>\n<p>During this activity you will:<\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Identify the explanatory and response variables given the context of a study.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Decide when linear regression is appropriate and when it is not appropriate.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Use data analysis tools to generate appropriate scatterplots and the line of best fit.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Use data analysis tools to identify the equation of the line of best fit and the correlation coefficient r.<\/li>\n<\/ul>\n<\/div>\n<h2>Straight Talk About Lines<\/h2>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1178\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/12030216\/Picture120-300x200.jpg\" alt=\"Two people smiling and looking at a whiteboard\" width=\"726\" height=\"484\" \/><\/p>\n<p>In the\u00a0<em>What to Know<\/em> page for this section, you learned definitions of the explanatory and response variables in a bivariate dataset and developed an understanding of when a linear relationship may exist between them. You should have enough background now to identify scenarios of bivariate data for which a linear regression analysis might be appropriate and to calculate and write the equation of a line of best fit. We&#8217;ll continue to extend that knowledge in this activity as we practice these skills and learn about a measure of strength in a linear relationship: the correlation coefficient, [latex]r[\/latex]. Along the way, you&#8217;ll deepen your understanding of the concept of a line of best fit and the method of linear regression analysis on a given dataset.<\/p>\n<h3>Explanatory and Response Variables<\/h3>\n<p>During the previous\u00a0<em>What to Know<\/em>, you were asked to look over your notes from [WTK 5A] and write down three different examples where you had listed an explanatory variable that could be used to predict a response variable. Both of those variables should have been quantitative for the purposes of this activity. You were asked to identify one set with a positive association, one with no (or almost no) association, and one with a negative association.<\/p>\n<p>Please retrieve those examples from your notes now and discuss them with a partner to answer Question 1. For added interest, &#8220;test&#8221; your partner to see if they can identify the explanatory and response variables in your scenario.<\/p>\n<div class=\"textbox key-takeaways\">\n<h3>Question 1<\/h3>\n<p>In the preview assignment, you were asked to think about a few scenarios in which an explanatory variable could be used to predict a response variable.<\/p>\n<p>&nbsp;<\/p>\n<p>Part A: Share your favorite scenario with a partner and take turns identifying the explanatory and response variables.<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q714487\">Hint<\/span><\/p>\n<div id=\"q714487\" class=\"hidden-answer\" style=\"display: none\">What was your favorite or most interesting scenario? What was your partner&#8217;s?<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>Part B: Draw scatterplots describing the two scenarios, and then sketch the line of best fit for each scatterplot.<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q183960\">Hint<\/span><\/p>\n<div id=\"q183960\" class=\"hidden-answer\" style=\"display: none\">You can either upload your sketches or describe them here.<\/div>\n<\/div>\n<\/div>\n<p>When analyzing bivariate data, it is important to first clearly identify the explanatory and response variables and plot the data to identify any visually obvious trends.<\/p>\n<p>Let\u2019s look at an example.<\/p>\n<div class=\"textbox tryit\">\n<h3>Guidance<\/h3>\n<p><span style=\"background-color: #e6daf7;\">[Intro:\u00a0Form into groups of four to continue this activity. As you answer Questions 2 through 5, try to begin to establish for your group a list of steps involved in analyzing bivariate data. We know that the first step is to clearly identify the explanatory and response variables, ensuring they are both quantitative for a Least Squares Regression analysis, and responsibly obtain data. Recall that you should ensure your sampling methods are random and bias is minimized as much as possible. Then, in the second step, you would plot the data to visually assess any present trends. What steps follow after these when performing the analysis? Are there any concerns that arise for you as you follow the example below involving student test scores? Regroup after Question 5 to compare your group&#8217;s list of steps and concerns with others before moving further in the activity. ]<\/span><\/p>\n<\/div>\n<h3>Linear Regression Analysis<\/h3>\n<p>George, a current student, got a 36 out of 50 on the first midterm (C-). He asked his instructor, \u201cIf I don\u2019t change my study approach, how do you predict I will do on the final exam?\u201d<\/p>\n<p>One way to answer this question is to look at the bivariate data of student scores from a previous class. In this case, we choose a random sample of past students who did not seek out additional tutoring and\/or support between the midterm and the final.<\/p>\n<p>The following is a dataset from a random sample of past students who did not seek out advice on study skills or additional tutoring between the midterm and the final exam. To protect their anonymity, only first names are shown.<\/p>\n<table>\n<tbody>\n<tr>\n<td><strong>Student First Name<\/strong><\/td>\n<td><strong>Midterm Score<\/strong><\/p>\n<p><strong>(out of 50 points)<\/strong><\/td>\n<td><strong>Final Exam Score<\/strong><\/p>\n<p><strong>(out of 100 points)<\/strong><\/td>\n<\/tr>\n<tr>\n<td>Joe<\/td>\n<td>42<\/td>\n<td>64<\/td>\n<\/tr>\n<tr>\n<td>Barak<\/td>\n<td>52<\/td>\n<td>94<\/td>\n<\/tr>\n<tr>\n<td>Hillary<\/td>\n<td>44<\/td>\n<td>87<\/td>\n<\/tr>\n<tr>\n<td>Donald<\/td>\n<td>25<\/td>\n<td>46<\/td>\n<\/tr>\n<tr>\n<td>Cher<\/td>\n<td>41<\/td>\n<td>73<\/td>\n<\/tr>\n<tr>\n<td>Katy<\/td>\n<td>39<\/td>\n<td>73<\/td>\n<\/tr>\n<tr>\n<td>Taylor<\/td>\n<td>33<\/td>\n<td>53<\/td>\n<\/tr>\n<tr>\n<td>Miley<\/td>\n<td>40<\/td>\n<td>77<\/td>\n<\/tr>\n<tr>\n<td>Justin<\/td>\n<td>35<\/td>\n<td>60<\/td>\n<\/tr>\n<tr>\n<td>Snoop<\/td>\n<td>31<\/td>\n<td>62<\/td>\n<\/tr>\n<tr>\n<td>Bruno<\/td>\n<td>37<\/td>\n<td>71<\/td>\n<\/tr>\n<tr>\n<td>Kanye<\/td>\n<td>49<\/td>\n<td>95<\/td>\n<\/tr>\n<tr>\n<td>Leonardo<\/td>\n<td>38<\/td>\n<td>70<\/td>\n<\/tr>\n<tr>\n<td>Rosie<\/td>\n<td>45<\/td>\n<td>80<\/td>\n<\/tr>\n<tr>\n<td>Maya<\/td>\n<td>49<\/td>\n<td>80<\/td>\n<\/tr>\n<tr>\n<td>Tyra<\/td>\n<td>48<\/td>\n<td>82<\/td>\n<\/tr>\n<tr>\n<td>Selena<\/td>\n<td>50<\/td>\n<td>81<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3>Using Technology in Analysis<\/h3>\n<div class=\"textbox key-takeaways\">\n<h3>question 2<\/h3>\n<p>Identify the explanatory and response variables.<\/p>\n<p>Go to the <em>Linear Regression<\/em> tool at <a href=\"https:\/\/dcmathpathways.shinyapps.io\/LinearRegression\/\">https:\/\/dcmathpathways.shinyapps.io\/LinearRegression\/<\/a> and plot the data using the following inputs:<\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Under \u201cEnter Data,\u201d select \u201cEnter Own.\u201d<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Name the X (explanatory) and Y (response) variables appropriately.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Copy and paste the data from <span style=\"background-color: #ffff00;\">DCMP_STAT_6A_Student_Scores<\/span>\u00a0[<span style=\"background-color: #ffff00;\">link this spreadsheet here<\/span>] or enter the data in the table by hand. Make sure the explanatory variable is in the first column and the response variable is in the second column.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Under \u201cPlot Options,\u201d select \u201cRegression Line.\u201d<\/li>\n<\/ul>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Click \u201cSubmit Data\u201d button.<\/li>\n<\/ul>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q436274\">Hint<\/span><\/p>\n<div id=\"q436274\" class=\"hidden-answer\" style=\"display: none\">Ensure that the data is entered correctly, that the variables are in the correct column, and that you have selected to show the regression line.<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 3<\/h3>\n<p>Do you think the line of best fit is a good model of the relationship between midterm and final exam score? Explain.<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q22743\">Hint<\/span><\/p>\n<div id=\"q22743\" class=\"hidden-answer\" style=\"display: none\">How closely do the data appear to follow the line?<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 4<\/h3>\n<p>Write the equation of the least squares regression line using appropriate notation.<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q971061\">Hint<\/span><\/p>\n<div id=\"q971061\" class=\"hidden-answer\" style=\"display: none\">The equation is shown in the data analysis tool.<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>Part A: Is the relationship positive or negative?<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q324890\">Hint<\/span><\/p>\n<div id=\"q324890\" class=\"hidden-answer\" style=\"display: none\">Recall the shape of a positive (increasing trend) vs. a negative (decreasing) trend. <\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>Part B: What is the value of r? Does this value indicate that the linear relationship between the two variables will be strong, moderate, or weak?<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q983122\">Hint<\/span><\/p>\n<div id=\"q983122\" class=\"hidden-answer\" style=\"display: none\">This value is located in the tool under the Model Summary.<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 5<\/h3>\n<p>Do you think George should be nervous about the final exam?<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q873849\">Hint<\/span><\/p>\n<div id=\"q873849\" class=\"hidden-answer\" style=\"display: none\">What do <em>you <\/em>think? Did students who struggled on the midterm tend to struggle on the final without getting any help?<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox tryit\">\n<h3>Guidance<\/h3>\n<p><span style=\"background-color: #e6daf7;\">[Summary: How did you do with a list of steps for performing a LSR analysis? Generally, the steps can be listed as follows:<br \/>\nStep 1) Identify the explanatory and response variables, then gather data as needed.<br \/>\nStep 2) Plot the data on a scatterplot, placing the explanatory variable along the horizontal (x) axis and the response variable along the vertical (y) axis. <\/span><\/p>\n<p><span style=\"background-color: #e6daf7;\">Step 3) Visually confirm that the data seem to follow a linear paggern.<\/span><\/p>\n<p><span style=\"background-color: #e6daf7;\">Step 4) Calculate and sketch the line of best fit in the plot and obtain the correlation coefficient [latex]r[\/latex]. Visually confirm that the line appears to pass through the data as closely as possible, minimizing how much the data points deviate from the line.<\/span><\/p>\n<p><span style=\"background-color: #e6daf7;\">You may not have listed a Step 5, which we will cover in [6E] later, during which you&#8217;ll interpret the coefficients (correlation and determination), assess the model accuracy and fit, and make appropriate predictions.<\/span><\/p>\n<p>Did you note any concerns about the data collection? Hopefully you discussed the need to make the data anonymous or de-identified since it involved student grades.<\/p>\n<\/div>\n<p>Continue to work in groups for the remainder of the activity. As you consider the opening question in Question 6, try not to spend too much time debating the issue. You might even take a moment to discuss the implications in Part A of answering both &#8220;yes&#8221; and &#8220;no.&#8221;<\/p>\n<h3>Line of Best Fit and The Correlation Coefficient<\/h3>\n<div class=\"textbox key-takeaways\">\n<h3>question 6<\/h3>\n<p>Now, consider the following question: \u201cCan steady driving speed be used to predict fuel efficiency?\u201d<\/p>\n<p>&nbsp;<\/p>\n<p>Part A: If you answered \u201cyes,\u201d do you think the relationship between driving speed and fuel efficiency would be positive or negative? If you answered \u201cno,\u201d explain.<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q427257\">Hint<\/span><\/p>\n<div id=\"q427257\" class=\"hidden-answer\" style=\"display: none\">Give your opinion to answer this question. There is no correct or incorrect response.<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>Part B: Identify the explanatory and response variables.<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q158870\">Hint<\/span><\/p>\n<div id=\"q158870\" class=\"hidden-answer\" style=\"display: none\">Which variable measures the outcome and which drives it?<span style=\"font-size: 1rem; text-align: initial;\"><\/div>\n<\/div>\n<p><\/span><\/p>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 7<\/h3>\n<p>Go to the <em>Linear Regression<\/em> tool at <a href=\"https:\/\/dcmathpathways.shinyapps.io\/LinearRegression\/\">https:\/\/dcmathpathways.shinyapps.io\/LinearRegression\/<\/a> and plot the data using the following inputs:<\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Under \u201cEnter Data,\u201d select \u201cFrom Textbook.\u201d<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Under \u201cChoose Dataset,\u201d select \u201cFuel Efficiency and Speed.\u201d<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<p>Part A: Is the relationship positive or negative?<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q725124\">Hint<\/span><\/p>\n<div id=\"q725124\" class=\"hidden-answer\" style=\"display: none\">Consider the shape of the entire plot before answering.<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>Part B: Find the correlation coefficient. Does this value indicate that the linear relationship between the two variables will be strong, moderate, or weak?<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q527012\">Hint<\/span><\/p>\n<div id=\"q527012\" class=\"hidden-answer\" style=\"display: none\">Locate the value in the tool beneath the plot.<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox key-takeaways\">\n<h3>question 8<\/h3>\n<p>Is a least squares regression line a reasonable model for the relationship between driving speed and fuel efficiency?<\/p>\n<div class=\"qa-wrapper\" style=\"display: block\"><span class=\"show-answer collapsed\" style=\"cursor: pointer\" data-target=\"q850817\">Hint<\/span><\/p>\n<div id=\"q850817\" class=\"hidden-answer\" style=\"display: none\">What do <em>you\u00a0<\/em>think?\u00a0<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox tryit\">\n<h3>Guidance<\/h3>\n<p><span style=\"background-color: #e6daf7;\">[Wrap-up: Did you clearly state and justify your conclusions to answer Question 8? If you found that the LSR line is not a reasonable model for the relationship, clearly state why you believe this using both a visual analysis and the value of [latex]r[\/latex]. In this case, all analysis indicates that a linear model would fail to make reasonable predictions for this dataset. There is no linear relationship. As you end the activity, take a look back at the objectives and point out the places where they appeared in the questions. ]<\/span><\/p>\n<\/div>\n<p>&nbsp;<\/p>\n","protected":false},"author":428269,"menu_order":4,"template":"","meta":{"_candela_citation":"[]","CANDELA_OUTCOMES_GUID":"","pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-3843","chapter","type-chapter","status-publish","hentry"],"part":4241,"_links":{"self":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/3843","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/users\/428269"}],"version-history":[{"count":10,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/3843\/revisions"}],"predecessor-version":[{"id":4847,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/3843\/revisions\/4847"}],"part":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/parts\/4241"}],"metadata":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/3843\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/media?parent=3843"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapter-type?post=3843"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/contributor?post=3843"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/license?post=3843"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}