{"id":5547,"date":"2022-09-21T17:28:50","date_gmt":"2022-09-21T17:28:50","guid":{"rendered":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/?post_type=chapter&#038;p=5547"},"modified":"2022-10-17T18:59:28","modified_gmt":"2022-10-17T18:59:28","slug":"17a-inclass","status":"publish","type":"chapter","link":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/chapter\/17a-inclass\/","title":{"raw":"17A InClass","rendered":"17A InClass"},"content":{"raw":"<div id=\"bp-page-1\" class=\"page\" data-page-number=\"1\" data-loaded=\"true\">\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 1<\/h3>\r\n1) What factors do you think determine high school students\u2019 science test scores?\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">\r\n\r\nThe dataset that we will be using in this in-class activity is called \u201cHigh School and Beyond\u201d and contains information about high school student achievement scores on math, science, reading, writing, and social studies tests. The dataset contains information about 200 high school students and 10 variables for each student. The data collected about each student includes the following: identification number, whether the student is male or female, race, socio-economic status, school type, program type, and scores from tests of reading, writing, math, science, and social studies. Descriptions of the variables are as follows:\r\n\r\n[caption id=\"attachment_2173\" align=\"alignnone\" width=\"944\"]<img class=\"wp-image-2173\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27014018\/Picture1141-300x200.jpg\" alt=\"A person in a wheelchair and wearing a disposable mask sitting at an outdoor table working on a laptop.\" width=\"944\" height=\"629\" \/> Credit: iStock\/Courtney Hale[\/caption]\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"bp-page-2\" class=\"page\" data-page-number=\"2\" data-loaded=\"true\">\r\n<div class=\"textLayer\">\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td><strong>Variable name<\/strong><\/td>\r\n<td><strong>Definition<\/strong><\/td>\r\n<\/tr>\r\n<tr>\r\n<td><em>id<\/em><\/td>\r\n<td>Identification number of the student<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><em>female<\/em><\/td>\r\n<td>Gender of the student (0 = male, 1 = female)<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><em>race<\/em><\/td>\r\n<td>Ethnic background of the student (1 = Hispanic, 2 = Asian, 3 = Black, 4 = White)<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><em>ses<\/em><\/td>\r\n<td>Socio-economic status of the student (1 = low, 2 = medium, 3 = high)<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><em>schtyp<\/em><\/td>\r\n<td>School type (1 = public, 2 = private)<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><em>prog<\/em><\/td>\r\n<td>Program type (1 = general, 2 = academic preparatory, 3 = vocational\/technical)<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><em>read<\/em><\/td>\r\n<td>Score from test of reading<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><em>write<\/em><\/td>\r\n<td>Score from test of writing<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><em>math<\/em><\/td>\r\n<td>Score from test of math<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><em>science<\/em><\/td>\r\n<td>Score from test of science<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><em>socst<\/em><\/td>\r\n<td>Score from test of social studies<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<div class=\"textLayer\">Questions 2\u20134: We are interested in answering the question,\u201cIs there a relationship between science scores for high school students and math and reading scores?\u201d<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 2<\/h3>\r\n2) Based on the question, what is the response variable? Identify the variable name from the dataset.\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 3<\/h3>\r\n3) What are the explanatory variables? Identify the variable names from the dataset.\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 4<\/h3>\r\n4) In simple linear regression, you have one response variable and one explanatory variable. Explain what the purpose of the simple linear regression model is.\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 5<\/h3>\r\n5) Using the following scatterplot of math and science scores, what do you notice about the relationship?\r\n\r\n<img class=\"alignnone wp-image-2174\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27014023\/Picture115-300x212.png\" alt=\"A scatterplot titled \u201cScatterplot of Math and Science Scores for High School Students.\u201d The x-axis is labeled \u201cmath test score\u201d and the y-axis is labeled \u201cscience test score.\u201d Points with higher x-values also tend to have higher y-values, with moderate consistency.\" width=\"1250\" height=\"883\" \/>\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 6<\/h3>\r\n6) Using the scatterplot of reading and science scores, what do you notice about the relationship?\r\n\r\n<img class=\"alignnone wp-image-2175\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27014027\/Picture116-300x211.png\" alt=\"A scatterplot titled \u201cScatterplot of Reading and Science Scores for High School Students.\u201d The x-axis is labeled \u201creading test score\u201d and the y-axis is labeled \u201cscience test score.\u201d Points with higher x-values also tend to have higher y-values, with moderate consistency.\" width=\"938\" height=\"660\" \/>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"bp-page-3\" class=\"page\" data-page-number=\"3\" data-loaded=\"true\">\r\n<div class=\"ba-Layer ba-Layer--highlight\" data-resin-fileid=\"910629704200\" data-resin-iscurrent=\"true\" data-resin-feature=\"annotations\" data-testid=\"ba-Layer--highlight\"><span style=\"font-size: 1em;\">A linear regression model with two or more explanatory variables is called a multiple linear regression model. Since there is more than one explanatory variable, the model is no longer a line. In fact, we can include \ud835\udc5d explanatory variables in our model. The equation for the estimated model that uses \ud835\udc5d variables is<\/span><\/div>\r\n<div class=\"ba-Layer ba-Layer--highlight\" data-resin-fileid=\"910629704200\" data-resin-iscurrent=\"true\" data-resin-feature=\"annotations\" data-testid=\"ba-Layer--highlight\"><span style=\"font-size: 1em;\"> \ud835\udc66\u0302=\ud835\udc4e+\ud835\udc4f<sub>1<\/sub>\u2219\ud835\udc65<sub>1<\/sub>+\ud835\udc4f<sub>2<\/sub>\u2219\ud835\udc65<sub>2<\/sub>+\u22ef+\ud835\udc4f<sub>\ud835\udc5d<\/sub>\u2219\ud835\udc65<sub>\ud835\udc5d <\/sub><\/span><\/div>\r\n<div class=\"ba-Layer ba-Layer--highlight\" data-resin-fileid=\"910629704200\" data-resin-iscurrent=\"true\" data-resin-feature=\"annotations\" data-testid=\"ba-Layer--highlight\"><span style=\"font-size: 1em;\">where \ud835\udc4f<sub>1<\/sub>, \ud835\udc4f<sub>2<\/sub>, ..., \ud835\udc4f<sub>\ud835\udc5d<\/sub> are the regression coefficients for explanatory variables \ud835\udc65<sub>1<\/sub>, \ud835\udc65<sub>2<\/sub>,..., \ud835\udc65<sub>\ud835\udc5d<\/sub>, respectively. In multiple linear regression, \ud835\udc4f<sub>1<\/sub>, \ud835\udc4f<sub>2<\/sub>, ..., \ud835\udc4f<sub>\ud835\udc5d<\/sub> are called partial slopes.<\/span><\/div>\r\n<\/div>\r\n<div id=\"bp-page-4\" class=\"page\" data-page-number=\"4\" data-loaded=\"true\">\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 7<\/h3>\r\n<div class=\"textLayer\">\r\n\r\n7) Using the following results, write the multiple linear regression equation for predicting science test scores using the explanatory variables of math and reading scores. Round the estimates to two decimal places.\r\n<table>\r\n<tbody>\r\n<tr>\r\n<td><\/td>\r\n<td><strong>Estimate<\/strong><\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong>Intercept<\/strong><\/td>\r\n<td>11.61550<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong><em>math<\/em><\/strong><\/td>\r\n<td>0.4172<\/td>\r\n<\/tr>\r\n<tr>\r\n<td><strong><em>read<\/em><\/strong><\/td>\r\n<td>0.36542<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">We can interpret the regression coefficients for each explanatory variable in the model in terms of the relationship with the response variable. The explanation is very similar to what we have seen in simple linear regression models. However, since it is a partial slope, we have to make sure that we hold any other explanatory variables constant in our interpretation. For example, for the following regression equation,<\/div>\r\n<div class=\"textLayer\">\ud835\udc66\u0302=\ud835\udc4e+\ud835\udc4f<sub>1<\/sub>\u2219\ud835\udc65<sub>1<\/sub>+\ud835\udc4f<sub>2<\/sub>\u2219\ud835\udc65<sub>2<\/sub>+\u22ef+\ud835\udc4f<sub>\ud835\udc5d<\/sub>\u2219\ud835\udc65<sub>\ud835\udc5d<\/sub><\/div>\r\n<div class=\"textLayer\">the partial slope, \ud835\udc4f<sub>1<\/sub>, represents the expected change in the response variable, \ud835\udc66, for every one unit increase in \ud835\udc65<sub>1<\/sub>, holding explanatory variables \ud835\udc65<sub>1<\/sub>, \ud835\udc65<sub>2<\/sub>, ..., \ud835\udc65<sub>\ud835\udc5d<\/sub> constant.<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 8<\/h3>\r\n8) What is the interpretation of the coefficient for the explanatory variable of mathscoresin the context of the dataset?\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"bp-page-5\" class=\"page\" data-page-number=\"5\" data-loaded=\"true\">\r\n<div class=\"textLayer\">The coefficient of determination, \ud835\udc452, is used to determine the percentage of variability in the response variable that is accounted for by the explanatory variables. In this activity, we will call the value the unadjusted \ud835\udc452. In simple linear regression, we would interpret the \ud835\udc452 value as the percentage of the variation in the response variable that can be explained by the linear relationship with the explanatory variable. For multiple linear regression, the interpretation is similar, but now the variation in the response variable is explained by the linear relationship with multiple explanatory variables.<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 9<\/h3>\r\n9) The unadjusted \ud835\udc452 value for this model is 0.4782. Interpret the unadjusted value of\ud835\udc452for this model.\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 10<\/h3>\r\n10) The simple linear regression model with math alone has an \ud835\udc452value of 39.8%.The simple linear regression model with reading alone has an \ud835\udc452 value of 39.7%. Explain why the total amount of variability explained by the model is not: 39.8% +39.7% =79.5%\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">We can assess whether or not it is reasonable to fit a linear regression model using residual plots, similar to simple linear regression. In multiple linear regression, the y-axis has the residual values and the x-axis has the explanatory variables and\/or the fitted values. For a multiple linear regression model, you create a residual plot for each continuous explanatory variable, as well as the fitted value.We would expect to see the residual values appear randomly scattered across the x-values with no clear patterns(e.g., residual plots that display a curvature violate the linearity condition). Residual plots that increase or decrease in magnitude (distance from zero) violate the constant variance condition. The residual plot of the residuals vs. predicted values account for all the variables in the model. Residual plots of the residuals vs. individual exploratory variables allow us to identify a potential source of a violation. The normality condition is beyond the scope of this course.<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 11<\/h3>\r\n11) Looking atthethreeresidual plotsthat follow,is it reasonable to fit a linear regression model to thesedata? Explain.\r\n\r\n<img src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27014032\/Picture117-300x239.png\" alt=\"A residual plot titled \u201cResiduals vs. Fitted,\u201d with \u201cFitted Value\u201d on the x-axis and \u201cResidual\u201d on the y-axis. The points appear to have no pattern.\" \/>\r\n\r\n<img src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27014037\/Picture118-300x212.png\" alt=\"A residual plot titled \u201cResiduals vs. Math Test Scores,\u201d with \u201cMath Test Scores\u201d on the x-axis and \u201cResiduals\u201d on the y-axis. The points appear to have no pattern.\" \/>\r\n\r\n<img src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27014043\/Picture119-300x212.png\" alt=\"A residual plot titled \u201cResiduals vs. Reading Test Scores,\u201d with \u201cReading Test Scores\u201d on the x-axis and \u201cResidual\u201d on the y-axis. The points appear to have no pattern.\" \/>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>","rendered":"<div id=\"bp-page-1\" class=\"page\" data-page-number=\"1\" data-loaded=\"true\">\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 1<\/h3>\n<p>1) What factors do you think determine high school students\u2019 science test scores?<\/p>\n<\/div>\n<\/div>\n<div class=\"textLayer\">\n<p>The dataset that we will be using in this in-class activity is called \u201cHigh School and Beyond\u201d and contains information about high school student achievement scores on math, science, reading, writing, and social studies tests. The dataset contains information about 200 high school students and 10 variables for each student. The data collected about each student includes the following: identification number, whether the student is male or female, race, socio-economic status, school type, program type, and scores from tests of reading, writing, math, science, and social studies. Descriptions of the variables are as follows:<\/p>\n<div id=\"attachment_2173\" style=\"width: 954px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-2173\" class=\"wp-image-2173\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27014018\/Picture1141-300x200.jpg\" alt=\"A person in a wheelchair and wearing a disposable mask sitting at an outdoor table working on a laptop.\" width=\"944\" height=\"629\" \/><\/p>\n<p id=\"caption-attachment-2173\" class=\"wp-caption-text\">Credit: iStock\/Courtney Hale<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"bp-page-2\" class=\"page\" data-page-number=\"2\" data-loaded=\"true\">\n<div class=\"textLayer\">\n<table>\n<tbody>\n<tr>\n<td><strong>Variable name<\/strong><\/td>\n<td><strong>Definition<\/strong><\/td>\n<\/tr>\n<tr>\n<td><em>id<\/em><\/td>\n<td>Identification number of the student<\/td>\n<\/tr>\n<tr>\n<td><em>female<\/em><\/td>\n<td>Gender of the student (0 = male, 1 = female)<\/td>\n<\/tr>\n<tr>\n<td><em>race<\/em><\/td>\n<td>Ethnic background of the student (1 = Hispanic, 2 = Asian, 3 = Black, 4 = White)<\/td>\n<\/tr>\n<tr>\n<td><em>ses<\/em><\/td>\n<td>Socio-economic status of the student (1 = low, 2 = medium, 3 = high)<\/td>\n<\/tr>\n<tr>\n<td><em>schtyp<\/em><\/td>\n<td>School type (1 = public, 2 = private)<\/td>\n<\/tr>\n<tr>\n<td><em>prog<\/em><\/td>\n<td>Program type (1 = general, 2 = academic preparatory, 3 = vocational\/technical)<\/td>\n<\/tr>\n<tr>\n<td><em>read<\/em><\/td>\n<td>Score from test of reading<\/td>\n<\/tr>\n<tr>\n<td><em>write<\/em><\/td>\n<td>Score from test of writing<\/td>\n<\/tr>\n<tr>\n<td><em>math<\/em><\/td>\n<td>Score from test of math<\/td>\n<\/tr>\n<tr>\n<td><em>science<\/em><\/td>\n<td>Score from test of science<\/td>\n<\/tr>\n<tr>\n<td><em>socst<\/em><\/td>\n<td>Score from test of social studies<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<div class=\"textLayer\">Questions 2\u20134: We are interested in answering the question,\u201cIs there a relationship between science scores for high school students and math and reading scores?\u201d<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 2<\/h3>\n<p>2) Based on the question, what is the response variable? Identify the variable name from the dataset.<\/p>\n<\/div>\n<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 3<\/h3>\n<p>3) What are the explanatory variables? Identify the variable names from the dataset.<\/p>\n<\/div>\n<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 4<\/h3>\n<p>4) In simple linear regression, you have one response variable and one explanatory variable. Explain what the purpose of the simple linear regression model is.<\/p>\n<\/div>\n<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 5<\/h3>\n<p>5) Using the following scatterplot of math and science scores, what do you notice about the relationship?<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-2174\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27014023\/Picture115-300x212.png\" alt=\"A scatterplot titled \u201cScatterplot of Math and Science Scores for High School Students.\u201d The x-axis is labeled \u201cmath test score\u201d and the y-axis is labeled \u201cscience test score.\u201d Points with higher x-values also tend to have higher y-values, with moderate consistency.\" width=\"1250\" height=\"883\" \/><\/p>\n<\/div>\n<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 6<\/h3>\n<p>6) Using the scatterplot of reading and science scores, what do you notice about the relationship?<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-2175\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27014027\/Picture116-300x211.png\" alt=\"A scatterplot titled \u201cScatterplot of Reading and Science Scores for High School Students.\u201d The x-axis is labeled \u201creading test score\u201d and the y-axis is labeled \u201cscience test score.\u201d Points with higher x-values also tend to have higher y-values, with moderate consistency.\" width=\"938\" height=\"660\" \/><\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"bp-page-3\" class=\"page\" data-page-number=\"3\" data-loaded=\"true\">\n<div class=\"ba-Layer ba-Layer--highlight\" data-resin-fileid=\"910629704200\" data-resin-iscurrent=\"true\" data-resin-feature=\"annotations\" data-testid=\"ba-Layer--highlight\"><span style=\"font-size: 1em;\">A linear regression model with two or more explanatory variables is called a multiple linear regression model. Since there is more than one explanatory variable, the model is no longer a line. In fact, we can include \ud835\udc5d explanatory variables in our model. The equation for the estimated model that uses \ud835\udc5d variables is<\/span><\/div>\n<div class=\"ba-Layer ba-Layer--highlight\" data-resin-fileid=\"910629704200\" data-resin-iscurrent=\"true\" data-resin-feature=\"annotations\" data-testid=\"ba-Layer--highlight\"><span style=\"font-size: 1em;\"> \ud835\udc66\u0302=\ud835\udc4e+\ud835\udc4f<sub>1<\/sub>\u2219\ud835\udc65<sub>1<\/sub>+\ud835\udc4f<sub>2<\/sub>\u2219\ud835\udc65<sub>2<\/sub>+\u22ef+\ud835\udc4f<sub>\ud835\udc5d<\/sub>\u2219\ud835\udc65<sub>\ud835\udc5d <\/sub><\/span><\/div>\n<div class=\"ba-Layer ba-Layer--highlight\" data-resin-fileid=\"910629704200\" data-resin-iscurrent=\"true\" data-resin-feature=\"annotations\" data-testid=\"ba-Layer--highlight\"><span style=\"font-size: 1em;\">where \ud835\udc4f<sub>1<\/sub>, \ud835\udc4f<sub>2<\/sub>, &#8230;, \ud835\udc4f<sub>\ud835\udc5d<\/sub> are the regression coefficients for explanatory variables \ud835\udc65<sub>1<\/sub>, \ud835\udc65<sub>2<\/sub>,&#8230;, \ud835\udc65<sub>\ud835\udc5d<\/sub>, respectively. In multiple linear regression, \ud835\udc4f<sub>1<\/sub>, \ud835\udc4f<sub>2<\/sub>, &#8230;, \ud835\udc4f<sub>\ud835\udc5d<\/sub> are called partial slopes.<\/span><\/div>\n<\/div>\n<div id=\"bp-page-4\" class=\"page\" data-page-number=\"4\" data-loaded=\"true\">\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 7<\/h3>\n<div class=\"textLayer\">\n<p>7) Using the following results, write the multiple linear regression equation for predicting science test scores using the explanatory variables of math and reading scores. Round the estimates to two decimal places.<\/p>\n<table>\n<tbody>\n<tr>\n<td><\/td>\n<td><strong>Estimate<\/strong><\/td>\n<\/tr>\n<tr>\n<td><strong>Intercept<\/strong><\/td>\n<td>11.61550<\/td>\n<\/tr>\n<tr>\n<td><strong><em>math<\/em><\/strong><\/td>\n<td>0.4172<\/td>\n<\/tr>\n<tr>\n<td><strong><em>read<\/em><\/strong><\/td>\n<td>0.36542<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"textLayer\">We can interpret the regression coefficients for each explanatory variable in the model in terms of the relationship with the response variable. The explanation is very similar to what we have seen in simple linear regression models. However, since it is a partial slope, we have to make sure that we hold any other explanatory variables constant in our interpretation. For example, for the following regression equation,<\/div>\n<div class=\"textLayer\">\ud835\udc66\u0302=\ud835\udc4e+\ud835\udc4f<sub>1<\/sub>\u2219\ud835\udc65<sub>1<\/sub>+\ud835\udc4f<sub>2<\/sub>\u2219\ud835\udc65<sub>2<\/sub>+\u22ef+\ud835\udc4f<sub>\ud835\udc5d<\/sub>\u2219\ud835\udc65<sub>\ud835\udc5d<\/sub><\/div>\n<div class=\"textLayer\">the partial slope, \ud835\udc4f<sub>1<\/sub>, represents the expected change in the response variable, \ud835\udc66, for every one unit increase in \ud835\udc65<sub>1<\/sub>, holding explanatory variables \ud835\udc65<sub>1<\/sub>, \ud835\udc65<sub>2<\/sub>, &#8230;, \ud835\udc65<sub>\ud835\udc5d<\/sub> constant.<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 8<\/h3>\n<p>8) What is the interpretation of the coefficient for the explanatory variable of mathscoresin the context of the dataset?<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"bp-page-5\" class=\"page\" data-page-number=\"5\" data-loaded=\"true\">\n<div class=\"textLayer\">The coefficient of determination, \ud835\udc452, is used to determine the percentage of variability in the response variable that is accounted for by the explanatory variables. In this activity, we will call the value the unadjusted \ud835\udc452. In simple linear regression, we would interpret the \ud835\udc452 value as the percentage of the variation in the response variable that can be explained by the linear relationship with the explanatory variable. For multiple linear regression, the interpretation is similar, but now the variation in the response variable is explained by the linear relationship with multiple explanatory variables.<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 9<\/h3>\n<p>9) The unadjusted \ud835\udc452 value for this model is 0.4782. Interpret the unadjusted value of\ud835\udc452for this model.<\/p>\n<\/div>\n<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 10<\/h3>\n<p>10) The simple linear regression model with math alone has an \ud835\udc452value of 39.8%.The simple linear regression model with reading alone has an \ud835\udc452 value of 39.7%. Explain why the total amount of variability explained by the model is not: 39.8% +39.7% =79.5%<\/p>\n<\/div>\n<\/div>\n<div class=\"textLayer\">We can assess whether or not it is reasonable to fit a linear regression model using residual plots, similar to simple linear regression. In multiple linear regression, the y-axis has the residual values and the x-axis has the explanatory variables and\/or the fitted values. For a multiple linear regression model, you create a residual plot for each continuous explanatory variable, as well as the fitted value.We would expect to see the residual values appear randomly scattered across the x-values with no clear patterns(e.g., residual plots that display a curvature violate the linearity condition). Residual plots that increase or decrease in magnitude (distance from zero) violate the constant variance condition. The residual plot of the residuals vs. predicted values account for all the variables in the model. Residual plots of the residuals vs. individual exploratory variables allow us to identify a potential source of a violation. The normality condition is beyond the scope of this course.<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 11<\/h3>\n<p>11) Looking atthethreeresidual plotsthat follow,is it reasonable to fit a linear regression model to thesedata? Explain.<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27014032\/Picture117-300x239.png\" alt=\"A residual plot titled \u201cResiduals vs. Fitted,\u201d with \u201cFitted Value\u201d on the x-axis and \u201cResidual\u201d on the y-axis. The points appear to have no pattern.\" \/><\/p>\n<p><img decoding=\"async\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27014037\/Picture118-300x212.png\" alt=\"A residual plot titled \u201cResiduals vs. Math Test Scores,\u201d with \u201cMath Test Scores\u201d on the x-axis and \u201cResiduals\u201d on the y-axis. The points appear to have no pattern.\" \/><\/p>\n<p><img decoding=\"async\" src=\"https:\/\/s3-us-west-2.amazonaws.com\/courses-images\/wp-content\/uploads\/sites\/5738\/2022\/01\/27014043\/Picture119-300x212.png\" alt=\"A residual plot titled \u201cResiduals vs. Reading Test Scores,\u201d with \u201cReading Test Scores\u201d on the x-axis and \u201cResidual\u201d on the y-axis. The points appear to have no pattern.\" \/><\/p>\n<\/div>\n<\/div>\n<\/div>\n","protected":false},"author":23592,"menu_order":68,"template":"","meta":{"_candela_citation":"[]","CANDELA_OUTCOMES_GUID":"","pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-5547","chapter","type-chapter","status-publish","hentry"],"part":5543,"_links":{"self":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5547","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/users\/23592"}],"version-history":[{"count":4,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5547\/revisions"}],"predecessor-version":[{"id":5652,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5547\/revisions\/5652"}],"part":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/parts\/5543"}],"metadata":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5547\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/media?parent=5547"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapter-type?post=5547"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/contributor?post=5547"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/license?post=5547"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}