{"id":5530,"date":"2022-09-21T12:24:07","date_gmt":"2022-09-21T12:24:07","guid":{"rendered":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/?post_type=chapter&#038;p=5530"},"modified":"2022-10-11T20:27:40","modified_gmt":"2022-10-11T20:27:40","slug":"16c-coreq","status":"publish","type":"chapter","link":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/chapter\/16c-coreq\/","title":{"raw":"16C Coreq","rendered":"16C Coreq"},"content":{"raw":"<div id=\"bp-page-1\" class=\"page\" data-page-number=\"1\" data-loaded=\"true\">\r\n<div class=\"textLayer\">In the next preview assignment and in the next class, you will need to use technology to calculate the linear regression equation and use that equation to calculate predicted values of the response variable. You will assess assumptions for regression and interpret confidence intervals.The objective of this analysis is to predict a coffee\u2019s aftertaste based on its acidity. We will use the \u201ccoffeeratings\u201d dataset. These data are originally from the Coffee Quality Database[footnote]coffee-quality-database. (2018, June 16). GitHub. Retrieved from https:\/\/github.com\/jldbc\/coffee-quality-database[\/footnote] compiled by James DeLoux, a data scientist at BuzzFeed. The dataset contains information about the origin, processing, and taste quality for a sample of 1,338 coffees.The taste quality characteristics are on a scale of 0\u201310, as determined by a panel of expert coffee tasters.<\/div>\r\n<div class=\"textLayer\">\u2022acidity: Measure of acidity (a sharp, tangy feeling, like when biting into an orange[footnote]Coffee cupping. (2006, July 26). In Wikipedia.https:\/\/en.wikipedia.org\/wiki\/Coffee_cupping[\/footnote]); higher values correspond to more acidic taste<\/div>\r\n<div class=\"textLayer\">\u2022aftertaste: Measure of taste after you take a sip of the coffee; higher values indicate better quality taste<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 1<\/h3>\r\n1) What are the response and explanatory variables? Explain.\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 2<\/h3>\r\n<div class=\"textLayer\">2) Before using linear regression, let\u2019s look at the variable aftertaste. The mean of the sample of aftertaste values is 7.41, and the 95% confidence interval for the mean aftertaste value for all coffees is (7.389, 7.425). Which of the following statements istrue? Select all that apply.<\/div>\r\n<div class=\"textLayer\">a) We are 95% confident that the mean value of aftertaste for all coffees is between 7.389 and 7.425.<\/div>\r\n<div class=\"textLayer\">b) The population mean value of aftertaste is 7.41.<\/div>\r\n<div class=\"textLayer\">c) The sample mean value of aftertaste is 7.41.<\/div>\r\n<div class=\"textLayer\">d) If we take thousands of random samples of 1,338 coffees and calculate a confidence interval, we expect 95% of the intervals will contain the true mean value of aftertaste.<\/div>\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 3<\/h3>\r\n3) What is one reason we may want to use a linear regression model to predict thevalue ofaftertastebased on acidityrather than just using the sample meanfrom Question 1 as our best guess?\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"bp-page-2\" class=\"page\" data-page-number=\"2\" data-loaded=\"true\">\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 4<\/h3>\r\n<div class=\"textLayer\">4) Use spreadsheet DCMP_STAT_16C_Coffee_Ratings to answer the following questions.You will need the DCMP Linear Regressiontoolat https:\/\/dcmathpathways.shinyapps.io\/LinearRegression\/. Make sure to select the appropriate explanatoryvariable (\ud835\udc4b)and responsevariable (\ud835\udc4c).<\/div>\r\n<div class=\"textLayer\">Part A: Make a scatterplot to visualize the relationship between acidityand aftertaste.<\/div>\r\n<div class=\"textLayer\">Part B: Describe the relationship between the two variables.<\/div>\r\n<div class=\"textLayer\">Part C: Based on the scatterplot, is linear regression appropriate to describe the relationship between the variables? Explain.<\/div>\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 5<\/h3>\r\n<div class=\"textLayer\">5) Usethe DCMP Linear Regressiontoolto calculate the linear regression equation.<\/div>\r\n<div class=\"textLayer\">Part A: Write the linear regression equationusing customized variable names.<\/div>\r\n<div class=\"textLayer\">Part B: Interpret the slope in the context of the data.<\/div>\r\n<div class=\"textLayer\">Part C: Is it meaningful to interpret the intercept? If so, interpret the intercept in the context of the data. If not, explain why not.<\/div>\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 6<\/h3>\r\n<div class=\"textLayer\">6) Now let\u2019s use the model for prediction.<\/div>\r\n<div class=\"textLayer\">Part A: Whatis the predicted aftertastevaluefor a coffee withanacidityof 7.25?<\/div>\r\n<div class=\"textLayer\">Part B: How is the aftertastevalueexpected to change iftheaciditydecreases by 0.25 points?<\/div>\r\n<\/div>\r\n<\/div>\r\n<div class=\"textLayer\">\r\n<div class=\"textbox key-takeaways\">\r\n<h3>Question 7<\/h3>\r\n<div id=\"bp-page-2\" class=\"page\" data-page-number=\"2\" data-loaded=\"true\">\r\n<div class=\"textLayer\">7) Let\u2019s conclude by looking at plots of the residuals.<\/div>\r\n<div class=\"textLayer\">Part A: Make a scatterplot of the residuals vs.the predicted values by selecting the Fitted Values &amp; Residual Analysis tab.<\/div>\r\n<\/div>\r\n<div id=\"bp-page-3\" class=\"page\" data-page-number=\"3\" data-loaded=\"true\">\r\n<div class=\"textLayer\">Part B: Based on the plot, is the linear regression equation a good fit for the relationship between acidity and aftertaste? Explain.<\/div>\r\n<div class=\"textLayer\">Part C: Use the linear regression tool (i.e., data analysis tool) to make a histogram of the residuals. Select the option \u201cHistogram\/Boxplot of Residuals.\u201dIf desired, select the option \u201cSuperimpose the Normal Curve\u201d as well.<\/div>\r\n<div class=\"textLayer\">Part D: Describe the distribution of the residuals.<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>","rendered":"<div id=\"bp-page-1\" class=\"page\" data-page-number=\"1\" data-loaded=\"true\">\n<div class=\"textLayer\">In the next preview assignment and in the next class, you will need to use technology to calculate the linear regression equation and use that equation to calculate predicted values of the response variable. You will assess assumptions for regression and interpret confidence intervals.The objective of this analysis is to predict a coffee\u2019s aftertaste based on its acidity. We will use the \u201ccoffeeratings\u201d dataset. These data are originally from the Coffee Quality Database<a class=\"footnote\" title=\"coffee-quality-database. (2018, June 16). GitHub. Retrieved from https:\/\/github.com\/jldbc\/coffee-quality-database\" id=\"return-footnote-5530-1\" href=\"#footnote-5530-1\" aria-label=\"Footnote 1\"><sup class=\"footnote\">[1]<\/sup><\/a> compiled by James DeLoux, a data scientist at BuzzFeed. The dataset contains information about the origin, processing, and taste quality for a sample of 1,338 coffees.The taste quality characteristics are on a scale of 0\u201310, as determined by a panel of expert coffee tasters.<\/div>\n<div class=\"textLayer\">\u2022acidity: Measure of acidity (a sharp, tangy feeling, like when biting into an orange<a class=\"footnote\" title=\"Coffee cupping. (2006, July 26). In Wikipedia.https:\/\/en.wikipedia.org\/wiki\/Coffee_cupping\" id=\"return-footnote-5530-2\" href=\"#footnote-5530-2\" aria-label=\"Footnote 2\"><sup class=\"footnote\">[2]<\/sup><\/a>); higher values correspond to more acidic taste<\/div>\n<div class=\"textLayer\">\u2022aftertaste: Measure of taste after you take a sip of the coffee; higher values indicate better quality taste<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 1<\/h3>\n<p>1) What are the response and explanatory variables? Explain.<\/p>\n<\/div>\n<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 2<\/h3>\n<div class=\"textLayer\">2) Before using linear regression, let\u2019s look at the variable aftertaste. The mean of the sample of aftertaste values is 7.41, and the 95% confidence interval for the mean aftertaste value for all coffees is (7.389, 7.425). Which of the following statements istrue? Select all that apply.<\/div>\n<div class=\"textLayer\">a) We are 95% confident that the mean value of aftertaste for all coffees is between 7.389 and 7.425.<\/div>\n<div class=\"textLayer\">b) The population mean value of aftertaste is 7.41.<\/div>\n<div class=\"textLayer\">c) The sample mean value of aftertaste is 7.41.<\/div>\n<div class=\"textLayer\">d) If we take thousands of random samples of 1,338 coffees and calculate a confidence interval, we expect 95% of the intervals will contain the true mean value of aftertaste.<\/div>\n<\/div>\n<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 3<\/h3>\n<p>3) What is one reason we may want to use a linear regression model to predict thevalue ofaftertastebased on acidityrather than just using the sample meanfrom Question 1 as our best guess?<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"bp-page-2\" class=\"page\" data-page-number=\"2\" data-loaded=\"true\">\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 4<\/h3>\n<div class=\"textLayer\">4) Use spreadsheet DCMP_STAT_16C_Coffee_Ratings to answer the following questions.You will need the DCMP Linear Regressiontoolat https:\/\/dcmathpathways.shinyapps.io\/LinearRegression\/. Make sure to select the appropriate explanatoryvariable (\ud835\udc4b)and responsevariable (\ud835\udc4c).<\/div>\n<div class=\"textLayer\">Part A: Make a scatterplot to visualize the relationship between acidityand aftertaste.<\/div>\n<div class=\"textLayer\">Part B: Describe the relationship between the two variables.<\/div>\n<div class=\"textLayer\">Part C: Based on the scatterplot, is linear regression appropriate to describe the relationship between the variables? Explain.<\/div>\n<\/div>\n<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 5<\/h3>\n<div class=\"textLayer\">5) Usethe DCMP Linear Regressiontoolto calculate the linear regression equation.<\/div>\n<div class=\"textLayer\">Part A: Write the linear regression equationusing customized variable names.<\/div>\n<div class=\"textLayer\">Part B: Interpret the slope in the context of the data.<\/div>\n<div class=\"textLayer\">Part C: Is it meaningful to interpret the intercept? If so, interpret the intercept in the context of the data. If not, explain why not.<\/div>\n<\/div>\n<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 6<\/h3>\n<div class=\"textLayer\">6) Now let\u2019s use the model for prediction.<\/div>\n<div class=\"textLayer\">Part A: Whatis the predicted aftertastevaluefor a coffee withanacidityof 7.25?<\/div>\n<div class=\"textLayer\">Part B: How is the aftertastevalueexpected to change iftheaciditydecreases by 0.25 points?<\/div>\n<\/div>\n<\/div>\n<div class=\"textLayer\">\n<div class=\"textbox key-takeaways\">\n<h3>Question 7<\/h3>\n<div id=\"bp-page-2\" class=\"page\" data-page-number=\"2\" data-loaded=\"true\">\n<div class=\"textLayer\">7) Let\u2019s conclude by looking at plots of the residuals.<\/div>\n<div class=\"textLayer\">Part A: Make a scatterplot of the residuals vs.the predicted values by selecting the Fitted Values &amp; Residual Analysis tab.<\/div>\n<\/div>\n<div id=\"bp-page-3\" class=\"page\" data-page-number=\"3\" data-loaded=\"true\">\n<div class=\"textLayer\">Part B: Based on the plot, is the linear regression equation a good fit for the relationship between acidity and aftertaste? Explain.<\/div>\n<div class=\"textLayer\">Part C: Use the linear regression tool (i.e., data analysis tool) to make a histogram of the residuals. Select the option \u201cHistogram\/Boxplot of Residuals.\u201dIf desired, select the option \u201cSuperimpose the Normal Curve\u201d as well.<\/div>\n<div class=\"textLayer\">Part D: Describe the distribution of the residuals.<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<hr class=\"before-footnotes clear\" \/><div class=\"footnotes\"><ol><li id=\"footnote-5530-1\">coffee-quality-database. (2018, June 16). GitHub. Retrieved from https:\/\/github.com\/jldbc\/coffee-quality-database <a href=\"#return-footnote-5530-1\" class=\"return-footnote\" aria-label=\"Return to footnote 1\">&crarr;<\/a><\/li><li id=\"footnote-5530-2\">Coffee cupping. (2006, July 26). In Wikipedia.https:\/\/en.wikipedia.org\/wiki\/Coffee_cupping <a href=\"#return-footnote-5530-2\" class=\"return-footnote\" aria-label=\"Return to footnote 2\">&crarr;<\/a><\/li><\/ol><\/div>","protected":false},"author":23592,"menu_order":68,"template":"","meta":{"_candela_citation":"[]","CANDELA_OUTCOMES_GUID":"","pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-5530","chapter","type-chapter","status-publish","hentry"],"part":5514,"_links":{"self":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5530","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/users\/23592"}],"version-history":[{"count":4,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5530\/revisions"}],"predecessor-version":[{"id":5639,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5530\/revisions\/5639"}],"part":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/parts\/5514"}],"metadata":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapters\/5530\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/media?parent=5530"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/pressbooks\/v2\/chapter-type?post=5530"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/contributor?post=5530"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/courses.lumenlearning.com\/lumen-danacenter-statsmockup\/wp-json\/wp\/v2\/license?post=5530"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}