In this final project, you will use data collected by the U.S.National Center for Health Statistics through the National Health and Nutrition Examination Survey (NHANES) for2011–2012. The NHANES has been conducted every two years since the early 1960s. The data collected from the survey include demographics, various body and health measurements, and information about various lifestyle choices. The NHANES survey is unique in that the health measurements are collected through physical examinations. Data on all the other variables are self-reported. You will work with a dataset that includes 6,033 adults, age 20 and older. The dataset has a total of 11 variables.The data are contained in spreadsheet DCMP_STAT_15F_nhanes_project_all.
For this project, the dataset has the following variables:
Credit: iStock/sturti
| Variable Name | Description |
| Age | Age in years at screening of participant |
| AttendCollege | Did the participant attend college? (Yes, No) |
| BadPhysHlthDay | Did the participant report any number of days when their physical health was not good during the past 30 days? (Yes, at least 1 day; No, 0 days) |
| BadMentHlthDay | Did the participant report any number of days when their mental health was not good during the past 30 days? (Yes, at least 1 day; No, 0 days) |
| BMI | Body mass index (kg/m2) |
| SleepTrouble | Participant has told a doctor or other health professional that they have trouble sleeping (Yes, No) |
| SleepHrsNight | Number of hours of sleep the participant reported they usually get at night on weekdays or workdays |
| TotChol | Total HDL cholesterol in mmol/L |
| BPSysAve | Average of three systolic blood pressure readings in mmHg |
| HealthGen | Self-reported rating of participant’s health in general (Excellent, Vgood, Good, Fair, Poor) |
| PhysActive | Whether the participant does moderate or vigorous sports, fitness, or recreational activities (Yes, No) |
Discussion about the projectBefore you begin the analysis, we will discuss the data and expectations for the project.
1) Read Parts A and B of the project. Do you have any questions about those two parts? Do you have any concerns or questions about creating graphical displays? Do you have any concerns or questions about calculating confidence intervals or choosing the correct statistical tests?
2) Read Part C of the project. Do you have any questions about Part C? Do you have any concerns orquestions about choosing a quantitative variable and a categorical variable for the analyses?
3) Read Part D of the project. Do you have any questions about Part D? Do you have any concerns or questions about creating the flyer and using at least two of the analyses from Parts A to C?
4) Do you have any questions about the dataset?
5) Do you have any additional questions or comments that you feel would help you be more prepared in completing the project?
Discussion about norms for collaboration
Before you begin the analysis, we will discuss expectations for collaboration and working in groups.
1) Please review the norms for effective collaborations in groups below. Highlight any norm that resonates with you.
•Respect your fellow group members; critique ideas and not people
•Be an active listener; let people finish talking before you talk
•Help others in your group and ask for help when you need it
•Do your fair share of the work as agreed upon by the group
•Be prepared and on time for any meetings with your group
2) Is there a norm or aspect of working collaboratively in a group that you feel is missing from the norms? If so, what is it?
3) If a student in your group breaks one of the norms for effective collaborations, how do you think your group should address it?
Discussion about template and grading expectations
One objective of this project is to develop your statistical inference skills and your ability to write interpretations of statistical analyses in a way that can be understood by a general audience. Before we begin the analysis, we are going to discuss the expectations for grading the project based on the rubric.
1) Read Parts A to C (four separate sections) of the grading rubric. What are the factors that are needed for Part C of the project to receive full credit for the section with a categorical and quantitative variable?
2) Read Part D of the grading rubric. What are the main aspects of the flyer that need to be included in order to receive full credit?
3) What part of the grading rubric requires that the paper is neatly prepared with correct spelling and grammar?
4) Do you have any questions about the grading rubric or expectations for the project?
Final Project
Your college is hosting a Health Day fair, and you have reserved a booth to share your findings. In Parts A to C of this project, you will analyze data to learn about the relationship between physical health and other health and lifestyle factors. After you complete the analyses, you decide to create a flyer to display at your booth. An infographic or flyer is “a collection of imagery, charts, and minimal text that gives an easy-to-understand overview of a topic.”[1]
Part A-Importance of Sleep: The question we are interested in asking is, “Is there a difference in the mean sleep hours reported on a weekday between participants who reported no days when their physical health was not good and participants who did report days when their physical health was not good?”
1) Use spreadsheet DCMP_STAT_15F_nhanes_project_all to create a graphical display appropriate for answering the question and interpret the graph in one to two sentences.
2) Calculate a confidence interval for the mean difference in sleep hours between the two groups of participants. Interpret the confidence interval in the context of the data, and state your conclusion based on the interval.
Part B-Importance of Mental Health: Next, we’d like to explore if there is a difference in the proportion of participants who reported having at least one day when their mental health was not good based on whether or not they reported having at least one day when their physical health was not good.
1) First, create a contingency table and calculate the appropriate summary statistics to explore the relationship above. Then, create a graphical display to visualize this relationship. (Note: Use the data from the contingency table to create the graphical display in the data analysis tool.)
2) Write two to three sentences interpreting the graphical display and summary statistics.
3) Use an appropriate statistical test to investigate the research question of interest. In your response, write the hypotheses in words and mathematical notation, display the output from the test, and state your conclusion in the context of the data.
Part C-Importance of Other Lifestyle Factors:Let’s continue investigating questions around physical health and other lifestyle and health factors. Consider the following variables: Age, AttendCollege, BMI,and SleepTrouble.
1) Choose a quantitative variable from the previous list. Create a graphical display to visualize the relationship between this variable and physical health, and then interpret the graph in one to two sentences. Then, use statistical inference to further investigate the two variables. Include all relevant output and write your interpretations and conclusions in the context of the data.
2) Choose a categorical variable from the previous list. Create a contingency table and calculate the appropriate summary statistics. Also, create a graphical display to visualize the relationship between this variable and physical health, and then interpret the graph in one to two sentences. Then, use statistical inference to further investigate the two variables.Include all relevant output and write your interpretations and conclusions in the context of the data. (Note: Use the data from the contingency table to create the graphical display in the data analysis tool.)
3) What are one to two potential limitations to the analysis in Part C? What are one to two additional questions that you would explore as next steps in your analysis?
Part D-Share the Results: Create a one-page flyer to share the results from at least two analyses from Parts A to C. Here are more considerations as you make the flyer:
•You are welcome to use the graphical displays you created in Parts A to C or create new ones. If you make new graphical displays, the underlying data and conclusions should not change.
•Your flyer should include a brief summary of the limitations and additional questions from Part C.
- Nediger, M. (2021, October 19). What is an infographic? Examples, templates, & design tips. Venngage. https://venngage.com/blog/what-is-an-infographic/ ↵