Assignment Questions
In this assignment, you will be analyzing the data collected in the class survey. A copy of this survey is found on the last page of this assignment.
Note: All tables and graphs included in your answers for questions 1 through 7 must conform to the PUBH5018 presentation guidelines. Penalties will be applied where they do not conform.
Questions
1. (2.5 marks)
Using appropriate methods, individually examine the following variables: lang, eyewear, sex, age, employ, breath and sitting to determine if there are any missing or outlier values. If necessary, delete any values as appropriate. For each variable, comment very briefly on what analysis you used, report what you found, any data edits you performed (if any) and why. (Preferably present it as a table; for example, you could use the template below); otherwise it must be a maximum of one sentence per variable).
NB: You should use your cleaned data, i.e. the data with any edits performed, for the remainder of the assignment.
Table 1: Analysis of variables
Variable Analysis used Number missing Outlier/Implausible values description Action taken description
lang
eyewear
sex
age
employ
breath
sitting
2. (3 marks)Calculate a statistical anxiety score (SAS) for each participant using the formula:
πππ₯ππ32
ππ΄π = 10 + πππ₯ππ1 + πππ₯ππ2 −10
Display the distribution of SAS using a boxplot and describe the important features of this distribution with relevant summary statistics.
3. (2 marks)
Long sitting times every day are associated with poorer health outcomes. Group the variable sitting into three categories as follows: Low risk (less than 4 hours sitting); medium risk (at least 4 hours sitting, but less than 8 hours); high risk (at least 8 hours sitting). Produce a frequency table that includes the number and percentage of participants in each category.
4. (2 marks)
Produce the appropriate plot that shows a breakdown of eye wear (eyewear) used by participants.
5. (2 marks)
Produce a histogram of age (age). Write one to three sentences that describe the distribution and provide appropriate summary statistics.
6. (3 marks)
Create a two-way table with lang as the rows and employ as the columns. Within each employment category, include the percentage of participants that do and do not speak a language other than English at home. Summarise the table in 1 to 2 sentences.
7. (5 marks)
Perform a hypothesis test that investigates whether Introductory Biostatistics students have a mean breath holding time (breath) of 36 seconds. To this end, carry out the following steps:
• State the null hypothesis for this hypothesis test
• Carry out a one sample t-test
• Calculate a 95% confidence interval for the mean breath holding time (Note: this is not the same as the confidence interval for the mean difference)
• Write a conclusion (2 – 4 sentences long) that summarises the important results
8. (0.5 marks)
To receive the marks for this question, you need to have created your profile with Examity through Canvas by the assignment due date. To do this, follow the instructions below:
• Log onto the PUBH5018 Canvas site
• Navigate to Modules > Examity Mid-Semester Online Assessment 2020
• Read through the “Examity Registration Instructions for Students - 2020” document
• Create your profile with Examity following the instructions in the above document
CS 340 Milestone One Guidelines and Rubric Overview: For this assignment, you will implement the fundamental operations of create, read, update,
Retail Transaction Programming Project Project Requirements: Develop a program to emulate a purchase transaction at a retail store. This
7COM1028 Secure Systems Programming Referral Coursework: Secure
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
CS 340 Final Project Guidelines and Rubric Overview The final project will encompass developing a web service using a software stack and impleme