● Q1: How many people reported their general health is excellent?
● Q2. What is the highest value for number of adult women in the household where someone has ever had a stroke? Summarise the value in a variable called max_numwomen.
● Q3: Compute the means and standard deviations for MENTHLTH comparing caregivers who managed personal care such as giving medications, feeding, dressing, or bathing and those who did not. The output should be a dataframe. It should be formatted like this:
CRGVPERS mean_health sd_health
1 5.23 xxx.xx
2 xxx.xx xxx.xx
● Q4: What is the median age when respondents were told they had diabetes for those living in Pennsylvania? Only calculate it for those who gave an age. The value should be called med.diab.age and be in a 1 x 1 dataframe.
● Q5: Predict number of days in the past 30 days mental health was not good from marital status. Assign the summary of the model to Q5.
● Q6: Use summarise to compare the mean number of days in the past 30 days mental health was not good by marital status and assign to Q6 as a dataframe. Round to two decimals. The mean should be called mean_mental. You should be able to confirm the results of Q5 after creating this.
● Q7: Calculate the means and standard deviations of MENTHLTH for those who have had a stroke and those who have not had a stroke only for those who do not have any kind of healthcare coverage. The output should be a dataframe assigned to Q7, and be formatted like this:
xxx mean_mental sd_mental
1 8.89 xxx.xx
2 xxx.xx xxx.xx
● Q8: Run an ANOVA comparing how many times per week respondents took part in the exercise they spent the most time doing in the past month by marital status, and assign the TukeyHSD post-hoc test to Q8. (You may need to look up how to do this in R. Note: The general instructions say to round all output but the TukeyHSD function output is not able to be rounded.)
● Q9: Calculate the variance in number of days a respondent drank alcohol in the past week for each type of physical activity or exercise spent in the past month for men. Note: Pay special attention to how values are coded in the Codebook.
○ Arrange in descending order, and include only the six with the highest variance in drinks.
○ The output should be a 6 X 2 dataframe, rounded to two decimals and look like:
EXRACT11 var_drinks
55 xxx.xx
xxx.xx xxx.xx
xxx.xx xxx.xx
xxx.xx 6.67
xxx.xx xxx.xx
xxx.xx xxx.xx
For the final section, you will choose four variables to explore in ways we have not. Complete the following:
● Q10: Address the values of any variables. For instance, is “none” equal to a value other than 0? Are there extra decimals implied?
● Q11: Remove any outliers. Briefly explain why you chose the method you used. Make sure to comment it out.
● Q12: Complete exploratory analyses doing appropriate visualizations with ggplot2.
● Q13: Run basic descriptive statistics.
● Q14: Finally, run an appropriate regression predicting one of those variables. Identify the best model.
Your answers must be clearly identifiable. Take time to tidy your code once you are finished. The easier it is for us to understand, the more partial credit you could receive.
CS 340 Milestone One Guidelines and Rubric Overview: For this assignment, you will implement the fundamental operations of create, read, update,
Retail Transaction Programming Project Project Requirements: Develop a program to emulate a purchase transaction at a retail store. This
7COM1028 Secure Systems Programming Referral Coursework: Secure
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
CS 340 Final Project Guidelines and Rubric Overview The final project will encompass developing a web service using a software stack and impleme