logo Hurry, Grab up to 30% discount on the entire course
Order Now logo

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

expert
Jack StifunOthers
(5/5)

679 Answers

Hire Me
expert
Martha BagemihlStatistics
(5/5)

526 Answers

Hire Me
expert
Cooper EllissFinance
(5/5)

772 Answers

Hire Me
expert
Arbaaj KhanScience
(5/5)

855 Answers

Hire Me
R Programming
(5/5)

Randomly select 500 observations from the data. For this selection use your student ID as the seed

INSTRUCTIONS TO CANDIDATES
ANSWER ALL QUESTIONS

The final project will be done individually and must be typed and submitted by the stated deadline. The project needs to fulfill the following criteria:

To submit your results, you will be required to prepare a 5 minute presentation that you will need to record (using your computer, phone, etc.). You will be required to display your T-card alongside your face at the beginning of your video to verify your identity.

You will be required to submit the R codes that you have created in a separate file. This helps us to check the reproducibility of your codes.

You will need to display the results of your project in a logical way using slides (e.g. Power- Point, latex, R Markdown or other) and record yourself discussing these results, with a focus on why you chose to do certain things and interpretation of your results for non-statisticians.

A rubric will be provided shortly within this week.

Presentations should be submitted on time (i.e. by the deadline).

There are no make-up final projects. A missed final project will be given a grade of 0.

For this problem you need to load the NHANES dataset using the following command

## If the package is not already installed then use ## install.packages(’NHANES’) ; install.packages(’tidyverse’) library(tidyverse)

library(NHANES)

small.nhanes <- na.omit(NHANES[NHANES$SurveyYr=="2011_12"

&     NHANES$Age    >    17,c(1,3,4,8:11,13,17,20,21,25,46,50,51,52,61)])

small.nhanes <- as.data.frame(small.nhanes %>% group_by(ID) %>% filter(row_number()==1) ) nrow(small.nhanes)

## Checking whether there are any ID that was repeated. If not ##

## then length(unique(small.nhanes$ID)) and nrow(small.nhanes) are same ## length(unique(small.nhanes$ID))

 This is data collected by the US National Center for Health Statistics (NCHS). To check the vari- able description please type ?NHANES in R. The preceding codes create a small subset of the original NHANES dataset.  The original dataset has 76 variables.  The  small.nhanes dataset has 17 vari- ables. We have only selected data from people with age > 17 years.

With this dataset answer the following questions, Randomly select 500 observations from the data. For this selection use your student ID as the seed (you can follow the next chunk of codes for this). This is the training set. The rest of the data will be used as a test set. The test set should not be used for model fitting and validating at any point during the analysis of the project.

## Create training and test set ## set.seed(1002656486)

train <- small.nhanes[sample(seq_len(nrow(small.nhanes)), size = 500),] nrow(train)

length(which(small.nhanes$ID   %in%   train$ID))

test <- small.nhanes[!small.nhanes$ID %in% train$ID,] nrow(test)

The combined systolic blood pressure reading (BPSysAve) is our outcome of interest.  Every other variable other than the ID can be considered as predictors. We are mainly interested on the effect of smoking (SmokeNow) on the combined systolic blood pressure reading. However, we are also interested in the prediction of the combined systolic blood pressure reading and identifying which variables are the best for the prediction. Based on the data analysis techniques you learned from this course perform a complete analysis on the dataset. Your analysis should include (but is not limited to):

Model Diagnostics

Checking for the variance inflation factor (VIF)

Variable selection

Shrinkage methods

Model Validation

Checking the prediction error on the test set after applying various model selection techniques

After selecting the best model interpret and explain the parameter estimates

Conclude on the effect of predictors on the combined systolic blood pressure reading However, you have to justify the aforementioned methods and have to use them accurately.

The final project will be submitted as a presentation. However, to structure your presentation please present in the following order:

Introduction section:  where you introduce the purpose and relevance of the project.  You can also include some literature review on the NHANES dataset if applicable and if you have some time. 

Methods section: Please describe and explain the methods, tools and techniques used to arrive at your final model here. Need to show some exploratory data analysis (graphs and tables).

Results section: here you present a description of  your  study  sample,  important  results that led you to make crucial decision in building your model, and the final model and any other important results

Discussion section: here you interpret your final model and describe why it answers the research question and why it is important, as well as discuss any limitations that still exist based on your results.

(5/5)
Attachments:

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

CS 340 Milestone One Guidelines and Rubric  Overview: For this assignment, you will implement the fundamental operations of create, read, update,

. Develop a program to emulate a purchase transaction at a retail store. This  program will have two classes, a LineItem class and a Transaction class

Retail Transaction Programming Project  Project Requirements:  Develop a program to emulate a purchase transaction at a retail store. This

. The following program contains five errors. Identify the errors and fix them

7COM1028   Secure Systems Programming   Referral Coursework: Secure

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer

CS 340 Final Project Guidelines and Rubric  Overview The final project will encompass developing a web service using a software stack and impleme