logo Hurry, Grab up to 30% discount on the entire course
Order Now logo

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

expert
Herbert BarciaEducation
(5/5)

926 Answers

Hire Me
expert
Julian RichardsonComputer science
(5/5)

767 Answers

Hire Me
expert
Georgee BlackbridgeData mining
(5/5)

605 Answers

Hire Me
expert
StatAnalytica ExpertEngineering
(5/5)

990 Answers

Hire Me
R Programming
(5/5)

Classification goal is to predict whether the patient has a 10-year risk of future coronary heart disease (CHD)

INSTRUCTIONS TO CANDIDATES
ANSWER ALL QUESTIONS

World Health Organization has estimated 12 million deaths occur worldwide, every year due to heart diseases. Half the deaths in the United States and other developed countries are due to cardiovascular diseases. The early prognosis of cardiovascular diseases can aid in making decisions on lifestyle changes in high-risk patients and in turn reduce the complications. This assignment intends to pinpoint the most relevant/risk factors of heart disease as well as predict the overall risk using logistic regression.

The classification goal is to predict whether the patient has a 10-year risk of future coronary heart disease (CHD). The data is available on Canvas with the name Heart Disease.

 Table 4.1: Meta Data                                                          

Variable

Definition

Variable type

male

If sex is male (yes = 1)

Categorical

age

age of the patient

Continuous

education

education of the patient

Categorical

currentSmoker

whether or not the patient is a current smoker

Categorical

cigsPerDay

the number of cigarettes that the person smoked on average in one day

Continuous

BPMeds

whether or not the patient was on blood pressure medication

Categorical

prevalentStroke

whether or not the patient had previously had a stroke

Categorical

prevalentHyp

whether or not the patient was hypertensive

Categorical

diabetes

whether or not the patient had diabetes

Categorical

totChol:

total cholesterol level

Continuous

sysBP

systolic blood pressure

Continuous

diaBP

diastolic blood pressure

Continuous

BMI

Body Mass Index

Continuous

heartRate

Heart rate

Continuous

glucose

Glucose

Continuous

TenYearCHD

10 year risk of coronary heart disease (CHD)

Categorical

 Questions

1.    Import dataset as chdData. Specify the number of variables and the number of observations.

2.    Show first five observations.

3.    Create a new dataset (chdDataSelected ) by keeping the following variables:

•  male

•  age

•  sysBP

•  heartRate

•  glucose 

 •  TenYearCHD

4.    Investigate the type of variables in chdDataSelected. Change the variable to the type shown in Table 4.1.

5.    Produce a table tabulating the missing count of the aforementioned six variables.

6.    Impute missing values by the median for variables glucose and heart rate.

7.    Make a density plot for variable glucose, similar to the one shown in Figure 4.1. (Hint: apply function

Geom density ). 

8.    Remove observations that have missing values for other variables. Show your output after applying summary function. (Hint: you might want to use complete.cases function)

9.   Make a bar plot showing ten year risk of heart disease for each gender

10. In this dataset (chdDataSelected ), what is the proportion of the people with 10 year heart disease risk? (Hint:use function table)

11.  In dataset (chdDataSelected ), add another column TenYearCHD Encoded representing No if TenYearCHD

= 0, or Yes if TenYearCHD = 1.

12.Build three logistic regression models and report ROC. Your response variable is TenYearCHD Encoded. (The arguements inside trainControl will be same as for the exercise we did in the class)

(5/5)
Attachments:

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

CS 340 Milestone One Guidelines and Rubric  Overview: For this assignment, you will implement the fundamental operations of create, read, update,

. Develop a program to emulate a purchase transaction at a retail store. This  program will have two classes, a LineItem class and a Transaction class

Retail Transaction Programming Project  Project Requirements:  Develop a program to emulate a purchase transaction at a retail store. This

. The following program contains five errors. Identify the errors and fix them

7COM1028   Secure Systems Programming   Referral Coursework: Secure

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer

CS 340 Final Project Guidelines and Rubric  Overview The final project will encompass developing a web service using a software stack and impleme