ASSIGNMENT 2
SCENARIO
John is having an animated discussion with his colleague, Mary, about the importance of total Cholesterol level and its relationship with BMI. Mary emphasises that literature focuses on the level of LDL (low density lipoprotein) cholesterol level, not total cholesterol. John suggests analysing the HERS dataset that contains these variables to obtain a proper answer to this question. Consider providing your syntax and output. Round to three decimal places (unless otherwise stated) and assume a priori alpha for p<0.05.
In this exercise, you will be helping John and Mary to arrive at a reasonable conclusion.
Paste your full syntax at the bottom of this assignment.
DATA PREP
Open the HERS dataset. Find two variables “tchol1” labeled as “Year 1 total cholesterol (mg/dl)”, “LDL1”, labeled as” year 1 LDL cholesterol (mg/dl)”, and BMI (kg/m2).
________________________________________
Hint: Working with Datasets with Many Variables
Sometimes when working in datasets with many variables, it can be cumbersome to try to find the variables you need. There are a few things that can make your life easier.
First, if you need to keep all the variables: You can use the search box above the variables list to quickly jump to the variables you are looking for. Note, you can also search for parts of variable names.
Second, if you know ahead of time exactly which variables you will need and which you will not need, you can drop the variables you will not need. If you do this, be careful not to over-wright the original data file! You can either save as a new data set, or just keep the data prep syntax to configure the original dataset to your needs in future analyses.
________________________________________
QUESTIONS
Q1. CARRY OUT A SIMPLE LINEAR REGRESSION WITH BMI AS THE OUTCOME VARIABLE AND CHOLESTEROL LEVEL AS THE PREDICTOR.
Report the correlation coefficient between BMI and cholesterol level (“Year 1 total cholesterol (mg/dl)”) Round your response to three decimal places.
(HINT: Remember that the Coefficient of determination (R2) is simply the square of the correlation coefficient in a simple regression). Enter the correlation coefficient in the box.
Q2. WHAT PROPORTION OF BMI IS EXPLAINED BY CHOLESTEROL LEVEL? (EXPRESS AS A PERCENT, E.G., 0.129 WOULD BE EXPRESSED AS 12.9)
Q3. NOW CARRY ON A SIMPLE LINEAR REGRESSION WITH BMI AS THE OUTCOME VARIABLE AND LDL LEVEL AS THE PREDICTOR.
Report the correlations coefficient between BMI and LDL level (use the LDL variable LDL1, labeled as year 1 LDL cholesterol (mg/dl)). Round your response to three decimal places. Enter the correlation coefficient in the box.
Q4. WHAT PROPORTION OF BMI IS EXPLAINED BY LDL LEVEL? (EXPRESS AS A PERCENT, E.G., 0.129 WOULD BE EXPRESSED AS 12.9)
Q5. ASSUMING A PRIORI ALPHA OF P<0.05 WHICH OF THE FOLLOWING STATEMENTS IS CORRECT?
Select one
a. Cholesterol is a significant predictor of BMI, but LDL is not
b. LDL is a significant predictor of BMI, but Cholesterol is not
c. Neither LDL nor Cholesterol are statistically significant predictors of BMI
d. Both LDL and Cholesterol are statistically significant predictors of BMI
e. We do not have enough information to determine
Q6. BASED ON THE ABOVE CORRELATION COEFFICIENT AND COEFFICIENT OF DETERMINATION FOR THE RELATIONSHIP BETWEEN BMI AND CHOLESTEROL OR LDL, WHO WAS CORRECT (MARY OR JOHN) IN THEIR ASSERTION THAT CHOLESTEROL VERSUS LDL WAS A “MORE IMPORTANT” PREDICTOR OF BMI, AND WHY?
Hint: when reporting your answer, support any statements of fact with the appropriate statistical results.
Q7. NOW COME AT THIS FROM A DIFFERENT PERSPECTIVE: INTERPRET THE RELATIONSHIP BETWEEN BMI AND CHOLESTEROL LEVEL BASED ON THE REGRESSION COEFFICIENT.
For ten unit change in cholesterol level, how much change should we see in BMI? (round to two decimal places). Report the P value associated with the t-test in this relationship. State as a complete sentence with all critical information provided.
Hint: Sometimes a one unit change in the predictor variable is clinically meaningless (when is the last time you had a physician concerned over a 1-point change in your serum cholesterol?). So, it may be more clinically relevant to interpret the relationship between the change in the predictor (X) and the outcome (Y) in terms of a larger unit of change. E.g., to interpret the effect of a 100 unit change in X, simply multiply the coefficient by 100.
Q8. INTERPRET THE RELATIONSHIP BETWEEN BMI AND LDL LEVEL.
For ten unit change in LDL level, how much change should we see in BMI? (round to two decimal places). Report the P value associated with the t-test in this relationship.
Q9. GENERATE A SCATTERPLOT BETWEEN BMI AND LDL AND INCLUDE A LINEAR FIT LINE ON TOP OF THE SCATTERPLOT.
Q9.1. PASTE THE GRAPH BELOW
Q9.2. PASTE THE SYNTAX FOR THE ABOVE GRAPH BELOW
Q10. IN A SIMPLE LINEAR REGRESSION, WHICH ESTIMATE DETERMINES WHETHER THE PREDICTOR VARIABLE HAS A STATISTICALLY SIGNIFICANT RELATIONSHIP WITH THE OUTCOME VARIABLE?
CS 340 Milestone One Guidelines and Rubric Overview: For this assignment, you will implement the fundamental operations of create, read, update,
Retail Transaction Programming Project Project Requirements: Develop a program to emulate a purchase transaction at a retail store. This
7COM1028 Secure Systems Programming Referral Coursework: Secure
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
CS 340 Final Project Guidelines and Rubric Overview The final project will encompass developing a web service using a software stack and impleme