For this assignment we will use the following definitions for the study variables:
-Dependent Continuous Variable: BMI. Notice that this variable does not appear in the original data set, instead a program such as AddHealthSamplePrePROC.sas or AddHealthSampleBasicPROC.sas (which are provided in the Complex Survey Module) must be ran before using it.
-Dependent Categorical Variable: Obese. Notice that this variable does not appear in the original data set, instead, after creating BMI, you will create it using the following rule: if BMI >=30 then Obese='Yes', else if 0<=BMI<30 then Obese='No'.
-Independent Categorical Variable: This a categorical variable that you can choose from the Add Health data set.
-Independent Continuous Variable: This a continuous variable that you can choose from the Add Health data set, e.g. H1DA3 "During the past week, how many times did you watch television or videos, or play video games?". Don't be surprise if you struggle a bit to find one that you like, there are surprisingly few of them!
________________________________________________________________________________
1. Create a data subset that includes only your study variables
2. Construct a new categorical variable (collapse categories) and rename or label. I'm expecting something simple, for example collapsing all the different categories that don't really provide information, e.g. "Refused"=6,"Don't know"=8, etc., into the single category of real missing values, would be enough.
3. For the dependent continuous-valued variable BMI (no survey proc's):
A. Show your 10 bottom and 10 top extreme values (± 3 SD)
B. Outliers (studentized residuals), leverage, and influential points. Notice that generating these measures is not enough to solve this part, you still have to identify which observations are considered outliers. For example generating Cook's D is no enough, in addition to this you have to identify which observations have an extraordinarily large value according to the rule of thumb provided in the lecture.
4. Describe your data subset:
A. Answer the following questions: How many observations? How many variables? What types of variables (character, numerical)?
B. Present your exploratory analysis results in a descriptive table format (Table 1):
i. Use proc surveyfreq for your categorical variables (Obese and your chosen categorical variable).
ii. Use proc surveymeans for your continuous variables (BMI and your chosen continuous variable).
Include your SAS code and output
5. Conduct a multicollinearity test on the continuous independent variable and age (no survey proc's).
6. Write a description of your outcome and exposure variables.
CS 340 Milestone One Guidelines and Rubric Overview: For this assignment, you will implement the fundamental operations of create, read, update,
Retail Transaction Programming Project Project Requirements: Develop a program to emulate a purchase transaction at a retail store. This
7COM1028 Secure Systems Programming Referral Coursework: Secure
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
CS 340 Final Project Guidelines and Rubric Overview The final project will encompass developing a web service using a software stack and impleme