Problem
Part 1 must be submitted (pdf attachment preferred). Part 2 should be answered through Canvas in the quiz section. In the STATA/R exercises, please upload your do file/R-script with comments.
1 Part 1
1.1 Individual project
1. What is the data source you chose? Describe the variables: content, timing, location, etc.
2. Why do you think understanding these variables, and their association, could be interesting to your or to others?
3. Create a histogram and box-and-whiskers plot of your variables
4. Are there outliers? Do you think these outliers could be a problem in the future?
5. Summarize your data: mean, median, 25th percentile, 75th percentile, stan- dard deviation, min, max
6. Are there missing observations in any of the variables? do we know why?
7. Does the data look symmetric? if not, what type of skewness there is?
1.2 Proofs
1. Let x1, x2, ..., xn be observed values of a variable x. Let x be the average and let s be the standard deviation of these observation, so zi = xi−x¯is the ith value of x expressed in standard units. Show that the average x expressed in standard units is zero and the standard deviation is one.
2. Consider the following model of linear statistical association:
yi = b1xi + ui
Given observations (yi; xi) for i = 1, ..., n ; derive an expression for the slope coefficient that minimizes the average squared difference between the observed values of y and the values predicted by this regression
1.3 STATA/R
Import the Auto data.
Stata: webuse auto.dta, clear
R: install.packages(“ISLR”) library(ISLR)
1. Find the mean and the median of the variable weight.
2. Draw a histogram to show the distribution of the variable weight.
3. Draw a quantile function of the variable weight and find the 25%, 50%, 75% percentiles.
4. Draw a box plot for weight. Are there any outliers?
5. Draw a scatterplot for price and weight and find the correlation between these 2 variables
6. Draw the scatterplot again with a best-fit line. Does the relation look linear?
7. Regress price on weight. What is the coefficient on the x variable (βˆ1) What is the intercept in each equation (βˆ0)? How can we interpret the coefficient on weight?
8. Include variable length on the regression. What are the coefficients of a re-gression of price on length and weight (βˆ1 and (βˆ0)?βˆ2)? What is the intercept
9. Create a correlation matrix for the three variables
CS 340 Milestone One Guidelines and Rubric Overview: For this assignment, you will implement the fundamental operations of create, read, update,
Retail Transaction Programming Project Project Requirements: Develop a program to emulate a purchase transaction at a retail store. This
7COM1028 Secure Systems Programming Referral Coursework: Secure
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
CS 340 Final Project Guidelines and Rubric Overview The final project will encompass developing a web service using a software stack and impleme