You are expected to write your assignment using R Markdown (see Lecture 6) or MS Word and submit a PDF. You need to submit your assignment via the provided submission link on iLearn.
You may discuss the assignment in the early stages with your fellow students. However, the assignment submitted should be your own individual work.
The R Markdown ‘Cheatsheet’ from the RStudio team is given here.
In your answers to the questions below, produce the appropriate R output and/or explanation of the steps and results. Don’t include any more R output than necessary and include only concise explanations.
Question 1 [21 marks]
Information was recorded on 376 sampled whisky and is available in the file whisky-chemistry.csv on iLearn.
Variable Description
Taste Taste score achieved for the Whisky Alcohol Level of Alcohol in % for the Whisky
Esters Level of Esters: Typically add fruity flavour
Lactones Level of Lactones: Found in the barrel the Whisky is aged in.
Contributes to woody flavours.
PhenComp Level of Phenolic Compounds: Typically giving a smokey flavour.
a) Consider first a full regression model with all the predictors used to explain the Taste response.
i) [4 marks] Write down the full statistical multiple regression model for quality explained by the other predictors. Carefully define all necessary parameters in your answer.
ii) [4 marks] Fit and validate the full regression model.
iii) [3 marks] Compute a 95% confidence interval for the regression coefficient (slope) for the Esters
variable. Explain what the confidence interval represents in the context of the data.
b) Consider now a reduced model that can explain the taste response with a reduced set of predictors.
i) [2 marks] Using the appropriate backward model selection method discussed in the course, determine the best regression model for the data.
ii) [4 marks] Write down the final model and interpret it in the context of the data.
c) For both the full model considered in a) and the reduced model in b), answer the questions below.
i) [2 marks] State the R2 and explain what it means in the context of the data.
ii) [2 marks] Explain why the adjusted R2 should be reported over the R2 for assessing the goodness of fit.
Question 2 [10 marks]
An internet service provider would like to know the levels of performance of its four competitors, which all offer a competitive product near Macquarie Park. A random sample of people that in that area and subscribe to one of those four providers’ plans were surveyed to see the average download speed achieved under a standard speed test. The achieved speed and service provider is recorded. The data is available in the file internet-speed.csv on iLearn.
Variable Description
TestedSpeed The measured download speed
Provider The four internet service providers with levels A, B, C and D
Define the general contrasts,
C1 = µA+µB +µC − µD C2 = µA+µB − µC +µD .
Conduct two significance tests on the contrasts C1 and C2 for an overall significance level of α = 0.05. In each of your answers:
a) [1 mark] State the appropriate hypotheses.
b) [1 mark] Calculate the observed value of the (raw) contrast.
c) [2 marks] Calculate the standard error of the contrast.
d) [1 mark] State the null distribution of the test statistic.
e) [2 marks] Compute or give the best approximate bound on the P-Value.
f) [3 marks] Draw your overall conclusions (both statistical and contextual).
CS 340 Milestone One Guidelines and Rubric Overview: For this assignment, you will implement the fundamental operations of create, read, update,
Retail Transaction Programming Project Project Requirements: Develop a program to emulate a purchase transaction at a retail store. This
7COM1028 Secure Systems Programming Referral Coursework: Secure
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
CS 340 Final Project Guidelines and Rubric Overview The final project will encompass developing a web service using a software stack and impleme