logo Hurry, Grab up to 30% discount on the entire course
Order Now logo

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

expert
Abhishek MisraEconomics
(5/5)

589 Answers

Hire Me
expert
Feuer YaoData mining
(5/5)

645 Answers

Hire Me
expert
BIDESH DASMathematics
(/5)

930 Answers

Hire Me
expert
Neha SharmaaStatistics
(5/5)

1156 Answers

Hire Me
R Programming
(5/5)

Make a simple regression of any promising individual variables. Which ones stand out

INSTRUCTIONS TO CANDIDATES
ANSWER ALL QUESTIONS

Multiple Regression Statistics

Fundamentals of Data Science

We will use the other part of that wine dataset from the UC-Irvine Machine Learning Repository.

White Wine: (for variety, and to keep practicing importing data, we'll use white wine this time).

Import the dataset into Rstudio. (The file uses semicolons instead of commas as the delimiter, so using the RStudio import tool is probably easiest.) I don't think it needs to be cleaned, but document if you do anything. Remember to copy the code from the Import Tool to your RMarkdown window, so that you can knit it at the end.

The final column, "quality,” is a 1-10 scale, and will be your “y” response variable for everything

Predict the quality of a wine from other key variables.

1) (extra credit) Using set.seed(123), and probability =0.7, make a test and training data set. Put the test dataset away (for now), and just use the training set. It's fine to just use the whole dataset, but you'll get a small bonus for doing this extra step.

2) Using a panel ggplot and/or the pairs command, look at all the variables (peek at this and earlier labs to remember how). Anything promising?

3) Start by looking at each variable, alone and altogether (on your plots).

Make a simple regression of any promising individual variables. Which ones stand out?

4) Then, make a full regression predicting quality from density for all interesting variables.

Spoiler: Im(y~x1+x2+x4+...) or Im(y~.)

5) Explore a few more promising candidates, using lm and graphs.

6) Use step() to find a stepwise regression model.

7) Pick your 3 favorite multiple regression models, each with a different number of predictor variables.

8) Compare Models to find the most awesome model.

a) Use AICC) to evaluate and compare your 3 models.

b) Use cross-validation and cvFit() to evaluate and compare your 3 models.

9) Move everything into RMarkdown to knit into a .pdf file. Write some text around your charts and graphs to make this like a report to someone who was trying to help someone pick a great wine.

(5/5)
Attachments:

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

CS 340 Milestone One Guidelines and Rubric  Overview: For this assignment, you will implement the fundamental operations of create, read, update,

. Develop a program to emulate a purchase transaction at a retail store. This  program will have two classes, a LineItem class and a Transaction class

Retail Transaction Programming Project  Project Requirements:  Develop a program to emulate a purchase transaction at a retail store. This

. The following program contains five errors. Identify the errors and fix them

7COM1028   Secure Systems Programming   Referral Coursework: Secure

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer

CS 340 Final Project Guidelines and Rubric  Overview The final project will encompass developing a web service using a software stack and impleme