logo Hurry, Grab up to 30% discount on the entire course
Order Now logo

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

expert
Herbert BarciaEducation
(5/5)

998 Answers

Hire Me
expert
Ishrat KhanAccounting
(/5)

785 Answers

Hire Me
expert
Jeremy HardyEconomics
(5/5)

680 Answers

Hire Me
expert
Douglas RichardsPsychology
(5/5)

777 Answers

Hire Me
Others
(5/5)

The most basic prediction one can make for trip_time_in_secs is the average trip time in seconds.

INSTRUCTIONS TO CANDIDATES
ANSWER ALL QUESTIONS

For this assignment, you are going to build a model to predict the trip time in seconds of a taxi trip. Using the previously provided sample of the NYC taxi trip database.

1) Split the data into three sets: a training set, a validation set, and a test set. The test set should be about 10% of the data, the validation set should be another 10% of the data, and the training set should be the remaining 80% of the data.

2) The most basic prediction one can make for trip_time_in_secs is the average trip time in seconds. Calculate the mean_trip_time_in_secs on the training dataset, then use this value as your prediction for every observation in the validation dataset. Calculate the mean squared error on the validation set using these predictions. This is the baseline score that we want to beat.

3) Build a linear model using one predictor variable. Train the linear model on the training set, then use the model to make predictions on the validation set. Calculate the mean squared error and compare to the baseline score (it should be considerably better). 

4) Build several more linear models using different combinations of predictor variables (minimum of two additional models total). As in step 3, for each model, make predictions on the validation set, and calculate the mean squared error. 

5) Select your best performing model from step four. Use this model to make predictions on the test set. This is your final assessment of model performance. Calculate the test set mean squared error.

In steps 3 - 4, limit your choice of predictors to only features that would be available at the start of a taxi trip. For instance, dropoff_datetime is not a reasonable choice of predictor, nor is fare_amount, but pickup_longitude is a reasonable choice, as is medallion, or hack_license.

Do your work in a Jupyter notebook; when complete, export to html and submit here by the due date.

(5/5)
Attachments:

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

CS 340 Milestone One Guidelines and Rubric  Overview: For this assignment, you will implement the fundamental operations of create, read, update,

. Develop a program to emulate a purchase transaction at a retail store. This  program will have two classes, a LineItem class and a Transaction class

Retail Transaction Programming Project  Project Requirements:  Develop a program to emulate a purchase transaction at a retail store. This

. The following program contains five errors. Identify the errors and fix them

7COM1028   Secure Systems Programming   Referral Coursework: Secure

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer

CS 340 Final Project Guidelines and Rubric  Overview The final project will encompass developing a web service using a software stack and impleme