For this assignment, you are going to build a model to predict the trip time in seconds of a taxi trip. Using the previously provided sample of the NYC taxi trip database.
1) Split the data into three sets: a training set, a validation set, and a test set. The test set should be about 10% of the data, the validation set should be another 10% of the data, and the training set should be the remaining 80% of the data.
2) The most basic prediction one can make for trip_time_in_secs is the average trip time in seconds. Calculate the mean_trip_time_in_secs on the training dataset, then use this value as your prediction for every observation in the validation dataset. Calculate the mean squared error on the validation set using these predictions. This is the baseline score that we want to beat.
3) Build a linear model using one predictor variable. Train the linear model on the training set, then use the model to make predictions on the validation set. Calculate the mean squared error and compare to the baseline score (it should be considerably better).
4) Build several more linear models using different combinations of predictor variables (minimum of two additional models total). As in step 3, for each model, make predictions on the validation set, and calculate the mean squared error.
5) Select your best performing model from step four. Use this model to make predictions on the test set. This is your final assessment of model performance. Calculate the test set mean squared error.
In steps 3 - 4, limit your choice of predictors to only features that would be available at the start of a taxi trip. For instance, dropoff_datetime is not a reasonable choice of predictor, nor is fare_amount, but pickup_longitude is a reasonable choice, as is medallion, or hack_license.
Do your work in a Jupyter notebook; when complete, export to html and submit here by the due date.
CS 340 Milestone One Guidelines and Rubric Overview: For this assignment, you will implement the fundamental operations of create, read, update,
Retail Transaction Programming Project Project Requirements: Develop a program to emulate a purchase transaction at a retail store. This
7COM1028 Secure Systems Programming Referral Coursework: Secure
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
CS 340 Final Project Guidelines and Rubric Overview The final project will encompass developing a web service using a software stack and impleme