Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

Drop Files Here Or Click to Upload

Or Get Complete Course Help

Roshan NehraPhilosophy

(5/5)

861 Answers

Hire Me

Cameron CollinssHistory

(5/5)

627 Answers

Hire Me

Joan DomettEngineering

(4/5)

623 Answers

Hire Me

StatAnalytica ExpertFinance

(5/5)

964 Answers

Hire Me

Others

(5/5)

perform an appropriate data type conversion for these variables so that they can be recognised as categorical variables

INSTRUCTIONS TO CANDIDATES

ANSWER ALL QUESTIONS

Question 1

Prepare the ship data stored in "ship.csv" for future analytics tasks using functions and methods of NumPy and pandas, respectively.

(a) Design your own Python program to carry out the following tasks:

(i) Read in "ship.csv" as pandas DataFrame called "ship". Since there are 6 observations where MS and Y are "." to indicate that they are missing values, declare this character as missing values in your program accordingly.

(ii) Since the variable names of this dataset are rather short and do not really describe the nature of the variables, rename the ship types to "types", construction years to "c_years", operation periods to "o_periods", the aggregated months of service to "s_months", and the number of incidents to "incidents".

(iii) For better understanding of the data, compute the average service months and the average number of incidents for the cross-products of every category in types and operation periods. The averages should be rounded to the nearest integers. Store the resulting table to an object named "shipgroup".

(iv) Replace the missing values in the variable "s_months" and "incidents" by the respective means of the other ships that share the same type AND the same operation period. Add comments to elaborate your Python program as well.

(v) Construct a Python program to save the target variable "incidents" in a pandas DataFrame named "Y". (33 marks)

(b) Except for the months of service and number of incidents, all the other variables, including "types", "c_years", and "o_periods" are actually nominal and not interval/ratio.

(i) Perform an appropriate data type conversion for these variables so that they can be recognised as categorical variables. ANL252 Copyright © 2021 Singapore University of Social Sciences (SUSS) Page 5 of 6 ECA – July Semester 2021

(ii) Construct Python code to convert all categorical variables to dummy variables and save the result as a pandas DataFrame named "X".

(iii) Researchers suggest that the aggregated months of service of each ship must be scaled down due to its wide range of values. Perform a log-transformation of this variable in the DataFrame and name the transformed variable "log_s_months". The transformed variable should be attached to both DataFrames "X" and "ship". (14 marks)

(c) Normally, we shall split the DataFrame into training and testing datasets to evaluate the predictive power of the model. Study the dataset carefully and explain why it is not sensible to split the DataFrame here, and we shall use the entire dataset for training purpose instead. (8 marks)

(d) We shall now save the prepared DataFrame "ship" as a new csv text file called "ship_prepared.csv". Furthermore, we shall also create a database called "ship.db" and export the DataFrame to the database as tables. Write a Python program to carry out these two tasks. (15 marks)

(5/5)

Hurry, Grab up to 30% discount on the entire course

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

Our Experts

Roshan NehraPhilosophy

Cameron CollinssHistory

Joan DomettEngineering

StatAnalytica ExpertFinance

Others

perform an appropriate data type conversion for these variables so that they can be recognised as categorical variables

ANSWER ALL QUESTIONS

Attachments:

Instructions Files

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

. Develop a program to emulate a purchase transaction at a retail store. This program will have two classes, a LineItem class and a Transaction class

. The following program contains five errors. Identify the errors and fix them

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer

Other Services

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

Our Experts

Roshan NehraPhilosophy

Cameron CollinssHistory

Joan DomettEngineering

StatAnalytica ExpertFinance

Others

perform an appropriate data type conversion for these variables so that they can be recognised as categorical variables

ANSWER ALL QUESTIONS

Attachments:

Instructions Files

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

. Develop a program to emulate a purchase transaction at a retail store. This program will have two classes, a LineItem class and a Transaction class

. The following program contains five errors. Identify the errors and fix them

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer