Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

Drop Files Here Or Click to Upload

Or Get Complete Course Help

Sujay PurakaitBusiness

(/5)

915 Answers

Hire Me

Gavin PhillipsData mining

(5/5)

525 Answers

Hire Me

Riya ChopraComputer science

(5/5)

857 Answers

Hire Me

Tristan WallaceEnglish

(5/5)

655 Answers

Hire Me

Python Programming

Write a Python code that creates two bar plots of average default rates ( Commercial_GBF_12m ) depending on whether a company was under external administration External_Admin

INSTRUCTIONS TO CANDIDATES

ANSWER ALL QUESTIONS

Foundations of Fintech - Assignment 5

This assignment builds on Lectures 10 to 12 and on Tutorials 8 to 11. You might want to consider using parts of the Python code discussed in those tutorials to answer some of the questions below.

Important: It is important that you do not change the type (markdown vs. code) of any cell, nor copy/paste/duplicate any cell! If the cell type is markdown, you are supposed to write text, not code, and vice versa. Provide your answer to each question in the allocated cell. Do not create additional cells. Answers provided in any other cell will not be marked. Do not rename the assignment ﬁles. All ﬁles in the assignment directory should be left as is.

Setting

Equifax Australia has provided us with synthetic loan application data from Australian proprietary companies. This data was generated to match the characteristics of actual lending proposals approved between February 2017 and March 2018. The Equifax data consists of two parts, which, to make it easier for you, we have merged together into one data set:

Company Business Trading History Data: This ﬁrst part of the data set contains historical business trading data from 25,000 Australian proprietary companies who were granted a loan between February 2017 and March
Director Data: This second part of the data set contains information on up to four directors of each company. In case a company has more than one director, the corresponding data has been averaged across directors at the company

Since this is proprietary data that belongs to Equifax, we are not allowed to give you direct access to it. However, thanks to Jupyter Hub, you are able to access it remotely. In particular, using your knowledge from Tutorial 9, you are able to analyze it at an aggregate level and to use it for the estimation of credit scoring models.

The ﬁle called Equifax_Data_Dictionary.xlsx provides you with the dictionary for both company and director level data.

Helpful commands

The merged Equifax dataset will be referred to by the name assignment5 . Please see Tutorial 9 for the details of how to run functions on remote data. We have implemented additional functions to help you with this assignment. Details of these functions are below:

send_grouped_mean_request() : This function takes inputs in the form of a dictionary containing names of two columns, and returns a Pandas dataframe that contains grouped means of a column with respect to another column. For example, using this function with input {"data": "tutorial9.1","var": "age", "y": "SeriousDlqin2yrs"} will return a data frame where each entry has a value for age and the respective mean of SeriousDlqin2yrs for applicants' of that age (see Tutorial 10).
send_glm_request() : This function takes inputs in the form of a reference to the remote data ("data") and the dependent variable ("y"), e., {"data": data, "y": y} , and outputs the detailed results of a full-ﬂedged logistic regression model without feature selection.
send_logit_request() : This function is similar to the send_nn_request() method, but it performs the remote estimation of a customised logistic It takes inputs in the form of a

dictionary {"data": data, "test": 0.2, "x": features, "y": y, "scale":"True"}

where:

"data": Reference to the remote data (see below)
"test": Fraction of the data used for testing
"x": List of features (independent variables) used by the model
"y": Target variable (dependent variable) of the model
"scale": Indicator ("True"/"False") for scaling

Predeﬁned variables

For your convenience, we have predeﬁned certain variables which you should reuse for this assignment:

data: The remote Equifax dataset that should be used in this assignment
target: The target variable (default indicator over 12 months)
all_features: The complete list of available features

Note: Please do not change the values of these variables.

In [ ]:

Question 1 (2 marks)

Write a Python code that creates two bar plots of average default rates ( Commercial_GBF_12m ) depending on (i) whether a company was under external administration ( External_Admin ) or (ii) had ﬁled petitions. Make sure your plots' axes are appropriately labeled.

Answer 1

"""Write your code in this cell"""

pass

Question 2 (2 marks)

Write a Python code that creates two plots of average default rates ( Commercial_GBF_12m ) as a function of

the number of months since a director's last commercial default ( ny7589_df_time_1 ) and (ii) the frequency of adverse commercial events over four years 48 months prior to application

( ny7601_adv_48_84m ). Make sure your plots' axes are appropriately labeled.

Answer 2

"Write your code in this cell"

pass

Question 3 (1 mark)

How do you interpret the above plots from Questions 1 and 2? What is your conclusion?

Answer 3

YOUR ANSWER HERE

Question 4 (2 marks)

Run a full-ﬂedged logistic regression model without any ex-ante feature selection. Based on the estimation output, select and report all features that are signiﬁcant at the 5%-level (or below).

Note: To increase the stability of the estimation, Python will automatically omit certain variables.

Answer 4 - Code

"Write your code in this cell"

pass

Answer 4 - Text

YOUR ANSWER HERE

Question 5 (3 marks)

Run a logit model using the function send_logit_request() and applying the following speciﬁcations:

Relative size of test data: 20%
Only use the features from Question 4 with a signiﬁcance level below 5%
Scaling: "True"

Evaluate the testing performance of your logit model.

Answer 5 - Code

Answer 5 - Text

YOUR ANSWER HERE

Question 6 (4 marks)

Write a Python code that estimates a series of full-ﬂedged neural networks with the following speciﬁcations:

Number of layers: 1
Number of units: 2, 4, 16, 64, 256
Relative size of test data: 20%
Scaling: "True"

Generate one plot that shows each model's ROC ("roc"), both for testing and training. What is your conclusion?

Hurry, Grab up to 30% discount on the entire course

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

Our Experts

Sujay PurakaitBusiness

Gavin PhillipsData mining

Riya ChopraComputer science

Tristan WallaceEnglish

Python Programming

Write a Python code that creates two bar plots of average default rates ( Commercial_GBF_12m ) depending on whether a company was under external administration External_Admin

ANSWER ALL QUESTIONS

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

. Develop a program to emulate a purchase transaction at a retail store. This program will have two classes, a LineItem class and a Transaction class

. The following program contains five errors. Identify the errors and fix them

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer

Other Services

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

Our Experts

Sujay PurakaitBusiness

Gavin PhillipsData mining

Riya ChopraComputer science

Tristan WallaceEnglish

Python Programming

Write a Python code that creates two bar plots of average default rates ( Commercial_GBF_12m ) depending on whether a company was under external administration External_Admin

ANSWER ALL QUESTIONS

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

. Develop a program to emulate a purchase transaction at a retail store. This program will have two classes, a LineItem class and a Transaction class

. The following program contains five errors. Identify the errors and fix them

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer