logo Hurry, Grab up to 30% discount on the entire course
Order Now logo

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

expert
James BeckettBusiness
(5/5)

913 Answers

Hire Me
expert
Jason AlexanderHistory
(5/5)

561 Answers

Hire Me
expert
Oscar ColeEnglish
(5/5)

872 Answers

Hire Me
expert
Kate DuggannMathematics
(5/5)

722 Answers

Hire Me
R Programming
(5/5)

Create a binary column by combining categories A and B Low Risk into one category and all the remaining categories in another High Risk

INSTRUCTIONS TO CANDIDATES
ANSWER ALL QUESTIONS

We will revisit the Lending Club data for this week’s assignment. The company has existed since 2007 and has provided millions of personal loans since then. Lending Club announced IPO in December 2014, since when the company came in the limelight for negative publicity. Lending club officials were accused of taking aggressive risks by lending money to those with risky creditworthiness. You are asked to study this phenomenon and determine if data provides clues of the authenticity of the claim that Lending Club behaved irresponsibly.

You are given a single combined file of “approved” loan data from six years, which are supposedly the pre and post-periods of the controversy.

Step 1 (30 Points)

The first step is to create two new columns as follows:

a) Comb_Risk_One: Create a binary column by combining categories A and B (Low Risk) into one category and all the remaining categories in another (High Risk).

b) Comb_Risk_Two: Create a binary column by combining categories A, B and C (Low Risk) into one category and all the remaining categories in another (High Risk).

Now, break the file into two files filtering out data for 2012, 13, and 14 in one file and 2015, 16, and 17 in another file.

Step 2 (70 Points)

The primary objective is to use classification techniques learned so far. Each loan is graded (A to G) based on the risk, with A being least risky and G being the highest risk category. You are asked to predict Low and High-risk categories (for the two new response variables) using various modeling techniques like Naïve Bayes’, KNN, Logistic Regression, and CART model. Make sure to look for the following:

a. Outliers based on the independent columns (predictors)

b. Multicollinearity

c. Scaling and standardization of the predictors

d. Train-Test split for both files and compare the confusion matrices on the Test.

Produce a “well documented and explained” R Markdown knit file analyzing the data with findings on the model with the highest classification ability. Also describe the features of the categories that are not classified correctly. Create a confusion matrix to answer the last question and run descriptive statistics on the misclassified categories. Provide any necessary EDA and visuals to enhance understanding of your analysis.

(5/5)
Attachments:

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

CS 340 Milestone One Guidelines and Rubric  Overview: For this assignment, you will implement the fundamental operations of create, read, update,

. Develop a program to emulate a purchase transaction at a retail store. This  program will have two classes, a LineItem class and a Transaction class

Retail Transaction Programming Project  Project Requirements:  Develop a program to emulate a purchase transaction at a retail store. This

. The following program contains five errors. Identify the errors and fix them

7COM1028   Secure Systems Programming   Referral Coursework: Secure

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer

CS 340 Final Project Guidelines and Rubric  Overview The final project will encompass developing a web service using a software stack and impleme