logo Hurry, Grab up to 30% discount on the entire course
Order Now logo

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

expert
Rech ElleMathematics
(5/5)

913 Answers

Hire Me
expert
StatAnalytica ExpertComputer science
(5/5)

999 Answers

Hire Me
expert
Rex HuntGeneral article writing
(5/5)

770 Answers

Hire Me
expert
Georgee BlackbridgeData mining
(5/5)

678 Answers

Hire Me
R Programming
(5/5)

five basic analyses and make sure that each one says something significant about the dataset

INSTRUCTIONS TO CANDIDATES
ANSWER ALL QUESTIONS

Tasks

1. Find a dataset from any source – the dataset should have at least 500 rows and at least

two categorical attributes and at least two numerical attributes. You can search on the

web for datasets. You should not use any of the datasets that we have already used in

the course materials.

2. Run the dataset by me as soon as possible so that I can see that it is appropriate for the

project and provides enough scope for analysis.

3. Identify the categorical and numerical attributes in the dataset (categories and

measures).

4. Perform basic analysis on the dataset – like overall summaries, analysis of individual

columns, appropriate plots of individual columns depending on the nature of the

column, etc. For each analysis, you should show the R code and also include your

interpretation of the results. You should include at least five basic analyses and make

sure that each one says something significant about the dataset. Just listing the number

of rows and columns will not count towards the five, although you can include that in

your analysis.

5. Perform more advances analysis – reports and plots that relate multiple attributes. In

this section, you will first pose several (at least five) interesting questions that you

would like to answer and then provide your answers via reports and charts. You can use

the EDA framework that we studied to generate the questions as applicable to your

dataset. Generate as many questions as you can and list all questions you generated.

Then select those that you think are the interesting ones with reasons for selection.

Here are some general themes for generating questions:

a. two-category counts

b. distribution of measures with breakup by category

c. relationships between numeric variables

d. contribution to measures by individual categories

e. contribution to measures by two-categories

f. comparison of variation of measure by category

g. time series of measures, etc.

6. For each of the above you should show your R code, plots and provide your

interpretation. You should make good use of dplyr, pipes and ggplot. Use aesthetics to

bring multiple attributes into play and use facets where needed.

(5/5)
Attachments:

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

CS 340 Milestone One Guidelines and Rubric  Overview: For this assignment, you will implement the fundamental operations of create, read, update,

. Develop a program to emulate a purchase transaction at a retail store. This  program will have two classes, a LineItem class and a Transaction class

Retail Transaction Programming Project  Project Requirements:  Develop a program to emulate a purchase transaction at a retail store. This

. The following program contains five errors. Identify the errors and fix them

7COM1028   Secure Systems Programming   Referral Coursework: Secure

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer

CS 340 Final Project Guidelines and Rubric  Overview The final project will encompass developing a web service using a software stack and impleme