logo Hurry, Grab up to 30% discount on the entire course
Order Now logo
529 Times Downloaded

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

expert
Devanshu KamraMarketing
(5/5)

554 Answers

Hire Me
expert
Soni GiranComputer science
(5/5)

795 Answers

Hire Me
expert
Earle BirdsellBusiness
(5/5)

701 Answers

Hire Me
expert
Georgee BlackbridgeData mining
(5/5)

782 Answers

Hire Me
R Programming
(5/5)

R script has to contain all code. The R script should be running from top to bottom in one go once the working directory is adapted to the respective machine.

INSTRUCTIONS TO CANDIDATES
ANSWER ALL QUESTIONS

Please write up all your answers in an R script.

This R script has to contain all code. The R script should be running from top to bottom in one go once the working directory is adapted to the respective machine.

Any answers that you are adding in your own words should also go into the document. So you could for example alternate between the R script and substantive answers that you are providing.

Please make sure that you structure your script well so that it is easy to read.

In the scenario for this labsheet, you are working on data for the company LEGO. It is your task to provide a report with an overview over the data and the first analysis of some key questions. To generate the data, I scraped the website from https://www.lego.co.uk. If you are lacking some contextual knowledge, feel free to browse around there to get some ideas.

 

1 Reading Data and Project Management (1 point)

Set the working directory correctly to the folder path where your .r file and your data is located. Read the data. And make sure that your .r script ultimately runs in one go and produces all results without returning error messages.

 

2 Describing Data (1 point)

Describe one ordinal variable from the data set. Think about good measures that can help express the key characteristics of the variable:

Can you provide an overview over the data?

How could you offer an idea of a typical value in the data, e.g. a category that is chosen very often?

Can you say something about the distribution of the frequencies in each category? Show how you analyse the data using R code. Write 3-4 sentences to describe each variable.

3 Selecting from Data (2 points)

How would you select the following data? Show how you analyse the data with the R code.

From the variable piece_count, select those observations that have more than 30 pieces, but less than 400.

From the variable vip_points, select those observations that have less than 75 vip points or more than 920.

Get the piece counts for all sets from the theme ‘MINDSTORMS’.

Subset the data so that it only contains sets with more than 300 pieces.

 

4 Visualise Data

Piece Count Data (1 point)

Select the data for the Technic theme sets.

Visualise the variable piece_count of this subset of the data with a plot type of your choice.

 Show how you generate the figure using R code.

Boxplot (1 point)

Visualise one continuous variable of your choice with a boxplot.

Describe the visualisation in 3-4 sentences and report all statistical information that you can retrieve from a boxplot.

5 Data Analysis

Now analyse the data. To answer the questions you can use any tool you want, from looking at data in the spreadsheet, tabulating it, plotting it etc.. In all you do, it needs to be evident for the markers how you come to your conclusions, so document all the code that you are writing for your analysis.

 

Price per Piece (2 points)

Make a plot with the number of pieces in a set on the x-axis and the price of a set in the y-axis. What do you conclude?

Calculate the price per piece and analyse the most characteristic values (min, max, median, mean, standard deviation). What do you conclude?

Now take a closer look at all sets that have a price per piece larger than 11GBP. Also consider all values that have a price per piece smaller than 0.06 GBP. Tabulating the two resulting data vectors, what do you conclude?

Price of Superheroes (2 points)

Generate four subsets for the LEGO sets with Batman, Spider-Man, Harry Potter and Marvel.

Calculate the average price in each of the subsets.

What do you conclude?

Now generate a boxplot with the price per piece for the sets with the theme Batman, Spider-Man, Harry Potter and Marvel. What do you conclude?

(5/5)
Attachments:

Expert's Answer

529 Times Downloaded

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

CS 340 Milestone One Guidelines and Rubric  Overview: For this assignment, you will implement the fundamental operations of create, read, update,

. Develop a program to emulate a purchase transaction at a retail store. This  program will have two classes, a LineItem class and a Transaction class

Retail Transaction Programming Project  Project Requirements:  Develop a program to emulate a purchase transaction at a retail store. This

. The following program contains five errors. Identify the errors and fix them

7COM1028   Secure Systems Programming   Referral Coursework: Secure

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer

CS 340 Final Project Guidelines and Rubric  Overview The final project will encompass developing a web service using a software stack and impleme