logo Hurry, Grab up to 30% discount on the entire course
Order Now logo

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

expert
Sina AntiqueStatistics
(5/5)

914 Answers

Hire Me
expert
Witi HydeGeneral article writing
(5/5)

939 Answers

Hire Me
expert
Bidit SadhukhanStatistics
(/5)

839 Answers

Hire Me
expert
Persimmon BissoondathhCriminology
(5/5)

946 Answers

Hire Me
Rapid Miner
(5/5)

This assignment is a group assignment consisting of 2 people. Students are responsible for the formation of groups.

INSTRUCTIONS TO CANDIDATES
ANSWER ALL QUESTIONS

Group Assignment:

This assignment is a group assignment consisting of 2 people. Students are responsible for the formation of groups.

 

Introduction & Overview:

The purpose of the assignment is to use RapidMiner to apply the concepts you have learned about the Data Analytics process to a public dataset of your choice e.g. from the UCI repository or Kaggle. Students are gently reminded that due to GDPR regulations and subtleties concerning “informed consent”, “retention”, “security”, “transfer”, “privacy” etc. it is unfortunately not possible to use private datasets.

 

Required Tasks:

You are required to produce a report detailing your work investigating the data, building a Data Analytics model, analysing the results and comparing your results with published findings in the area.

 

The first task you should complete is a data exploration exercise, where you will document the summary characteristics of the data set and further understanding that you have gained through visualisations.

 

Depending on the problem domain, develop a supervised or unsupervised model of the dataset e.g. pick a candidate technique after researching standard approaches to the problem in the published literature. The modelling phase should be iterative i.e. adjust the settings of the model until you are satisfied that the performance is acceptable.

 

You will need to independently evaluate the results of the model on a separate test set. You can then compare your results with published findings and discuss the outcomes.

 

Deliverables:

The report should be roughly 5 pages long and not longer than 8 pages. A separate index should be included at the end of the document for any references cited and an appendix for additional relevant material generated as part of the process. The complete document should not exceed 10 pages.

 

The report should clearly show your work in the following areas:

 

Definition of problem.

 

Data Exploration and Descriptive Analytics.

 

Identification of data insights from previous step.

 

Details of the model building iterations conducted e.g. intermediate results, rationale for adjusting configuration settings etc..

 

Details of the evaluation strategy and performance measures for your data analytics model.

 

Discussion of reasons underlying how your results compare and contrast with 1.) your expectations after selecting the final model configuration settings and 2.) further results for the dataset that are available in the online literature.

 

It is fine to include a selection of important tables and graphs in the main report (other tables and graphs created and referenced in the report should be captioned and put in an appendix). It is not required to include screenshots of RapidMiner processes in the report as these are readily available in the process files. The file SampleAssignmentDataAnalytics.pdf contains an example of a report previously submitted for this assignment.

 

Individual Work

My experience is that it can be the case that students treat assignments as box-ticking exercises to be completed, submitted and forgotten about. Such an attitude lets slip the opportunity project work represents to develop relevant skills and experience that can be used as a platform to build on going forward e.g. the recent job descriptor, ExampleJobDescriptor.pdf, included in the zip file indicates that employers are usually very interested in the attitude of the applicant and evidence of how this can be authentically demonstrated.

 

The last part of the assignment requires that each student individually defend their role in the realisation of the project. The format for this is 1 page of text per student, written in the style of answering the following quesion at a job interview “So, could you tell us a little about project work that you were involved in as part of the Masters in Data Analytics”.

(5/5)
Attachments:

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

CS 340 Milestone One Guidelines and Rubric  Overview: For this assignment, you will implement the fundamental operations of create, read, update,

. Develop a program to emulate a purchase transaction at a retail store. This  program will have two classes, a LineItem class and a Transaction class

Retail Transaction Programming Project  Project Requirements:  Develop a program to emulate a purchase transaction at a retail store. This

. The following program contains five errors. Identify the errors and fix them

7COM1028   Secure Systems Programming   Referral Coursework: Secure

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer

CS 340 Final Project Guidelines and Rubric  Overview The final project will encompass developing a web service using a software stack and impleme