logo Hurry, Grab up to 30% discount on the entire course
Order Now logo
576 Times Downloaded

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

expert
Elis PanaseykoEnglish
(5/5)

630 Answers

Hire Me
expert
Alan DuderMarketing
(5/5)

910 Answers

Hire Me
expert
Ashok MalhotraComputer science
(5/5)

825 Answers

Hire Me
expert
Pope AtkinsPhilosophy
(4/5)

852 Answers

Hire Me
SAS
(5/5)

machine learning pipeline involves several tasks before the development of a predictive/descriptive models.

INSTRUCTIONS TO CANDIDATES
ANSWER ALL QUESTIONS

DATA MANAGEMENT

The machine learning pipeline involves several tasks before the development of a predictive/descriptive models. The inevitable and vital process includes preparing and understanding the data. Moreover, the performance of the predictive/descriptive model depends on the choice of pre-processing techniques.

For the assignment, you are required to prepare and explore the given dataset. It is imperative to explain and justify the pre-processing, transformation, and feature engineering techniques that have been chosen. Your analysis should be deep and in detail, also it must go further than what has already been covered in this course. 

The assignment should involve a number of experiments, and a detailed exploration and analysis of the results using SAS Studio.

You need to do the following tasks: 

 

PART 1 

Feature Engineering 

Several Data Mining/Machine Learning algorithms are designed to work with qualitative or quantitative data and very few algorithms support mixed data. Hence, this task requires you to transform with an appropriate method(s) and proper justification to be provided.  In addition to that, the metadata should be created for each dataset.  Feature engineering itself can be divided in 2 steps:

Variable transformation.

Variable / Feature creation.

In this section , you need to summarize feature engineering task and provide the interpretation of  work related to feature engineering task that you have done in SAS Studio.[1000 words].

 

Documentation Format:

Typeface: Times New Roman. Boldface, italic & lines can be used for emphasizing and to enhance readability.

Font size: 12 (except titles and headings).

Margins: 1” from the left, right, top & bottom of the edges of the A4 paper.

Spacing: 1.5 lines between texts of a paragraph.

Alignment: Justify.

Headers and footers can be used all pages must be numbered accordingly.

Standard cover page as available in the learning management system

PART 2

1. Related Works

In this section, you are supposed to research and present the other works related to the application domain. 

Initial Data Exploration –

Data exploration is an approach similar to initial data analysis, whereby a data analyst uses visual exploration to understand what is in a dataset and the characteristics of the data, rather than through traditional data management systems.

This section should contain the following task.

Indicate the type of each attribute (nominal, ordinal, interval or ratio).

Identify the values of the summarising properties for each attribute including frequency and spread e.g. value ranges of the attributes, frequency of values, distributions, medians, means, variances, and percentiles. Wherever necessary, use proper visualisations for the corresponding statistics. Summary / descriptive stats

Using SAS explore your dataset and identify the variables any outliers, missing values, and outliers treatment.  

2. Data Pre-processing

Investigate the required method(s) to handle the incomplete, noisy and inconsistent data.

Report each of the applied techniques with detailed explanations. Show your results and justify your approach.

(5/5)
Attachments:

Expert's Answer

576 Times Downloaded

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

CS 340 Milestone One Guidelines and Rubric  Overview: For this assignment, you will implement the fundamental operations of create, read, update,

. Develop a program to emulate a purchase transaction at a retail store. This  program will have two classes, a LineItem class and a Transaction class

Retail Transaction Programming Project  Project Requirements:  Develop a program to emulate a purchase transaction at a retail store. This

. The following program contains five errors. Identify the errors and fix them

7COM1028   Secure Systems Programming   Referral Coursework: Secure

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer

CS 340 Final Project Guidelines and Rubric  Overview The final project will encompass developing a web service using a software stack and impleme