This assignment is based on a dataset titled as ‘BUSS613 Assignment Dataset’. The dataset provides
demographic and household information for 5000 customers of a telecom service provider.
Important information about the variables is presented in Annexure 1 through Annexure 4. This information will help complete the assignment.
A brief description of the annexures is presented below.
Annexure 1. This annexure provides information about the variable names (i.e., column heading) and their labels/ description.
Annexure 2. This annexure provides information about the categorical/ ordinal variables. It presents the labels/ descriptions of the codes of the categories within each categorical/ ordinal variable.
Annexure 3. This annexure provides a histogram and boxplot for each continuous variable.
Annexure 4. This annexure provides the distribution of frequencies for the categorical/ ordinal variables.
For this assignment, you are required to perform the following tasks:
1. Understand the assignment dataset by going through the information given in the annexures and the excel spreadsheet.
2. Use the information given in the annexures to identify the potential data quality issues in the variables. Suggest a mechanism to deal with these data quality issues.
3. Identify one appropriate dependent variable that you would want to predict. Your dependent variable should be either ordinal or categorical. Provide the reasons for your choice of the dependent variable.
4. From the remaining variables, shortlist independent variables (maximum ten variables) for the prediction of your dependent variable.
5. Suggest one analytical technique that you will use for predicting the dependent variable. What will be your major considerations while using this technique? Justify your reasons for the choice of the analytical technique.
The document (a word file) should not be more than 2000 words in length (font Times New Roman size 12; 1.5 line spacing; justified). Page margins are to be 2.5cm all around. The file should be submitted via Turnitin on the blackboard. The blackboard link for submission is Assignment Submission.
No. |
Question |
Answer |
1. |
Use the information given in the annexures to identify the potential data quality issues in the variables. |
|
2. |
Suggest a mechanism to deal with these data quality issues. |
|
3. |
Identify one appropriate dependent variable that you would want to predict. Your dependent variable should be either ordinal or categorical. Provide the reasons for your choice of the dependent variable. |
|
4. |
From the remaining variables, shortlist independent variables (maximum ten variables) for the prediction of your dependent variable. |
|
5. |
Suggest one analytical technique that you will use for predicting the dependent variable. What will be your major considerations while using this technique? |
|
6. |
Justify your reasons for the choice of the analytical technique. |
|
CS 340 Milestone One Guidelines and Rubric Overview: For this assignment, you will implement the fundamental operations of create, read, update,
Retail Transaction Programming Project Project Requirements: Develop a program to emulate a purchase transaction at a retail store. This
7COM1028 Secure Systems Programming Referral Coursework: Secure
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
CS 340 Final Project Guidelines and Rubric Overview The final project will encompass developing a web service using a software stack and impleme