Assignment A2: Text + Clustering + Estimation |
Aim
To clearly articulate your understanding of the business problem and to present its solution to management.
Expectation
Business Problem
LP3: A business problem (or question) is stated succinctly in business terms.
Solution to Business Problem
LP4: The business solution succinctly described for executives and justified. Cross-references with the technical sections of the report provided for support, e.g. to tables, charts and plots.
Extension
LP3: Business decisions and actions that the analytic solution could support are well explained.
LP4: Business decisions and actions are clearly supported by the analytic solution.
Any academic references cited throughout this report may be included in this section.
Hints
For clarity and to guide the markers, leave the words “Expectation” and “Extension”.
Do not attempt the extension unless the main objective has been achieved.
If not attempting the extension then delete the extension section, together with the word “Extension”.
Use this font for your report, i.e. Arial 10 (and not MyTiniestFont 2).
Make sure your exec summary is very clear.
You can restate or rephrase the problem statement as you gain better understanding.
Do not invent your own problem – it has been given to you but may not be achievable in its current form.
Ensure that whatever problem you describe can be solved using the provided data.
Make sure the exec summary describes the problem from the business perspective and not a technical perspective.
Use business language and not computer / mathematical / statistical / data science language.
The problem statement should describe the high level aims and not the methods of their achieving.
Think and state the likely benefit of this project for the company and its management.
Think and state who the company clients are and what the likely benefits of this project are for them.
Ensure that your solution clearly matches the problem statement.
Ensure that the solution is formulated in terms of achieving the high-level aim.
Do not include any charts or tables in the problem statement section.
However, as appropriate cross-reference your problem statement with tables or charts from the following section, e.g. you can refer to them as “… (see Figure 1)” or “As shown in Table 4…”.
If you need to support your statements / analysis / argument with references to any published materials, use Harvard citation style as described in: As the executive summary should not take even one page, we suggest to include your bibliographic references at the bottom of this page, immediately below the executive summary (or problem description).
All comments, such as this, which are not part of your submission should be deleted to save space.
Data exploration and relationships - Clustering in RapidMiner (one page) |
Aim
To demonstrate your understanding of text processing and interpretation.
Expectation
Review texts parsed. Dimensionality reduced. Text clustered and analysed using segmentation analysis. Various cluster parameters (k) experimented with and used. Emerging data relationships described.
Answer to question (A) given and justified.
Extension
Text and selected structured data used in segmentation analysis. Clusters optimised. Diagnostic charts with PCA.
Answer to business question (B) given and justified.
Hints:
For clarity and to guide the markers, leave the words “Expectation” and “Extension” in your report.
Do not attempt the extension unless the main objective has been achieved.
If not attempting the extension then delete the extension section, together with the word “Extension”.
Use this font for your report, i.e. Arial 10 (and not MyTiniestFont 2).
Make sure that any included chart is readable (so do not shrink it into a microscopic size).
Make sure that if you scale the included screen shots you keep the image proportions (do not distort images).
The aim of this section is to report any insights emerging from the initial analysis and interpretation of text.
Include here the text of your analysis with tables and plots to be used as evidence to support the analysis.
If you include any charts or tables, describe their important features (use arrows / boxes to highlight them).
Most importantly describe what those data features mean and how important they are, and why.
For this submission, we expect you to perform analysis of mainly textual data.
You will need to use some nominal attributes to be able to answer the extension question.
You may include here parts of the RM process. If so, also include a brief description of the operators.
Do not include definition of terms or a “textbook” description of operations – we already know this!
If your analysis or results could only be determined by inspecting the process and running it,
the marks will be reduced – if it is not in the report, it does not exist for the marker!
Avoid indiscriminate “dumping” of tables, charts or code into this section – all content must have some purpose.
All included charts, tables or RM processes (or their parts) have to be described or used in the discussion.
Make sure that all charts, tables and important results are labelled for cross-referencing, e.g. “Figure 1 - Histogram of Overall Rating” or “Table 4 – Comparison of model performance”.
If analysis or results could only be determined by inspecting the process or running it, the marks will be reduced.
CS 340 Milestone One Guidelines and Rubric Overview: For this assignment, you will implement the fundamental operations of create, read, update,
Retail Transaction Programming Project Project Requirements: Develop a program to emulate a purchase transaction at a retail store. This
7COM1028 Secure Systems Programming Referral Coursework: Secure
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
CS 340 Final Project Guidelines and Rubric Overview The final project will encompass developing a web service using a software stack and impleme