A. Identify data types:
1. (5 points) Login to LionPath to complete the following:
o (2 points) Identify 5 different data points that would be structured data.
o (2 points) Identify 5 different data points that would be unstructured data.
o Note: You can choose other pages than the main page to identify the data. Put the data into a table.
o (1 point) Identify the data points as (interval, nominal, ordinal, or ratio)
2. (5 points) Answer the following questions:
o (2 points) Capturing 5000 records from Expedia travel website, which includes zip codes, to determine where people from specific zip codes go on vacation. Is this a population or sample?
o (1.5 points) What would need to be done to the previous question for the correct answer to be the opposite of what you answered. Meaning how would you make it a population or sample, depending on what you answered.
o (1.5 points) An organization wants to know what consumers think about changing the color of their product. They have 100000 followers on social media. If they send a survey to all 100000 followers would this be a population or a sample?
B. This week you work for a Real Estate company and have exported the data from your ERP system to provide analysis for a meeting you have this week. Your boss wants you to provide analysis of the data as outlined below. Please use the Excel file (Week1_House_Price.xlsx) to complete the following.
3. (5 points) Identify missing and invalid data
o In R, use functions to find the missing information. Indicate the column and the row numbers that have the missing (NA) values
4. (7.5 points) Subset the data into the different colleges and universities
o Select 5 college universities and subset the data
o Create different dataframes in R and create csv files for each dataframe
Name the dataframes appropriately
Make sure to document what you are doing in the script using the # symbol
o For each subset, provide the count of sales and the total amount of sales in R, using functions
5. (7.5 points) Transform numeric variables
o In R –
Import Excel file into dataframe – HousePrices
Bin Sale_amount into bins (0, .33, .66, 1) use SaleBins for the vector name
Add the bin information to the HousePrices dataframe using the cut function
Label the bins with the cutoff amounts in the Bins (i.e. if bins were 10000, 20000, 30000 then label the bins 10k, 20k, 30k)
Create bins using the cut function for High, Medium and Low
Add the factors to the HousePrices dataframe
CS 340 Milestone One Guidelines and Rubric Overview: For this assignment, you will implement the fundamental operations of create, read, update,
Retail Transaction Programming Project Project Requirements: Develop a program to emulate a purchase transaction at a retail store. This
7COM1028 Secure Systems Programming Referral Coursework: Secure
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
CS 340 Final Project Guidelines and Rubric Overview The final project will encompass developing a web service using a software stack and impleme