logo Hurry, Grab up to 30% discount on the entire course
Order Now logo

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

expert
Noel HaywardGeneral article writing
(5/5)

528 Answers

Hire Me
expert
StatAnalytica ExpertTechnical writing
(5/5)

512 Answers

Hire Me
expert
Connor EvanssEconomics
(5/5)

522 Answers

Hire Me
expert
Charles MorrissData mining
(5/5)

749 Answers

Hire Me
R Programming

goal of this mini project is for you to complete an exploratory data analysis by applying the concepts and tools you learned during the first month of class

INSTRUCTIONS TO CANDIDATES
ANSWER ALL QUESTIONS

Mini-Project

Overview: The goal of this mini project is for you to complete an exploratory data analysis by applying the concepts and tools you learned during the first month of class. We will soon be progressing to statistics and statistical machine learning, but the first step in working with data should always be EDA, as it can be a window offering powerful insights into your data.

1: Hate Crimes

The hate crimes dataset is compiled per state, and describes the prevalence of hate crimes. The data’s features include: - state: state name - me- dian_household_income: median household income - share_unemployed_seasonal: share of the population that is unemployed, seasonally adjusted - share_population_in_metro_areas: share of the population that lives in metropolitan areas - share_population_with_high_school_degree: share of adults 25 and older with a high-school degree - share_non_citizen: share of the population that are not U.S. citizens - share_white_poverty: share of white residents who are living in poverty - gini_index: Gini index - share_non_white: share of the population that is not white - share_voters_voted_trump: Share of 2016 U.S. presidential voters who voted for Donald Trump - hate_crimes_per_100k_splc: Hate crimes per 100,000 population, Southern Poverty Law Center, Nov. 9-18, 2016 - avg_hatecrimes_per_100k_fbi: Average annual hate crimes per 100,000 population, FBI, 2010-2015

2. Bike sharing

The bike sharing dataset list individual bike rides in the New York City Citibike system. The features include: - tripduration: length of trip (seconds) - starttime: start time and date - stoptime: end time and date - start station id: id of the start station - start station name: name of the start station - start station latitude: latitude of the start station - start station longitude: longitude of the start station - end station id: if of the end station - end station name: name of the end station - end station latitude: latitude of the end station - end station longitude: longitude of the end station - bikeid: the id of the bike used - usertype: type of rider, (Customer = 24-hour pass or 3-day pass user; Subscriber = Annual Member) - birth year: year of birth of rider - gender: gender of rider,

(Zero=unknown; 1=male; 2=female)

This data was already processed to remove trips taken by staff as they service and inspect the system, trips that are taken to/from “test” stations, and any trips that were below 60 seconds in length (potentially false starts or users trying to re-dock a bike to ensure it’s secure).

Exploratory Data Analysis

During your recent studio and homework assignments, you performed cursory explorations of several reasonably large data sets. Your goal in this mini-project is to spend some time doing a more in depth exploration of a dataset of your choice.

You should feel free to use all the R tools at your disposal so far to try to uncover an interesting story in the data, and to create informative visualizations. You should then write a short summary of your discoveries in R Markdown, suppressing code for, but displaying relevant visualizations. Your aim in this write up is to convince us that your conclusions are valid.

This mini-project is a pair programming project because collaboration will give you a chance to vet your ideas. Your partner can find holes in your thinking, and likewise you in theirs. We also encourage you to visit the course staff during their office hours, so you can run your ideas by them as well for further (constructive) criticism.

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

CS 340 Milestone One Guidelines and Rubric  Overview: For this assignment, you will implement the fundamental operations of create, read, update,

. Develop a program to emulate a purchase transaction at a retail store. This  program will have two classes, a LineItem class and a Transaction class

Retail Transaction Programming Project  Project Requirements:  Develop a program to emulate a purchase transaction at a retail store. This

. The following program contains five errors. Identify the errors and fix them

7COM1028   Secure Systems Programming   Referral Coursework: Secure

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer

CS 340 Final Project Guidelines and Rubric  Overview The final project will encompass developing a web service using a software stack and impleme