Need to find out what would be the best hypothesis testing for each of the questions.
INSTRUCTIONS TO CANDIDATES
ANSWER ALL QUESTIONS
Need to find out what would be the best hypothesis testing for each of the questions.
Objective – Statistical Analysis of Business Data. Explore the dataset and extract insights from the data. The idea is for you to get comfortable with doing statistical analysis in Python.
You are expected to do the following:
- Explore the dataset and extract insights using Exploratory Data Analysis.
- Prove (or disprove) that the medical claims made by the people who smoke is greater than those who don't? [Hint- Formulate a hypothesis and prove/disprove it]
- Prove (or disprove) with statistical evidence that the BMI of females is different from that of males.
- Is the proportion of smokers significantly different across different regions? [Hint : Create a contingency table/cross tab, Use the function : stats.chi2_contingency()]
- Is the mean BMI of women with no children, one child, and two children the same? Explain your answer with statistical evidence.
*Consider a significance level of 0.05 for all tests.
Context - Leveraging customer information is of paramount importance for most businesses. In the case of an insurance company, attributes of customers like the ones mentioned below can be crucial in making business decisions. Hence, knowing to explore and generate value out of such data can be an invaluable skill to have.
Data Dictionary –
- Age - This is an integer indicating the age of the primary beneficiary (excluding those above 64 years, since they are generally covered by the government).
- Sex - This is the policy holder's gender, either male or female.
- BMI - This is the body mass index (BMI), which provides a sense of how over or under-weight a person is relative to their height. BMI is equal to weight (in kilograms) divided by height (in meters) squared. An ideal BMI is within the range of 18.5 to 24.9.
- Children - This is an integer indicating the number of children / dependents covered by the insurance plan.
- Smoker - This is yes or no depending on whether the insured regularly smokes tobacco.
- Region - This is the beneficiary's place of residence in the U.S., divided into four geographic regions - northeast, southeast, southwest, or northwest.
- Charges - Individual medical costs billed to health insurance
Submission Guidelines:
Please note the following:
- There are two parts to the submission:
- A well commented Jupyter notebook [format - .ipynb]
- A presentation as you would present to the top management [format - .ppt /.pptx]
- Any assignment found copied/ plagiarized with other groups will not be graded and awarded zero marks
- Please ensure timely submission as a post-deadline assignment will not be accepted
- Your submission will not be evaluated if it contains more than 2 files.
Happy Learning!!
Scoring guide (Rubric) - Axis Insurance Project
Criteria
|
Points
|
Explore the dataset and extract insights using Exploratory Data Analysis.
|
20
|
Prove (or disprove) that the medical claims made by the people who smoke is greater than those who don't?
|
10
|
Prove (or disprove) with statistical evidence that the BMI of females is different from that of males.
|
10
|
Is the proportion of smokers significantly different across different regions?
|
10
|
Is the mean BMI of women with no children, one child and two children the same? Explain your answer with statistical evidence.
|
10
|
Points
|
60
|
Attachments:
Expert's Answer
1024 Times Downloaded
Related Questions
. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java
CS 340 Milestone One Guidelines and Rubric
Overview: For this assignment, you will implement the fundamental operations of create, read, update,
. Develop a program to emulate a purchase transaction at a retail store. This program will have two classes, a LineItem class and a Transaction class
Retail Transaction Programming Project
Project Requirements:
Develop a program to emulate a purchase transaction at a retail store. This
. The following program contains five errors. Identify the errors and fix them
7COM1028
Secure Systems Programming
Referral Coursework: Secure
. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer
CS 340 Final Project Guidelines and Rubric
Overview The final project will encompass developing a web service using a software stack and impleme