For this exercise open the Excel file titled ‘HH Energy Data’ in UniHub.
This data comes from a study of annual household energy consumption in the UK (measured in Kilowatt hours per year). This data was collected from a random sample of houses from across the UK in 2018. The aim was to establish a baseline of energy consumption prior to the introduction of new energy efficiency regulations.
*Kindly generate ‘largehh’ variable in stata as it is missing by using command –
gen large hh=hhsize>2
The dataset contains the following variables:
Variable name |
Description of variable |
id |
Identifies each household in the study |
hhsize |
Number of people living in the household |
largehh |
1=household with 3 or more people living in it, 0=all other |
rural |
Whether a household is defined as rural (1=rural, 0=urban or suburban) |
kwh |
Household energy consumption per year (in Kilowatt hours, kWh) |
1. Please provide a brief description of the data including (10 marks)
a. A table with summary statistics
b. A histogram of energy consumption
c. A short textual description of the sample statistics
2. The national energy regulator claims that the average household in the UK consumes on average about 4000 kWh of energy per year. (15 marks)
a. Test this claim using a two-tailed t-test.
b. Report the 95% confidence interval for mean energy consumption.
c. Discuss your results. Clarify whether your hypothesis test confirms your confidence interval estimation.
3. Perform appropriate tests to identify: (10 marks)
a. Whether rural households consume more energy than urban/suburban ones.
b. Whether large households consume more energy than smaller ones
c. Whether rural households are larger than urban ones
4. A pilot study of the impact of proposed new energy efficiency technologies is being designed to be implemented next year. Based on results of the above study, do you have any suggestions for this pilot study might be most effective? Write a short paragraph explaining your reasoning. (10 marks)
The dataset ‘London_data.dat’ contains information on individuals’ budget share on clothing and food, their total expenditure, age and number of children, for a total of 850 people.
a) Provide a table of descriptive statistics for your data and discuss what this initial analysis tells you about the characteristics of your sample (9 marks)
b) Produce two scatter plots, the first one using food expenditure (Y) and total expenditure (X) and the second one using clothing expenditure (Y) and total expenditure (x). Include the fitted line on your scatter plot. What can you say about the relationship of the two budget shares with respect of total expenditure? (10 marks)
c) Estimate the following budget share equation for the food budget share and the clothing budget share:
Report and discuss your results. In your discussion comment on how total expenditure, age and number of children influence the various budget proportions. Interpret you R-squared and comment on the statistical significance of your coefficient estimates, using the 5% and the 1% significance level. (12 marks)
d) Re-estimate the food budget share but this time express TOTEXP in logarithms. Interpret your results following the instruction in question b) (12 marks)
e) Re-estimate the clothing budget share expressing the both the share and TOTEXP in logarithms. Interpret your results following the instruction in question b) (12 marks)
Produce professional looking tables (using the outreg2 command) and figures.
CS 340 Milestone One Guidelines and Rubric Overview: For this assignment, you will implement the fundamental operations of create, read, update,
Retail Transaction Programming Project Project Requirements: Develop a program to emulate a purchase transaction at a retail store. This
7COM1028 Secure Systems Programming Referral Coursework: Secure
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
CS 340 Final Project Guidelines and Rubric Overview The final project will encompass developing a web service using a software stack and impleme