In this assignment, you will explore the correlates of earnings. You will use data on real individuals collected by the Office for National Statistics and published in the Quarterly Labour Force Survey. In the course of working on this assignment, you will:
• be introduced to an important source of data for research on the UK (the UK Data Service);
• become familiar with processing and analysing data using Stata;
• interpret regression output;
• develop critical thinking about economic phenomena and econometric analysis of them.
Practical details
Word limit: no more than 1000 words. Reasonable use of tables and figures does not count toward the word limit.
Submission file: one file (in .pdf or .doc format).
Figures and tables: should be numbered and titled appropriately. Tables should be formatted and presented as in standard economics journals (copy-pasted output from Stata is not acceptable).
Appendix: an appendix should be included at the end of the assignment, containing a copy of the Stata code used to obtain the results. This should be the exact code used to go from loading the dataset to generating the results presented (no more, no less). For convenience, you can simply copy and paste the .do file.
Preliminaries
You will have accessed the data and documentation you need to complete this assignment during Computer Workshop 5. Please refer to that workshop and the accompanying guide on how to access the dataset (“Computer Workshop 5 – UK Data Service Access Instructions”).
0. Loading data and defining the sample
a) Load the main dataset (lfsp jm19 eul pwt18.dta) in Stata.
b) We will focus on individuals reporting positive gross weekly earnings and not currently working towards a qualification. To keep only these observations, do:
keep if GRSSWK > 0 & QULNOW == 2
c) Check: the resulting dataset should contain 8949 observations. If this is not the case, something has gone wrong somewhere.
1. Degrees and earnings [20 marks]
a) Plot the distribution of weekly earnings (GRSSWK) in a histogram. What do you notice?
b) Consider instead taking the logarithm of earnings. Compare the distribution of log(GRSSWK) to the one above. When considering earnings in our regressions, why might it be preferable to take logarithms?
c) Now let us think about education, in particular the correlation between having attained a degree and earnings. Using the variable detailing the highest degree earned (HIQUL15D), create a dummy variable taking a value of one if the respondent has attained a degree or equivalent. Then, estimate the regression:
log(earnings)i = α + β Degreei + ϵi (1)
Report your findings in a table, and interpret the estimated coefficient on Degreei, both in terms of econometrics and economic theory.
2. Men and women [20 marks]
a) Create a dummy variable taking a value of one for female respondents (the relevant variable is
SEX).
b) Estimate equation 1 separately for men and women and report your findings in additional columns of the table you created above. What do you conclude about the returns to a degree for men and women?
c) Instead of estimating the regression separately for men and women, estimate instead: log(earnings)i = α + β1 Degreei + β2 Femalei + β3 Degreei × Femalei + ϵi (2)
How do the findings from this regression compare to the ones for men and women separately?
d) What do we learn, on the basis of these regressions, about the earnings of men and women?
3. Other factors [30 marks]
a) Pick an additional dimension on which information is available in the dataset (consult the user manual and codebook (lfs user guide vol3 variabledetails2019.pdf) for details) and construct variable(s) which allow you to explore the relationship between this dimension and earnings. Without running additional regressions (yet), why does economic theory tell you to expect this dimension to matter for earnings? Can we expect these dimensions to interact with the “degree effect” or “gender effect” we explored already?
b) Estimate additional regressions to test whether your theoretical predictions hold in the data. Present these in a second table and discuss.
4. Taking a step back [30 marks]
a) Thinking about the first regression we estimated (equation 1), do you expect the estimated βˆ to capture the true causal effect of attaining a degree on earnings? Why or why not?
b) In this assignment, you have used labour force data from 2019. What differences, if any, do you expect to find if you were to re-do the assignment using data from 2022?
CS 340 Milestone One Guidelines and Rubric Overview: For this assignment, you will implement the fundamental operations of create, read, update,
Retail Transaction Programming Project Project Requirements: Develop a program to emulate a purchase transaction at a retail store. This
7COM1028 Secure Systems Programming Referral Coursework: Secure
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
CS 340 Final Project Guidelines and Rubric Overview The final project will encompass developing a web service using a software stack and impleme