Part A: Quarter of birth, returns to schooling, instrumental variables and more
This part is based on an extract from the data used by Angrist and Krueger in their seminal study “Does Compulsory School Attendance Affect Schooling and Earnings?” Μ§ Quarterly Journal of Economics, November 1991 (https://www.jstor.org/stable/2937954). The data can be found on the course website in Stata format, AngristKrueger_3039Cohort.dta.
As usual, make sure to present your results in a neat format.
1 a) Angrist and Krueger are interested in estimating the causal effect of schooling on log earnings.
i. Estimate an OLS regression of log weekly wages (lwklywage) on years of schooling (educ), without any additional controls.
ii. Repeat part (i), but now add controls for race, SMSA, marital status, regional dummies, and year of birth dummies.
Compare the coefficients in the two specifications, and comment on the results. Is the coefficient on years of schooling likely to be a consistent estimator of the causal effect of interest? Why or why not? What are the possible sources of bias?
2. b) Angrist and Krueger propose using quarter of birth as an instrument for years of schooling. Explain why this can be a valid instrument. Specifically, why does it satisfy the first stage (or “relevance” condition, Cov(Z,X) =ΜΈ 0)? Does it satisfy the exclusion restriction?
c) Define a new variable QOB1, equal to 1 if the individual is born in the first quarter of the year, and equal to zero otherwise.
i. Estimate the first stage regression: the regression of years of schooling on QOB1 (without any additional explanatory variables), and report the coefficient and standard error.
ii. Estimate the reduced form regression: the regression of lon weekly wage on QOB1 (without any additional explanatory variables), and report the coefficient and standard error.
iii. Estimate a 2SLS regression of log weekly wage on years of schooling, using QOB1 as an instrument. Use the ivregress 2sls command in Stata. Report the coefficient and standard error.
iv. What is the relationship between the estimates you obtained in parts i, ii and iii of this question?
4. d) Suppose that you have a faulty version of Stata, in which the ivregress command doesn’t work. Explain how you could obtain the 2SLS coefficient (of part c.iii) using only the regress command. Implement this estimate. Is the coefficient and the standard error the same as in part c.iii?
5. e) Suppose that you have an even faultier version of Stata, in which neither the ivregress nor the regress commands work. Explain how you could obtain the 2SLS coefficient of part c.iii using only the summarize command (and if statements). Implement this estimate.
6. f) Repeat parts c.i-c.iii but now use three dummies for quarter of birth as your instruments (QOB1, QOB2 and QOB3), and include all the explanatory variables (the race dummy, the SMSA dummy, the marital status dummy, the region dummies, and year of birth dummies). What is now the relationship between the first stage, the reduced form, and the 2SLS coefficient?
7. g) Compare the 2SLS estimate you obtained in part f) with the OLS coefficient you obtained in part a.ii. Comment on the results. Is the sign of the bias of the OLS coefficient what you expected?
8. h) Regress each one of the control variables you used in part f) on the three quarter of birth dummies. Report the F-statistic and the p-value for the test of the null hypothesis that all the coefficients are equal to zero. Is there any systematic pattern between the quarter of birth and the control variables? Why would one be interested in carrying out this test?
i) Define a new set of variables: 1 educ ≥ π
educ_gt_π = 0 otherwise , for π = {8,9, ... ,16}
Regress each one of these variables on the three quarter of birth dummies, and all of the control variables that you used in part f). Calculate the F-statistic for the test that all three coefficients on the quarter of birth dummies are equal to zero. For which values of ππ is the test statistic greater than 10? Why does this result support Angrist and Krueger’s identification strategy?
Part B: The earnings-experience profile
For this question, you will download the data set ps4_CPS-ASEC_1970-2020.dta. The data set contains information on earnings, wages, and demographic characteristics taken from the Annual Social and Economic Supplement of the CPS (the “March CPS”) for 1970, 1980, 1990, 2000, 2010 and 2020.
Use the describe and summarize commands in Stata to familiarize yourself with the data set. The key variable, lnearn, is the log of yearly earnings in the previous calendar year. The data includes only individuals aged 20 to 64, who were salaried workers in the previous year, and worked full-time (i.e., more than 35 hours a week) and year-round (more than 40 weeks a year). Values of earnings, weekly wages and hourly wages below the bottom 1st percentile and above the top 99th percentile were set to missing. Answer the following questions.
a) It is well known that the earnings-experience profile is concave: earnings rise with labor market experience, but at a declining rate.
i. How does the theory of human capital rationalize this finding? In your answer, make sure to explain the role of on-the-job training, and how it evolves over the life-cycle.
ii. Do you expect the earnings-experience profile steeper for college graduates or for high-school graduates? Why?
Do you expect the earnings-experience profile to be steeper for men or for women? Why?
CS 340 Milestone One Guidelines and Rubric Overview: For this assignment, you will implement the fundamental operations of create, read, update,
Retail Transaction Programming Project Project Requirements: Develop a program to emulate a purchase transaction at a retail store. This
7COM1028 Secure Systems Programming Referral Coursework: Secure
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
CS 340 Final Project Guidelines and Rubric Overview The final project will encompass developing a web service using a software stack and impleme