代写辅导接单-ECON7320: Advanced Microeconometrics

欢迎使用51辅导,51作业君孵化低价透明的学长辅导平台,服务保持优质,平均费用压低50%以上! 51fudao.top

ECON7320: Advanced Microeconometrics

Problem Set 1

Fu Ouyang

March 18, 2025

Instruction

Answer all questions and clearly label your answers. For empirical questions, you should show

your R script(s) and outputs (e.g., screenshots for commands, tables, and figures, etc.). You

will lose2 points

whenever you fail to provide R commands and outputs. When you are asked

to explain or discuss something, your response should be brief and compact. You should upload

your assignment (in PDF or Word format) via the “Turnitin” submission link (in the “Problem

Set 1” folder under “Assessment”) by 16:59 on the due date April 1, 2025. Do not hand in a

hard copy. You are allowed to work on this assignment in groups; that is, you can discuss how

to answer these questions with your group members. However, this isnot

a group assignment,

which means that you must answer all the questions in your own words and submit your work

separately. The marking system will check the similarity, and UQ’s student integrity and

misconduct policies on plagiarism apply.

OLS (30 points)

Use thecps09mardataset described in Tutorial 1. Take the sub-sample of non-Hispanic women

to estimate the following wage equation:

log(wage) =β

0

1

education+β

2

experience+β

3

experience

2

/100 +u,(1)

wherewage=earnings/(hours×week) andexperience=age−education−6.

(a) Estimate equation (1) using the described sub-sample and compute theR

2

(5 points).

(b) Include a set of dummy variables for regions and marital status in (1) and estimate the

extended model. For regions, create dummy variables for Northeast, South, and West

so that Midwest is the excluded group. For marital status, create variables for married

(marital≤3), widowed or divorced, and separated, so that single (never married) is

the excluded group (6 points). Calculate standard errors using the HC0, HC1, and HC3

methods (3 points). Are they very different?

(c) In what follows, use the estimation results obtained from (a). Letθbe the ratio of the

return to one year of education to the return to one year of experience forexperience=

10. Writeθas a function of the regression coefficients and compute

ˆ

θfrom the estimated

model (3 points). Suppose the OLS estimator for model (1) is consistent. Is

ˆ

θconsistent

(3 points)? Hint: Apply continuous mapping theorem.

(d) Compute the regression function ateducation= 16 andexperience= 10 (2 points).

Compute a 95% confidence interval for the regression function at this point (3 points).

Hint: You can use theglht()function in themultcomppackage.

1

(e) Consider the same out-of-sample individual as in (d) (education= 16,experience= 10).

Construct an 90% forecast interval for their wage (5 points). Hint: To obtain the forecast

interval for the wage, apply the exponential function to both endpoints.

MLE: Tobit Regression (25 points)

Consider the Tobin (1958) regression model:

Y

=X

T

β+ewithe|X∼N(0,σ

2

),(2)

Y= max{Y

,0}.(3)

Tobin (1958) used this model to study household consumptionYof durable goods. He observed

that in survey data,Yis zero for a positive fraction of households. He proposed treating the

observedYas acensoredrealization from a latent continuous variableY

(like “willingness to

pay”); that is,Y=Y

whenY

>0 andY= 0 whenY

≤0. Here the observed variableYis

censoredY

from below at zero. After all, negative pay is infeasible. This model is known as

Tobit regression,censored regression, orType I Tobit model.

Sincee|X∼N(0,σ

2

) is assumed in (2), Tobit model (2)–(3) is parametric and its unknown

parameters (β,σ) can be estimated using the maximum likelihood method. By definition, it is

easy to write out the distribution function ofYconditional onX:

P(Y≤y|X=x) =P(Y

≤0|X=x)

1[y≤0]

·P(Y

≤y|X=x)

1[y>0]

=P(x

T

β+e≤0)

1[y≤0]

·P(x

T

β+e≤y)

1[y>0]

= Φ(−x

T

β/σ)

1[y≤0]

·Φ((y−x

T

β)/σ)

1[y>0]

,(4)

where1[·] is the indicator function

1

and Φ(·) is the CDF ofN(0,1). Taking derivative of (4)

with respect toyyields the likelihood function:

f

Y|X

(y|x) = Φ(−x

T

β/σ)

1[y≤0]

·[σ

−1

φ((y−x

T

β)/σ)]

1[y>0]

,(5)

whereφ(·) is the PDF ofN(0,1).

Use theCHJ2004dataset (see the data description file). The variablestinkindandincome

are household transfers received in-kind and household income, respectively. Divide both vari-

ables by 1000 to standardize.

(a) Estimate a linear regression oftinkindonincomeandincome

2

(5 points).

(b) Calculate the percentage of censored observations (tinkind= 0) (3 points). Estimate a

linear regression oftinkindonincomeandincome

2

by omitting the censored observations

(5 points).

(c) Estimate a Tobit regression oftinkindonincomeandincome

2

(6 points). Explain the

differences between your results in (a)–(c) (6 points). Hint: You can use thetobit()

function in theAERpackage. Alternatively, you can also use (5) to code up the maximum

likelihood estimation and inference by yourself and check if you can replicate the results

returned bytobit(). But this is optional.

1

1[A] = 1 if eventAoccurs and1[A] = 0, otherwise.

2

2SLS and GMM (25 points)

In an influential paper, David Card (1995) suggested that if a potential student lives close to a

college, this reduces the cost of attendance and raises the likelihood that the student will attend

college. However, college proximity does not directly affect a student’s skills or abilities, so it

should not affect their wage. These considerations suggest that college proximity can be a valid

IV for education in a wage regression.

Use theCard1995dataset to replicate the baseline analysis conducted in Card (1995). Con-

sider the following model:

log(wage) =β

0

1

education+β

2

experience+β

3

experience

2

/100

4

black+β

5

south+β

6

smsa+u,(6)

whereeducationis years of schooling andexperience=age−education−6. For all the

questions below, only use data for 1976. See the data description file for variable definitions.

(a) Estimate model (6) by OLS and 2SLS (usingnearc4as the instrument foreducation)

(8 points).

(b) Estimate model (6) by 2SLS using instruments{nearc4a,nearc4b}(3 points). What is

the impact on the structural estimate of model (6) (2 points)?

(c) Are the instruments used in (b) strong or weak? Test it (4 points).

(d) Use the 2SLS regression in (b) to test the exogeneity ofeducation(4 points).

(e) Use the 2SLS regression in (b) to test the exogeneity of instruments (4 points).

(f) (Optional) Re-estimate model (6) using the same set of instruments as in (a) by efficient

GMM. Do the results change meaningfully? Hint: Use thegmm()function in thegmm

package. This question has no points; it is here just because we don’t have room for it in

Tutorial 4.

Theoretical Questions (20 points)

1. Prove that the regression errors of LPM must be heteroskedastic (5 points).

2. Prove thatE[Y|X] = arg min

g

E[(Y−g(X))

2

]; i.e.,E[Y|X] has the smallestmean squared

error(MSE) among all functions ofX. Hint: Consider (Y−g(X))

2

= [(Y−E[Y|X]) +

(E[Y|X]−g(X))]

2

and apply the law of iterated expectation (LIE) (5 points).

3. Consider the following simple linear regression model:

y

i

=βx

i

i

, i= 1,...,n,(7)

where{x

i

,y

i

}

n

i=1

are i.i.d.,E[ε

i

|x

i

] = 0, and V[ε

i

|x

i

] =σ

2

. Supposex

i

>1 andE[x

2

i

]<∞

for alli= 1,...,n. We have the following three estimators ofβ.

ˆ

β

A

=

n

X

i=1

x

i

y

i

/

n

X

i=1

x

2

i

,

ˆ

β

B

=

n

X

i=1

y

i

/

n

X

i=1

x

i

,

ˆ

β

C

=

1

n

n

X

i=1

y

i

x

i

.

3

(a) Show that

ˆ

β

A

,

ˆ

β

B

and

ˆ

β

C

are all unbiased (6 points). Hint: Use (7) and LIE.

(b) Among

ˆ

β

A

,

ˆ

β

B

and

ˆ

β

C

, which one has the smallest variance? Why? (2 points) Hint:

Review the Gauss-Markov Theorem.

2

(c) Show that

ˆ

β

C

is consistent (2 points). Hint: Apply WLLN and LIE.

2

When we say an estimator is linear, we mean the estimator is a linear function ofy.

4

51作业君

Email:51zuoyejun

@gmail.com

添加客服微信: Fudaojun0228