# 程序代写案例-EC220

Summer 2021 Exam
EC220
Introduction to Econometrics
Suitable for all candidates
Instructions to candidates
This paper contains FOUR questi
ons, divided into two sections. Section A contains ONE question
related to Michaelmas Term and Section B contains THREE questions related to Lent Term. You
should answer ALL questions from Section A and ALL questions from Section B.
If at any point in this exam you feel that anything is unclear, please make additional assumptions that
you feel are necessary and state them clearly.
You could combine the typed document with scanned or photographed hand-drawn diagrams and
computations. The maximumword count is 1500 words, beyond which nothing will be marked. There
is no minimum word count and concise answers will be rewarded.
For Section B: Please use pen and paper and scan (or photograph) your answers. You could also use
clearly.
The answers must then be converted to pdf and uploaded to Moodle as ONE individual file together
with the Coversheet. Please make sure every single scanned page is legible and properly ordered.
The file will be run through Turnitin to ensure academic integrity.
Time Allowed Submit PDF with answers within 24 hours after official start of the exam
Expected effort Reading Time: 15 minutes
You are supplied with: Lindley & Scott Cambridge Statistical Tables
Table A5 Durbin-Watson d-statistic
You may also use: Open book examination
Calculators: Calculators are allowed in this exam
© LSE ST 2021/EC220 Page 1 of 7
Section A
Question 1
[33.34 marks]
A blowout of the BPDeepwater Horizon oil-well in April 2010 led to the largestmarine oil spill in history,
lasting until July of that year. Researchers would like to analyse whether consumers reacted to the
disaster by reducing their consumption of BP branded petrol during the oil spill. They collected data
on the prices and quantities sold at BP-branded and non-BP petrol stations across zip codes (postal
codes; small local areas) in the US. Either a zip code contains BP stations, in which case the average
price and average number of gallons sold for each of these BP stations is recorded (and the indicator
variableBP = 1), or a zip code contains no BP station, in which case the price and quantity at these
non-BP stations is recorded (and BP = 0). An observation is a particular petrol station. Non-BP
stations in BP zip codes are not used in the sample.
Prices and quantities are the averages either for the period January 2009 to March 2010 (before the
oil spill) in columns (1) and (2) or for April 2010 to July 2010 (during the oil spill) in columns (3) to (6)
of the table below. Prices are in US Dollars per gallon and coded as Price. Quantities are in logarithm
and coded as ln(sales). The researchers also constructed a variable called Green Index, which is
supposed to measure the environmental orientation of consumers in the zip code. TheGreen Index
is constructed by combining the share of hybrid vehicle registrations, per capita membership in the
Sierra Club, an environmental organisation, and per capita contributions to Green Party election funds
in the zip code, all measured prior to 2010. The Green index is then standardised to have mean 0
and standard deviation 1. Using either Price or ln(sales) as the dependent variable, the researchers
obtain the following results.
Jan 2009 - Mar 2010 Apr 2010 - Jul 2010
Price
(1)
ln(sales)
(2)
Price
(3)
ln(sales)
(4)
Price
(5)
ln(sales)
(6)
BP 0.003(0.002)
-0.004
(0.007)
-0.043
(0.002)
-0.036
(0.009)
-0.041
(0.002)
-0.033
(0.009)
Green Index 0.006(0.001)
-0.002
(0.002)
BP× Green Index -0.006(0.002)
0.013
(0.008)
Notes: All regressions also contain a constant. Robust standard errors are
in parentheses. The regressions have 5,686 observations
(a) Define the treatment, the outcome, and the counterfactuals implicit in the regression in column
(3)? What do the researchers use as the control group in this regression?
[3.34 marks]
(b) Why do the researchers run the regressions in columns (1) and (2) for the period before the oil
spill? What do you conclude from this exercise?
[6 marks]
(c) What is the average effect of the oil spill on BP prices in column (3)? Discuss whether this is
likely a causal effect.
[6 marks]
© LSE ST 2021/EC220 Page 2 of 7
(d) The researchers also have a variable available which measures the advertising expenditures by
BP in a particular zip code fromApril to July 2010. Would this variable be useful for the analysis?
Explain either how it could be used productively or why you think it is not useful.
[4 marks]
(e) What is the interpretation of the coefficient on the interaction term BP x Green Index in Column
(5)? How much lower is the BP price at a station in a zip code which is 2 standard deviations
above the mean in the Green Index compared to a similarly green control counties? Carefully
[6 marks]
(f) The average BP price during the sample period (2009-2010) is US\$2.70. Use the estimates in
columns (3) and (4) to construct a price elasticity for BP-branded petrol. Is this a demand or
[8 marks]
© LSE ST 2021/EC220 Page 3 of 7
Section B
Question 2
[22.33 marks]
Consider the bivariate regression model without intercept
yi = βxi + ui,
for i = 1, . . . , n. We impose the following assumptions.
SLR.1 The population model is y = βx+ u.
SLR.2We have a random sample of size n, {(yi, xi) : i = 1, . . . , n}, following the population model
in SLR.1.
SLR.3 The sample outcomes on {xi : i = 1, . . . , n} are not all the same value.
SLR.4 The error term u satisfies E(u|x) = 0 for any value of x.
(a) Let βˆ be the OLS estimator for the regression from y on x (without intercept). Show that βˆ is a
consistent estimator for β under SLR.1-4.
[3 marks]
(b) In addition to SLR.1-4, suppose we know that
V ar(u|x) = σ2x2.
Derive the (conditional) variance V ar(βˆ|X), whereX = (x1, . . . , xn).
[4 marks]
(c) Find an estimator for V ar(βˆ|X) derived in (b), and explain how to compute the 95% confidence
interval for β.
[3.33 marks]
(d) Now consider the weighted OLS estimator β˜ defined as
β˜ = arg min
b
n∑
i=1
w(xi) · (yi − bxi)2,
where w(x) > 0 is a positive weight function of x. Derive the expression for β˜.
[4 marks]
(e) Show that β˜ is an unbiased estimator for β under SLR.1-4.
[4 marks]
SLR.5 The error term u satisfies V ar(u|x) = σ2 for any value of x (homoskedasticity).
Under SLR.1-5, derive the (conditional) variance V ar(β˜|X).
[4 marks]
© LSE ST 2021/EC220 Page 4 of 7
Question 3
[22.33 marks]
(i) Consider the bivariate regression model yi = β0 + β1xi + ui for i = 1, . . . , n. If the t
statistic to test the null hypothesisH0 : β1 = 0 is zero, then R2 is also zero.
[4 marks]
(ii) Consider the bivariate regression model yi = β0 +β1xi+ui for i = 1, . . . , n. If one rejects
the null hypothesis
H0 : β1 = 10 in favor ofH1 : β1 > 10
at significance level α, then he/she would certainly reject the null hypothesis
H0 : β1 = 10 in favor ofH1 : β1 6= 10
at the same significance level.
[4 marks]
(iii) Consider a multiple regression model y = β0 +β1x1 +β2x2 +u, where u is independent of
(x1, x2) and u ∼ Normal(0, σ2) (i.e., Assumptions MLR.1-6 are satisfied). If the sample
correlation between x1 and x2 is extremely high (say, 0.99), then the t statistic for testing
the hypothesisH0 : β2 = 0 does not follow the t distribution underH0.
[3 marks]
© LSE ST 2021/EC220 Page 5 of 7
(b) We are interested in evaluating whether the decision by loan officials to deny a mortgage may
be racially motivated. Let the binary variable deny equal 1 if the application for a mortgage was
denied, and deny = 0 if an application for a mortgage was successful. minority is a dummy vari-
able which indicates if the applicant belongs to an ethnic minority group (1 = yes) or not (0 = no).
The data set contains information on a wide range of variables which a loan officermight legally
consider when deciding on amortgage application.Wewill restrict our attention to the variables:
pirat (ratio of total monthly debt payment to total monthly income), lvrat_med and lvrat_high
(dummy variables indicating whether the loan-to-value ratio is intermediate or high, with the ex-
cluded dummy being low), and a consumer credit score, chist (which ranges from 1 to 5, where 5
is the worst rating). We have a random sample of 2,380 observations. There are 285 denied ap-
plications, the average of pirat equals 0.33, and the average of chist equals 2.12. The following
table provides various regression results based on this data thatmay shed light on this question.
LPM logit (1) Logit (2) Logit (3)
(minority)
Logit (4)
(non-minority)
minority .113
(.024)
.777
(.160)
- - -
pirat .483
(.072)
5.039
(.752)
5.283
(.759)
5.165
(1.579)
5.013
(.860)
lvrat_med .053
(.014)
.643
(.145)
.748
(.143)
.190
(.285)
.788
(.168)
lvrat_high .251
(.051)
1.759
(.281)
1.904
(.277)
1.182
(.497)
1.973
(.336)
chist .041
(.005)
.332
(.035)
.366
(.034)
.318
(.065)
.340
(.041)
intercept − .171
(.024)
−5.144
(.305)
−5.209
(.307)
−4.039
(.643)
−5.238
(.352)
n 2, 380 2, 380 2, 380 339 2, 041
R2 .1427
logL −726.469 −737.605 −177.477 −547.051
The usual standard errors for the logit model and the robust standard errors for the LPM are
reported in parentheses.
(i) Let us consider the results that are based on the linear probability model (LPM). Provide
the interpretation of parameter estimate on minority, let us denote it by βˆminority, and test
whether the effect is statistically significant. Clearly indicate the assumptions you make
use of for your interpretation and for the test you conduct.
[4 marks]
(ii) Obtain the partial effect of being a minority on the mortgage denial rate using the logit
model, when evaluated at the mean values of the explanatory variables and given an inter-
mediate loan-to-value ratio. Briefly indicate whether you expect this effect to be significant
(no formal test expected), and should we expect this effect to be the same given a high
loan-to-value ratio?
[4.33 marks]
(iii) Let us define Pr (deny = 1|x) = Λ(α0 + α1pirat + α2lvrat_med + α3lvrat_high + α4chist),
where Λ(z) = exp(z)/(1 + exp(z)). You are interested in deciding whether there is evi-
dence that the decision rule taken by the loan officer is the same for minorities as it is for
non-minorities. Using the results provided, conduct this test. Clearly specify the null and
the alternative hypothesis, the test statistic and its distribution under the null, the rejection
[3 marks]
© LSE ST 2021/EC220 Page 6 of 7
Question 4
[22 marks]
Let us consider the following stationary, weakly dependent, time series model
yt = α + ρ1yt−1 + ρ2yt−2 + βxt + εt, t = 1, ..., T.
The error process hasmean zero and exhibits autocorrelation. Youmay assume that εt is independent
of xs for all s ≤ t, and y0 and y−1 are available and equal 0.
(a) Discuss how you would test the null hypothesis that the error process does not exhibit autocor-
relation against the alternative that the error can be represented by a stationary AR(2) process.
In your answer you should clearly indicate what an AR(2) process is.
[5 marks]
(b) Briefly discuss the properties of the OLS estimator when applied to the above model (unbiased-
suitable arguments.
[4 marks]
(c) What is the purpose of heteroskedasticity and autocorrelation robust (HAC) standard errors?
Discuss whether HAC standard errors can resolve themain problem associated with estimating
the model using OLS.
[3 marks]
(d) Describe in detail amethod to resolve themain problemwith usingOLS to estimate (α, ρ1, ρ2, β).
Clearly indicate the assumptions underlying this method. If you think there is no method that
can resolve the main problem, explain why.
[5 marks]
(e) What is the Long Run Propensity (long run effect that a permanent change in xt has on yt) in
this model and what is the Impact Propensity? Describe how you would conduct, using a single
linear hypothesis, a test for the hypothesis that the LRP is the same as the Impact Propensity.
[5 marks]
END OF PAPER
© LSE ST 2021/EC220 Page 7 of 7

Email:51zuoyejun

@gmail.com