程序辅导案例 > Program >

程序代写案例-L1

欢迎使用51辅导，51作业君孵化低价透明的学长辅导平台，服务保持优质，平均费用压低50%以上！ 51fudao.top

Recap
Shrinkage Estimators
• LASSO — L1 penalty
• Shrinks some coecient estimates all the way to 0
• Ridge — L2 penalty
• Shrinks coecient estimates (not all the way to 0)
• Relaxed LASSO—decouple selection from shrinkage
• Many others
STAT 331: Applied Linear Models 1
Linear Regression
y = X + ✏, ✏ ⇠ MVN(0,2I )
Assumptions:
1. Linearity
2. Independence
3. Normality
4. Equal variance (homoskedasticity)
Estimate via OLS:
min
X
i
(yi xTi )2
yields ˆ = (XTX)1XTy , and we have shown
ˆ ⇠ N(,2(XTX)1)
STAT 331: Applied Linear Models 2
What happens when our
assumptions are broken?
Estimation
Minimizing
P
i (yi xTi )2 is still a reasonable thing to do, so we can get
our usual OLS estimator ˆ.
What about the appealing features of ˆ?
E [ˆ] = (XTX)1XTE [y]
Under linearity, E [y] = X, hence
E [ˆ] = (XTX)1XTX =
If linearity does not hold, E [ˆ] 6=
Note the other three assumptions were not necessary for unbiased
estimates!
STAT 331: Applied Linear Models 3
Inference
What about SEs?
Var(ˆ) = Var((XTX)1XTy)
= (XTX)1XTVar(y)X(XTX)1
Under Independence and Homoskedasticity Var(y) = 2I:
Var(ˆ) = (XTX)1XT2IX(XTX)1 = 2(XTX)1
But if either assumption is not met, our variance estimates will be
incorrect
Hence our SEs, CIs, etc are invalid.
STAT 331: Applied Linear Models 4
A Note on Normality
Without Normality, ˆ is no longer a linear transformation of a MVN
vector, hence it is no longer Normally-distributed, so our CIs, tests are
not necessarily valid.
However, in large samples, ˆ is approximately Normally distributed due
to the Central Limit Theorem.
p
n(z¯ E [z ])p
(var(z))
d! N(0, 1)
So we can get away with valid inference despite non-normal errors in
“large enough” samples.
• Replace critical tnp1,↵/2 values with z↵/2
STAT 331: Applied Linear Models 5
Prediction
Prediction intervals explicitly require Normality: ynew ⇠ N(xTnew,2)
Without normality, our prediction intervals are invalid.
• Predictions are still unbiased, however (why?)
Predictions intervals are sensitive to all 4 assumptions
STAT 331: Applied Linear Models 6
Model Diagnostics
Each nice feature of regression relies on one of our assumptions (to
varying degrees)
Once we have fit a model, we need some tools to diagnose whether our
assumptions are broken
STAT 331: Applied Linear Models 7
Residuals
One of the best tools for diagnostics is to visualize residuals.
We can use ordinary residuals: ei = yi yˆi
Could also use studentized residuals: ri =
ei
ˆ
p
1hi where hi is the i
th
diagonal of H.
Intuition for studentized residuals
• Recall: e = (IH)y ⇠ N(0,2(IH))
• Hence ei ⇠ N(0,2(1 hi ))
• I.e., ei have di↵erent variances, so it is dicult to learn anything
about their distribution
• By contrast ei/
p
(1 hi ) has constant variance 2, so they should
look normally distributed when plotted.
• Note: in practice we estimate ˆ so the studentized residuals are
really t-distributed
STAT 331: Applied Linear Models 8
Assessing Normality
Histogram of studentized residuals
• Should look like the density for N(0,1)
Normal-QQ plots
• Comparing quantiles to theoretical quantiles of N(0,1)
• Should fall on 45 degree line
STAT 331: Applied Linear Models 9
Assessing Normality
STAT 331: Applied Linear Models 10
Assessing Heteroskedasticity
Plot residuals against fitted values
• Can detect mean-variance relationships
• I.e. if there is higher variance for larger fitted values
STAT 331: Applied Linear Models 11
Assessing Heteroskedasticity
STAT 331: Applied Linear Models 12
Assessing Independence
Dicult to visualize unless you have something like time-series data
(Also: note that residuals are not independent even when the errors are!)
• Clear from P ei = 0
Instead consider how data were collected:
• Observations on patients, clustered within hospitals
• Students within classes
• Following a person over time
STAT 331: Applied Linear Models 13
Assessing Linearity in SLR
Consider simple linear regression:
yi = 0 + 1xi + ✏i
Linearity assumption is
E [yi ] = 0 + 1xi
Residuals: ei = yi (ˆ0 + ˆ1xi )
Plot yi against xi
• Should look linear!
Plot residuals ei against xi
• Conspicuous pattern may indicate non-linearity
• Should look fairly “random”
• Sometimes easier to identify non-linearity than plotting yi against xi
STAT 331: Applied Linear Models 14
Assessing Linearity
STAT 331: Applied Linear Models 15
Assessing Linearity in MLR
Plotting yi against xi ignores the e↵ect of all the other covariates!
Instead visualize partial regression plots (added variable plots)
To assess linearity in x⇤
1. Regress y on other covariates (all xj except xj = x⇤)
• Get fitted values from this model fit, and compute the residuals, ey
2. Regress x⇤ on other covariates (all xj except xj = x⇤)
• Get fitted values from this model fit, and compute the residuals, ex⇤
3. Plot ey against ex⇤
Intuitively: we are isolating the y ⇠ x⇤ relationship, after adjusting for
the other covariates
STAT 331: Applied Linear Models 16
Assessing Linearity
STAT 331: Applied Linear Models 17
Practice Q1
Load the HSB2.csv data, and consider only the first 100 observations.
Regress math on locus, concept, mot.
(a) Are you comfortable with the heteroskedasticity assumption?
(b) Are you comfortable with the normality assumption?
(c) Are you comfortable with the linearity assumption?
STAT 331: Applied Linear Models 18
Practice Q2
First generate a normally distributed covariate, xi . Now simulate a
dataset in each of the settings below (assuming all other assumptions
hold as usual), Regress yi on xi as usual and examine the diagnostic plots
to investigate the impact of these violations.
(a) Generate yi = 0 + 1xi + ✏i where ✏i
iid⇠ N(0, 2)
(b) Generate yi = 0 + 1xi + ✏i where ✏i
iid⇠ Unif (2, 2)
(c) Generate yi = 0 + 1xi + ✏i where ✏i
iid⇠ N(0, 2 + |xi |)
(d) Generate yi = 0 + 1xi + ✏i where ✏i
iid⇠ N(0, 1I (xi  0)+ 3I (xi > 0))
(e) Generate yi = 0 + 1x3i + ✏i where ✏i
iid⇠ N(0, 2)
(f) Generate yi = 0 + 1exp(xi ) + ✏i where ✏i
iid⇠ N(0, 2)
STAT 331: Applied Linear Models 19

欢迎咨询51作业君