Introduction to Econometrics (3rd Updated Edition)
by
James H. Stock and Mark W. Watson
Solutions to End-of-Chapter Exercises: Chapter 17*
(This version August 17, 2014)
*Limited distribution: For Instructors Only. Answers to all odd-numbered questions
are provided to students on the textbook website. If you find errors in the solutions,
please pass them along to us at [email protected].
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
1
_____________________________________________________________________________________________________
17.1. (a) Suppose there are n observations. Let b1 be an arbitrary estimator of β1. Given
the estimator b1, the sum of squared errors for the given regression model is
n
∑ (Y − b X ) .
i =1
2
i
1
i
βˆ1RLS , the restricted least squares estimator of β1, minimizes the sum of squared
errors. That is, βˆ RLS satisfies the first order condition for the minimization which
1
requires the differential of the sum of squared errors with respect to b1 equals
zero:
n
∑ 2(Y − b X )(− X ) = 0.
i =1
i
1
i
i
Solving for b1 from the first order condition leads to the restricted least squares
estimator
βˆ1RLS =
∑in=1 X iYi
.
∑in=1 X i2
(b) We show first that βˆ1RLS is unbiased. We can represent the restricted least
squares estimator βˆ1RLS in terms of the regressors and errors:
βˆ1RLS =
∑in=1 X iYi ∑in=1 X i ( β1 X i + ui )
∑in=1 X i ui
=
=
β
+
.
1
∑in=1 X i2
∑in=1 X i2
∑in=1 X i2
Thus
⎛ ∑n X u ⎞
⎡ ∑n X E (u | X ,K , X n ) ⎤
E ( βˆ1RLS ) = β1 + E ⎜ i =n1 i 2 i ⎟ = β1 + E ⎢ i =1 i ni 1 2
⎥ = β1 ,
∑i =1 X i
⎝ ∑i =1 X i ⎠
⎣
⎦
where the second equality follows by using the law of iterated expectations, and
the third equality follows from
∑in=1 X i E (ui | X 1 ,K , X n )
=0
∑in=1 X i2
(continued on the next page)
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
2
_____________________________________________________________________________________________________
17.1 (continued)
because the observations are i.i.d. and E(ui |Xi) = 0. (Note, E(ui |X1,…, Xn) =
E(ui |Xi) because the observations are i.i.d.
Under assumptions 1−3 of Key Concept 17.1, βˆ1RLS is asymptotically normally
distributed. The large sample normal approximation to the limiting distribution
of βˆ1RLS follows from considering
∑in=1 X i ui 1n ∑in=1 X i ui
RLS
ˆ
β1 − β1 = n 2 = 1 n 2 .
∑i =1 X i
n ∑ i =1 X i
Consider first the numerator which is the sample average of vi = Xiui. By
assumption 1 of Key Concept 17.1, vi has mean zero:
E ( X iui ) = E[ X i E (ui | X i )] = 0. By assumption 2, vi is i.i.d. By assumption 3,
var(vi) is finite. Let v = 1n ∑in=1 X iui , then σ v2 = σ v2 /n. Using the central limit
theorem, the sample average
v /σ v =
1
σv
n
∑ v → N (0, 1)
n
i =1
i
d
or
1 n
d
X i ui →
N (0, σ v2 ).
∑
n i =1
For the denominator, X i2 is i.i.d. with finite second variance (because X has a
finite fourth moment), so that by the law of large numbers
1 n 2 p
X i → E ( X 2 ).
∑
n i =1
Combining the results on the numerator and the denominator and applying
Slutsky’s theorem lead to
(continued on the next page)
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
3
_____________________________________________________________________________________________________
17.1 (continued)
n ( βˆ1RLS − βu ) =
1
n
1
n
∑in=1 X i ui
∑
n
i =1
X
2
i
⎛ var( X i ui ) ⎞
d
→
N ⎜ 0,
⎟.
E( X 2 ) ⎠
⎝
(c) βˆ1RLS is a linear estimator:
βˆ1RLS =
∑in=1 X iYi
n
= ∑ i =1 aiYi ,
n
2
∑i =1 X i
where ai =
Xi
.
∑ X i2
n
i =1
The weight ai (i = 1,…, n) depends on X1,…, Xn but not on Y1,…, Yn.
Thus
βˆ1RLS = β1 +
∑in=1 X i ui
.
∑in=1 X i2
βˆ1RLS is conditionally unbiased because
n
⎛
⎞
∑ i=1
Xu
RLS
ˆ
E( β1 |X 1 ,…, X n = E ⎜ β1 + n i 2 i |X 1 ,…, X n ⎟
∑ i=1 X i
⎝
⎠
n
⎛ ∑ i=1
⎞
Xu
= β1 + E ⎜ n i 2 i |X 1 ,…, X n ⎟
⎝ ∑ i=1 X i
⎠
= β1.
The final equality used the fact that
n
⎛ ∑ n X iui
⎞ ∑ i=1
X i E(ui |X 1 ,…, X n )
E ⎜ i=1
|X
,…,
X
=
=0
1
n⎟
n
2
n
∑ i=1
X i2
⎝ ∑ i=1 X i
⎠
because the observations are i.i.d. and E (ui |Xi) = 0.
(continued on the next page)
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
4
_____________________________________________________________________________________________________
17.1 (continued)
(d) The conditional variance of βˆ1RLS , given X1,…, Xn, is
⎛
⎞
∑ n X iui
var( βˆ1RLS |X1,…, X n ) = var ⎜ β1 + i=1
|X 1 ,…, X n ⎟
n
2
∑ i=1 X i
⎝
⎠
=
n
∑ i=1
X i2 var(ui |X 1 ,…, X n )
n
(∑ i=1
X i2 )2
n
∑ i=1
X i2σ u2
= n 2 2
(∑ i=1 X i )
=
σ u2
.
n
∑ i=1
X i2
(e) The conditional variance of the OLS estimator βˆ1 is
var( βˆ1|X 1 ,K , X n ) =
σ u2
∑in=1 ( X i − X ) 2
.
Since
n
n
n
n
n
i =1
i =1
i =1
i =1
i =1
∑ ( X i − X )2 = ∑ X i2 − 2 X ∑ X i + nX 2 = ∑ X i2 − nX 2 < ∑ X i2 ,
the OLS estimator has a larger conditional variance:
var(β1|X1 ,K , X n ) > var(βˆ1RLS |X1,K , X n ).
The restricted least squares estimator βˆ1RLS is more efficient.
(f) Under assumption 5 of Key Concept 17.1, conditional on X1,…, Xn, βˆ1RLS is
normally distributed since it is a weighted average of normally distributed
variables ui:
βˆ1RLS = β1 +
∑in=1 X i ui
.
∑in=1 X i2
(continued on the next page)
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
5
_____________________________________________________________________________________________________
17.1 (continued)
Using the conditional mean and conditional variance of βˆ1RLS derived in parts (c)
and (d) respectively, the sampling distribution of βˆ1RLS , conditional on X1,…, Xn,
is
⎛
βˆ1RLS ~ N ⎜ β1 ,
⎝
⎞
⎟.
X ⎠
σ u2
∑
n
i =1
2
i
(g) The estimator
n
n
n
∑ i=1
Yi ∑ i=1
( β1 X i + ui )
∑ i=1
u
!
β1 = n
=
= β1 + n i
n
∑ i=1 X i
∑ i=1 X i
∑ i=1 X i
The conditional variance is
⎛
⎞
∑n u
var( β!1 |X 1 ,…, X n) = var ⎜ β1 + ni=1 i |X 1 ,…, X n⎟
∑ i=1 X i
⎝
⎠
=
n
∑ i=1
var(ui |X 1 ,…, X n )
n
(∑ i=1
X i )2
nσ u2
= n
.
(∑ i=1 X i )2
The difference in the conditional variance of β!1 and βˆ1RLS is
var( β!1|X 1 ,…, X n ) − var( βˆ1RLS |X 1 ,…, X n ) =
nσ u2
σ u2
− n 2.
n
(∑ i=1
X i )2 ∑ i=1
Xi
In order to prove var( β!1|X 1 ,…, X n ) ≥ var( βˆ1RLS |X 1 ,…, X n ), we need to show
n
1
≥ n
2
(∑ X i )
∑i =1 X i2
n
i =1
or equivalently
(continued on the next page)
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
6
_____________________________________________________________________________________________________
17.1 (continued)
2
⎛ n
⎞
n∑ X ≥ ⎜ ∑ X i ⎟ .
i =1
⎝ i =1 ⎠
n
2
i
This inequality comes directly by applying the Cauchy-Schwartz inequality
2
n
n
⎡ n
⎤
2
2
(
a
⋅
b
)
≤
a
⋅
⎢∑ i i ⎥ ∑ i ∑ bi
i =1
i =1
⎣ i =1
⎦
which implies
2
2
n
n
n
⎛ n
⎞ ⎛ n
⎞
2
2
X
=
1⋅
X
≤
1
⋅
X
=
n
X i2 .
∑
∑
∑
i⎟
i
⎜⎝ ∑ i ⎟⎠ ⎜⎝ ∑
⎠
i=1
i=1
i=1
i=1
i=1
n
X i2 ≥ (Σ nx=1 X i )2, or var( β!1|X 1 ,…, X n ) ≥ var( βˆ1RLS |X 1 ,…, X n ).
That is nΣ i=1
Note: because β!1 is linear and conditionally unbiased, the result
var( β!1 |X 1 ,…, X n) ≥ var( βˆ1RLS |X 1 ,…, X n ) follows directly from the Gauss-Markov
theorem.
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
7
_____________________________________________________________________________________________________
17.2. The sample covariance is
s XY =
1 n
∑ ( X i − X )(Yi − Y )
n − 1 i =1
=
1 n
∑{[ X i − µ X ) − ( X − µ X )][Yi − µY ) − (Y − µY )]}
n = 1 i =1
=
n
1 ⎧ n
(
X
−
µ
)(
Y
−
µ
)
−
( X − µ X )(Yi − µY )
⎨∑ i
∑
X
i
Y
n − 1 ⎩ i =1
i =1
n
n
⎫
−∑ ( X i − µ X )(Y − µY ) + ∑ ( X − µ X )(Y − µY ) ⎬
i =1
i =1
⎭
n
n ⎡1
n
⎤
=
( X i − µ X )(Yi − µY ) ⎥ −
( X − µ X )(Y − µY )
∑
⎢
n − 1 ⎣ n i =1
⎦ n −1
where the final equality follows from the definition of X and Y which implies that
Σin=1 ( X i − µ X ) = n( X − µ X ) and Σin=1 (Yi − µY ) = n(Y − µY ), and by collecting terms.
We apply the law of large numbers on sXY to check its convergence in probability. It
p
is easy to see the second term converges in probability to zero because X →
µ X and
p
p
Y →
µY so ( X − µ X )(Y − µY ) →
0 by Slutsky’s theorem. Let’s look at the first
term. Since (Xi, Yi) are i.i.d., the random sequence (Xi − µX) (Yi − µY) are i.i.d. By the
definition of covariance, we have E[( X i − µ X )(Yi − µY )] = σ XY . To apply the law of
large numbers on the first term, we need to have
var[( X i − µ X )(Yi − µY )] < ∞
which is satisfied since
var[( X i − µ X )(Yi − µY )] < E[( X i − µ X ) 2 (Yi − µY ) 2 ]
≤ E[( X i − µ X ) 4 ]E [(Yi − µY ) 4 ] < ∞.
The second inequality follows by applying the Cauchy-Schwartz inequality, and the
third inequality follows because of the finite fourth moments for (Xi, Yi). Applying
the law of large numbers, we have
(continued on the next page)
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
8
_____________________________________________________________________________________________________
17.2 (continued)
1 n
p
( X i − µ X )(Yi − µY ) →
E[( X i − µ X )(Yi − µY )] = σ XY .
∑
n i =1
Also,
n
n−1
→ 1, so the first term for sXY converges in probability to σXY. Combining
p
results on the two terms for sXY , we have s XY →
σ XY .
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
9
_____________________________________________________________________________________________________
17.3. (a) Using Equation (17.19), we have
n
1
n ∑ i =1 ( X i − X )ui
ˆ
n ( β1 − β1 ) = n 1 n
2
n ∑ i =1 ( X i − X )
1
n
= n
=
=
∑in=1[( X i − µ X ) − ( X − µ X )]ui
n
2
1
n ∑ i =1 ( X i − X )
1
n
∑in=1 ( X i − µ X )ui
1
n
∑in=1 ( X i − X ) 2
1
n
∑in=1 vi
∑in=1 ( X i − X ) 2
1
n
−
−
( X − µX )
1
n
∑in=1 ui
∑in=1 ( X i − X ) 2
( X − µX )
1
n
1
n
1
n
∑in=1 ui
∑in=1 ( X i − X ) 2
by defining vi = (Xi − µX)ui.
(b) The random variables u1,…, un are i.i.d. with mean µu = 0 and variance
0 < σ u2 < ∞. By the central limit theorem,
n (u − µu )
σu
=
1
n
∑in=1 ui
σu
d
→
N (0, 1).
The law of large numbers implies X → µ X 2 , or X − µ X → 0. By the consistency
p
p
of sample variance, 1n Σin=1 ( X i − X ) 2 converges in probability to population
variance, var(Xi), which is finite and non-zero. The result then follows from
Slutsky’s theorem.
(c) The random variable vi = (Xi − µX) ui has finite variance:
var(vi ) = var[( X i − µ X ) µi ]
≤ E[( X i − µ X ) 2 ui2 ]
≤ E[( X i − µ X ) 4 ]E[(ui ) 4 ] < ∞.
The inequality follows by applying the Cauchy-Schwartz inequality, and the
second inequality follows because of the finite fourth moments for (Xi, ui). The
finite variance along with the fact that vi has mean zero (by assumption 1 of Key
Concept 15.1) and vi is i.i.d. (by assumption 2) implies that the sample average
v satisfies the requirements of the central limit theorem. Thus,
(continued on the next page)
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
10
_____________________________________________________________________________________________________
17.3 (continued)
v
σv
=
1
n
∑in=1 vi
σv
satisfies the central limit theorem.
(d) Applying the central limit theorem, we have
1
n
∑in=1 vi
σv
d
→
N (0, 1).
Because the sample variance is a consistent estimator of the population variance,
we have
1
n
∑in=1 ( X i − X ) 2 p
→ 1.
var( X i )
Using Slutsky’s theorem,
1
n
1
n
∑in=1 vt
σv
d
→
N (0, 1),
∑ ( X t − X )2
σ X2
n
i =1
or equivalently
∑in=1 vi
⎛
var(vi ) ⎞
d
→
N
0,
.
⎜
n
2
2 ⎟
1
⎝ [var( X i )] ⎠
n ∑ i =1 ( X i − X )
1
n
Thus
1
n
n ( βˆ1 − β1 ) =
1
n
∑in=1 vi
∑in=1 ( X i − X ) 2
−
( X − µX )
1
n
1
n
∑in=1 ui
∑in=1 ( X i − X ) 2
⎛
var(vi ) ⎞
d
→
N ⎜ 0,
2 ⎟
⎝ [var( X i )] ⎠
since the second term for
n (βˆ1 − β1 ) converges in probability to zero as shown
in part (b).
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
11
_____________________________________________________________________________________________________
17.4. (a) Write (βˆ1 − β1 ) = an Sn where an =
1
n
and Sn = n ( Bˆ1 − β1 ). Now,
d
an → 0 and S n →
S where S is distributed N (0, a2). By Slutsky’s theorem
p
d
an Sn →
0 × S . Thus Pr (|βˆ1 − β1| > δ ) → 0 for any δ > 0, so that βˆ1 − β1 → 0 and βˆ1
is consistent.
(b) We have (i)
su2
σ
2
u
p
→ 1 and (ii) g ( x) = x is a continuous function; thus from the
continuous mapping theorem
su2
σ
2
u
=
su
σu
p
→ 1.
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
12
_____________________________________________________________________________________________________
17.5. Because E(W 4) = [E(W2)]2 + var(W2), [E(W2)]2 ≤ E (W 4) < ∞. Thus E(W2) < ∞.
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
13
_____________________________________________________________________________________________________
17.6. Using the law of iterated expectations, we have
E( βˆ1 ) = E[E( βˆ1|X 1 ,…, X n)] = E( β1 ) = β1.
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
14
_____________________________________________________________________________________________________
17.7. (a) The joint probability distribution function of ui, uj, Xi, Xj is f (ui, uj, Xi, Xj). The
conditional probability distribution function of ui and Xi given uj and Xj is f (ui,
Xi |uj, Xj). Since ui, Xi, i = 1,…, n are i.i.d., f (ui, Xi |uj, Xj) = f (ui, Xi). By definition
of the conditional probability distribution function, we have
f (ui , u j , X i , X j ) = f (ui , X i | u j , X j ) f (u j , X j )
= f (ui , X i ) f (u j , X j ).
(b) The conditional probability distribution function of ui and uj given Xi and Xj
equals
f (ui , u j | X i , X j ) =
f (ui , u j , X i , X j )
f (Xi, X j )
=
f (ui , X i ) f (u j , X j )
f (Xi ) f (X j )
= f (ui | X i ) f (u j | X j ).
The first and third equalities used the definition of the conditional probability
distribution function. The second equality used the conclusion the from part (a)
and the independence between Xi and Xj. Substituting
f (ui , u j | X i , X j ) = f (ui | X i ) f (u j | X j )
into the definition of the conditional expectation, we have
E (ui u j | X i , X j ) = ∫ ∫ ui u j f (ui , u j |X i , X j ) dui du j
= ∫ ∫ ui u j f (ui | X i ) f (u j | X j )dui du j
= ∫ ui f (ui | X i )dui ∫ u j f (u j | X j )du j
= E (ui | X i ) E (u j | X j ).
(c) Let Q = (X1, X2,…, Xi – 1, Xi + 1,…, Xn), so that f (ui|X1,…, Xn) = f (ui |Xi, Q). Write
(continued on next page)
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
15
_____________________________________________________________________________________________________
17.7 (continued)
f (ui | X i , Q) =
f (ui , X i , Q)
f ( X i , Q)
=
f (ui , X i ) f (Q)
f ( X i ) f (Q)
=
f (ui , X i )
f (Xi )
= f (ui | X i )
where the first equality uses the definition of the conditional density, the second
uses the fact that (ui, Xi) and Q are independent, and the final equality uses the
definition of the conditional density. The result then follows directly.
(d) An argument like that used in (c) implies
f (ui u j | X i , K X n ) = f (ui u j | X i , X j )
and the result then follows from part (b).
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
16
_____________________________________________________________________________________________________
17.8. (a) Because the errors are heteroskedastic, the Gauss-Markov theorem does not
apply. The OLS estimator of β1 is not BLUE.
(b) We obtain the BLUE estimator of β1 from OLS in the following
Y!i = β0 X! 0i + β1 X! 1i + u!i
where
Y!i =
X! 1i =
Yi
θ 0 + θ1|X i |
Xi
θ 0 + θ1|X i |
, X! 0i =
1
θ 0 + θ1|X i |
, and u! =
ui
θ 0 + θ1|X i |
.
(c) Using equations (17.2) and (17.19), we know the OLS estimator, βˆ1 , is
∑in=1 ( X i − X )(Yi − Y )
∑in=1 ( X i − X ) ui
ˆ
β1 =
= β1 + n
.
∑in=1 ( X i − X )2
∑i =1 ( X i − X ) 2
As a weighted average of normally distributed variables ui , βˆ1 is normally
distributed with mean E ( βˆ1 ) = β1. The conditional variance of βˆ1 , given X1,…,
Xn, is
⎛
⎞
∑ n ( X − X ) ui
var ( βˆ1| X 1 ,..., X n ) = var ⎜ β1 + i =n1 i
| X 1 ,..., X n ⎟
2
∑i =1 ( X i − X )
⎝
⎠
n
2
∑ ( X − X ) var (ui | X 1 ,..., X n )
= i =1 i n
[∑i =1 ( X i − X ) 2 ]2
∑in=1 ( X i − X ) 2 var(ui | X i )
=
[∑in=1 ( X i − X ) 2 ]2
=
∑in=1 ( X i − X ) 2 (θ0 + θ1| X i |)
.
[∑in=1 ( X i − X ) 2 ]2
Thus the exact sampling distribution of the OLS estimator, βˆ1 , conditional on
X1,…, Xn, is
(continued on next page)
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
17
_____________________________________________________________________________________________________
17.8 (continued)
⎛
∑in=1 ( X i − X )2 (θ0 + θ1| X i |) ⎞
⎟.
[∑in=1 ( X i − X )2 ]2
⎠
βˆ1 ~ N ⎜ β1 ,
⎝
(d) The weighted least squares (WLS) estimators, βˆ0WLS and βˆ1WLS , are solutions to
n
min ∑ (Y!i − b0 X! 0i − b1 X! 1i )2 ,
b0, b1
i=1
the minimization of the sum of squared errors of the weighted regression. The
first order conditions of the minimization with respect to b0 and b1 are
n
∑ 2(Y! − b X!
i
0
0i
− b1 X! 1i )(− X! 0i ) = 0,
i=1
n
∑ 2(Y! − b X!
i
0
0i
− b1 X! 1i )(− X! 1i ) = 0.
i=1
Solving for b1 gives the WLS estimator
−Q S + Q00 S1
βˆ1WLS = 01 0
Q00Q11 − Q012
where
n
n
n
n
n
!
Q00 = ∑ i=1
X! 0i X! 0i , Q01 = ∑ i=1
X! 0i X! 1i , Q11 = ∑ i=1
X! 1i X! 1i , S0 = ∑ i=1
X! 0iY!i , and S1 = ∑ i=1
X! 1iY.
!
!
!
Substituting Yi = β0 X 0i + β1 X 0i + u!i yields
−Q Z + Q Z
βˆ1WLS = β1 + 01 0 002 1
Q00Q11 − Q01
n
n
where Z0 = ∑ i=1 X! 0iu!i , and Z1 = ∑ i=1 X! 1iu!i or
n
∑ i=1
(Q00 X! 1i − Q01 X! 0i )u!i
WLS
ˆ
β1 − β1 =
.
Q00Q11 − Q012
From this we see that the distribution of βˆ1WLS | X1 ,... X n is N ( β1 , σ β2ˆWLS ), where
1
(continued on next page)
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
18
_____________________________________________________________________________________________________
17.8 (continued)
σ
2
βˆ WLS
1
n
σ u2! ∑ i=1
(Q00 X! 1i − Q01 X! 0i )2
=
(Q00Q11 − Q012 )2
=
Q002Q11 + Q012Q00 − 2Q00Q012
(Q00Q11 − Q012 )2
=
Q00
Q00Q11 − Q012
where the first equality uses the fact that the observations are independent, the
2
second uses σ u! = 1, the definition of Q00, Q11, and Q01, and the third is an
algebraic simplification.
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
19
_____________________________________________________________________________________________________
17.9. We need to prove
1 n
p
[( X i − X ) 2 uˆi2 − ( X i − µ X ) 2 ui2 ] → 0.
∑
n i =1
Using the identity X = µ X + ( X − µ X ),
n
1 n
2 2
2 2
2 1
ˆ
[(
X
−
X
)
u
−
(
X
−
µ
)
u
]
=
(
X
−
µ
)
∑ i
∑ uˆi2
i
i
X
i
X
n i =1
n i =1
1 n
− 2( X − µ X ) ∑ ( X i − µ X )uˆi2
n i =1
1 n
+ ∑ ( X i − µ X ) 2 (uˆi2 − ui2 ).
n i =1
The definition of uˆi implies
uˆi2 = ui2 + ( βˆ0 − β0 )2 + ( βˆ1 − β1 )2 X i2 − 2ui ( βˆ0 − β0 )
− 2u ( βˆ − β ) X + 2( βˆ − β )( βˆ − β ) X .
i
1
1
i
Substituting this into the expression for
0
1
n
0
1
1
i
Σin=1[( X i − X ) 2 uˆi2 − ( X i − µ X ) 2 ui2 ] yields a
p
series of terms each of which can be written as anbn where an → 0 and
bn = 1n Σin=1 X ir uis where r and s are integers. For example,
an = ( X − µ X ), an = (βˆ1 − β1 ) and so forth. The result then follows from Slutksy’s
theorem if
1
n
p
Σin=1 X ir uis → d where d is a finite constant. Let wi = X ir uis and note that
wi is i.i.d. The law of large numbers can then be used for the desired result if
E ( wi2 ) < ∞. There are two cases that need to be addressed. In the first, both r and s
are non-zero. In this case write
E (wi2 ) = E ( X i2 r ui2 s ) < [ E ( X i4 r )][ E (ui4 s )]
and this term is finite if r and s are less than 2. Inspection of the terms shows that
this is true. In the second case, either r = 0 or s = 0. In this case the result follows
directly if the non-zero exponent (r or s) is less than 4. Inspection of the terms shows
that this is true.
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
20
_____________________________________________________________________________________________________
17.10. Using (17.43) with W = θˆ − θ implies
E[(θˆ − θ ) 2 ]
ˆ
Pr(|θ − θ | ≥ δ ) ≤
δ2
p
Since E[(θˆ − θ ) 2 ] → 0, Pr(| θˆ − θ | > δ ) → 0, so that θˆ − θ →
0.
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
21
_____________________________________________________________________________________________________
17.11. Note: in early printing of the third edition there was a typographical error in the
expression for µY|X. The correct expression is µY | X = µY + (σ XY / σ X2 )( x − µ X ) .
(a) Using the hint and equation (17.38)
fY | X = x ( y ) =
1
2
σ (1 − ρ XY
)
2
Y
⎛
⎛ ⎛ x − µ ⎞2
⎛ x − µ X ⎞⎛ y − µY
1
X
⎜⎜
× exp ⎜
− 2 ρ XY ⎜
⎟
⎟⎜
2
⎜ −2(1 − ρ XY ) ⎜ ⎝ σ X ⎠
⎝ σ X ⎠⎝ σ Y
⎝
⎝
2
2
⎞ ⎛ y − µY ⎞ ⎞ 1 ⎛ x − µ X ⎞ ⎞
⎟+⎜
⎟ ⎟+ ⎜
⎟ ⎟.
⎠ ⎝ σ Y ⎠ ⎟⎠ 2 ⎝ σ X ⎠ ⎟⎠
Simplifying yields the desired expression.
(b) The result follows by noting that fY|X=x(y) is a normal density (see equation
(17.36)) with µ = µT|X and σ2 = σ Y2|X .
(c) Let b = σXY/ σ X2 and a = µY −bµX.
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
22
_____________________________________________________________________________________________________
17.12. (a)
⎛ u2
⎞
⎛ u2
⎛ σ u2 ⎞ ∞
σ u2 ⎞
1
exp
−
+
u
du
=
exp
exp
−
+
u
−
⎜
⎟ du
⎜ ⎟∫
2
∫−∞ σ 2π ⎜⎝ 2σ u2 ⎟⎠
2
2
σ
2
σ
2
π
⎝ ⎠ −∞ u
u
⎝
⎠
u
∞
2
2
2⎞
⎛ 1
⎛σ ⎞
⎛σ ⎞
1
= exp ⎜ u ⎟ ∫
exp ⎜ − 2 ( u − σ u2 ) ⎟ du = exp ⎜ u ⎟
⎝ 2 ⎠ −∞ σ u 2π
⎝ 2 ⎠
⎝ 2σ u
⎠
E (e u ) =
∞
1
where the final equality follows because the integrand is the density of a normal
random variable with mean and variance equal to σ u2 . Because the integrand is a
density, it integrates to 1.
(b) The result follows directly from (a).
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
23
_____________________________________________________________________________________________________
17.13 (a) The answer is provided by equation (13.10) and the discussion following the
equation. The result was also shown in Exercise 13.10, and the approach used
in the exercise is discussed in part (b).
(b) Write the regression model as Yi = β0 + β1Xi + vi, where β0 = E(β0i), β1 = E(β1i), and vi = ui + (β0i − β0) + (β1i − β1)Xi. Notice that E(vi | Xi) = E(ui|Xi) + E(β0i − β0| Xi) + XiE(β1i − β1|Xi) = 0 because β0i and β1i are independent of Xi. Because E(vi | Xi) = 0, the OLS regression of Yi on Xi will provide consistent estimates of β0 = E(β0i) and β1 = E(β1i). Recall that the weighted least squares estimator is the OLS estimator of Yi/σi onto 1/σi and Xi/σi , where σ i = θ0 + θ1 X i2 . Write this regression as Yi / σ i = β0 (1/ σ i ) + β1 ( X i / σ i ) + vi / σ i . This regression has two regressors, 1/σi and Xi/σi. Because these regressors depend only on Xi, E(vi|Xi) = 0 implies that E(vi/σi | (1/σi), Xi/σi) = 0. Thus, weighted least squares provides a consistent estimator of β0 = E(β0i) and β1 = E(β1i). ©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
24
_____________________________________________________________________________________________________
17.14
(a) Yi = (Yi − µ) + µ, so that Yi 2 = (Yi − µ)2 + µ2 + 2(Yi − µ)µ. The result follows after
taking the expected value of both sides of the equation.
(b) This follows from the large of large numbers because Yi is i.i.d with mean E(Yi) =
µ and finite variance.
(c) This follows from the large of large numbers because Yi 2 is i.i.d with mean E( Yi 2 )
= µ2 + σ2 (from (a)) and finite variance (because Yi has a finite fourth moment, Yi 2
has a finite second moment).
(d)
(
2
1 n
1 n 2
Y
−
Y
=
(
)
∑
∑ Y + Y 2 − 2YYi
n i=1 i
n i=1 i
)
=
1 n 2
1 n
2
Y
+
Y
−
2Y
∑
∑Y 2
n i=1 i
n i=1 i
=
1 n 2
∑Y − Y 2
n i=1 i
p
(e) This follows from (a)-(d) and Y 2 → µ 2
(f) This follows from (e) and n/(n−1) → 1.
©2015 Pearson Education, Inc.
Stock/Watson - Introduction to Econometrics - 3rd Updated Edition - Answers to Exercises: Chapter 17
25
_____________________________________________________________________________________________________
17.15
n
(a) Write W =
∑Z
i=1
2
i
d
where Zi ~ N(0,1). From the law of large number W/n → E( Z i2 )
= 1.
(b) The numerator is N(0,1) and the denominator converges in probability to 1. The
result follows from Slutsky’s theorem (equation (17.9)).
(c) V/m is distributed χ m2 / m and the denominator converges in probability to 1. The
result follows from Slutsky’s theorem (equation (17.9)).
©2015 Pearson Education, Inc.