Page 208 -

P. 208

5.4 Heteroskedasticity and Homoskedasticity 207

will be referred to as the “homoskedasticity-only” formulas for the variance and
standard error of the OLS estimators. As the name suggests, if the errors are
heteroskedastic, then the homoskedasticity-only standard errors are inappropri-
ate. Specifically, if the errors are heteroskedastic, then the t-statistic computed
using the homoskedasticity-only standard error does not have a standard normal
distribution, even in large samples. In fact, the correct critical values to use for this
homoskedasticity-only t-statistic depend on the precise nature of the heteroskedas-
ticity, so those critical values cannot be tabulated. Similarly, if the errors are hetero-
skedastic but a confidence interval is constructed as {1.96 homoskedasticity-only
standard errors, in general the probability that this interval contains the true value
of the coefficient is not 95%, even in large samples.

In contrast, because homoskedasticity is a special case of heteroskedasticity,
the estimators snbn21 and snbn20 of the variances of bn1 and bn0 given in Equations (5.4)
and (5.26) produce valid statistical inferences whether the errors are heteroske-
dastic or homoskedastic. Thus hypothesis tests and confidence intervals based on
those standard errors are valid whether or not the errors are heteroskedastic.
Because the standard errors we have used so far [that is, those based on Equations
(5.4) and (5.26)] lead to statistical inferences that are valid whether or not the
errors are heteroskedastic, they are called heteroskedasticity-robust standard
errors. Because such formulas were proposed by Eicker (1967), Huber (1967), and
White (1980), they are also referred to as Eicker–Huber–White standard errors.

What Does This Mean in Practice?

Which is more realistic, heteroskedasticity or homoskedasticity? The answer to
this question depends on the application. However, the issues can be clarified by
returning to the example of the gender gap in earnings among college graduates.
Familiarity with how people are paid in the world around us gives some clues as to
which assumption is more sensible. For many years—and, to a lesser extent, today—
women were not found in the top-paying jobs: There have always been poorly paid
men, but there have rarely been highly paid women. This suggests that the distribu-
tion of earnings among women is tighter than among men (see the box in Chapter 3
“The Gender Gap in Earnings of College Graduates in the United States”). In
other words, the variance of the error term in Equation (5.20) for women is plausi-
bly less than the variance of the error term in Equation (5.21) for men. Thus the
presence of a “glass ceiling” for women’s jobs and pay suggests that the error term
in the binary variable regression model in Equation (5.19) is heteroskedastic. Unless
there are compelling reasons to the contrary—and we can think of none—it makes
sense to treat the error term in this example as heteroskedastic.

203 204 205 206 207 208 209 210 211 212 213