Page 306 -
P. 306

8.1    A General Strategy for Modeling Nonlinear Regression Functions	 305

Figure 8.2 		Scatterplot of Test Score vs. District Income with a Linear OLS Regression Function

There is a positive correlation        Test score
between test scores and district       740
income (correlation = 0.71),           720
but the linear OLS regression line
does not adequately describe the       700
relationship between these variables.
                                       680

                                       660

                                       640

                                       620

                                       600         10 20 30 40 50 60
                                            0                                           District income

                                                                                (thousands of dollars)

     In short, it seems that the relationship between district income and test scores
is not a straight line. Rather, it is nonlinear. A nonlinear function is a function
with a slope that is not constant: The function ƒ(X) is linear if the slope of ƒ(X) is
the same for all values of X, but if the slope depends on the value of X, then ƒ(X)
is nonlinear.

     If a straight line is not an adequate description of the relationship between
district income and test scores, what is? Imagine drawing a curve that fits the points
in Figure 8.2. This curve would be steep for low values of district income and
then would flatten out as district income gets higher. One way to approximate
such a curve mathematically is to model the relationship as a quadratic function.
That is, we could model test scores as a function of income and the square of
income.

     A quadrEatlieccptroopnuiclaPtiuobnlisrehginrgesSsieornvimceosdeInlcr.elating test scores and income is
written mathSemtoactki/cWalalytsaosn, Econometrics 1e

                STOC.ITEM.0022
	 FigT. 0e6st.S0c2orei = b0 + b1Incomei + b2Incomei2 + ui,	(8.1)

where  b0,  b1,                        and  1st  aPrreoocof efficien2tns,dIPncrooomfei  is  the3irndcPomroeofin  the  itFh indaisltrict,
                                            b2
Incomei2 is the square of income in the ith district, and ui is an error term that, as

usual, represents all the other factors that determine test scores. Equation (8.1) is

called the quadratic regression model because the population regression function,
   301   302   303   304   305   306   307   308   309   310   311