Content Frame
Skip Breadcrumb Navigation
Home  arrow Student Resources  arrow Interactive Exercise 2  arrow Section 3

Section 3

Heteroskedasticity and Serial Correlation in the Interactive Example

Now that you have chosen your "best" specification, we can discuss the possibility that pure heteroskedasticity or serial correlation might exist in your residuals.

Heteroskedasticity. This dependent variable is already per capita, is time series, and does not change substantially during the sample period. As a result, the possibility of pure heteroskedasticity is so low that most econometricians would not even bother testing for it. (If a Park test were to be run, however, a logical proportionality factor Z might be per capita disposable income.)

Serial Correlation. Serial correlation is quite another matter, however, since the data are time series and a number of the Durbin-Watson d statistics are below the critical dL value for positive serial correlation. In addition, it seems possible that short-run fads in the consumption of pork, or alternatively, supply shocks, might cause swings in consumption from year to year that would not be completely captured by the price variables or by coefficients estimated over the entire sample.

In particular, if your final equation was either regression run 6 or run 14, then there is a possibility of serial correlation. Their Durbin-Watson d's of 1.09 and 1.19 respectively are right at the edge of the critical dL for positive serial correlation for 40 observations and 6 explanatory variables.

The application of generalized least squares to regression runs 6 and 14 are contained in regression runs 2l and 22 respectively; we include these mainly for instructional purposes, since the inconclusive Durbin-Watson statistics do not justify GLS on their own.

Regression Run 21
Dependent Variable: CONPK
Method: Least Squares
Sample(adjusted): 1975:2 1984:4
Included observations: 39 after adjusting endpoints
Convergence achieved after 48 iterations
VariableCoefficientStd. Errort-StatisticProb.
C17.517670.94027118.630450.0000
PRIPK-0.0798470.008176-9.7663550.0000
PRIBF0.0405370.0055727.2746660.0000
YDUSP0.2443660.1194072.0464970.0493
D1-0.9385270.141149-6.6491900.0000
D2-1.6509330.163267-10.111870.0000
D3-1.5239300.137596-11.075370.0000
AR(1)0.4847430.1709592.8354290.0080

R-squared0.953597Mean dependent var14.92359
Adjusted R-squared0.943119S.D. dependent var1.589140
S.E. of regression0.379007Akaike info criterion1.078159
Sum squared resid4.453041Schwarz criterion1.419402
Log likelihood-13.02410F-statistic91.00801
Durbin-Watson stat1.940739Prob(F-statistic)0.000000

Inverted AR Roots.48

Regression Run 22
Dependent Variable: CONPK
Method: Least Squares
Sample(adjusted): 1975:2 1984:4
Included observations: 39 after adjusting endpoints
Convergence achieved after 32 iterations
VariableCoefficientStd. Errort-StatisticProb.
C15.116761.11090413.607620.0000
PRIPK-0.0783710.007517-10.425460.0000
PRIBF0.0383690.0055546.9084500.0000
LYDUSP2.2341940.9331252.3943140.0229
D1-0.9329560.144686-6.4481550.0000
D2-1.6359280.164427-9.9492410.0000
D3-1.5164720.141406-10.724240.0000
AR(1)0.4061620.1766482.2992710.0284

R-squared0.954604Mean dependent var14.92359
Adjusted R-squared0.944353S.D. dependent var1.589140
S.E. of regression0.374871Akaike info criterion1.056210
Sum squared resid4.356367Schwarz criterion1.397454
Log likelihood-12.59610F-statistic93.12587
Durbin-Watson stat1.811766Prob(F-statistic)0.000000

Inverted AR Roots.41

As can be seen by comparing the GLS result with the OLS result, the correction with a point estimate for a population did indeed decrease the significance of the price and income explanatory variables as would have been expected. The increase in significance of the seasonal dummies with GLS estimation raises the possibility that the seasonal pattern is not as simple as that implied by the three intercept dummies.

Note that most of the lowest Durbin-Watson statistics came with the seasonal dummy group included, supporting the hypothesis that this set of dummies did not properly capture the actual seasonality of the demand for pork; the introduction of the seasonal dummies might have deleted some but not all of the seasonal variation, leaving a serially correlated pattern in the residuals. While the equation was probably better off with the seasonal dummies than without, a better knowledge of the meat industry before estimation might have allowed a more sophisticated seasonal pattern to be chosen. Even using only intercept dummies, for example, a better overall fit might have been obtained by using only one seasonal dummy (D4, equal to one in the fourth quarter and zero otherwise). To make such a switch (or to drop one of the seasonal dummies while keeping the others) on the sole basis of the attached estimations would be a mistake, however, because the hypothesis would be tested on the same data set from which it was developed.

Finally, note that some of the Durbin-Watson statistics were greater than two when PROPK was included in the regression. This result almost surely occurred because the resulting equation included both demand-side and supply-side variables, and the residuals were no longer the residuals of just the demand-side equation. This result is one of many reasons that most econometricians go to great lengths to avoid including such a production variable in a demand-side equation. In a sense, production acts like a dominant variable; its relationship to the dependent variable is strong but is definitional with little economic content.




Pearson Copyright © 1995 - 2010 Pearson Education . All rights reserved. Pearson Addison Wesley is an imprint of Pearson .
Legal Notice | Privacy Policy | Permissions

Return to the Top of this Page