Frisch–Waugh–Lovell theorem
In econometrics, the Frisch–Waugh–Lovell[a] (FWL) theorem is named after the econometricians Ragnar Frisch, Frederick V. Waugh, and Michael C. Lovell.[1][2][3]
Background
The Frisch-Waugh-Lovell theorem is an algebraic result for regressions estimated by least squares, the most commonly used estimator in applied econometrics.[4] Least squares is a method of estimating coefficients in models which are linear in parameters. That is, the outcome variable is modeled as a linear combination of the input variables plus some error term. The least squares solution is that which sets the input variables' coefficients to minimize the sum of squared errors. Under a certain set of assumptions, the Gauss–Markov theorem, least squares estimation is the best linear unbiased estimator.
Let be any outcome variable and a set of predictor variables, such that , and suppose observations of are sampled. If is modeled as a linear function of , it can be written as . The least squares estimator sets the coefficients to minimize the sum of squared errors . With observations this involves minimizing across equations, and is typically written in matrix form as , where and are -dimensional column vectors and is an -by--dimensional matrix. Then, the least squares solution is .[5]
In regressions estimated by least squares, it is common to refer to a coefficient as the effect of that variable "holding constant" the other input variables.[6] For example, if wage is modeled as a function of education and work experience, the coefficient for education is interpreted as the difference in the expectation of wage for a unit difference in education, "holding constant" work experience. Econometrician Arthur Goldberger frames the Frisch-Waugh-Lovell theorem as "giving content to th[is] language".[7]
Definition and interpretation
The Frisch-Waugh-Lovell theorem states that in a least squares-estimated regression of the form
any coefficient can be estimated by the two-step process of:
- Regress on the set of other right-hand-side variables, obtaining residuals
- Regress on , obtaining
This two-step process is referred to as the residual regression or equivalently the regression anatomy theorem.[8][9][10]
The theorem shows that coefficients in a multiple regression reflect the relationship between the associated variable and the outcome variable after removing the part linearly explained by the other predictor variables.[11][9] This is the basis for understanding the contribution of each single variable to a multivariate regression (see, for instance, Ch. 13 in [12]).
Double residual regression
The double residual regression is the three-step process:
- Regress on the set of other right-hand-side variables, obtaining residuals
- Regress on the set of right-hand-side variables excluding , obtaining residuals
- Regress on , estimating and
This yields an identical coefficient to the two-step process.[7][13] It includes the additional feature that the residuals from the regression in step 3 equal the residuals in the full regression.[11]
Multivariate definition
Consider the regression , where and are -dimensional column vectors, is an -by- matrix, and is an -by- matrix. Then, the Frisch-Waugh-Lovell theorem states that
where , the residuals from the regression of on , and , the residuals from the regression of on . The first expression of is the residual regression, and the second the double residual regression.[7]
History
The origin of the theorem is uncertain, but it was well-established in the realm of linear regression before the Frisch and Waugh paper. George Udny Yule's comprehensive analysis of partial regressions, published in 1907, included the theorem in section 9 on page 184.[14]
Yule emphasized the theorem's importance for understanding multiple and partial regression and correlation coefficients, as mentioned in section 10 of the same paper.[14]
Yule 1907 also introduced the partial regression notation which is still in use today.
In 1962, Richard Stone generalized the theorem to apply to an arbitrary number of variables which may be chosen for special analysis in the same way that time was distinguished in Frisch's and Waugh's original formulation.[15]
In 1963, Lovell published a proof considered more straightforward and intuitive.[2] In recognition, people generally add his name to the theorem name.
Proof
Consider the linear regression and annihilator matrix . Premultiplying both sides of the regression equation by the annihilator matrix removes from and the component linearly explained by :
Then, by the least squares result, and . This concludes the proof.[16][7]
Extensions
Standard errors
The Frisch-Waugh-Lovell theorem applies to the standard errors of the partial and full regressions, where they differ (in the homoskedastic case) only by a degrees of freedom adjustment.[17]
See also
Notes
- ^ Pronounced /ˈfriʃˌwɔːˌlʌvəl/.
References
Sources
- ^ Frisch 1933.
- ^ a b Lovell 1963.
- ^ Lovell 2008.
- ^ Hansen 2022, p. 13.
- ^ Hansen, p. 73.
- ^ Hansen, p. 30.
- ^ a b c d Goldberger 1991, p. 186.
- ^ Hansen 2022, p. 81.
- ^ a b Goldberger 1991, p. 185.
- ^ Filoso 2013, p. 93.
- ^ a b Hansen 2022, p. 82.
- ^ Tukey 1977.
- ^ Goldberger 1968, p. 30.
- ^ a b Yule 1907.
- ^ Stone 1970.
- ^ Hayashi 2000.
- ^ Ding 2021.
Journal articles
- Yule, George Udny (1907-05-14). "On the theory of correlation for any number of variables, treated by a new system of notation". Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character. 79 (529): 182–193. doi:10.1098/rspa.1907.0028. ISSN 0950-1207.
- Frisch, Ragnar; Waugh, Frederick V. (1933). "Partial Time Regressions as Compared with Individual Trends". Econometrica. 1 (4): 387–401. doi:10.2307/1907330. ISSN 0012-9682.
- Lovell, Michael C. (December 1963). "Seasonal Adjustment of Economic Time Series and Multiple Regression Analysis". Journal of the American Statistical Association. 58 (304): 993–1010. doi:10.1080/01621459.1963.10480682. ISSN 0162-1459.
- Lovell, Michael C. (January 2008). "A Simple Proof of the FWL Theorem". The Journal of Economic Education. 39 (1): 88–91. doi:10.3200/JECE.39.1.88-91. ISSN 0022-0485.
- Aldrich, John (1998). "Doing Least Squares: Perspectives from Gauss and Yule". International Statistical Review / Revue Internationale de Statistique. 66 (1): 61–81. doi:10.2307/1403657. ISSN 0306-7734.
- Sosa Escudero, Walter (March 2001). "A geometric representation of the Frisch-Waugh-Lovell theorem". Documentos de Trabajo. 29. ISSN 1853-3930.
- Filoso, Valerio (March 2013). "Regression Anatomy, Revealed". The Stata Journal: Promoting communications on statistics and Stata. 13 (1): 92–106. doi:10.1177/1536867X1301300107. ISSN 1536-867X.
- Ding, Peng (2021-01-01). "The Frisch–Waugh–Lovell theorem for standard errors". Statistics & Probability Letters. 168 108945. doi:10.1016/j.spl.2020.108945. ISSN 0167-7152.
- Basu, Deepankar (2024-10-01). "Frisch–Waugh–Lovell theorem-type results for the k-Class and 2SGMM estimators". Statistics & Probability Letters. 213 110188. doi:10.1016/j.spl.2024.110188. ISSN 0167-7152.
- Fiebig, Denzil G.; Bartels, Robert (January 1996). "The frisch-waugh theorem and generalized least squares". Econometric Reviews. 15 (4): 431–443. doi:10.1080/07474939608800365. ISSN 0747-4938.
- Yamada, Hiroshi (2017-11-02). "The Frisch–Waugh–Lovell theorem for the lasso and the ridge regression". Communications in Statistics - Theory and Methods. 46 (21): 10897–10902. doi:10.1080/03610926.2016.1252403. ISSN 0361-0926.
Books
- Hansen, Bruce E. (2022). Econometrics. Princeton: Princeton University Press. ISBN 978-0-691-23615-5.
- Ruud, Paul Arthur (2000). An Introduction to Classical Econometric Theory. New York: Oxford University Press. ISBN 978-0-19-511164-4.
- Goldberger, Arthur Stanley (1991). A Course in Econometrics. Cambridge, Mass.: Harvard Univ. Press. ISBN 978-0-674-17544-0.
- Goldberger, Arthur Stanley (1968). Topics in Regression Analysis. New York: MacMillan. LCCN 68-15265.
- Mosteller, Frederick; Tukey, John W. (1977). Data Analysis and Regression a Second Course in Statistics. Addison-Wesley. ISBN 0-201-04854-X.
- Hayashi, Fumio (2000). Econometrics. Princeton: Princeton University Press. pp. 18–19. ISBN 0-691-01018-8.
- Stone, Richard (1970). "A generalization of the theorem of Frisch and Waugh". Mathematical Models of the Economy and Other Essays. Chapman and Hall. pp. 73–74. ISBN 0-412-10030-4.