Real-time macroeconomic forecasting with leading indicators: An empirical comparison

https://doi.org/10.1016/j.ijforecast.2010.04.008Get rights and content

Abstract

This paper demonstrates that the Conference Board’s Composite Leading Index (CLI) has significant real-time predictive ability for Industrial Production (IP) growth rates at horizons from one to six months ahead over the period 1989–2009. A popular but unrealistic analysis, which combines real-time data for CLI and final vintage data for IP as predictor variables, obscures the actual predictive content of the CLI, in the sense that in that case, the improvements in forecast accuracy relative to a univariate AR model are not significant. The CLI appears to be less useful for forecasting growth rates of the Conference Board’s Composite Coincident Index (CCI) in real time, as a univariate AR model performs better. This result is mostly due to its disappointing performance during the first five years of the forecast period. The CLI may not be the best way of exploiting the information contained in the underlying individual leading indicator variables. The use of principal components instead of CLI leads to more accurate real-time forecasts for both IP and CCI growth rates.

Introduction

Statistical agencies face a tradeoff between accuracy and timeliness in the construction and publication of macroeconomic time series such as output and employment. The quality of measurements of these variables improves if they are based on more complete sample information. However, collecting the necessary data takes time, while both the government and the public benefit from having such key macro variables available as soon as possible for making their policy and consumption and investment decisions. Many statistical agencies have chosen a compromise solution by releasing a first estimate with a relatively short delay (usually one or two months after the end of the relevant calendar period), but then revising this initial value as more information becomes available. These data revisions are often substantial, and may still take place several months and even years after the initial releases, while their characteristics appear to be time-varying and subject to business cycle fluctuations; see Aruoba (2008) and Swanson and Van Dijk (2006), among others. Such revisions obviously affect various aspects of macroeconomic analysis, including policy evaluation and out-of-sample forecasting, see Croushore, 2006, Croushore, in press.

Data revisions may be particularly influential when it comes to out-of-sample forecasting, given that such forecasts typically rely upon the observations for the most recent periods, which may still be subject to substantial revisions. In a seminal contribution, Diebold and Rudebusch (1991) documented the possible effects of using real-time data for forecasting purposes. Based on monthly vintage data for the 1970s and 80s, they demonstrate that the Conference Board’s Composite Leading Index (CLI) helps substantially in forecasting Industrial Production (IP) if ‘ex post’ (that is, revised) data are used. However, the usefulness of CLI evaporates completely in real-time forecasting, and in many cases a simple autoregressive (AR) model involving only lags of IP performs best. An important feature of their analysis is that, while they use real-time data for the CLI, revised data obtained from the final available vintage are used for IP.

The main goal of our paper is to reconsider the questions posed by Diebold and Rudebusch. In particular, we intend to conduct a genuine real-time forecasting experiment, using the values that were available in real-time, at the moment the forecasts were actually made, for all variables. In addition, we address the question of whether the CLI provides the best way of extracting the predictive information available from individual leading indicators. Although the CLI is commonly used in forecasting, it is still of interest to consider alternative approaches, such as principal components.

Our main results are the following. We find that the Conference Board’s CLI does have significant real-time predictive ability for forecasting IP growth rates at horizons from one to six months ahead over the period 1989–2009. Using real-time data for both IP and CLI, statistically significant reductions in mean squared errors (MSE) of between 8% and 15%, relative to an AR benchmark, can be achieved. Combining real-time data for CLI with final vintage data for IP, as Diebold and Rudebusch (1991) did, masks the predictive content of the CLI, in the sense that, in this setting, the improvements in forecast accuracy are no longer significant at horizons of three and six months.

The CLI turns out to be less useful for forecasting growth rates of the Conference Board’s Composite Coincident Index (CCI), as no MSE gains relative to a univariate AR model are obtained. This result is mostly due to a disappointing performance during the first five years of the forecast period.

We also find that the CLI is not the best way of exploiting the information contained in the underlying individual leading indicator variables. While including them as separate regressors in a linear forecasting model actually worsens the forecast performance, constructing alternative diffusion indexes by means of principal components leads to more accurate real-time forecasts for both the IP and CCI growth rates.

Real-time forecasting is also discussed, for instance, by Clements and Galvo (2006), Croushore and Stark (2001), Pesaran and Timmermann (2005), Swanson (1996) and Swanson, Ghysels, and Callan (1999); see also the survey paper by Croushore (2006). Recently, the Conference Board has reported positive results for real-time forecasts of the Composite Coincident Index (CCI), as the CLI provides significant forecast gains over the AR benchmark model, see McGuckin, Ozyildirim, and Zarnowitz (2007). These and most other real-time forecast studies are concerned with the US, but some recent applications have dealt with the euro area, for instance Golinelli and Parigi (2008) and Ozyildirim, Schaitkin, and Zarnowitz (2009).

The remainder of the paper is organized as follows. The data set, consisting of vintage data of various coincident and leading indicators, is described in Section 2. Both the competing forecast methods and different data specifications are laid out in Section 3. The real-time forecast results for IP and CCI growth rates are discussed in Section 4 and Section 5 concludes.

Section snippets

Vintage data

Our data set consists of monthly observations for the ten leading indicators and four coincident indicators (including industrial production, IP) underlying the Conference Board’s Composite Leading Index (CLI) and Composite Coincident Index (CCI). The individual series are listed in Table 1. The Business Cycle Indicators Handbook of the Conference Board (2001) provides further background on these variables. This collection of ten leading indicators provides a strong predictor set, see Banerjee

AR and CLI methods

Our main research question is whether the CLI has real-time forecasting ability for future h-month IP and CCI growth rates. For this purpose, we compare CLI-based forecasts obtained from the model yth=α+j=0rβjytj+j=0qγjCLItj+εt, where CLIt denotes the one-month growth rate of CLI, with those provided by a benchmark AR model yth=α+j=0rβjytj+εt. For both IP and CCI we consider forecast horizons h of one, three, and six months. The model specification, as given in Eq. (1), which relates the h

Forecasts of actual growth rates

The methods discussed in Section 3 are applied in forecasting IP and CCI growth rates over horizons of one, three, and six months. Table 2 provides a summary of the main outcomes over the complete forecast period from December 1988 to April 2009 in terms of the mean squared forecast errors (MSEs), where the growth rates in the last available vintage of May 2009 are used as the actual (target) values. The MSEs for AR are in absolute terms, whereas the MSEs of the other three methods (CLI, OLS,

Conclusions

A realistic evaluation of the predictive ability of a forecasting method requires the use of real-time data, that is, data that were actually available at the time the forecasts were made. In this paper, we show that the Conference Board’s CLI does have such real-time predictive ability for forecasting IP growth rates at horizons from one to six months ahead over the period 1989–2009. Using real-time data for both variables, statistically significant reductions in MSE of between 8% and 15%

Acknowledgements

We thank Ataman Ozyildirim of The Conference Board for making the vintage data of the leading and coincident variables and composite indexes available to us. Furthermore, we thank the Associate Editor and two anonymous referees for their helpful suggestions and constructive comments on a preliminary version of this paper.

Christiaan Heij is Assistant Professor in Econometrics and Statistics at the Econometric Institute, Erasmus University Rotterdam. His current research interests lie in the areas of applied econometrics, statistics, and forecasting. He has published in journals in systems and control, econometrics, and statistics. He has(co-)authored four books, including the textbook ‘Econometric Methods with Applications in Business and Economics’, published in 2004 by Oxford University Press.

References (29)

  • M. Camacho et al.

    This is what the leading indicators lead

    Journal of Applied Econometrics

    (2002)
  • T.E. Clark et al.

    Tests of equal predictive ability with real-time data

    Journal of Business and Economic Statistics

    (2009)
  • Conference Board, The (2001). The business cycle indicators...
  • Croushore, D. (2008). Frontiers of real-time data analysis. Federal Reserve Bank of Philadelphia. Working paper 08-4....
  • Cited by (15)

    • Tracking world trade and GDP in real time

      2014, International Journal of Forecasting
      Citation Excerpt :

      Thirdly, the world trade and GDP forecasting ability of the WBM can alternatively be evaluated on the basis of the latest available observations (see column 6 of Table 4). As Diebold and Rudebusch observe in their seminal paper (Diebold & Rudebusch, 1991), models based on revised data are characterised by better forecasting ability (see also Heij, van Dijk, & Groenen, 2011). Unfortunately, however, this is not a realistic case, as revised information is not available at the time when the forecast is computed.

    • The D-model for GDP nowcasting

      2023, Swiss Journal of Economics and Statistics
    • The D-model for GDP nowcasting

      2022, Research Square
    View all citing articles on Scopus

    Christiaan Heij is Assistant Professor in Econometrics and Statistics at the Econometric Institute, Erasmus University Rotterdam. His current research interests lie in the areas of applied econometrics, statistics, and forecasting. He has published in journals in systems and control, econometrics, and statistics. He has(co-)authored four books, including the textbook ‘Econometric Methods with Applications in Business and Economics’, published in 2004 by Oxford University Press.

    Dick van Dijk is Professor in Financial Econometrics at the Econometric Institute, Erasmus University Rotterdam. His research interests include volatility modeling and forecasting, high-frequency data, business cycle analysis, and nonlinear time series analysis. He has published in the Journal of Applied Econometrics, Journal of Business and Economic Statistics, Journal of Econometrics, Journal of Empirical Finance, and Review of Economics and Statistics, among others. He also co-authored the book ‘Nonlinear Time Series Models in Empirical Finance’ (with Philip Hans Franses), published by Cambridge University Press in the year 2000.

    Patrick J.F. Groenen is full professor in statistics at the Econometric Institute, Erasmus University Rotterdam. His research interests lie in data modeling, multivariate analysis, visualization and optimization. He has written papers in journals such as Quantitative Finance, Journal of Empirical Finance, Journal of Marketing Research, Psychometrika, Computational Statistics and Data Analysis, Journal of Classification, and Multivariate Behavioural Research. He is also coauthor of a textbook on multidimensional scaling published in the Statistics Series by Springer.

    View full text