Microsoft Word - 00_tresc.docx


DYNAMIC ECONOMETRIC MODELS 
Vol. 9 – Nicolaus Copernicus University – Toruń – 2009 

Jarosław Krajewski 
Nicolaus Copernicus University in Toruń 

Estimating and Forecasting GDP in Poland  
with Dynamic Factor Model† 

A b s t r a c t. Presented paper concerns the dynamic factor models theory and application in the 
econometric analysis of GDP in Poland. DFMs are used for construction of the economic indica-
tors and in forecasting, in analyses of the monetary policy and international business cycles. In the 
article we compare the forecast accuracy of DFMs with the forecast accuracy of 2 competitive 
models: AR model and symptomatic model. We have used 41 quarterly time series from the 
Polish economy. The results are encouraging. The DFM outperforms other models. The best fitted 
to empirical data was model with 3 factors. 

K e y w o r d s: Dynamic factor models, principal components analysis, GDP. 

1. Introduction 
 In recent years, dynamic factor models have become popular in empirical 
macroeconomics. They are believed to have been pioneered by Geweke (1977) 
and Sims & Sargent (1977) who applied this type of models to the analysis of 
small sets of variables. DFMs have a very wide field of applications. Such mod-
els are widely used for forecasting, constructing leading indicators of business 
climate, monetary policy analysis or the analysis of international business cy-
cles.  
 The purpose of the article is to estimate a dynamic factor model of GDP in 
Poland in 1997 – 2008.  
 The second part of the article presents a concept of a dynamic factor model. 
In the third part the approach to estimating model parameters as well as com-
mon factors are discussed. The methods of specifying the number of factors in 

                                                 
† Scientific work is financing by European Social Fund and national budget of Poland in The 

Integrated Regional Operational Programme framework, Measure 2.6 "Regional Innovation Strat-
egies and transfer of knowledge" Kujawsko-Pomorskie voievodship own project "Candidate for 
doctor's degree grants 2008/2009 – IROP". 


Jarosław Krajewski 

 
140 

the model are also presented. The data used in the study and the empirical re-
sults have been described in the fourth part. Final part summarises the whole 
study. 

2. Dynamic Factor Model  
 The concept of factor models bases on the assumption that the behavior of 
most macroeconomic variables may be well described using a small number of 
unobserved common factors. These factors are often interpreted as the driving 
forces in the economy. The particular variables may be then expressed as linear 
combination of up-to-twenty common factors which usually make it possible to 
explain a major part of variability of those variables (Kotłowski, 2008).  
    Let ty  stand for a variable and let tX express the vector of N variables con-
taining information that can be useful in modeling and forecasting the future 
values of ty . In the dynamic factor model we assume that all variables itx  con-
tained in vector tX  may be expressed as a linear combination of current and 
lagged unobserved factors itf .  

,,...,1for ,)( NieLx itiit =+= tfλ  (1) 

where ]',...,,[ 21 trtt fff=tf stands for vector r of unobserved common factors at 

moment t, qiqiiii LLL λλλλλ ++++= ...)(
2

210  represent a lag polynomials and 

ite  express an idiosyncratic errors for variable itx  (see Stock, Watson, 1998). 

 In turn, ty  may be noted as the function of current and lagged common 
factors contained in vector tf  and the past values of variable ty , with the fol-
lowing formula 

.)()( tttt eyLLy ++= γβ f  (2) 

 The model described with equations (1) and (2) is a dynamic factor model. 

3. Model Estimation and Selection of the Factor’s Number 
 One of the most widely used methods of parameters and factors estimation 
in a factor models is the method of principal components. Let us emphasise that 
both: the factor matrix and the coefficient matrix are unknown. Model (1) is 
thus equivalent to the model in the matrix form of  

,'1 eΛFHHX += −  (3) 
where matrix H  is any non-singular matrix of dimension rr × . It is necessary 
to carry out the appropriate normalisation of matrix H . Stock & Watson (1998) 


Estimating and Forecasting GDP in Poland … 

 
141

suggest that for this purpose condition in the form of rIΛΛ =)/(
' N  may be 

imposed on the parameters of model which would render matrix H orthonormal.  
 The estimation of matricies F and Λ using the method of principal compo-
nent consist in finding such estimates of matrices F̂  and Λ̂  that would mini-
mise the residual sum of squares in equation (3) as expressed with the following 
formula 

.)(
1

)(
1 1

2'∑∑
= =

−=
N

i

T

t
itxNT

V ti FΛΛF,  (4) 

It is necessary, in the first step, to perform a minimisation of function (4) in 
respect to factor matrix F with the assumption that matrix Λ is known and fixed. 
Then we obtain estimate F̂ , as function Λ, which is  subsequently substituted in 
equation (4) for the true value of F. In the second step, we minimise function 
(4) in respect to matrix Λ with a normalisation condition ,)/( ' rIΛΛ =N  thus 
directly obtaining estimate .Λ̂  It should be emphasised that it is equivalent to 
maximisation of expression ].)([ '' ΛXXΛtr  

 Matrix Λ̂  is a matrix whose subsequent columns are eigenvectors of matrix  
XX' multiplied by N  corresponding to highest eigenvalues of the same ma-

trix. In turn, the estimate of matrix F is expressed by the formula 

./)ˆ(ˆ NΛXF =  (5) 

Stock & Watson (1998) emphasise that if the number of variables is higher than 
the number of observations, i. e. N > T, then from the computational point of 
view it is easier to apply a procedure which determine estimate F

~
 by minimiz-

ing concentrated function (4) in respect to matrix F with the condition 
./' rIFF =T  Matrix F

~
 will then contain eigenvectors of matrix XX'  corres-

ponding to r highest eigenvalues of this matrix and multiplied by T . In turn, 
the estimate of matrix Λ

~
 will assume the following form 

./)
~

(
~ '' TXFΛ =  (6) 

 In practice, the number of factors necessary to represent the correlation 
among the variables is usually unknown. To determine the number of factors 
empirically a number of criteria were suggested. Bai and Ng (2002) have sug-
gested information criteria to be used to estimate the number of factors. 

,ln))(ln()(1 ⎟⎟
⎠

⎞
⎜⎜
⎝

⎛
⎟
⎠
⎞

⎜
⎝
⎛

+
⎟
⎠
⎞

⎜
⎝
⎛ +

+=
TN

NT
NT

TN
kkVkIC

)
 (7) 


Jarosław Krajewski 

 
142 

,ln))(ln()( 22 ⎟⎟
⎠

⎞
⎜⎜
⎝

⎛
⎟
⎠
⎞

⎜
⎝
⎛ +

+= NTCNT
TN

kkVkIC
)

 (8) 

,
ln

))(ˆln()( 2
2

2 ⎟
⎟
⎠

⎞
⎜
⎜
⎝

⎛
+=

NT

NT

C
C

kkVkIC  (9) 

where )(ˆ kV  is residual sum of squares from k – factors model 

and { }.min TNCNT =  
4. Description of Data and Empirical Results 
 The data used in the study are macroeconomic quarterly data describing 
Polish economy and encompass the period from first quarter of 1997 to third 
quarter of 2008 (47 observations). As explained variable we used polish GDP. 
All of data were taken from polish Central Statistical Office1. Before embarking 
on the work on factor model specification, the date had to be appropriately 
modified. In the first step variables were adjusted for the impact of seasonal 
fluctuations. Next, the series were transformed by taking logarithms and/or dif-
ferencing so that the transformed series were stationary (Green, 2003).  In the 
final step, all variables were standardized. In total, 41 time series were consider-
ing, representing of following macroeconomic categories: output & sales, con-
struction, domestic and foreign trade, prices and labour market, budgetary and 
monetary policy.  
 After preliminaries principal components analysis were used to estimate 
factors. Next Bai and Ng informational criteria were calculated to specify num-
ber of factors. Table 1 includes eigenvalues and values of information criteria. 
The first two criteria reach minimum for the number of factors equal to 3, while 
the third criteria assumes its lowest value for 10 factors. Due to the fact that two 
out of three criteria display the same value, we arbitrarily assume that the num-
ber of factors in the model is 3. The first three factors explain almost 82% of 
total variance of GDP. 
 Some econometricians maintain that factors estimated using the principal 
component method do not have an economic interpretation. However, in this 
paper try-out were done. For this reason, it is possible to carry out a regression 
of particular variables against each of the estimated factors and check which 
factor explains the behavior of a given variable to the greatest extent. The 
R-squared values on the regression of particular variables suggest that the first 
factor primarily affects the variability of labour market and foreign trade. The 
second factor determines the prices and incomes. The third factor to the greatest 
extent influences the values of sales.  

                                                 
1 www.stat.gov.pl 


Estimating and Forecasting GDP in Poland … 

 
143

 The further stage of study BIC criterion was used to determine the number 
of GDP and factors delays. The BIC criterion indicated model only with current 
values of the first three factors. Model estimation results are presented in Ta-
ble 2. All coefficients are significant under 5% level. Factor model of GDP in 
Poland estimated in this way has R-squared over 70%. Real values of GDP and 
values based on the model are shown on the figure 1.    

Table 1. Selection of the number of factors in the model 
Number of 

factors Eigenvalues 
Contribution 
to variance 

Cumulative contribution to 
variance IC1 IC2 IC3 

1 76.949 0.721 0.721 -3.694 -3.491 -3.833 
2 6.249 0.059 0.779 -3.584 -3.177 -3.861 
3 4.278 0.040 0.819 -4.555 -3.945 -4.971 
4 2.921 0.027 0.846 -4.490 -3.677 -5.044 
5 2.460 0.023 0.870 -4.467 -3.451 -5.160 
6 1.976 0.019 0.888 -4.335 -3.115 -5.166 
7 1.551 0.015 0.903 -4.192 -2.769 -5.162 
8 1.429 0.013 0.916 -4.109 -2.482 -5.216 
9 1.282 0.012 0.928 -4.246 -2.417 -5.493 
10 1.066 0.010 0.938 -4.126 -2.093 -5.511 

  
Figure 1. Actual and fitted values of GDP in Poland in 1997 – 2008 

 Next stage of analysis was to check if lagging some of variables will influ-
ence the final result of estimation. This caused increasing the number of factors 
to four. The resulting model with four factors is presented in Table 3. It is not 

-3

-2

-1

0

1

2

3

1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43

Actual Fitted


Jarosław Krajewski 

 
144 

hard to see that in four factors model R-squared is higher and true significance 
level of coefficients is lower.  

Table 2. Dynamic factor model of GDP in Poland in 1997-2008 

Dependent Variable: GDP 
 Coefficient Std. Error T - statistic P - value 

F1 -0.0362 0.0095 -3.7985 0.0005 
F2 0.0683 0.0335 2.0404 0.0478 
F3 0.3707 0.0405 9.1610 0.0000 

R-squared 0.7142 Akaike Criterion 1.7264 
Adjusted R-squared 0.7003 Schwarz Criterion 1.8481 

Durbin-Watson 2.1454 Hannan-Quinn Criterion 1.7715 

Table 3. Dynamic factor model of GDP in Poland in 1997-2008 – after changes in data 
set 

Dependent Variable: GDP 
 Coefficient Std. Error. T - statistic P - value 

F21 0.1722 0.0273 6.3160 0.0000 
F22 0.1686 0.0329 5.1320 0.0000 
F23 -0.2548 0.0361 -7.0669 0.0000 
F24 0.2241 0.0400 5.6090 0.0000 

R-squared 0.7909 Akaike Criterion 1.4858 
Adjusted R-squared 0.7748 Schwarz Criterion 1.6497 

Durbin-Watson 2.2511 Hannan-Quinn Criterion 1.5463 

Table 4. Forecast errors 

 MAPE RMSE R-squared 
AR 90.8002 0.8116 0.1246 

DFM 1.7898 0.0160 0.7003 
DFM2 12.8691 0.1150 0.7748 

Causal Model 4.9267 0.0440 0.8694 

 In the last stage of our study we generated forecasts and forecast errors. The 
forecasting performance of the factor models was evaluated by comparing the 
accuracy of GDP forecasts obtained on the basis of the factor models with the 
accuracy of GDP forecasts derived from other competitive models. Two com-
petitive models were taken into consideration: an univariate autoregressive 
model and causal model with two variables. An univariate autoregressive model 
was adopted as the main benchmark model for evaluating the forecasting per-
formance of the factor models (see Marcellino, Stock, Watson, 2001). The BIC 
criterion indicated model AR(1). The causal model included industrial produc-
tion sales and average employment as explanatory variables. Forecasting mod-
els were estimated on a shorter sample (up to 4 quarter 2007). The forecast was 


Estimating and Forecasting GDP in Poland … 

 
145

produced on the one period ahead. First factor model had the best forecast accu-
racy.  Table 4 presents results.  

5. Summary 
 The principal component analysis reduced the number of explanatory va-
riables from 41 to 3 factors. The resulting dynamic factor model of GDP in 
Poland is satisfactory from the statistical point of view. 
 Changes in data set influenced the final result of model estimation. In this 
study it brought out increasing number of factors and improvement estimation 
performance. Unfortunately, it did not improve forecasting performance. 
 First dynamic factor model of GDP in Poland in 1997 – 2008 gave the best 
forecasting performance in comparison with three competitive models described 
above. 

References  
Bai, J., Ng, S. (2002), Determining the Number of Factors in Approximate Factor Models, Eco-

nometrica, 70, 191–221. 
Geweke, J. (1977), The Dynamic Factor Analysis of Economic Time Series, [in:] Aigner D. J., 

Goldberger A. S. (ed.), Latent Variables in Socio–Economic Models, Amsterdam, North 
Holland. 

Greene, W. H. (2003), Econometric Analysis, Pearson Education, New Jersey. 
Kotłowski, J. (2008), Forecasting Inflation With Dynamic Factor Model – the Case of Poland, 

Working Papers, 2-08, SGH, Warszawa.  
Marcellino, M., Stock, J. H., Watson, M. W. (2001), Macroeconomic Forecasting in the Euro 

Area: Country Specific versus Area–Wide Information, Working Paper, 201, Innocenzo 
Gasparini Institute for Economic Research. 

Sargent, T., Sims, C. (1977), Business Cycle Modelling Without Pretending to Have Too Much 
A-Priori Economic Theory, in Sims C.  (ed.), New Methods in Business Cycle Research, 
Minneapolis, Federal Reserve Bank of Minneapolis. 

Sims, C. A. (1980), Macroeconomics and Reality, Econometrica, 48, 1–48.  
Stock J., Watson, M. W. (1998), Diffusion Indexes, Working Paper, 6702, National Bureau of 

Economic Research. 

Zastosowanie dynamicznego modelu czynnikowego do modelowania 
i prognozowania PKB w Polsce 

Z a r y s  t r e ś c i. Referat traktuje o podstawach konstrukcji dynamicznych modeli czynniko-
wych i ich zastosowaniu empirycznym. DFM stosuje się do prognozowania, konstruowania 
głównych wskaźników koniunktury, analiz polityki monetarnej i badania międzynarodowych 
cykli koniunkturalnych. W referacie oszacowano dynamiczny model czynnikowy PKB w Polsce 
w latach 1997–2008, a także oceniono trafność uzyskanych na jego podstawie prognoz w porów-
naniu do modelu AR i modelu symptomatycznego. Zbiór danych wykorzystanych do badania 
zawiera 41 zmiennych makroekonomicznych. Najlepszym ze statystycznego punktu widzenia 
okazał się model z 3 czynnikami. 

S ł o w a  k l u c z o w e: dynamiczny model czynnikowy, metoda głównych składowych, PKB.