Time Series Analysis II

Univariate ARIMA Models

Table of Contents

The Theoretical Model

The Empirical Model

Extensions

Readings


The General Model

Yt = m + y1et-1 + y2et-2 + ... + et

or

Yt = m + (1+y1B+y2B2+...)et = m + y(B)et

where

y(B) = 1 + y1B + y2B2 + ..., and
et ~ ii(0,s2), t = 1,2,...,N.

Stationarity requirement for the process:
Mean m = E(Yt) < ¥
Variance g0 = s2åi=0,...,¥yi2 < ¥
Autocovariance gj = s2åi=0,...,¥yiyj+i < ¥
Autocorrelation fj = gj / g0

The plot of aucorrelation coefficients at each lag j = 0,1,2,... is called autocorrelation function. In general, the autocorrelation function of a stationary time series falls to 0 quickly. The autocorrelation function of a non-stationary time series does not converge to zero. However, by differencing the time series, the differenced series may become stationary.

A linear stochastic process called an integrated process of order d, I(d), if the d-th difference of the series is stationary. Note that d is the lowest number of differences required for the resulting series to be stationary. That is, Yt ~ I(d) if DdYt is stationary, where

DYt = Yt-Yt-1
D2Yt = DYt-DYt-1
...

Moving Average Process of Order q: MA(q)

Consider the special case of linear stochastic process with finite number of lags:

yj = -qj, j=1,2,...q
0 for j > q

then

Yt = m - q1et-1 - q2et-2 - ... - qqet-q + et
= m + (1 - q1B - q2B2 - ... - qqBq)et
= m + q(B) et

where

q(B) = 1 - q1B - q2B2 - ... - qqBq, and
et ~ ii(0,s2), t = 1,2,...,N.

Examples

Autoregressive Process of Order p: AR(p)

The autoregressive representation of the general linear stochastic process can be obtained by substituting out et-1, et-2, ... sequentially. Then,

Yt = d + r1Yt-1 + r2Yt-2 + r3Yt-3 + ... et

Consider only the case with finite number of autoregressive lags. That is,

Yt = d + r1Yt-1 + r2Yt-2 + ... + rpYt-p + et

or

r(B)Yt = d + et,

where

r(B) = 1 - r1B - r2B2 - ... - rpBp, and
et ~ ii(0,s2), t = 1,2,...,N.

Examples

Mixed Autoregressive and Moving Average Process: ARMA(p,q)

Yt = d + r1Yt-1 + r2Yt-2 + ... + rpYt-p - q1et-1 - q2et-2 - ... - qqet-q + et

or

r(B)Yt = d + q(B)et,

where

r(B) = 1 - r1B - r2B2 - ... - rpBp,
q(B) = 1 - q1B - q2B2 - ... - qqBq, and
et ~ ii(0,s2), t = 1,2,...,N.

Examples

Partial Autocorrelation Function

Deriving from the computation of autocorrelation function (Yule-Walker equations), we obtain the partial autocorrelation coefficients for different order of AR(p) process, p = 1,2,...

f11 = f1 (this is r1 of AR(1))
f22 = (f2-f12) / (1-f12) (this is r2 of AR(2)),
and for additional lags j = 3,4,...:
fjj =
fj-åk=1,...,j-1 fj-1,kfj-k

1-åk=1,...,j-1 fj-1,kfk
where fjk = fj-1,k - fjjfj-1,j-k for k = 1,2,...,j-1.

Plot of partial autocorrelation can reveal the correct order of AR(p) process. In other words, For AR(p), fjj = 0 for j > p. For MA(q), fjj is non-zeros for all j and exhibits a geometrically decaying pattern. For ARMA(p,q), the decay of partial autocorrelations fjj starts after the p-th lag.

The Empirical Model

Given a sample of time series observations Y1, Y2, ..., YN, sample statistics such as mean, variance, autocovariances, and autocorrelations can be used to identify the structure of the data generating process for Yt.

The structural identification of a time series includes (1) the minimum order d of differencing required on the sample to achieve stationarity; (2) the appropriate order q of a moving-average process; and (3) the appropriate order p of an autoregressive process for the stationary time series. First of all, a rapidly decline pattern of sample autocorrelation plot or correlogram is needed to ensure a stationary time series for further identification and analysis.

Model Identification

From the theoretical investigation of the linear stochastic process, autocorrelation function is useful to identify the order of a moving-average process (that is, MA(q)), while partial autocorrelation is useful to identify the order of an autoregressive process (that is, AR(p)).

Model Estimation

Forecasting

Time series forecasting is based on the estimated model:

r(B)Yt = d + q(B)et, t=1,2,...N

where

r(B) = 1-r1B-r2B2-...-rpBp,
q(B) = 1-q1B-q2B2-...-qqBq.

Because m = r(B)-1d and
y(B) = r(B)-1q(B) = 1 + y1B + y2B2 + ...

The forecasting model can be represented as:

Yt = m + y1et-1 + y2et-2 + ... + et

Example 1

To demonstrate the univariate time series analysis, BONDS.TXT is a data file consisting of 5 years of monthly average of the yields on a Moody's Aaa rated corporat bond (see also Greene [2002], Example 20.1). The original level series is nonstationary, but it can be shown as an integrated process of the first order or I(1) without augmented lags (we leave this as an exercise). In other words, the data generating process may be written as a stationary 2nd order autoregressive process as follows:

Yt = g0 + g1 Yt-1 + g2 Yt-2 + ut

The alternative is to express the model (mean-deviation) residuals in terms of AR(2) structure:

Yt = m + et
et = r1 et-1 + r2 et-2 + ut

where ut ~ nii(0,s2).

We investigate the data series Yt for autocorrelations and partial autocorrelations. For univariate analysis, the ACF and PACF of the time series is the same as those of residuals obtained from the mean-deviation regression. Up to the maximum number of lags of ACF and PACF specified, Box-Pierece and Ljung-Box test statistics will be useful for identifying the proper order of AR(p), MA(q), or ARMA(p,q) process. In addition, Breusch-Pagan test may be used to verify the higher order of autocorrelation (Program).


Extensions

Intervention Analysis

r(B)Yt = d + gZt + q(B)et

where Zt is a deterministic fixed variable (e.g., trend, dummy, or step variable). The interpretation of the intervention variable Zt may be of interest.

Seasonal ARMA and Mixed Model

Transfer Function or ARMAX Model

r(B)Yt = d + b(B)Xt + q(B)et

where b(B) = b0 + b1B + b2B2 + ... + bKBK. Model analysis including model identification, estimation, and forecasting is the same as (although more complicate than) the univariate ARMA analysis. Regression parameters bs and ARMA parameters rs and qs must be simultaneously estimated through iterations of nonlinear functional (sum-of-squares or log-likelihood) optimization. For statistical reference, the degrees of freedom must be adjusted.

Multivariate Vector Autocorrelation or VAR Model

r(1)(B)Yt = d(1) + b(1)(B)Xt + e1t
r(2)(B)Xt = d(2) + b(2)(B)Yt + e2t


Copyright © Kuan-Pin Lin
Last updated: 01/27/2005