or
Yt = m + (1+y1B+y2B2+...)et = m + y(B)et
where
y(B) = 1 + y1B + y2B2 + ..., and
et ~ ii(0,s2),
t = 1,2,...,N.
Stationarity requirement for the process:
| Mean | m = E(Yt) < ¥ |
| Variance | g0 = s2åi=0,...,¥yi2 < ¥ |
| Autocovariance | gj = s2åi=0,...,¥yiyj+i < ¥ |
| Autocorrelation | fj = gj / g0 |
The plot of aucorrelation coefficients at each lag j = 0,1,2,... is called autocorrelation function. In general, the autocorrelation function of a stationary time series falls to 0 quickly. The autocorrelation function of a non-stationary time series does not converge to zero. However, by differencing the time series, the differenced series may become stationary.
A linear stochastic process called an integrated process of order d, I(d), if the d-th difference of the series is stationary. Note that d is the lowest number of differences required for the resulting series to be stationary. That is, Yt ~ I(d) if DdYt is stationary, where
DYt = Yt-Yt-1
D2Yt = DYt-DYt-1
...
| yj = | -qj, j=1,2,...q |
| 0 for j > q |
then
| Yt | = m - q1et-1 - q2et-2 - ... - qqet-q + et |
| = m + (1 - q1B - q2B2 - ... - qqBq)et | |
| = m + q(B) et |
where
q(B) = 1 - q1B -
q2B2 - ... - qqBq, and
et ~ ii(0,s2),
t = 1,2,...,N.
or
| gj = | (-qj+q1qj+1+q2qj+2+...+qq-jqq)s2, j=1,2,...q |
| 0 otherwise |
or
| fj = |
| , j = 1,2,...q | |||
| 0 | otherwise |
Plot of autocorrelation coefficients of a stationary MA(q) process indicates a finite memory pattern up to the q-th lag. Beyond that the autocorrelation function is zero-valued.
Examples
Yt = d + r1Yt-1 + r2Yt-2 + r3Yt-3 + ... et
Consider only the case with finite number of autoregressive lags. That is,
Yt = d +
r1Yt-1 +
r2Yt-2 + ... +
rpYt-p +
et
or
r(B)Yt = d + et,
where
r(B) = 1 - r1B -
r2B2 - ... - rpBp, and
et ~ ii(0,s2),
t = 1,2,...,N.
| g0 = E(yt2) | = E[(r1yt-1 + r2yt-2 + ... + rpyt-p + et)2] |
| = r1g1 + r2g2 + ... + rpgp + s2 |
That is the Yule-Walker Equations of AR(p) Process:
f1 =
r1 +
r2f1 +
r3f2 + ... +
rpfp-1
f2 =
r1f1 +
r2 +
r3f1 + ... +
rpfp-2
...
fp =
r1fp-1 +
r2fp-2 +
r3fp-3 + ... +
rp
...
fj =
r1fj-1 +
r2fj-2 +
r3fj-3 + ... +
rpfj-p, for j > p
Plot of autocorrelation coefficients fj (j=1,2,...) of a stationary AR(p) process indicates an infinite but decay memory pattern.
Examples
or
r(B)Yt = d + q(B)et,
where
r(B) = 1 - r1B -
r2B2 - ... - rpBp,
q(B) = 1 - q1B -
q2B2 - ... - qqBq, and
et ~ ii(0,s2),
t = 1,2,...,N.
Therefore,
g0 =
E(yt2) =
åi=1,...,p
rigi +
[1-åi=1,...,q
qi(ri-qi)]s2.
That is,
| s2 = |
|
or,
for j = 1,2,...,q
gj =
åi=1,...,p
rig|i-j| -
[qj+åi=1,...,q-j
qi+j(ri-qi)]s2.
and, for j > p
gj =
åi=1,...,p
rig|i-j|
or,
for j = 1,2,...,q
| fj = | åi=1,...,p rif|i-j| - |
|
Plot of autocorrelation coefficients fj (j=1,2,...) of a stationary mixed ARMA(p,q) process reflects the combination of infinite memory AR and finite memory MA processes. However, after q lags, only the AR process continues.
Examples
| f1 = |
|
f11 = f1
(this is r1 of AR(1))
f22 =
(f2-f12) /
(1-f12)
(this is r2 of AR(2)),
and for additional lags j = 3,4,...:
| fjj = |
|
Plot of partial autocorrelation can reveal the correct order of AR(p) process. In other words, For AR(p), fjj = 0 for j > p. For MA(q), fjj is non-zeros for all j and exhibits a geometrically decaying pattern. For ARMA(p,q), the decay of partial autocorrelations fjj starts after the p-th lag.
The structural identification of a time series includes (1) the minimum order d of differencing required on the sample to achieve stationarity; (2) the appropriate order q of a moving-average process; and (3) the appropriate order p of an autoregressive process for the stationary time series. First of all, a rapidly decline pattern of sample autocorrelation plot or correlogram is needed to ensure a stationary time series for further identification and analysis.
Bartlett Test
Testing H0: fj = 0 for each j > q (no autocorrelation at each lag j longer than q)
based on the Bartlett distribution of the estimated fj.
That is, the estimated fj ~ normal(0,ÖVar(fj)) approximately, where
Var(fj) = 1/N
(1 + 2 åj=1,...,qfj2).
Box-Pierece Test and Ljung-Box Test
Testing H0: f1 =... = fk =0 (zero autocorrelation coefficients up to some k lags) based on Box-Pierece Q or Ljung-Box Q' Statistic defined as follows:
Q = N åj=1,...,kfj2
Q' = N(N+2) åj=1,...,kfj2/(N-j)
Both Q and Q' ~ Chi-Square(k).
Let fjj be the partial autocorrelation coefficient at the j-th lag. That is, fjj = rj obtained from the autoregressive regression of the AR(j) model:
Yt = d + r1Yt-1 + r2Yt-2 + ... + rjYt-j + et
If the sample series follows a AR(p) process, then the autoregressive coefficient for each lag j longer than p must be zero.
Testing H0: fjj = 0 for each j > p based on the approximated distribution for the estimated fjj ~ normal(0,1/ÖN). Alternatively, the standard error of fjj can be obtained from the corresponding estimated autoregressive regression equation for each lag.
In order to use all N data observations, initialization may be needed for the following:
The model may be written in the "inverted" form as
q(B)-1(-d+r(B)Yt) = et
where et ~ ii(0,s2). Conditional to the historical information (YN, ..., Y1), and data initialization (Y0, ..., Y-p+1), (e0, ..., e-q+1), the sum-of-squares is defined by
S = åt=1,2,...,Net2
Assuming et ~ nii(0,s2) for each observation i, the concentrated log-likelihood function is
ll = -N/2 (1+ln(2p)-ln(N)+ln(S))
The conditional maximum likelihood estimators of rs, qs, and d are obtained by maximizing the nonlinear function ll. A set of initial values for the parameters rs and qs are needed to start the iteration of nonlinear model estimation.
Further identification on the estimated residuals, with N-(p+q+1) of observations as the estimated model is an ARMA(p,q) process.
r(B)Yt = d + q(B)et, t=1,2,...N
where
r(B) = 1-r1B-r2B2-...-rpBp,
q(B) = 1-q1B-q2B2-...-qqBq.
Because m = r(B)-1d and
y(B)
The forecasting model can be represented as:
Yt = m + y1et-1 + y2et-2 + ... + et
Given the historical information available at the end of estimation period N,
HN = (Y-p+1,...,Y-1,Y0,Y1,...,YN; e-q+1,...,e-1,e0,e1,...,eN)
One-step ahead forecast is the conditional expectation of YN+1:
YN(1) = E(YN+1|HN) = m + y1eN + y2eN-1 + ...
Compared with the observed YN+1 which is:
YN+1 = m + eN+1 + y1eN + y2eN-1 + ...
One-step ahead forecast error is defined by:
eN(1) = YN+1-YN(1) = eN+1
E(eN(1)) = 0
sN2(1) =
Var(eN(1)) =
Var(eN+1) =
s2
Similarly, two-step ahead forecast is the conditional expectation of YN+2:
YN(2) = E(YN+2|HN) = m + y2eN + y3eN-1 + ...
Compared with the observed YN+2 which is:
YN+2 = m + eN+2 + y1eN+1 + y2eN + ...
Two-step ahead forecast error is defined by:
eN(2) = YN+2-YN(2) = eN+2 + y1eN+1 = (1 + y1B)eN+2
E(eN(2)) = 0
sN2(2) =
Var(eN(2)) =
Var((1+y1B)eN+2) =
(1+y12)s2 >
sN2(1)
f-step ahead forecast is the conditional expectation of YN+f:
| YN(f) = E(YN+f|HN) = | m + yfeN + yf+1eN-1 + ... |
| m, as f ® ¥ |
Compared with the observed YN+f which is:
YN+f = m + eN+f + y1eN+f-1 + y2eN+f-2 + ...
f-step ahead forecast error is defined by:
eN(f) = YN+f-YN(f) = (1+y1B+...+yf-1Bf-1)eN+f
E(eN(f)) = 0
sN2(f) =
Var(eN(f)) =
(1+y12+...+yf-12)s2
> ... > sN2(1)
In general, f+1-step ahead forecast is written as
YN(f+1) = E(YN+f+1|HN) = m + yf+1eN + yf+2eN-1 + ...
Compared with the f-step ahead forecast at N+1 (with the historical information HN+1 = (HN,YN+1,eN+1)):
YN+1(f) = E(YN+f+1|HN+1) = m + yfeN+1 + yf+1eN + ...
Then the forecast revision for YN+f+1 is defined by
YN+1(f) - YN(f+1) = yfeN+1 = yfeN(1)
Therefore, with additional information available at N+1, the f-step ahead forecast is just the (f+1)-step ahead forecast made at previous date N, adjusted for one-step forecast error eN(1) weighted by the error learning coefficient yf as follows:
YN+1(f) = YN(f+1) + yfeN(1)
Yt = g0 + g1 Yt-1 + g2 Yt-2 + ut
The alternative is to express the model (mean-deviation) residuals in terms of AR(2) structure:
Yt = m + et
et =
r1 et-1 +
r2 et-2 + ut
where ut ~ nii(0,s2).
We investigate the data series Yt for autocorrelations and partial autocorrelations. For univariate analysis, the ACF and PACF of the time series is the same as those of residuals obtained from the mean-deviation regression. Up to the maximum number of lags of ACF and PACF specified, Box-Pierece and Ljung-Box test statistics will be useful for identifying the proper order of AR(p), MA(q), or ARMA(p,q) process. In addition, Breusch-Pagan test may be used to verify the higher order of autocorrelation (Program).
where Zt is a deterministic fixed variable (e.g., trend, dummy, or step variable). The interpretation of the intervention variable Zt may be of interest.
Denote s the seasonal span (s=4 for quarterly data, s=12 for monthly data), then
Yt = (1-Bs)Zt
rs(Bs)Yt = d + qs(Bs)et
rs(Bs)r(B)Yt = d + qs(Bs)q(B)et
where b(B) = b0 + b1B + b2B2 + ... + bKBK. Model analysis including model identification, estimation, and forecasting is the same as (although more complicate than) the univariate ARMA analysis. Regression parameters bs and ARMA parameters rs and qs must be simultaneously estimated through iterations of nonlinear functional (sum-of-squares or log-likelihood) optimization. For statistical reference, the degrees of freedom must be adjusted.