f(u) = (1/(2ps2))N/2
exp(-½ u'u/s2)
The joint likelihood of Y=[Y1,Y2,...,YN]' is f(Y) = f(u)|du/dY|, where |du/dY| is the Jacobian transformation. Then the log-likelihood of Y is written as
L = -N/2 ln(2p) -N/2 ln(s2) -½ u'u/s2 + ln(|du/dY|)
By concentrating on the maximum likelihood estimate of s2 = u'u/N, the concentrated log-likelihood function is
L* = -N/2 [ln(2p)-ln(N)+1] - N/2 ln(u'u) + ln(|du/dY|)
The Jacobian term |du/dY| is defined by
L* = -N/2 [ln(2p)-ln(N)+1] - N/2 ln(u*'u*)
where u* = u/(|du/dY|1/N) in which u is scaled by 1/(|du/dY|1/N).
Maximizing the log-likelihood function is the same as minimizing the sum of squares of scaled errors: u*'u*. Since u* depends on the parameters of error structure (r,q), in addition to the regression parameters (b,l,g), spatial model estimation is a nonlinear optimization problem which involves the evaluation of large Jacobian in the form of |I-aW| where a is l, r and/or q depending on model specification. The determinant of (I-aW) can be computed from the products of eigenvalues of (I-aW). That is, |I-aW| = (1-aw1)(1-aw2)...(1-awN) where w1,w2,...,wN are the N eigenvalues of the spatial weights matrix W. Note that W is row-standardized but non-symmetry). In addition, the estimation requires that the parameter a associated with the spatial weights matrix W satisfies the range condition: 1/wmin < a < 1/wmax. Furthermore, for a row-standardized W, wmax = 1.
For a spatial MA(1) or ARMA(1,1), the computation of (I-qW)-1 posts a challenging task since it is large in size and the result is not necessary sparse.