Next: 4.2.6
Estimating exponential-decay time Up: 4.2
Applications Previous: 4.2.4
An example: Fitting
4.2.5 Maximum-likelihood estimation
in cases with correlated errors
Consider the Taylor-series expansion of W about the maximum set
of values for the parameters,
:
 |
(4.27) |
The second term on the right equals zero, because that is the condition
for the validity of the maximum-likelihood solution. The reason for expanding
W, rather than
,
is that the joint probability function can be approximated by a Gaussian
distribution function and (4.27) gives a generalized Gaussian form for
:
 |
(4.28) |
where A=exp
,
= (aj-aj*),
and
 |
(4.29) |
The matrix H is sometimes called the information matrix, and
its inverse is the covariance or error matrix discussed in Chapter 2.
For uncorrelated fluctuations, the mixed derivatives of the likelihood
function are zero because the variation of W with ak
is not a function of aj if the two parameters are independent.
In this case, the likelihood function reduces to the product of two independent
Gaussian functions. However, if the parameters are not independent, there
are terms in the resulting likelihood that depend on the mixed products
of the fluctuations; e.g.,
 |
(4.30) |
In all but degenerate cases, it is possible to transform the parameters
into new parameters that are independent (or to diagonalize the error matrix).
In the two-dimensional case, consider a rotation by an angle
so that new parameters y and z are formed from the old parameters
a1 and a2:
 |
(4.31) |
 |
(4.32) |
Contours of constant probability, in terms of the new variables, are
specified by the equation
![\begin{displaymath}~~~~~~~~~~ + (H_{12} + H_{21})[\cos\theta\sin\theta\deltay^2......\theta\delta y\delta z -\cos\theta\sin\theta\delta z^2] \ \ .\end{displaymath}](img155.gif) |
(4.33) |
This has the form of an ellipse, and an appropriate choice of
can give the standard form for an ellipse,
 |
(4.34) |
which does not involve cross-terms in y and z. The required
rotation can be found by setting the coefficient of the cross product in
(4.33)
to zero:
 |
(4.35) |
 |
(4.36) |
 |
(4.37) |
In the transformed variables,
 |
(4.38) |
and
 |
(4.39) |
 |
(4.40) |
The variables y and z are uncorrelated, so the appropriate
standard deviation in the initial parameters can be estimated from
 |
(4.41) |
 |
(4.42) |
 |
(4.43) |
The procedure can be generalized as follows. Write
= (
),
so that
 |
(4.44) |
Transform according to the rotation matrix U, to new parameters
:
 |
(4.45) |
Because the inverse of a rotation matrix is its transpose, (4.44)
can be written
 |
(4.46) |
 |
(4.47) |
where
 |
(4.48) |
For appropriate choice of U, h can be made a diagonal matrix. In that
case,
is the product of independent Gaussian distributions with variances specified
by the components h-1jj.
The variances in the initial variables
can then be found in terms of the (uncorrelated) variances in the variables
:
 |
(4.49) |
 |
(4.50) |
 |
(4.51) |
 |
(4.52) |
But, from (4.48),
 |
(4.53) |
so
 |
(4.54) |
This again shows the connection between H, the information matrix,
and the variances and covariances of the observations.
One approach to determining the errors in the multivariate case with
correlated errors is to determine the matrix H by finding the second
derivatives of W, then invert H to find the error matrix.
The elements of the error matrix then give the variances and covariances
for the parameters.
Next: 4.2.6
Estimating exponential-decay time Up: 4.2
Applications Previous: 4.2.4
An example: Fitting
NCAR Advanced Study Program
http://www.asp.ucar.edu