next previous
Up: Efficient implementations of

1. Introduction

 

The Backus-Gilbert (BG) method, Backus & Gilbert (1968), also known as the method of optimally localized averages, has been used successfully in seismology, Parker (1977), helioseismology, Christensen-Dalsgaard et al. (1990) and other areas that involve inverse problems, Craig & Brown (1986). The BG method belongs to a class of methods known as mollifier methods. The underlying idea in these methods is that if we are given m measurements tex2html_wrap_inline2477 which represent (noisy) values of m functionals tex2html_wrap_inline2481 on an unknown function f, i.e.,


 equation275
where tex2html_wrap_inline2485 represent the noise, then we estimate f in a given point tex2html_wrap_inline2489 by a linear combination of the measurements:
 equation279
Here, we have introduced the two m-vectors
eqnarray283
If the noise is unbiased and has covariance matrix tex2html_wrap_inline2493, then the variance tex2html_wrap_inline2495 of the error in tex2html_wrap_inline2497 due to the noise in tex2html_wrap_inline2499 is given by
equation285

The BG method is a particular technique for computing the coefficients tex2html_wrap_inline2501 explicitly. If we compare this approach with Tikhonov regularization of a discrete ill-posed problem tex2html_wrap_inline2503, whose solution is formally given by tex2html_wrap_inline2505 for some tex2html_wrap_inline2507, then we notice that tex2html_wrap_inline2509 in a mollifier method conceptually corresponds to a row in the matrix tex2html_wrap_inline2511.

If tex2html_wrap_inline2513 denotes the exact solution, then for any regularization method it is easy to see from Eq. (1 (click here)) that tex2html_wrap_inline2515 is given by
displaymath2517
The function
 equation293
is called the averaging kernel at tex2html_wrap_inline2519, and it shows how the noise-free part of the regularized estimate tex2html_wrap_inline2521 is obtained as an average of the exact solution tex2html_wrap_inline2523. Thus, tex2html_wrap_inline2525 determines the resolving power of the particular regularization method at tex2html_wrap_inline2527, and for tex2html_wrap_inline2529 to be meaningful the function tex2html_wrap_inline2531 should be peaked around tex2html_wrap_inline2533 and be normalized such that
 equation301
Hence, the averaging kernel tex2html_wrap_inline2535 often plays a role which is at least as important as the computed estimate tex2html_wrap_inline2537.

In the BG method the peakedness of tex2html_wrap_inline2539 is controlled by specifying a criteria function tex2html_wrap_inline2541 and requiring that the quantity
 equation308
be sufficiently small, subject to the constraint in Eq. (5 (click here)). Often, a regularization term is also added in Eq. (6 (click here)). A commonly used criteria function is tex2html_wrap_inline2543. Various other criteria functions are discussed in the original paper by Backus &\ Gilbert (1968), while the additional regularization term is not used there. See Hansen (1994) for an SVD-analysis and computational details.

The BG method is computationally very expensive: If n is the number of quadrature points used to evaluate the integrals then tex2html_wrap_inline2547 operations are required for each tex2html_wrap_inline2549. This is prohibitively expensive in large-scale applications. In, e.g., 2D rotational inversion in helioseismology m is of the order tex2html_wrap_inline2553 and n is of the order tex2html_wrap_inline2557.

Therefore, there is a need for computationally more efficient methods. One such method is subtractive optimally localized averages (SOLA), which has been discovered independently by several researchers; see, e.g., Louis & Maass (1990) and Pijpers &\ Thompson (1992). The idea in the SOLA method is, instead of minimizing (6 (click here)), to minimize the squared difference between the averaging kernel and some chosen target function tex2html_wrap_inline2559,
 equation324
and it is common practice to retain the constraint in Eq. (5 (click here)), see , Pijpers &\ Thompson (1992, 1994). The peakedness of tex2html_wrap_inline2561 is controlled by the choice of tex2html_wrap_inline2563, and therefore to compute tex2html_wrap_inline2565 one would choose a target function peaked around tex2html_wrap_inline2567. In practice it is often necessary to add a regularizing term, i.e., to replace Eq. (7 (click here)) with
 equation335
Here tex2html_wrap_inline2569 is the error covariance matrix of the measured data points tex2html_wrap_inline2571 and tex2html_wrap_inline2573 is a trade-off or regularization parameter whose value determines the relative importance of the two terms in Eq. (8 (click here)). Thus, tex2html_wrap_inline2575 controls the propagation of errors from the measurements to tex2html_wrap_inline2577.

The great advantage of the SOLA method compared to the BG method is that the tex2html_wrap_inline2579 coefficient matrix involved does not depend on tex2html_wrap_inline2581 and therefore its factorization, costing tex2html_wrap_inline2583 operations, is performed only once. The additional cost to compute tex2html_wrap_inline2585 is tex2html_wrap_inline2587 for each tex2html_wrap_inline2589.

In this paper we discuss two different approaches to implementing the SOLA method: The standard approach using a Lagrange multiplier, and a new approach that uses an orthogonal transformation to incorporate the constraint in Eq. (5 (click here)) and bidiagonalization algorithms to solve the resulting least squares problem. We demonstrate that the latter approach is more efficient when many values of the regularization parameter tex2html_wrap_inline2591 must be tried in order to find the optimal one. This is important, since the optimal tex2html_wrap_inline2593 is very rarely known in advance - instead it must be computed by means of parameter-choice methods, such as generalized cross validation or the L-curve criterion, that involve the optimization of a function that depends on tex2html_wrap_inline2595; see Hanke & Hansen (1993).

Our paper is organized as follows. In Sect. 2 (click here) we introduce our notation and present the algorithms. These algorithms are based on standard linear algebra routines which are available in most scientific software libraries, and in Sect. 3 (click here) we point to available Fortran software. The computational load in the algorithms, and the efficiency of our new approach, is discussed in Sect. 4 (click here). Finally, in Sect. 5 (click here) we illustrate our algorithms with an example from helioseismology.


next previous
Up: Efficient implementations of

Copyright by the European Southern Observatory (ESO)
web@ed-phys.fr