next previous
Up: Minimum-component baselines: Fourier

2. A solution

A solution lies in taking the one-dimensional scan to pieces in a Fourier analysis. Fourier theory (e.g. Bracewell 1965) indicates that any continuous function may be represented as the sum of sines and cosines, i.e.
equation229
where the function F representing the phased amplitudes of the sinusoidal components of f is known as the Fourier Transform (FT). If the function is sampled N times at uniform intervals tex2html_wrap_inline1216 in the spatial (observed) frame, the total length in the t- direction is tex2html_wrap_inline1220, and the result is the continuous function multiplied by the ``comb" function, producing a function f'(t) which (with the interval in spatial frequency as tex2html_wrap_inline1224) may be represented (e.g. Gaskill 1978) either as a sum of sines and cosines
equation238
or as a cosine series
equation242
where amplitudes tex2html_wrap_inline1226 and phases tex2html_wrap_inline1228 are given by
eqnarray246
In the latter formulation, obtaining the FT produces - by virtue of the tex2html_wrap_inline1230 cyclic nature of sine and cosine - a ``Fourier-Transform plane" for f'(t) which shows the amplitudes mirror-imaged about zero frequency, with a sampling in spatial frequency at intervals of tex2html_wrap_inline1234 and a repetition of the pattern at intervals of tex2html_wrap_inline1236.

There are three criteria for successful discrete-sampling.

  1. The Nyquist criterion or Nyquist limit guarantees that there is no information at spatial frequencies above tex2html_wrap_inline1238. (Consider the silly case of a signal which is a spatial sine wave of wavelength tex2html_wrap_inline1240: sampling at intervals of tex2html_wrap_inline1242 finds points of identical amplitude and thus does not carry information on amplitude or phase of this spatial frequency.) Thus the sampling interval tex2html_wrap_inline1244 sets the highest spatial frequency tex2html_wrap_inline1246 which can be present; if higher frequencies are present in the data, this sampling rate loses them.

    At the same time, the sampling theorem (Whittaker 1915; Shannon 1949) indicates that any bandwidth-limited function can be specified exactly by regularly-sampled values provided that the sample interval does not exceed a critical length (which corresponds approximately to half the FWHM resolution), i.e. if the instrumental half-width is B, tex2html_wrap_inline1250 if tex2html_wrap_inline1252.

  2. To avoid any ambiguity - aliasing - in the reconstruction of the scan from its FT, the sampling interval must be small enough for the amplitude coefficients of components at frequencies as high as tex2html_wrap_inline1254 to be effectively zero. If tex2html_wrap_inline1256 for components of frequency this high, the positive high-frequency tail of the repeating tex2html_wrap_inline1258 tangles up with the negative tail of the symmetric function repeating about tex2html_wrap_inline1260 to produce an indeterminate transform.
  3. Any physical system is indeed band-pass limited (although noise added by the subsequent detector is not necessarily so), and is a low-pass filter; it is in the accurate assessment of the lowest spatial frequencies, the slowly-varying continuum, where our interest lies. The lowest frequencies which harmonic analysis can delineate are at tex2html_wrap_inline1262. Such low-frequency spatial components may be real as in the case of a stellar spectrum, or may be instrumental in origin as for sky scans with a single-beam radio telescope. In either case, to have any chance of distinguishing these from those of the signal, the scan length must exceed the width of single resolved features by a factor preferably tex2html_wrap_inline1264.

In practice most data satisfy these properties. By design, sampling is frequent enough to maintain resolution, to obtain spatial frequencies beyond those present in signal, and to avoid aliasing. By design we take spectra over ranges substantially greater than the width of the features. Consider for instance the FT of the spectra in the upper panels of Figs. 1 (click here) and 2 (click here), whose tex2html_wrap_inline1266 are shown in upper panels of Fig. 3 (click here) and Fig. 4 (click here). Only half the interval of the FT is shown in these plots, up to tex2html_wrap_inline1268; the amplitude of the components has drained away very satisfactorily. It is therefore appropriate to consider what can be done to such scans via harmonic analysis, filtering in the Fourier (frequency) plane, and reconstruction.

  figure287
Figure 3: Upper panel: The amplitudes of the Fourier components of the spectrum in Fig. 1, top. Central panel: The amplitudes of the Fourier components of the patched spectrum (with the least-squares approximation subtracted) of Fig. 1, middle. Note the change in scale, due to the removal of the massive low-frequency signal present in the lines. Lower panel: The same amplitude plane after application of a half-Gaussian taper of scale tex2html_wrap_inline1270 channels. This is the set of amplitudes used in the reverse FT which produces the baseline shown in Fig. 1, lower panel

  figure292
Figure 4: The FT amplitudes of the complete scan (upper panel), the patched scan (central panel), and the filtered result from which the baseline is constructed (lower panel), as described in the caption for Fig. 3

To fit a continuum or remove a baseline, one could in principle consider an optimum filter for removing the prevalent low-frequency components. The difficulty is the well-known property for the FT of a Gaussian to be a Gaussian in the Fourier plane; even unresolved lines thus have very low frequency components. The result is that the harmonic components which govern the baseline are also required for the signal. Indeed simply building a baseline by reverse-transforming the few lowest-frequency components from the transform of the spectrum will produce baselines with gross distortion in regions of signal due to the low-frequency signal components.

However, it is straightforward to ``patch out" regions of signal, as shown in the central panels of Figs. 1 (click here) and 2 (click here). A simple linear interpolation across suspect or manifestly ``bad" (line) regions is easily selected and for the purpose interactive graphics can be used. Such a process is remarkably stable against the choice of the extent of the patches, except for extreme curvature of the baseline.

The next step is to take the FT of this ``patched" array, and then to filter the low-frequency components from this. A formal method can be used, such as the filtering described by Martin (1959); but a simple and effective approach is to apply a savage half-Gaussian taper to the amplitude plane. (Any filter applied needs a taper to prevent ringing effects.) Finally the continuum is reconstructed with the inverse transform of the patched/filtered data.

The FT (amplitude) of the patched arrays are shown in the central panels of Figs. 3 (click here) and 4 (click here). The heavy half-Gaussian filter applied in this instance has a tex2html_wrap_inline1272 wave-numbers. The lower panels of Figs. 3 (click here) and 4 (click here) show the savagery of the filtering which, when the inverse transform is applied, yields the highly satisfactory continua in the lower panels of Figs. 1 (click here) and 2 (click here).

The cyclic nature of the FT technique always means that the ends of the scan must match, so that an overall slope must be removed at the outset. A simple least-squares fitting technique, or very heavy smoothing to estimate a zero-order approximation to the continuum, can be used for this purpose, as shown in the central panels of Figs. 1 (click here) and 2 (click here).

To summarize, the baseline-finding technique advocated here consists of

  1. patching: forming a ``baseline array" from the original data-series by patching across regions of the scan where signal is evident;
  2. end-matching: subtracting from this baseline array a first approximation to the patched scan, obtained with a linear fit, a very low-order polynomial or a heavy-smoothing estimate;
  3. Fourier Transforming the resultant baseline array;
  4. Removal of the high frequencies by applying a heavy-tapered multiplicative filter to taper off the higher-frequency Fourier amplitudes;
  5. Reverse-transforming using these minimum remaining components; and
  6. Gradient-restoration, by adding back in the first approximation (step 2) to the baseline.

next previous
Up: Minimum-component baselines: Fourier

Copyright by the European Southern Observatory (ESO)
web@ed-phys.fr