3. Methods

3.1. Filtering and de-noising methods based on wavelets

Most of the wavelet coefficients of the transform of a noiseless signal are close to zero. Therefore the most obvious way of filtering in the wavelet domain is to identify those wavelet coefficients that are significantly non-zero against the noisy background (Donoho & Johnstone 1994). This motivates the method of hard thresholding according to which only the significantly non-zero coefficients are retained and all others rejected:

where the w_m are wavelet coefficients of the noisy signal, the W_m are the filtered wavelet coefficients, is the estimated standard deviation of the noise at the appropriate scale m, and k is a freely selectable factor which defines the level of confidence for the preserved coefficients. Non-orthogonal wavelets reduce the with increasing m, and hence we have to estimate it for each scale separately (Starck & Bijaoui 1994).

We now briefly introduce the principles of some more sophisticated de-noising methods with which we later compare our own method. The first method was developed by Donoho (1992a,b). It is based on the so-called wavelet shrinkage technique. In addition to the application of a threshold criterion, the wavelet coefficients are shrunk towards zero. Donoho & Johnstone (1992, 1993, 1994) provide several algorithms based on statistical arguments to determine the level of shrinkage.

Figure 1: Two-level decomposition of a Stokes V spectrum with a noise level of . The corrupted signal is plotted on the left, the coefficients of the first level of decomposition are plotted in the central column. The coefficients of the wavelet transform of each of the scales in the central column are plotted on the right (second level of decomposition). The two-level decomposition reveals some of the uncorrupted signal at scales 13 to 15 which is absent at scale 1. Hence truncating scales 11 to 15, instead of scale 1, can recover features of the uncorrupted signal which are lost if scale 1 is truncated

Another set of noise-reduction methods have been proposed by Starck & Bijaoui (1994). They employ the so-called à trous algorithm to perform the wavelet transformation based on linear or cubic spline functions. They apply different filter criteria which include information on wavelet coefficients of different scales (hierarchical filtering). In addition, they use an iterative reconstruction (adaptive filtering) to find a signal whose wavelet transform is as close as possible to the original set of thresholded coefficients (see also Starck & Murtagh 1994 and Starck et al. 1995 for further details).

The final noise reducing technique to which we compare our method was developed by Bury et al. (1996) and is also based on the à trous algorithm. They apply an iterative structure detection algorithm in the scale-space plane to extract the significant coefficients from the noisy wavelet transform.

In the next section we present a somewhat different approach to the problem of noise reduction using wavelet-based techniques. Instead of applying a suitable selection criterion to determining the significant coefficients of a certain scale we extract the underlying spectral features of a noisy scale by simply transforming it again using the same wavelet base (wavelet-packets analysis). The transformed signal is then de-noised by a simple hard thresholding criterion. Details are given in the next section.

3.2. De-noising using wavelet-packets

Noise is mainly concentrated at the finest scales of a signal transformed using non-orthogonal wavelets, i.e. the signal-to-noise ratio (SNR) decreases towards the finer scales. If the noise level of the original signal is sufficiently high, the finest scales are completely dominated by noise. As an example consider the noisy signal in the left part of Fig. 1 (click here) (details about the signal are given later). The wavelet coefficients of the noisy signal are plotted at the center, with frequency decreasing from top to bottom. The highest frequency coefficients, i.e. the top row, are completely dominated by noise. As we demonstrate later, however, there is a considerable signal hidden in that noise. Simply truncating the wavelet coefficients at these scales either rejects the real signal along with the noise or leaves so much noise that the signal is still not recognizable. This dilemma is resolved using the splitting algorithm of a wavelet-packets analysis (Wickerhauser 1991). Each scale (frequency) may itself be considered to be a noisy signal for which we wish to separate the noise from the signal. We do this by transforming (decomposing) the wavelet coefficients at each scale, again, exactly in the same manner as we transformed the original signal. The coefficients resulting from this second set of transformations are shown on the right side of Fig. 1 (click here): each scale of the singly transformed signal gives rise to (in this case) 5 scales in the doubly transformed or twice decomposed signal.

Figure 2: a) Finest scale of the non-orthogonal wavelet transform of a noisy signal (scale 1 of Fig. 1 (click here)). b) Same scale of the uncorrupted original signal. c) Reconstructed scale after a three-level-decomposition. Clearly there is little chance of recovering the uncorrupted pattern out of the noisy signal a) by simply truncating at a certain threshold value. In contrast the reconstructed scale c) is a good approximation of the uncorrupted scale b)

For greater clarity we number the scales of the different decompositions as follows: The scales after the first transformation are numbered 1, 2, ... 5 (from finest to coarsest). After the second level of decomposition we number them as follows: 11, 12, ... 15 (transforms of scale 1), ..., 51, 52, ... 55 (transforms of scale 5). The tree-like arrangement of the individual scales which results from a two-level decomposition of a Stokes V (i.e. circularly polarized) spectrum is illustrated in Fig. 1 (click here).

Consider scale 1 of Fig. 1 (click here). It is completely dominated by noise. Simple thresholding will set each coefficient to zero, because none lies significantly above the background noise. Consequently the restored signal will be considerably altered in the high frequency range, i.e. in the peaks of the spectral lines. After the second transformation the hidden signal has partially been separated from the noise and has become clearly visible at the scales 14 and 15 and can still be detected in scale 13, although just barely. If we now truncate all the wavelet coefficients of the twice decomposed signal under a hard threshold, then we can reduce the noise by almost the same amount as after the first transformation, while preserving much more of the uncorrupted signal, particularly at high frequencies. It is possible to recover even more of the original signal by transforming scales 11-55 once more (three-level decomposition). The degree to which a three-level decomposition can restore scale 1 is shown in Fig. 2 (click here). Note the remarkable similarity of the reconstructed coefficients (Fig. 2 (click here)c) to the original, uncorrupted coefficients (Fig. 2 (click here)b).

Due to its high redundancy the base of non-orthogonal wavelets has the highly desirable property that it allows the main features of the uncorrupted signal to be recognized even after several decompositions of the noisiest scales. So instead of truncating the coefficients of the finest scales directly it is worth transforming them once again and performing the truncation on the coefficients of the second (or third) decomposition level.

Generally, by truncating the coefficients of a higher level of decomposition, it is possible to restore features of the uncorrupted signal at scales of lower levels, which were originally drowned in the noise. This is especially effective for the finest scales whereas the coarser scales have a larger SNR and hence can be truncated directly at the first level of decomposition without affecting their information content significantly.

In the present paper we truncate the coefficients on the basis of a simple hard thresholding criterion. Since we are dealing with non-orthogonal wavelets, we need to know the standard deviation of noise at each scale . Due to the linearity of the wavelet transformation, i.e.
eqnarray330
this can be obtained from the transformation of a signal that consists purely of noise. Strictly speaking we do the following. First we determine the standard deviation of the noise of the corrupted signal (white Gaussian noise in the illustrated case). Then this value is used to create a signal of pure, e.g., white Gaussian noise to which our splitting scheme is applied. Finally, we get an estimate of the standard deviation of noise at scale m of the corrupted signal, called , by calculating the standard deviation of the pure noise, called , of that scale. Such a scheme has already been applied by Starck & Bijaoui (1994). We set the threshold value to . The factor k determines the confidence level of the preserved coefficients. In general we set k=3, which results in a level of confidence of 99.7%.

Figure 3: a) Uncorrupted Stokes V spectrum. b) Corrupted version with artificially added noise with . c) Fourier smoothed spectrum. It shows clearly the typical random oscillations within the low frequency parts of the spectrum. d) Wavelet-packets de-noised version, using a three-level decomposition. The continuum is now clean and even a number of small features of the original spectrum are at least qualitatively recovered

3.3. Data and noise model

We test the various methods on intensity (Stokes I) and net circular polarization spectra (Stokes V) of solar features recorded with the Fourier Transform Spectrometer (FTS) installed at the McMath Pierce facility at Kitt Peak. The data are ideal for testing various de-noising techniques for a number of reasons. They have a very low noise level (as low as in some parts of the spectrum; is the continuum intensity), so that de-noising techniques can be tested rigorously. They have a spectral resolving power of , so that the high frequencies present in the original solar spectrum have not been significantly suppressed by the instrument. They cover a broad wavelength range with different types of spectral features. Also, both Stokes I and Stokes V\ spectra were recorded. In the following we first concentrate on Stokes V since it shows higher frequencies and thus poses a larger challenge to de-noising techniques, particularly since the data are sampled close to the Nyquist frequency. The data are described in detail by Stenflo et al. (1984) and Solanki (1987).

We first corrupted the spectra by adding artificial white Gaussian noise whose standard deviation scales as the inverse of the square root of the reference spectrum intensity. Such a noise spectrum is a good representation of Poisson distributed photon noise in the limit of many photons (which is fulfilled by the present data). These noisy spectra then served as input data for the de-noising algorithms described in Sects. 3.1 (click here) and 3.2 (click here).

Up: Noise reduction in