In order to quantitatively test the proposed ansatz, we have calculated the complete likelihood function for several experiments. Our aim will be to compare the true likelihoods to the approximation. Figures 2-5 summarize our comparisons with the Saskatoon and MAX data sets. In all cases, the complete likelihood functions have been calculated as outlined in Sect. 2 above.
The first comparison will be made to
the Saskatoon Q-band 1995 4-point and 10-point
differences (experimental information can be
found in Netterfield et al. 1997;
all relevant information concerning the
experiment can be found on the group's web
page;
for useful and detailed information on a number of experiments, see Caltech's
web page
).
This particular choice of window functions was arbitrary.
The approximation, applied using the constraints
(19) and (20), is shown in Figs. 2 and 3
as the dashed (red) curve. We see that it provides
a good representation of the complete likelihood functions,
traced by the solid (black) curves in each figure;
in fact, the fit is truly spectacular for
the 10-point difference.
Taking as a benchmark the rule-of-thumb that
1, 2 and 3
confidence intervals may be
estimated by
,
4 and 9,
respectively, we see that the approximation
reproduces almost perfectly all of these, and
more.
Consider now setting
and 48, for
the 4-point and 10-point differences, respectively,
and then adjusting
to the 68% confidence interval. In so doing, we obtain
the dot-dashed (blue) curves, which in fact are not too
bad in both cases. These values of
should
be compared to the values of
and 41 found previously
by adjusting to two confidence intervals.
Thus, we see that the effective
number of degrees-of-freedom describing these Saskatoon
likelihood functions is indeed
,
as we expected from
the above discussion.
Finally, the 3-dot-dashed (green)
curves show "2-winged''
Gaussians with separate positive- and negative-
going variances, sometimes employed in
traditional -analyses. This
is also a fare representation of the
two likelihood functions, although the proposed
ansatz does perform slightly better. We will return to this
point, but we should not be too surprised that
the Gaussian works reasonably well when, as
here,
becomes large (all the same, notice
that the curves are not symmetric and that
a single Gaussian, with a single
,
would not fare particularly well).
Comparison to the MAX experiment is
shown in Fig. 4 for the region ID (experimental details
can be found in Tanaka et al. 1996); we have combined
all three frequency channels to construct the complete likelihood
function. The scan strategy consisted in taking
single
differences aligned along a unidimensional scan.
Once again, the approximation, applied using Eqs. (19, 20), supplies
an excellent representation of the likelihood function,
down to values well below "
'' (0.01 of the peak).
The effective number of degrees-of-freedom is
,
demonstrating
again that
.
Here, the
difference is rather large, due to the significant overlap
between adjacent pixels along the scan, and we see that
the ansatz with
does not produce a good approximation.
Could there a way to proceed if only one confidence
interval is given? This would require a choice for
one of the parameters, say ,
based on some knowledge of the
scan strategy. We have just seen that for MAX
leads to a bad representation of the likelihood function.
One might be tempted to try instead
(scan length)/(beam FWHM) =8.8, which is in fact very
close to the best value of
found from adjusting
to two confidence intervals. Although this is successful
in this case, it is nevertheless
guess-work, the problem being that
it is really not clear if there is a unique rule for
judiciously choosing
.
For Saskatoon,
worked reasonably well, while here it
does not, something much less being required because
of the significant redundancy in the scan. We have found
that it is difficult to justify a priori
a general rule for choosing
when lacking two
confidence intervals. The most sure way of finding
the effective number of degrees-of-freedom to be used
in the ansatz remains the use of two confidence intervals, via Eqs. (19, 20).
A noteworthy aspect of this MAX likelihood function
is its asymmetry, i.e., it is manifestly non-Gaussian.
Even a "2-winged'' Gaussian is clearly a very bad
representation. As
the number of statistically independent elements
entering the power estimation increases, we should
expect the likelihood function to approach a Gaussian
distribution. The question is, what is meant by
statistically independent elements? It is
obviously not something like 2l+1, for MAX
covers multipoles near 100; rather, as we have
argued above, it is really the parameter which measures this, what we have been calling
the effective number of degrees-of-freedom. The
fact that
tells
us that the number of generalized pixels is
an upper limit to this number degrees-of-freedom
determining the non-Gaussian nature of the likelihood
function. We make the connection to the familiar 2l+1-rule
only when we have full-sky coverage and bands consisting
of single multipoles; then, the number of generalized
pixels defining each (single multipole) band corresponds
to 2l+1. In the general case,
it is more useful and correct to reason with the number of pixels
(really,
).
We may also conclude from this that although experiments
with relatively large sky coverage should provide
Gaussian likelihood functions on scales much smaller than
the survey area, band-power estimates on scales
approaching the survey area will always be non-Gaussian.
The proposed ansatz represents a substantial improvement
over either a single or "2-winged'' Gaussian in such
cases.
These comparisons focus on simple cases where the power over a single band defined by the observing strategy is to be estimated, although in the MAX case the analysis did include three frequency channels simultaneously. A more subtle test of the approximation is its extension to a power estimate over several bands defined by different window functions. Such is the situation presented by the five standard Saskatoon power bins. Each bin comprises several bands, of the type considered above, and the bin power is estimated using the joint likelihood of the contributing bands, including all band-band correlations. One could worry that the information carried by several bands might not be adequately incorporated by the two parameters of the ansatz.
In Fig. 5 we compare the approximation
to the likelihood function of a combination of
10-, 11- and 12-point differences. Included
are the K-band 1994 and Q-band 1994 and 1995
CAP data. The true likelihood function for this
bin is calculated from the complete covariance matrix
accounting for all correlations, and the approximation
was fit using two confidence intervals.
Even in this more complicated situation
we see that the ansatz continues to work quite well, once the appropriate best power estimate and
errors for the complete bin are used to
find
and
.
It is on the basis of such comparisons
that we believe the proposed ansatz and
method of application produces acceptable
likelihood functions. Besides the comparisons shown
here, we have also tested the approximation against
11 other complete likelihood functions, all kindly
provided by K. Ganga; these comparisons may be viewed on our
web page.
The approximation works well in all cases.
We emphasize again that
the particular value of the proposed ansatz resides in its
simplicity - we obtain
very good approximations with little effort.
Copyright The European Southern Observatory (ESO)