next previous
Up: The ATCA/VLA OH

6. Summary

We have given the results of a survey of the region tex2html_wrap_inline1857 and tex2html_wrap_inline1859 in the OH 1612.231 MHz maser line. The survey is complete for sources brighter than 500 mJy and 80% complete for sources brighter than 300 mJy. The absolute flux density limit is 160 mJy. We have found 307 compact OH-maser sources, 145 of which are new detections. The sources are mainly OH/IR stars, with a few related sources, like PPNe and supergiants. The sources have positions accurate to tex2html_wrap_inline1861, velocities accurate to 1 tex2html_wrap_inline1863 and flux densities accurate to 5%. For 201 sources, an associated IRAS point source is found. A special CLEANing method was developed to search a very large data set for spectral-line point sources.

Acknowledgements

The authors thank ATNF staff for valuable discussions about observations and reduction, especially Ron Ekers, Wim Brouw and Jim Caswell. MS thanks the NFRA for financial support, the ATNF for infinite hospitality, Richard Arnold for all his useful ideas and Laurens Smulders for many a statistical eye-opener. ML is supported by an ESA external fellowship. This research has made use of the Simbad database, operated at CDS, Strasbourg, France. All FFT computing in the reduction proces was done on the Cray-C98 of the National Facility for Supercomputing in Amsterdam.

Appendix A: Reduction method

A.1. Considerations

The reduction of the acquired data calls for a special strategy for a number of reasons. What we describe here is the process of searching the data for sources after the calibration described in Sect. 3.

truecm In creating this strategy the following facts have to be dealt with :

a)
The size of the dataset of 539 fields consisting of 512 spectral channels and a spatial extent of 42tex2html_wrap1977 tex2html_wrap_inline1869 42tex2html_wrap1979 with a FWHM of the synthesized beam of 6tex2html_wrap1981 is of the order of a few hundred GBytes when fully imaged.

b)
The synthesized beam pattern resulting from taking several short cuts spread over 12 h with a one-dimensional array has almost point-like sidelobes. The strength of the first sidelobe can be tex2html_wrap_inline187550%. For strong sources, the sidelobes can be higher than the detection level up to tex2html_wrap_inline1877 away from the real position. These sidelobes do not only create the possibility of obtaining wrong positions for sources, but also mask out fainter sources that by chance are in the same velocity channel. (See for instance spectrum #109 with sidelobes of #98.)

c)
Even after the editing mentioned in Sect. 3, radio-frequency interference (RFI) has an effect on images that is not easy to predict. It confuses standard astronomical data processing routines because it does not satisfy the general assumption of stationary sources.

truecm

Corresponding requirements were made for a possible reduction strategy:

a)
It has to be highly automated, fast and efficient in its use of disk space. We cannot make and store full spectral-line cubes nor inspect them by eye.

b)
It has to deal with sidelobes of sources that are not in the current field. Simply correlating peaks with the synthesized beam pattern does not enable us to discriminate between primary and secondary peaks, because of the way the visibility plane is sampled and weighted.

c)
It must be robust in its ability to discriminate between astronomical sources and RFI.

A.2. Assumptions

There are three major assumptions we make to justify our strategy.

Firstly, we assume the sources we are looking for are point sources. Secondly, we assume the whole region of the survey is covered entirely within the width of the primary beam out to the offset where the primary-beam response in the main lobe equals the response of the first primary-beam sidelobe (see the mark max in Fig. 2 (click here)a). It then follows that if we see the same spectrum (save a factor in flux density) at different positions in different fields, the brightest represents the real star. Thirdly, we assume the flux density of most of the stars we expect to find is too low for detection in a vector-averaged visibility spectrum with zero phase offset, so we do have to Fourier transform the visibility data and search in the image domain.

These three assumptions are all justified for the data we are presently discussing; there is only one exception to the first assumption in that source #153 (Table 1) is probably a nearby object and slightly resolved.

A.3. Method

The stategy developed is as follows:

1) MPFND

We Fourier transform one channel at a time, keeping it in memory. This image is then searched for its one highest peak value. If the shape of the peak is approximately gaussian (i.e. a point source) and above a certain detection level, we write its velocity and fitted spatial position to an output file, otherwise we discard it. The image is then discarded. This process is repeated for all channels and for a number of neighbouring fields. Each field has a separate output file.

2) Model correlation

The output files of different fields are then individually searched for detections at the same position in a number of neighbouring channels. These detections are then marked as models. If there are models in two (or more) different fields at the same velocity within a certain distance of each other, the brightest is assigned real and the others (assumed to be sidelobes) are shifted in position to the real position.

3) UVSUB

The consolidated point source models for each field are subtracted from the visibility data and the process starts again at step (1), now with a lower detection level and the new visibility data.

The whole cycle from step (1) to (3) is called a pass. This way we build the stellar spectra in subsequent levels and find fainter sources in channels where bright sources were found in the first passes.

 figure399

Fig. A1. Schematic representation of the reduction method. Every main cycle through MPFND, the model correlation and UVSUB is called one pass. At the end of the first pass (P2) UVLIN is applied. The input parameters at (1) and (2) can be adapted according to spectral features of the sources looked for and several features of the visibility data (see Sect. A4)

A.4. Implementation

1) MPFND

The main routine of the method is MPFND which does the imaging and the searching. It is a derivative of the existing MIRIAD imaging routine INVERT (version 18 nov. 94). This is the most time-consuming part of the reduction and this routine as well as UVSUB and UVLIN was run on a Cray-C98. The less time-consuming part of correlating the model files was done on a local workstation since it requires more interaction and does not act directly on the visibility data.

When imaging we used a natural weighting scheme of the visibilities, without any tapering, to get the maximum SNR. This does increase the sidelobe levels, but the method deals with that. The visibility data from the shortest baseline were always excluded from the imaging because of RFI.

The input at label (1) in Fig. A1 (click here) consists of the cell size, image size, detection level and the width of the boundary region of each image to exclude from the searching.

The detection levels in subsequent passes were decreased by tex2html_wrap_inline1881 starting at tex2html_wrap_inline1883 (tex2html_wrap_inline1885 is the noise level as theoretically calculated from the visibilities). In the last pass, for homogeneity, the detection level was fixed to 120 mJy for all fields. This level equals 3 to 4 tex2html_wrap_inline1887 for 90% of the fields (see Sect. 5.1). The levels were chosen to ensure firstly that all sources bright enough to influence polynomial fitting (see Sect. A5) are subtracted in the first and second passes and secondly that the flux densities of the sources found in a pass are comparable to the detection level of that pass. It is important to find stars in as early a pass as possible, because at lower levels there are more and more noise detections and the process of correlation (see below) becomes more and more time-consuming. This is the reason for not making six passes all with a detection level of 120 mJy, which in principle would yield the same result.

To increase the speed of the transforms in the first two passes the images were made with a cell size of 5tex2html_wrap1985 tex2html_wrap_inline1891 10tex2html_wrap1987. This causes some loss in SNR, but, since we only search for very bright sources in those passes, the SNR is still higher than 20. In the following passes cell sizes of tex2html_wrap_inline1895 5tex2html_wrap1991 were used. gif The size of the images was always 42tex2html_wrap1997 to ensure ample overlap between them. (This corresponds to an image size of tex2html_wrap_inline1907 and tex2html_wrap_inline1909 cells respectively.) The cell size was not optimized for all individual pointings but we chose to make the procedure as uniform as possible. To avoid the detection of Fourier transform errors, that are strongest at the boundaries of the images, the outer five cells were excluded from searching in the first two passes and in later passes the outer 10 cells, because at lower detection levels the imaging errors become relatively more important. After finding a peak in an image an area of tex2html_wrap_inline1911 pixels around it was fitted with a two-dimensional parabola (see Sect. 5.2). If this fit indicates a position that is more than one cell size away from the peak pixel this indicates a highly non-gaussian shape of the peak under consideration, since the peak of a gaussian is roughly parabolic. This can be safely used as a criterion for interference or other unwanted detections and such peaks were discarded.

2) Model correlation

The inputs at (2) in Fig. A1 (click here) are relatively complicated to determine. In all passes, peaks at different velocities are identified as coming from the same source when positional coincidence is smaller than 0.35 cell size. This value was found empirically to be smaller than the typical difference in position between noise or interference peaks, correlated in neighbouring channels and bigger than the scatter expected in the positions of a source found in different channels (Sect. 5.2). Depending on the detection level and the strength of the highest peak found in the pass the distance out to which sidelobes can be expected has to be set. For the first pass this is as much as tex2html_wrap_inline1913 in declination. gif After subtracting the brightest sources the distance quickly decreases to about tex2html_wrap_inline1915, so that sidelobes are found only in directly neighbouring fields. All 539 fields of the Bulge region were processed through each pass simultaneously, so that all models could be correlated optimally. Only detections in the outermost fields surrounding the whole region of 539 fields had to be checked for sidelobes from unobserved regions, since there the sidelobe detection assumption of complete coverage broke down. This was done by verifying the presence of a point source pattern by eye in the image plane at the detected channels.

Neighbouring channels are correlated by 16% after Hanning smoothing, which is enough to smear out a large number of noise peaks over two channels. Therefore, we required that detections be present at the same position in at least three channels (either directly neighbouring or within a velocity range reasonably expected from outflow velocities) for a detection to be trusted. At lower detection levels statistical considerations put a lower limit on the number of channels for a detection. There can be false detections, that is a number of detections at the same position that meet the requirements mentioned above but are purely a statistical coincidence. At the 5tex2html_wrap_inline1917 level the expectation value of the total number of false detections in the whole data set is far less than one when the signal is demanded to be detected in three neighbouring channels at the same position. For the 4tex2html_wrap_inline1919 level the expected number is close to one. If we allow the third detected channel to be within a velocity range of tex2html_wrap_inline1921 100 tex2html_wrap_inline1923 from the other two neighbouring channels, the expected number is as much as one hundred. Such false detections could be identified as a double-peaked source. Therefore, at detection levels lower than 5tex2html_wrap_inline1925, at least four detections, either in one peak over four neighbouring channels or in two separate peaks, are demanded at the same position. This gives an expected number of false detections of tex2html_wrap_inline1927 for detection levels tex2html_wrap_inline19294tex2html_wrap_inline1931 up to a few for the lowest detection level of 3tex2html_wrap_inline1933.

3) UVSUB

The flux densities of all the point source models were determined with UVFLUX from the actual data at the velocities and the fitted (and for sidelobes shifted) positions found in the model correlation. They were then subtracted from the visibilities with UVSUB, via a direct Fourier transform.

A.5. Further implementations

To remove RFI from the visibilities, an 11tex2html_wrap_inline1935-order polynomial fit (UVLIN, Sault 1994) to the real and imaginary parts of the visibilities was subtracted from the data between the second and third pass. By doing it at this stage, fitting bright OH sources was avoided, because all sources brighter than tex2html_wrap_inline19371 Jy had been subtracted. Fitting spectra of brighter sources could cause serious problems over the whole spectral band and especially at the edges, because too much intensity would be put in the higher order terms of the polynomial.

It should be noted that it is possible for a source to be wrongly identified as a sidelobe of another star. This is not a problem, since the model that is subtracted from the visibility data is correct if the correct flux density level is determined at the position of that other star. If a detection reoccurs after identifying it as a sidelobe and subtracting a corresponding point source model in the previous pass, then obviously the identification as a sidelobe was wrong and it should now be modelled as a real star. Therefore, we never subtracted the same model from the data twice.

Finally, we checked the spectra by eye, extracting them from the original data with UVSPEC (shown in Fig. A2 (click here)). This is not necessary in principle with data that contain only sources and random noise. However, in our data, RFI introduces too much correlated signal that masquerades as sources. After the visual inspection, the limiting flux density, corrected for primary beam attenuation, is found to be 160 mJy (see Figs. 4 (click here)b, f) or, in other words, tex2html_wrap_inline19394tex2html_wrap_inline1941.

A.6. Discussion

Essentially, our method is a modification of the traditional method of CLEAN (Högbom 1974), in the sense that models are subtracted from the data at successive flux density levels. It is, however, stripped of every performance that is superfluous for processing the present data. Since in the present observations the sky does not feature bright extended emission and we want to find only point sources, in principle one cycle is sufficient to find the model. This model is then subtracted with loop gain 1 from the ungridded visibilities (similar to the Cotton-Schwab CLEAN algorithm, Schwab 1984). Therefore RFI and boundary imaging errors do not influence MPFND as much as they do standard implementations of CLEANing. MPFND is a fast method because it does not perform any convolution in the image plane, which is not necessary when one assumes that one iteration and loop gain 1 are best to find the models. Models are subtracted directly from the complex visibilities. Together with the fact that no disk I/O is needed to store cubes, this reduces the time needed for finding and subtracting models by a factor of four. In various fields we tested that the results of MPFND are exactly the same as those of other CLEAN methods with appropriate input values.

truecm

Table 1. Compact OH-maser sources in the galactic Bulge region.

The columns of Table 1 contain the following information:

1
Sequence number (coincident with spectra in Fig. A2 (click here)).

2
Name in the OHtex2html_wrap_inline1943 convention.

3
Type indication

tex2html_wrap_inline1945
D = double-peaked spectrum
tex2html_wrap_inline1947
S = single-peaked spectrum
tex2html_wrap_inline1949
I = irregular spectrum.

4
Right ascension of the brightest peak for epoch J2000 (typical error tex2html_wrap_inline1951).

5
Declination of the brightest peak for epoch J2000 (typical error tex2html_wrap_inline1953).

6
Mean scatter of the position of the star in all channels where it was detected, in arcseconds on the sky (Eq.(1), Sect. 5.2).

7
Radial offset of the source from pointing centre.

8
Line-of-sight velocity with respect to the LSR of the blue-shifted (L) peak. For S and I spectra the velocity of the peak is always given as blue-shifted for reasons of tabulation (typical error 1 tex2html_wrap_inline1955).

9
Same for the red-shifted (H) peak (typical error 1 tex2html_wrap_inline1957).

10
Stellar velocity (typical error 1 tex2html_wrap_inline1959).


displaymath1961

11
Outflow velocity; zero for S and I sources (typical error 1 tex2html_wrap_inline1963).


displaymath1965

12
Flux density, corrected for primary-beam attenuation, of the blue-shifted (L) peak (typical error 5%).

13
Same for the red-shifted (H) peak (typical error 5%).

14
Empirical noise in all empty planes for the present field (Fig. 3 (click here), Sect. 5.1).

15
Reference for previous OH maser detection (see Table 2 and Sect. 5.6).

16
Nearest IRAS PSC position (Sect. 5.5)

17
Ratio between the size of the error ellipse of the nearest IRAS point source and the distance to nearest IRAS PSC position in the direction of the OH position (Sect. 5.5).

(The table is available in electronic form via anonymous ftp (ftp 130.79.128.5) or through the World Wide Web at http://cdsweb.u-strasbg.fr/Abstract.html).

Table 2. References for previous OH maser detections (Table 1, Col. 15)

01 te Lintel Hekkert et al. 1991

02 te Lintel Hekkert et al. 1989

03 Becker et al. 1992

04 Lindqvist et al. 1992

05 Blommaert et al. 1994

06 Bowers & Knapp 1989

07 David et al. 1993

08 van Langevelde et al. 1992

09 Braz & Epchtein 1983

Fig.A2. On the following pages the spectra for all the sources in Table 1 are shown. They are displayed with 50 tex2html_wrap_inline1967 (except #008, #200 that have outflow velocities higher than 50 tex2html_wrap_inline1969) on either side of the stellar velocity (if possible; for sources at the edge of the observed velocity range the unobserved part of the spectra is left blank, e.g. #179). The channel width used in the spectra is 1.45 tex2html_wrap_inline1971\ which is equal to the velocity resolution. Along the upper border of each spectrum the entry number of the source in Table 1 is given, along with its usual OHtex2html_wrap_inline1973 name, its identification as a D, S or I source (see Table 1, Col. 3) and the number of the reference in the case of previously known sources. The spectra are extracted from the original visibilities by the MIRIAD routine UVSPEC and the baselines are fitted with polynomial of order up to three. This means that interference still shows in the spectra. Also sidelobes from neighbouring stars are present in some spectra. If there is possible ambiguity in the interpretation of the spectra the detected peaks are marked with an asterisk; other peaks in such spectra are not necessarily connected to the same source. The spectra were left relatively unprocessed in order to give a fair view of the data quality. (This figure is included in the electronic version of the paper, available at . The individual spectra are available upon request in more workable ascii-format (e-mail: sevenste@strw.leidenuniv.nl)).

 figure464 figure469 figure472

Fig. A2. The spectra for all the sources in Table 1.

figure475  figure478 figure482

figure485 figure488 figure491

figure494 figure497


next previous
Up: The ATCA/VLA OH

Copyright by the European Southern Observatory (ESO)
web@ed-phys.fr