Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

In this chapter, we introduce the full two-dimensional analysis of the interferometer response, without small-angle assumptions, and then investigate the small-field approximations that simplify the transformation from the measured visibility to the intensity distribution. There is a discussion of the relationship between the cross-correlation of the received signals and the cross power spectrum, which results from the Wiener–Khinchin relation and is fundamental to spectral line interferometry. An analysis of the basic response of the receiving system is also given. The appendix considers some approaches to the representation of noiselike signals, including the analytic signal, and truncation of the range of integration.

3.1 Fourier Transform Relationship Between Intensity and Visibility

3.1.1 General Case

We begin by deriving the relationship between intensity and visibility in a coordinate-free form and then show how the choice of a coordinate system results in an expression in the familiar form of the Fourier transform. Suppose that the antennas track the source under observation, which is the most common situation, and let the unit vector s 0 in Fig. 3.1 indicate the phase reference position introduced in Sect. 2.3 This position, sometimes also known as the phase-tracking center, becomes the center of the field to be imaged. For one polarization, an element of the source of solid angle d Ω at position \(\mathbf{s} = \mathbf{s}_{0} + \boldsymbol{\sigma }\) contributes a component of power \(\frac{1} {2}A(\boldsymbol{\sigma })I(\boldsymbol{\sigma })\varDelta \nu d\varOmega\) at each of the two antennas. Here, \(A(\boldsymbol{\sigma })\) is the effective collecting area of each antenna, \(I(\boldsymbol{\sigma })\) is the source intensity distribution as observed from the distance of the antennas, and Δ ν is the bandwidth of the receiving system. It is easily seen that this expression has the dimensions of power since the units of I are W m−2 Hz−1 sr−1. From the considerations outlined in the derivation of Eqs. (2.1) and (2.2), including the far-field condition for the source, the resulting component of the correlator output is proportional to the received power and to the fringe term cos(2π ν τ g ), where τ g is the geometric delay. The vector D λ will specify the baseline measured in wavelengths, and then \(\nu \tau _{g} = \mathbf{D}_{\lambda }\mathbf{\boldsymbol{\cdot }\,}\mathbf{s} = \mathbf{D}_{\lambda }\mathbf{\,\boldsymbol{\cdot }\,}(\mathbf{s}_{0} + \boldsymbol{\sigma }\)). Thus, the output from the correlator is represented by

Fig. 3.1
figure 1

Baseline and position vectors that specify the interferometer and the source. The source is represented by the outline on the celestial sphere.

$$\displaystyle\begin{array}{rcl} r(\mathbf{D}_{\lambda },\mathbf{s}_{0})& =& \varDelta \nu \int _{4\pi }A(\boldsymbol{\sigma })I(\boldsymbol{\sigma })\cos \left [2\pi \mathbf{D}_{\lambda }\mathbf{\,\boldsymbol{\cdot }\,}(\mathbf{s}_{0} + \boldsymbol{\sigma })\right ]d\varOmega \\ & =& \varDelta \nu \cos (2\pi \mathbf{D}_{\lambda }\mathbf{\,\boldsymbol{\cdot }\,}\mathbf{s}_{0})\int _{4\pi }A(\boldsymbol{\sigma })I(\boldsymbol{\sigma })\cos (2\pi \mathbf{D}_{\lambda }\mathbf{\,\boldsymbol{\cdot }\,}\boldsymbol{\sigma })\,d\varOmega \\ & & -\varDelta \nu \sin (2\pi \mathbf{D}_{\lambda }\mathbf{\,\boldsymbol{\cdot }\,}\mathbf{s}_{0})\int _{4\pi }A(\boldsymbol{\sigma })I(\boldsymbol{\sigma })\sin (2\pi \mathbf{D}_{\lambda }\mathbf{\,\boldsymbol{\cdot }\,}\boldsymbol{\sigma })\,d\varOmega \;.{}\end{array}$$
(3.1)

Note that the integration of the response to the element d Ω over the source in Eq. (3.1) requires the assumption that the source is spatially incoherent, that is, that the radiated waveforms from different elements d Ω are uncorrelated. This assumption is justified for essentially all cosmic radio sources. Spatial coherence is discussed further in Sect. 15.2 Let A 0 be the antenna collecting area in direction s 0 in which the beam is pointed. We introduce a normalized reception pattern \(A_{N}(\boldsymbol{\sigma }) = A(\boldsymbol{\sigma })/A_{0}\) and consider the modified intensity distribution \(A_{N}(\boldsymbol{\sigma })I(\boldsymbol{\sigma })\). Now we define the complex visibilityFootnote 1 as

$$\displaystyle{ \mathcal{V} = \vert \mathcal{V}\vert e^{\,j\phi _{v} } =\int _{4\pi }A_{N}(\boldsymbol{\sigma })I(\boldsymbol{\sigma })\,e^{-j2\pi \mathbf{D}_{\lambda }\mathbf{\,\boldsymbol{\cdot }\,}\boldsymbol{\sigma }}d\varOmega \;. }$$
(3.2)

Then by separating the real and imaginary parts, we obtain

$$\displaystyle\begin{array}{rcl} & & \int _{4\pi }A_{N}(\boldsymbol{\sigma })I(\boldsymbol{\sigma })\,\cos (2\pi \mathbf{D}_{\lambda }\mathbf{\,\boldsymbol{\cdot }\,}\boldsymbol{\sigma })\,d\varOmega = \vert \mathcal{V}\vert \cos \phi _{v}\;,{}\end{array}$$
(3.3)
$$\displaystyle\begin{array}{rcl} & & \int _{4\pi }A_{N}(\boldsymbol{\sigma })I(\boldsymbol{\sigma })\,\sin (2\pi \mathbf{D}_{\lambda }\mathbf{\,\boldsymbol{\cdot }\,}\boldsymbol{\sigma })\,d\varOmega = -\vert \mathcal{V}\vert \sin \phi _{v}\;,{}\end{array}$$
(3.4)

and from Eq. (3.1)

$$\displaystyle{ r(\mathbf{D}_{\lambda },\mathbf{s}_{0}) = A_{0}\varDelta \nu \vert \mathcal{V}\vert \cos (2\pi \mathbf{D}_{\lambda }\mathbf{\,\boldsymbol{\cdot }\,}\mathbf{s}_{0} -\phi _{v})\;. }$$
(3.5)

Thus, the output of the correlator can be expressed in terms of a fringe pattern corresponding to that for a hypothetical point source in the direction s 0, which is the phase reference position. As noted earlier, this is usually the center or nominal position of the source to be imaged. The modulus and phase of \(\mathcal{V}\) are equal to the amplitude and phase of the fringes; the phase is measured relative to the fringe phase for the hypothetical source. As defined above, \(\mathcal{V}\) has the dimensions of flux density (W m−2 Hz−1), which is consistent with its Fourier transform relationship with I. Some authors have defined visibility as a normalized, dimensionless quantity, in which case it is necessary to reintroduce the intensity scale in the resulting image. Note that the bandwidth has been assumed to be small compared with the center frequency in deriving Eq. (3.5).

In introducing a coordinate system, the geometry we now consider is illustrated in Fig. 3.2. The two antennas track the center of the field to be imaged. They are assumed to be identical, but if they differ, \(A_{N}(\boldsymbol{\sigma })\) is the geometric mean of the beam patterns of the two antennas. The magnitude of the baseline vector is measured in wavelengths at the center frequency of the observing band, and the baseline has components (u, v, w) in a right-handed coordinate system, where u and v are measured in a plane normal to the direction of the phase reference position. The spacing component v is measured toward the north as defined by the plane through the origin, the source, and the pole, and u is measured toward the east. The component w is measured in the direction s 0, which is the phase reference position. On Fourier transformation, the phase reference position becomes the origin of the derived intensity distribution I(l, m), where l and m are direction cosines measured with respect to the axes u and v. In terms of these coordinates, we find

$$\displaystyle\begin{array}{rcl} & & \mathbf{D}_{\lambda }\mathbf{\,\boldsymbol{\cdot }\,}\mathbf{s}_{0} = w \\ & & \mathbf{D}_{\lambda }\mathbf{\,\boldsymbol{\cdot }\,}\mathbf{s} = \left (ul + vm + w\sqrt{1 - l^{2 } - m^{2}}\right ) \\ & & d\varOmega = \frac{dl\,dm} {\sqrt{1 - l^{2 } - m^{2}}}\;, {}\end{array}$$
(3.6)

where \(\sqrt{1 - l^{2 } - m^{2}}\) is equal to the third direction cosine n measured with respect to the w axis.Footnote 2 Note also that \(\mathbf{D}_{\lambda }\mathbf{\,\boldsymbol{\cdot }\,}\boldsymbol{\sigma } = \mathbf{D}_{\lambda }\mathbf{\,\boldsymbol{\cdot }\,}\mathbf{s} -\mathbf{D}_{\lambda }\mathbf{\,\boldsymbol{\cdot }\,}\mathbf{s}_{0}\). Thus, from Eq. (3.2):

Fig. 3.2
figure 2

Geometric relationship between a source under observation I(l, m) and an interferometer or one antenna pair of an array. The antenna baseline vector, measured in wavelengths, has length D λ and components (u, v, w).

$$\displaystyle\begin{array}{rcl} \mathcal{V}(u,v,w)& =& \int _{-\infty }^{\infty }\int _{ -\infty }^{\infty }A_{ N}(l,m)I(l,m) \\ & & \quad \times \exp \left \{-j2\pi \left [ul\,+\,vm\,+\,w\left (\hspace{-2.0pt}\sqrt{1 - l^{2 } - m^{2}} - 1\right )\right ]\right \} \frac{dl\,dm} {\sqrt{1 - l^{2 } - m^{2}}}\;.{}\end{array}$$
(3.7)

A factor e j2π w on the right side in Eq. (3.7) results from the measurement of angular position with respect to the w axis. For a source on the w axis, l = m = 0, and the argument of the exponential term in Eq. (3.7) is zero. For any other source, the fringe phase is measured relative to that for a source on the w axis, which is the phase reference position, s 0 in Fig. 3.2. The function A N I in Eq. (3.7) is zero for l 2 + m 2 ≥ 1, and in practice, it usually falls to very low values for directions outside the field to be imaged, as a result of the antenna beam pattern, the bandwidth pattern, or the finite size of the source. Thus, we can extend the limits of integration to ±. Note, however, that Eq. (3.7) requires no small-angle assumptions. The reason why we use direction cosines rather than a linear measure of angle in interferometer theory is that they occur in the exponential term of this relationship.

The coordinate system (l, m) defined above is a convenient one in which to present an intensity distribution. It corresponds to the projection of the celestial sphere onto a plane that is a tangent at the field center, as shown in Fig. 3.3. The distance of any point in the image from the (l, m) origin is proportional to the sine of the corresponding angle on the sky, so for small fields, distances on the image are closely proportional to the corresponding angles. The same relationship usually applies to the field of an optical telescope. For a detailed discussion of relationships on the celestial sphere and tangent planes, see König (1962).

Fig. 3.3
figure 3

Mapping of the celestial sphere onto an image plane, shown in one dimension. The position of the point P is measured in terms of the direction cosine m with respect to the v axis. When projected onto a plane surface with a scale linear in m, P appears at P′ at a distance from the field center C proportional to sinψ.

If all the measurements could be made with the antennas in a plane normal to the w direction so that w = 0, Eq. (3.7) would reduce to an exact two-dimensional Fourier transform. In general, this is not possible, and we now consider ways in which the transform relationship can be applied. Recall first that the basis of the synthesis imaging process is the measurement of \(\mathcal{V}\) over a wide range of u and v. For a ground-based array, this can be achieved by varying the length and direction of the antenna spacing and also by tracking the field-center position as the Earth rotates. The rotation causes the projection of D λ to move across the (u, v) plane, and an observation may last for 6–12 h. As the Earth’s rotation carries the antennas through space, the baseline vector remains in a plane only if D λ has no component parallel to the rotation axis, that is, the baseline is an east–west line on the Earth’s surface. In the general case, there is a three-dimensional distribution of the measurements of \(\mathcal{V}\). The simplest form of the transform relationship that can then be used is based on an approximation that is valid so long as the synthesized field is not too large. If l and m are small enough that the term

$$\displaystyle{ \left (\sqrt{(1 - l^{2 } - m^{2 } )} - 1\right )w \simeq -\frac{1} {2}(l^{2} + m^{2})w }$$
(3.8)

can be neglected, Eq. (3.7) becomes

$$\displaystyle{ \mathcal{V}(u,v,w) \simeq \mathcal{V}(u,v,0) =\int _{ -\infty }^{\infty }\int _{ -\infty }^{\infty }\frac{A_{N}(l,m)I(l,m)} {\sqrt{1 - l^{2 } - m^{2}}} e^{-j2\pi (ul+vm)}\,dl\,dm\;. }$$
(3.9)

Thus, for a restricted range of l and \(m,\mathcal{V}(u,v,w)\) is approximately independent of w, and for the inverse transform, we can write

$$\displaystyle{ \frac{A_{N}(l,m)I(l,m)} {\sqrt{1 - l^{2 } - m^{2}}} =\int _{ -\infty }^{\infty }\int _{ -\infty }^{\infty }\mathcal{V}(u,v)\,e^{\,j2\pi (ul+vm)}\,du\,dv\;. }$$
(3.10)

With this approximation, it is usual to omit the w dependence and write the visibility as the two-dimensional function \(\mathcal{V}(u,v)\). Note that the factor \(\sqrt{1 - l^{2 } - m^{2}}\) in Eqs. (3.9) and (3.10) can be subsumed into the function A N (l, m). Equation (3.10) is a form of the van Cittert–Zernike theorem, which originated in optics and is discussed in Sect. 15.1.1

The approximation in Eq. (3.9) introduces a phase error equal to 2π times the neglected term, that is, π(l 2 + m 2)w. Limitation of this error to some tolerable value places a restriction on the size of the synthesized field, which can be estimated approximately as follows. If the antennas track the source under observation down to low elevation angles, the values of w can approach the maximum spacings (D λ )max in the array, as shown in Fig. 3.4. Also, if the spatial frequencies measured are evenly distributed out to the maximum spacing, the synthesized beamwidth θ b is approximately equal to (D λ )max −1. Thus, the maximum phase error is approximately

$$\displaystyle{ \pi \left (\frac{\theta _{f}} {2}\right )^{2}\theta _{ b}^{-1}\;, }$$
(3.11)

where θ f is the width of the synthesized field. The condition that no phase errors can exceed, say, 0.1 rad then requires that

$$\displaystyle{ \theta _{f} <\frac{1} {3}\sqrt{\theta _{b}}\;, }$$
(3.12)

where the angles are measured in radians. For example, if θ b  = 1′ ′, θ f  < 2. 5 arcmin. Much synthesis imaging in astronomy is performed within this restriction, and ways of imaging larger fields will be discussed later.

Fig. 3.4
figure 4

When observations are made at a low angle of elevation and at an azimuth close to that of the baseline, the spacing component w becomes comparable to the baseline length D λ , which is measured in wavelengths.

3.1.2 East–West Linear Arrays

We now turn to the case of arrays with east–west spacings only and discuss further the conditions for which we can put w = 0, and the resulting effects. Let us first rotate the (u, v, w) coordinate system about the u axis until the w axis points toward the pole, as shown in Fig. 3.5. We indicate by primes the quantities measured in the rotated system. The (u′, v′) axes lie in a plane parallel to the Earth’s equator. The east–west antenna spacings contain components in this plane only (i.e., w′ = 0), and as the Earth rotates, the spacing vectors sweep out circles concentric with the (u′, v′) origin. From Eq. (3.7), we can write

$$\displaystyle{ \mathcal{V}(u',v') =\int _{ -\infty }^{\infty }\int _{ -\infty }^{\infty }A_{ N}(l',m')I(l',m')\,e^{-j2\pi (u'l'+v'm')} \frac{dl'dm'} {\sqrt{1 - l'^{2 } - m'^{2}}}\;, }$$
(3.13)

where (l′, m′) are direction cosines measured with respect to (u′, v′). Equation (3.13) holds for the whole hemisphere above the equatorial plane. The inverse transformation yields

$$\displaystyle{ \frac{A_{N}(l',m')I(l',m')} {\sqrt{1 - l'^{2 } - m'^{2}}} =\int _{ -\infty }^{\infty }\int _{ -\infty }^{\infty }\mathcal{V}(u',v')\,e^{\,j2\pi (u'l'+v'm')}du'dv'\;. }$$
(3.14)

In this imaging, the hemisphere is projected onto the tangent plane at the pole, as shown in Fig. 3.5. In practice, however, an image may be confined to a small area within the antenna beams. In the vicinity of such an area, centered at right ascension and declination (α  0, δ  0), angular distances in the image are compressed by a factor sin δ  0 in the m′ dimension. Also, in imaging the (α  0, δ  0) vicinity, it is convenient if the origin of the angular position variables is shifted to (α  0, δ  0). Expansion of the scale and shift of the origin can be accomplished by the coordinate transformation

$$\displaystyle{ l = l',\qquad \qquad m'' = (m' -\cos \delta _{\,0})\,\text{cosec}\,\delta _{\,0}\;. }$$
(3.15)
Fig. 3.5
figure 5

The (u′, v′, w′) coordinate system for an east–west array. The (u′, v′) plane is the equatorial plane and the antenna spacing vectors trace out arcs of concentric circles as the Earth rotates. Note that the directions of the u′ and v′ axes are chosen so that the v′ axis lies in the plane containing the pole, the observer, and the point under observation (α  0, δ  0). In Fourier transformation from the (u′, v′) to the (l′, m′) planes, the celestial hemisphere is imaged as a projection onto the tangent plane at the pole. The (u, v, w) coordinates for observation in the direction (α  0, δ  0) are also shown.

If we write F(l′, m′) for the left side of Eq. (3.14), then

$$\displaystyle{ F(l',m')\longleftrightarrow \mathcal{V}(u',v')\;, }$$
(3.16)

and

$$\displaystyle{ F\left [l',(m' -\cos \delta _{\,0})\,\text{cosec}\,\delta _{\,0}\right ]\longleftrightarrow \left \vert \sin \delta _{\,0}\right \vert \mathcal{V}(u',v'\sin \delta _{\,0})e^{-j2\pi v'\cos \delta _{\,0} }\;, }$$
(3.17)

where ↔ indicates Fourier transformation. Equation (3.17) follows from the behavior of Fourier pairs with change of variable and involves the shift and similarity properties of Fourier transforms (see Appendix 2.1). The coordinates (u′, v′sinδ  0) on the right side of Eq. (3.17) represent the projection of the equatorial plane onto the (u, v) plane, which is normal to the direction (α  0, δ  0). In the (u, v, w) system, u = u′ and v = v′sinδ  0. The coordinate w shown in Fig. 3.5 is equal to − v′cosδ  0. Thus, \(e^{-j2\pi v'\cos \delta _{\,0}}\) in Eq. (3.17) is the same factor that occurs in Eq. (3.7) as a result of the measurement of visibility phase relative to that for a point source in the w direction. Equation (3.14) now becomes

$$\displaystyle\begin{array}{rcl} \frac{A_{N}(l,m'')I(l,m'')} {\sqrt{1 - l^{2 } - m''^{2}}} & =& \int _{-\infty }^{\infty }\int _{ -\infty }^{\infty }\mathcal{V}(u',v'\sin \delta _{\, 0})\left \vert \sin \delta _{\,0}\right \vert e^{-j2\pi v'\cos \delta _{\,0} } \\ & & \quad \quad \times e^{\,j2\pi (u'l'+v'm')}du'dv' \\ & =& \int _{-\infty }^{\infty }\int _{ -\infty }^{\infty }\mathcal{V}(u,v)\,e^{\,j2\pi (ul+vm'')}du\,dv\;.{}\end{array}$$
(3.18)

A similar analysis is given by Brouw (1971).

The derivation of Eq. (3.18) from Eq. (3.14) involves a redefinition of the m coordinate but no approximations. Equation (3.18) is of the same form as Eq. (3.10), in which the term in Eq. (3.8) was neglected. Thus, if we apply the imaging scheme of Eq. (3.10), which is based on omitting this term, to observations made with an east–west array, the phase errors introduced distort the image in a way that corresponds exactly to the change of definition of the m variable to m″. Since m″ is derived from a direction cosine measured from the v′ axis in the equatorial plane, there is a progressive change in the north–south angular scale over the image. The factor cosec δ  0 in Eq. (3.15) establishes the correct angular scale at the center of the image, but this simple correction is acceptable only for small fields. The crucial point to note here is that when visibility data measured in a plane are projected into (u, v, w) coordinates, w is a linear function of u and v (and a linear function of v alone for east–west baselines). Hence, the phase error π(l 2 + m 2)w is linear in u and v. Phase errors of this kind have the effect of introducing position shifts in the resulting image, but there remains a one-to-one correspondence between points in the image and on the sky. The effect is simply to produce a predictable, and hence correctable, distortion of the coordinates.

It is clear from Fig. 3.5 that if all the measurements lie in the (u′, v′) plane, then the values of v in the (u, v) plane become seriously foreshortened for directions close to the celestial equator. Obtaining two-dimensional resolution in such directions requires components of antenna spacing parallel to the Earth’s axis. The design of such arrays is discussed in Chap. 5 The effect of the Earth’s rotation is then to distribute the measurements in (u, v, w) space so that they no longer lie in a plane, unless the observation is of short time duration. In some cases, the restriction of the synthesized field in Eq. (3.12) is acceptable. In other cases, it may be necessary to image the entire beam to avoid source confusion, and several techniques are possible based on the following approaches:

  1. 1.

    Equation (3.7) can be written in the form of a three-dimensional Fourier transform. The resulting intensity distribution is then taken from the surface of a unit sphere in (l, m, n) space.

  2. 2.

    Large images can be constructed as mosaics of smaller ones that individually comply with the field restriction for two-dimensional transformation. The centers of the individual images must be taken at tangent points on the same unit sphere referred to in 1.

  3. 3.

    Since in most terrestrial arrays the antennas are mounted on an approximately plane area of ground, measurements taken over a short time interval lie close to a plane in (u, v, w) space. It is therefore possible to analyze an observation lasting several hours as a series of short duration images, which are subsequently combined after adjustment of the coordinate scales.

Practical implementation of the three approaches outlined above requires the nonlinear deconvolution techniques described in Chap. 11 A more detailed discussion of the resulting methods is given in Sect. 11.7

3.2 Cross-Correlation and the Wiener–Khinchin Relation

The Fourier transform relationship between the power spectrum of a waveform and its autocorrelation function, the Wiener–Khinchin relation, is expressed in Eqs. (2.6) and (2.7). It is also useful to examine the corresponding relation for the cross-correlation function of two different waveforms. The response of a correlator, as used in a radio interferometer, can be written as

$$\displaystyle{ r(\tau ) =\lim _{T\rightarrow \infty } \frac{1} {2T}\int _{-T}^{T}V _{ 1}(t)V _{2}^{{\ast}}(t-\tau )\,dt\;, }$$
(3.19)

where the superscript asterisk indicates the complex conjugate. In practice, the correlation is measured for a finite time period 2T, which is usually a few seconds or minutes but is long compared with both the period and the reciprocal bandwidth of the waveforms. The factor 1∕2T is sometimes omitted, but for the waveforms considered here, it is required to obtain convergence. Cross-correlation is represented by the pentagram symbol (⋆):

$$\displaystyle{ V _{1}(t) \star V _{2}(t) =\lim _{T\rightarrow \infty } \frac{1} {2T}\int _{-T}^{T}V _{ 1}(t)V _{2}^{{\ast}}(t-\tau )\,dt\;. }$$
(3.20)

This integral can be expressed as a convolution in the following way:

$$\displaystyle{ V _{1}(t) \star V _{2}(t) =\lim _{T\rightarrow \infty } \frac{1} {2T}\int _{-\infty }^{\infty }V _{ 1}(t)V _{2-}^{{\ast}}(\tau -t)\,dt = V _{ 1}(t) {\ast} V _{2-}^{{\ast}}(t)\;, }$$
(3.21)

where V 2−(t) = V 2(−t). Now the ν, t Fourier transforms are as followsFootnote 3: \(V _{1}(t)\longleftrightarrow \widehat{V }_{1}(\nu )\), \(V _{2}(t)\longleftrightarrow \widehat{V }_{2}(\nu )\), and \(V _{2-}^{{\ast}}(t)\longleftrightarrow \widehat{V }_{2}^{{\ast}}(\nu )\). Then from the convolution theorem,

$$\displaystyle{ V _{1}(t) \star V _{2}(t)\longleftrightarrow \widehat{V }_{1}(\nu )\widehat{V }_{2}^{{\ast}}(\nu )\;. }$$
(3.22)

The right side of Eq. (3.22) is known as the cross power spectrum of V 1(t) and V 2(t). The cross power spectrum is a function of frequency, and we see that it is the Fourier transform of the cross-correlation, which is a function of τ. This is a useful result, and in the case where V 1 = V 2, it becomes the Wiener–Khinchin relation. The relationship expressed in Eq. (3.22) is the basis of cross-correlation spectrometry, described in Sect. 8.8.2

3.3 Basic Response of the Receiving System

From a mathematical viewpoint, the basic components of the interferometer receiving system are the antennas that transform the incident electric fields into voltage waveforms, the filters that select the frequency components to be processed, and the correlator that forms the averaged product of the signals. In the filter and the correlator, the signals may be in either analog or digital form. These components are shown in Fig. 3.6. Most other effects can be represented by multiplicative gain constants, which we shall ignore here, or as variations of the frequency response that can be subsumed into the expressions for the filters. Thus, we assume that the frequency response of the antennas and the strength of the received signal are effectively constant over the filter passband, which is realistic for many continuum observations.

Fig. 3.6
figure 6

Basic components of the receiving system of a two-element interferometer.

3.3.1 Antennas

In order to consider the responses of the two antennas independently, we should introduce their voltage reception patterns, since the correlator responds to the product of the signal voltages. The voltage reception pattern of an antenna V A (l, m) has the dimension length and responds to the electric field specified in volts per meter. V A (l, m) is the Fourier transform of the field distribution in the aperture \(\overline{\mathcal{E}}(X,Y )\), as shown in Sect. 15.1.2 X and Y are coordinates of position within the antenna aperture. Omitting constant factors, we can write

$$\displaystyle{ V _{A}(l,m) \propto \int \int _{-\infty }^{\infty }\overline{\mathcal{E}}(X,Y )\,e^{\,j2\pi [(X/\lambda )l+(Y/\lambda )m]}dX\,dY \;, }$$
(3.23)

where λ is the wavelength. In applying Eq. (3.23), X and Y are measured from the center of each antenna aperture. The power reception pattern is proportional to the squared modulus of the voltage reception pattern. V A (l, m) is a complex quantity, and it represents the phase of the radio frequency voltage at the antenna terminals as well as the amplitude. For an interferometer (with antennas denoted by subscripts 1 and 2), the response is proportional to V A1 V A2 , which is purely real if the antennas are identical. For each antenna, the collecting area A(l, m) is a real quantity. In practice, it is usual to specify the antenna response in terms of A(l, m) and to replace V A (l, m) by \(\sqrt{ A(l,m)}\), which is proportional to the modulus of V A (l, m). Any phase introduced by differences between the antennas is ignored in this analysis but in effect is combined with the phase responses of the amplifiers, filters, transmission lines, and other elements that make up the signal path to the correlator input. The overall instrumental response of the interferometer in both phase and amplitude is calibrated by observing an unresolved source of known position and flux density.

For the case in which the antennas track the source, both the antenna beam center and the center of the source are at the (l, m) origin. If E(l, m) is the incident field, the output voltage of an antenna can be written (omitting constant gain factors) as

$$\displaystyle{ \widehat{V } =\int \int _{ -\infty }^{\infty }E(l,m)\sqrt{A(l, m)}\,dl\,dm\;. }$$
(3.24)

If the antennas do not track the source, a convolution relationship of the form shown in Eq. (2.15) applies.

3.3.2 Filters

The filters in Fig. 3.6 will be regarded as a representation of the overall effect of components that determine the frequency response of the receiving channels, including amplifiers, cables, filters, and other components. The frequency response of a filter will be represented by H(ν), which can also be called the bandpass function. The output of the filter \(\widehat{V }_{c}(\nu )\) is related to the input \(\widehat{V }(\nu )\) by

$$\displaystyle{ \widehat{V }_{c}(\nu ) = H(\nu )\widehat{V }(\nu )\;. }$$
(3.25)

The Fourier transform of H(ν) with respect to time and frequency is the impulse response of the filter h(t), which is the response to a voltage impulse δ(t) at the input. Thus, in the time domain, the corresponding expression to Eq. (3.25) is

$$\displaystyle{ V _{c}(t) =\int _{ -\infty }^{\infty }h(t')V (t - t')\,dt' = h(t) {\ast} V (t)\;, }$$
(3.26)

where the centerline asterisk represents convolution. In specifying filters, it is usual to use the frequency response rather than the impulse response because the former is more directly related to the properties of interest in a receiving system and is usually easier to measure.

3.3.3 Correlator

The correlatorFootnote 4 produces the cross-correlation of the two voltages fed to it. If V 1(t) and V 2(t) are the input voltages, the correlator output is

$$\displaystyle{ r(\tau ) =\lim _{T\rightarrow \infty } \frac{1} {2T}\int _{-T}^{T}V _{ 1}(t)V _{2}^{{\ast}}(t-\tau )\,dt\;, }$$
(3.27)

where τ is the time by which voltage V 2 is delayed with respect to voltage V 1. For continuum observations, τ is maintained small or zero. The functions V 1 and V 2 that represent the signals may be complex. The output of a single multiplying device is a real voltage or number. To obtain the complex cross-correlation, which represents both the amplitude and the phase of the visibility, one can record the fringe oscillations and measure their phase, or use a complex correlator that contains two multiplying circuits, as described in Sect. 6.1.7 As follows from Eqs. (3.20) and (3.22), the Fourier transform of r(τ) is the cross power spectrum, which is required in observations of spectral lines. This can be obtained by inserting a series of instrumental delays in the signal to determine the cross-correlation as a function of τ, as described in Sect. 8.8.3

3.3.4 Response to the Incident Radiation

We use subscripts 1 and 2 to indicate the two antennas and receiving channels as in Fig. 3.6. The response of antenna 1 to the signal field E(l, m) given by Eq. (3.24) is the voltage spectrum \(\widehat{V }(\nu )\). We multiply this by H(ν) to obtain the signal at the output of the filter, and then take the Fourier transform to go from the frequency to the time domain. Thus

$$\displaystyle{ V _{c1}(t) =\int _{ -\infty }^{\infty }\int _{ -\infty }^{\infty }\int _{ -\infty }^{\infty }E(l,m)\sqrt{A_{ 1}(l,m)}H_{1}(\nu )e^{\,j2\pi \nu t}\,dl\,dm\,d\nu \;. }$$
(3.28)

A similar expression can be written for the signal V c2(t) from antenna 2, and the output of the correlator is obtained from Eq. (3.27). Note also that if the radiation were to have some degree of spatial coherence, we should integrate over (l, m) independently for each antenna (Swenson and Mathur 1968), but here we make the usual assumption of incoherence. Thus, the correlator output is

$$\displaystyle\begin{array}{rcl} r(\tau )& =& \lim _{T\rightarrow \infty } \frac{1} {2T}\int _{-\infty }^{\infty }\int _{ -\infty }^{\infty }\int _{ -\infty }^{\infty }\int _{ -\infty }^{\infty }E(l,m)E^{{\ast}}(l,m)\sqrt{A_{ 1}(l,m)A_{2}(l,m)} \\ & & \times \,H_{1}(\nu )H_{2}^{{\ast}}(\nu )e^{\,j2\pi \nu t}\,e^{-j2\pi \nu (t-\tau )}\,dl\,dm\,dt\,d\nu \\ & =& \int _{-\infty }^{\infty }\int _{ -\infty }^{\infty }\int _{ -\infty }^{\infty }I(l,m)\sqrt{A_{ 1}(l,m)A_{2}(l,m)}H_{1}(\nu )H_{2}^{{\ast}}(\nu )\,e^{\,j2\pi \nu \tau }\,dl\,dm\,d\nu \;.{}\end{array}$$
(3.29)

Here, we have replaced the squared field amplitude by the intensity I. The result is a very general one since the use of separate response functions A 1 and A 2 for the two antennas can accommodate different antenna designs, or different pointing offset errors, or both. Also, different frequency responses H 1 and H 2 are used. In the case in which the antennas and filters are identical, Eq. (3.29) becomes

$$\displaystyle{ r(\tau ) =\int _{ -\infty }^{\infty }\int _{ -\infty }^{\infty }\int _{ -\infty }^{\infty }I(l,m)A(l,m)\vert H(\nu )\vert ^{2}e^{\,j2\pi \nu \tau }\,dl\,dm\,d\nu \;. }$$
(3.30)

The result is a function of the delay τ of the signal V c2(t) with respect to V c1(t). The geometric component of the delay is generally compensated by an adjustable instrumental delay (discussed in Chaps. 6 and 7), so that τ = 0 for radiation from the direction of the (l, m) origin. For a wavefront incident from the direction (l, m), the difference in propagation times through the two antennas to the correlator results from a difference in path lengths of (ul + vm) wavelengths, for the conditions indicated in Eqs. (3.8) and (3.9). The corresponding time difference is (ul + vm)∕ν. If we take as V 1 the signal from the antenna for which the path length is the greater (for positive l and m), then from Eq. (3.30), the correlator output becomes

$$\displaystyle{ r =\int _{ -\infty }^{\infty }\int _{ -\infty }^{\infty }\int _{ -\infty }^{\infty }I(l,m)A(l,m)\vert H(\nu )\vert ^{2}e^{-j2\pi (lu+mv)}dl\,dm\,d\nu \;. }$$
(3.31)

Equation (3.31) indicates that the correlator output measures the Fourier transform of the intensity distribution modified by the antenna pattern. Let us assume that, as is often the case, the intensity and the antenna pattern are constant over the bandpass range of the filters, and the width of the source is small compared with the antenna beam. The correlator output then becomes

$$\displaystyle\begin{array}{rcl} r& =& \int _{-\infty }^{\infty }\int _{ -\infty }^{\infty }I(l,m)A(l,m)e^{-j2\pi (lu+mv)}dl\,dm\int _{ -\infty }^{\infty }\vert H(\nu )\vert ^{2}d\nu \\ & =& A_{0}\mathcal{V}(u,v)\int _{-\infty }^{\infty }\vert H(\nu )\vert ^{2}d\nu \;, {}\end{array}$$
(3.32)

where A 0 is the collecting area of the antennas in the direction of the maximum beam response and \(\mathcal{V}\) is the visibility as in Eq. (3.2). The filter response H(ν) is a dimensionless (gain) quantity. If the filter response is essentially constant over a bandwidth Δ ν, Eq. (3.32) becomes

$$\displaystyle{ r = A_{0}\mathcal{V}(u,v)\varDelta \nu \;. }$$
(3.33)

\(\mathcal{V}(u,v)\) has units of W m−2 Hz−1, A 0 has units of m2, and Δ ν has units of Hz. This is consistent with r, the output of the correlator, which is proportional to the correlated component of the received power.