In this section, we formulate an implementation of conventional beamforming in the frequency domain. Next, we introduce the expression for cross-correlation beamforming and show that it is similar to cross-correlation and stacking of phase-adjusted recordings. We continue by stating the expressions for the array-response functions (ARFs). In the last subsection, these ARFs are used to explain the link between the spatial sampling characteristics of an array and its resolving power in terms of resolution and aliasing.
Conventional beamforming
We define d(x
(i), t) = [d(x
(1), t) d(x
(2), t)...d(x
(n), t)] as a two-dimensional data matrix with ns time samples and n receivers. Furthermore, (i) is a receiver index and x = [x
1
x
2
x
3] denotes a position vector of a receiver. Hence, d(x
(i),t) is the data vector recorded at location i. In the following, we assume the sensors to be placed on a flat surface, reducing the position vector to x = [x
1
x
2]. For further processing, we take the data to the temporal Fourier domain:
$$ d\left( \textbf{x}^{(i)},\omega\right)={\int}_{-\infty}^{\infty} d\left( \textbf{x}^{(i)},t\right) {e}^{-\iota \omega t} dt, $$
(1)
where ι denotes the imaginary unit, ω = 2 π
f is the angular frequency, and f is the frequency.
With conventional beamforming, the beampower P (as function of horizontal slowness p, backazimuth 𝜃, and angular frequency ω
o
) is computed using the following equation:
$$ P(p,\theta,\omega_{o})=\left|\sum\limits_{i=1}^{n} d\left( \textbf{x}^{(i)},\omega_{o}\right) {e}^{\iota \textbf{x}^{(i)} \textbf{k}^{T} }\right|^{2}, $$
(2)
where the wave vector k is defined as
$$ \textbf{k}=[k_{E}\,k_{N}]=\omega_{o} p [\sin(\theta)\,\cos(\theta)], $$
(3)
and k
E
and k
N
denote the wavenumber in the eastward and northward direction, respectively, and T denotes the transpose. Futhermore, ω
o
is the frequency for which the beampower is computed. Either (2) is computed for only this frequency sample or the beampower is averaged over a small frequency bin around ω
o
(see, e.g., Gal et al. 2014). Equation 3 states that a wavenumber vector is composed for each combination of slowness (here the reciprocal of horizontal apparent velocity) and backazimuth, which together define a plane wave. Equation 2 states that the delay times τ
(i) = x
(i)
k
T are computed for each wavenumber vector and each receiver. These delay times define—together with the frequency of interest—phase-shift terms that are applied to the data values d(x
(i),ω
o
). Finally, beampowers are obtained by summing the phase-shifted data over all receivers and by taking the squared norm of this sum.
Note that the sum in (2) is an approximation of the two-dimensional spatial Fourier Transform. For a regularly sampled array, the distance between the array elements dx determines the Nyquist wavenumber, i.e., the maximum wavenumber that can be mapped unambiguously: k
Nyq = π/(2d
x). For an irregularly sampled array of stations, the clear distinction between directly mapped wavenumbers and aliasing, vanishes.
Cross-correlation beamforming
With cross-correlation beamforming, the data are first cross-correlated for all possible receiver pairs. In the frequency domain, the cross-correlations can be written as
$$ c\left( \textbf{x}^{(i)},\textbf{x}^{(j)},\omega\right) = d\left( \textbf{x}^{(i)},\omega\right) \left\{d(\textbf{x}^{(j)},\omega)\right\}^{*}, $$
(4)
where ∗ denotes the complex conjugate. The correlation function c(x
(i),x
(j),ω) is only evaluated for i≠j, i.e., all auto-correlations are left out, since they do not contain directivity information. Consequently, for n receivers, we have q = n(n−1) receiver pairs (of which n(n−1)/2 are unique) and q corresponding cross-correlations remaining. These are concatenated to a two-dimensional matrix (in case of multiple frequencies): c(ω) = [c
(1)(ω) c
(2)(ω) ... c
(q)(ω)], where, e.g., c
(1)(ω) = d(x
(1),ω){d(x
(2),ω)}∗ and c
(q)(ω) = d(x
(n−1),ω){d(x
(n),ω)}∗. With CCBF, the beampower \(\check {P}\) is computed using the following equation:
$$ \check{P}(p,\theta,\omega_{o})=\left|\sum\limits_{k=1}^{q} c^{(k)}(\omega_{o}) {e}^{\iota \omega_{o} 2 {h}^{(k)} p\cos(\theta^{(k)}-\theta)}\right|, $$
(5)
where h
(k) are the half offsets (i.e., half of the receiver-pair separations), 𝜃
(k) are the receiver-pair azimuths, and (k) is a receiver-pair index. The derivation of the receiver-pair delay-time expression τ
(k)=2h
(k)(pcos(𝜃
(k)−𝜃)) can be found in the following subsection. The delay times are computed for each receiver-pair and for each plane-wave defined by p and 𝜃. These delay times define—together with the frequency of interest—phase-shift terms that are applied to the cross-correlated data. Finally, beampowers are obtained by summing the phase-shifted cross-correlations over all receivers and by taking the absolute value (complex modulus) of this sum.
Note that in (5), no squared norm is taken, like in (2), since the cross-correlation of the data already yields an energy measure. To obtain a power quantity, a division would need to be made by the duration of the input data. In (2) and (5), we do not explicitly write this division.
When (5) is averaged over a frequency band with varying signal strength over this bandwidth, it is advantageous to replace the cross-correlation by a spectrally normalized cross-correlation, e.g., the cross-coherence:
$$ h\left( \textbf{x}^{(i)},\textbf{x}^{(j)},\omega\right) = \frac{d\left( \textbf{x}^{(i)},\omega\right) \left\{d(\textbf{x}^{(j)},\omega)\right\}^{*}}{|d\left( \textbf{x}^{(i)},\omega\right)| |d\left( \textbf{x}^{(j)},\omega\right)|}. $$
(6)
CCBF (5) and the co-phase method (Posmentier and Herrmann 1971) are nearly identical. With co-phase, the exponential in (5) is replaced by a cosine. This cosine term is multiplied with a sum of the amplitude spectra at the different receivers. Equation 5, on the other hand, contains a multiplication of the amplitude spectra (which is hidden in the correlation function c
(k)(ω
o
)). The co-phase was written as a sum over a large frequency bandwidth, unlike a small frequency band, or a single frequency, like in (5). To assure equal contribution of the different frequencies, despite differences in amplitudes, with co-phase each monochromatic result is normalized with the sum of the amplitude spectra at the different receivers, instead of with the multiplication of the amplitude spectra, like in (6).
Correlation beamforming
Frankel et al. (1991) suggested to first apply time shifts to the data and subsequently to cross-correlate the data for all possible receiver pairs. If we additionally take the absolute value, their approach amounts—in the frequency domain—to
$$\begin{array}{@{}rcl@{}} \hat{P}(p,\theta,\omega)&=&\left|\sum\limits_{i=1}^{n} \sum\limits_{j=1}^{n} d\left( \textbf{x}^{(i)},\omega\right) {e}^{\iota (\textbf{x}^{(i)} \textbf{k}^{T}} \right.\\ &&\left. \left\{d(\textbf{x}^{(j)},\omega) {e}^{\iota (\textbf{x}^{(j)}) \textbf{k}^{T} } \right\}^{*}{\vphantom{\sum\limits_{i=1}^{n}}}\right|. \end{array} $$
(7)
Equation 7 can be re-organized to
$$\begin{array}{@{}rcl@{}} \hat{P}(p,\theta,\omega)&=&\left|\sum\limits_{i=1}^{n} \sum\limits_{j=1}^{n} d\left( \textbf{x}^{(i)},\omega\right) \left\{d\left( \textbf{x}^{(j)},\omega\right)\right\}^{*} \right. \\&&\left. {e}^{\iota \left( \textbf{x}^{(i)} - \textbf{x}^{(j)}\right) \textbf{k}^{T}} {\vphantom{\sum\limits_{j=1}^{n}}}\right|. \end{array} $$
(8)
Further, by rewriting the vector (x
(i) − x
(j)) to 2h
(ij)[ sin(𝜃
(ij)) cos(𝜃
(ij))], where 𝜃
(ij) is the receiver-pair azimuth and h
(ij) = |x
(i)−x
(j)|/2 is the half offset, we find
$$\begin{array}{@{}rcl@{}} \hat{P}(p,\theta,\omega) &=& \left|\sum\limits_{i=1}^{n} \sum\limits_{j=1}^{n} d\left( \textbf{x}^{(i)},\omega\right) \left\{d(\textbf{x}^{(j)},\omega)\right\}^{*} \right.\\&& \left. {e}^{\iota \omega 2{h}^{(ij)}p \left[ \sin\left( \theta^{(ij)}\right) \sin(\theta) + \cos\left( \theta^{(ij)}\right) \cos(\theta)\right]}{\vphantom{\sum\limits_{j=1}^{n}}} \right|. \\ \end{array} $$
(9)
Using the trigonometric product-to-sum identity:
$$ \sin(\theta^{(ij)}) \sin(\theta) + \cos(\theta^{(ij)}) \cos(\theta) = \cos(\theta^{(ij)}-\theta), $$
(10)
leaving out the auto-correlations and computing the beampower only for a single frequency or frequency bin, we find again (5). Hence, applying phase shifts prior to cross-correlation is similar to applying phase shifts after cross-correlation.
Applying (7), (8), or (9) for beamforming in small frequency bands, we call correlation beamforming (CBF). The only difference between CBF and CCBF is that auto-correlations are left out from the latter.
Note that Frankel et al. (1991) do not apply cross-correlations for all possible time lags. Instead, they multiply the time-shifted seismograms in the time domain, which operation corresponds to zero-lag cross-correlation. Hence, the name of their method: zero-lag cross-correlation (ZLCC). All dominant contributions are at the zero time lag. Therefore, in practice, it makes little difference whether the cross-correlation is extended to larger time lags or not. This is shown in Section 6.
Array response functions
To compare different beamforming approaches, we use the array-response functions (ARFs) (Birtill and Whiteway 1965). The ARF is the beampower for a given plane-wave model, array configuration and beamforming method. The ARFs can be computed prior to the actual installation of an array to assess the resolving power for the expected waveforms.
The directionality of a plane-wave wavefield is defined by k
S
= ω
o
p
S
[sin(𝜃
S
) cos(𝜃
S
)], where p
S
and 𝜃
S
are the slowness and backazimuth of the wavefield. The backazimuth is defined with respect to the center of gravity of the array. For an array of receivers located at x, the phase delay of an amplitude-normalized monochromatic plane wave at receiver i reads as \({e}^{-\iota \textbf {x}^{(i)} (\textbf {k}_{S})^{T} }\). Using the conventional beamforming (BF) beampower expression (2), the ARF is obtained by substituting the data values with the plane wavefield expression, yielding:
$$ P(p,\theta,\omega_{o};p_{S},\theta_{S})=\left|\sum\limits_{i=1}^{n} {e}^{\iota \textbf{x}^{(i)} (\textbf{k}-\textbf{k}_{S})^{T} }\right|^{2}. $$
(11)
For cross-correlated data, the (differential) amplitude-normalized monochromatic plane wavefield defined by p
S
and 𝜃
S
can be expressed as \({e}^{-\iota \omega _{o} 2 {h}^{(k)} p_{S}\cos (\theta ^{(k)} - \theta _{S})}\). By substituting this cross-correlated data model in (5), the cross-correlation-beamforming (CCBF) ARF is obtained:
$$\begin{array}{@{}rcl@{}} &&{\kern-.7pc}\check{P}(p,\theta,\omega_{o};p_{S},\theta_{S})=\\ &&{\kern.7pc} \left|\sum\limits_{k\,=\,1}^{q} {e}^{\iota \omega_{o} 2 {h}^{(k)}(p\cos(\theta^{(k)}\,-\,\theta)\! -\! p_{S}\cos(\theta^{(k)}\,-\,\theta_{S}) )}\right|. \end{array} $$
(12)
The ARF for CBF (9) is the same as (12), with the exception that the summation incorporates the auto-correlations. Hence, instead of summing over q = n(n−1) receiver pairs, the summation is extended to q = n
2 combinations.
Resolution and aliasing
In this section, we show an example of an ARF for cross-correlation beamforming. This ARF serves to illustrate the connection between the spatial element distribution and resolution and aliasing characteristics in the beampower domain.
From the ARCES array (Mykkeltveit et al. 1990) in northern Norway (Fig. 1), we select stations ARA1, ARA2, and ARB2. Figure 2a shows the station distribution. The array samples a distribution of (receiver-pair) azimuths and offsets, which determine the resolution and aliasing in the p−𝜃 domain for a given frequency. Each station pair samples one distance 2h
(k) in two directions 𝜃
(k) & 𝜃
(k)+180∘. Figure 2b displays the distribution of both parameters.
Figure 2c shows the CCBF ARF (12). As a source model, a plane wave is used with f=5 Hz, impinging the array from vertically below, that is with p
S
=0. The ARF is shown in a polar plot, where the slowness axis is along the radius and the backazimuth axis is along the circumference. In this and subsequent figures, the beampower is normalized using the maximum beampower within the plotted range. The ARF correcly maps a peak power at p=0. The power only slowly diminishes for larger slowness, which limits the resolution in the p−𝜃 plane.
The resolution is dependent on the maximum receiver offset (i.e., the aperture of the array). This offset is azimuth dependent (Fig. 2b) and hence also the resolution is azimuth dependent. This can be seen in Fig. 2c: the area where the power remains high is an ellipse rather than a circle, with the minor semi-axis corresponding to the direction with the largest offset. For simplicity, we approximate the resolution slowness as a backazimuth independent function:
$$ p_{Res}=1/(4 h_{max} f), $$
(13)
where 2h
m
a
x
is the largest offset within the array. With f=5 Hz and 2h
m
a
x
=0.3 km (Fig. 2b), this yields p
R
e
s
=0.33 s/km.
The aliasing is dependent on the minimum receiver offset, which also varies with azimuth. The aliasing is manifested in Fig. 2c by a repetition of beampower patterns: besides to the actual slowness, the plane wave is also erroneously mapped to higher slownesses. The Nyquist slowness, i.e., the slowness beyond which repetition occurs, we approximate with
$$ p_{Nyq}=1/(4 h_{min} f), $$
(14)
where h
m
i
n
is the smallest half-offset within the array. With f = 5 Hz and 2h
m
i
n
= 0.25 km we find p
N
y
q
= 0.4 s/km. In Fig. 2c, aliasing maxima occur at ∼2p
N
y
q
. As the aliasing, like the resolution, is in fact a direction dependent function, the aliasing artifacts appear precisely in between the azimuths that are sampled.
The Nyquist slowness is frequency dependent. Hence, if the signal is coherent over a band of frequencies, the mapping to the p−𝜃 domain can be improved by stacking the beampower over this band. Figure 2d shows the resulting beampower for stacking the ARFs over a frequency range from 3 to 7 Hz (with increments of 0.5 Hz). Over this band, the signal is assumed to be stable in directivity. For higher frequency, p
N
y
q
goes down (14) and the aliasing thus moves towards the center of the p−𝜃 domain. This frequency dependence of the aliasing leads to a smearing of the aliased beampowers. The maximum beampower related to the actual directivity of the signal is not frequency dependent. Consequently, stacking beampower over frequency leads to a power reduction of the aliasing artifacts and improves detectability of the actual directivity of the signal.
The p−𝜃 domain plots in Fig. 2c, d, and plots alike, could visually be improved by not plotting the power, but the power squared, or even higher powers of the power. Since \(10 \log _{10} \tilde {P}^{2}= 20 \log _{10} \tilde {P}\), the logarithm of the squared power difference between signal and artifacts becomes twice as big as that of the non-squared power difference. Figure 2e shows the power-squared version of 2e and the further reduction of artifacts can be appreciated here. Such non-linear improvement is worth considering when only the directivity of the dominant arrival is of interest. When the measurement contains wavefields with different directivity and amplitude, the non-linear enhancement will likewise amplify the primary beam with respect to the less energetic, but still physical, secondary beams. In the following, we will leave out such non-linear enhancements since they can equally be applied to enhance the output of BF, CBF, and CCBF.