Keywords

1 Introduction

Multiple-input multiple-output (MIMO) radar is a relatively new concept for radar system. By exploiting multiple antennas in both transmit and receive end, the extra visual antenna aperture is formed. The visual aperture makes the performance of MIMO radar better than the traditional phased array radar [1]. Generally speaking, the array geometry in MIMO radar can be divided into two categories, the uniform array and the nonuniform array. Elements in the uniform array geometry must be spaced at intervals no larger than half wavelength of the carrier signal thus to avoid phase ambiguity. Typical uniform arrays including linear arrays, uniform circular array and L-shape or rectangle array. The nonuniform array setup is much more flexible than the uniform array configuration [2]. The minimum redundancy linear array and the random array are belong to this kind of array.

Direction-of-arrival (DOA) estimation is a fundamental problem in MIMO radar that has aroused extensive attention. Existing estimation algorithms including Capon [3], multiple signal classification (MUSIC) [4], the estimation method of signal parameters via rotational invariance techniques (ESPRIT) [5–7], the parallel factor analysis (PARAFAC) [8, 9]. However, the majority of the above algorithms are effectiveless with the nonuniform arrays. Algorithm Capon and MUSIC are effective with nonuniform configuration, they only perform well with large number of snapshot. Besides, additional prior information is needed in this algorithm, such as the number of targets, the noise level, et al. Recently, compressive sensing (CS) theory has attracted extensive attention in the field of array signal processing [10–12]. In this paper, we focus on the compressive sensing based MIMO (CS-MIMO) radar [13]. In CS-MIMO radar, transmit and receive elements are randomly placed over a large aperture and spatial sampling is applied at sub-Nyquist rate. The random array setup would achieve similar resolution with significantly fewer elements. In fact, the targets in CS-MIMO can be viewed as sparse in the background. Therefore, DOA estimation can be regarded as a sparse inverse problem from multiple measurement vectors (MMV) in CS. Actually, there are many excellent algorithms for the MMV problem, such as Basis Pursuit (BP) [14], Orthogonal Matching Pursuit (OMP) [15], FOCal Underdetermined System Solver (FOCUSS) [16], Sparse Bayesian Learning (SBL) [17]. The sparse inverse problem makes the DOA estimation more accurate than the traditional methods. Unfortunately, the computational complexity of BP algorithm is too large to engineering implementation. Both OMP and FOCUSS are sensitivity to noise, and it is hard for FOCUSS to choose a proper regularization parameter. SBL is a blind recovery algorithm that always achieves the sparsest global minima. The statistical model that SBL based provides a flexible framework to exploit special structures in the model [18], which may significantly improve the recovery performance. A SBL based DOA estimation algorithm has been proposed for CS-MIMO radar [19], but it suffers from high computational complexity, making it unsuitable for the radar system.

In this paper, a structural correlated subspace SBL algorithm is derived for DOA estimation in CS-MIMO radar. The problem of DOA estimation is formulated as a sparse inverse problem. A statistical model is present which utilize the intra- and the extra- information of the data. DOA estimation is linked to parameters iteration in the SBL model. To speed up the learning process, the subspace-based sparse Bayesian learning (S-SBL) algorithm is developed. The proposed algorithm can reduce the complexity of the recovery process and enable the radar system keep good performance in lower signal-to-noise ratio (SNR), even with coherent sources. Experimental results show the proposed scheme performs better than the existing algorithms.

The paper outline is as follows. The data model for the CS-MIMO radar is presented in Sect. 2. The proposed S-SBL algorithm is derived in Sect. 3. Simulation results are given in Sect. 4. We end the paper by a brief concluding in Sect. 5.

Notation, capital letters and lower case in bold denote, respectively, matrices and vectors. The superscript \( (\varvec{X})^{T} ,(\varvec{X})^{H} ,(\varvec{X})^{ - 1} \,and\,(\varvec{X})^{\dag } \) represent the operations of transpose, Hermitian transpose, inverse and pseudo-inverse, respectively; \( \varvec{X}_{k \bullet } \) and \( \varvec{X}_{ \bullet k} \) represent the k-th row and k-th column of X. The subscript \( \left\| \varvec{X} \right\|_{F} \) denote the Frobenius norm of X; \( \otimes \) stands for the Kronecker product; The Khatri-Rao product (column-wise Kronecker product) is denoted by \( \odot \), i.e., \( \left[ {\varvec{a}_{1} , \cdots ,\varvec{a}_{K} } \right] \odot \left[ {\varvec{b}_{1} , \cdots ,\varvec{b}_{K} } \right] = \left[ {\varvec{a}_{1} \otimes \varvec{b}_{1} , \cdots ,\varvec{a}_{K} \otimes \varvec{b}_{K} } \right] \); The \( M \times M \) identity matrix is denoted by \( \varvec{I}_{M} \), and the \( M \times M \) inverse permutation matrix is denoted by \( \varvec{II}_{M} \).

2 Signal Model

Consider the model for monostatic CS-MIMO radar in Fig. 1 [13]. The radar system is consist of M transmit elements and N receive elements. Both transmit elements and receive elements are located in the x-axis. The total transmit aperture and the receive aperture is \( Z_{TX} \) time and \( Z_{RX} \) time of the carrier signal wavelength, respectively. The position of the m-th transmitter and the n-th receiver is denoted by \( \frac{Z}{2}\zeta_{m} \) and \( \frac{Z}{2}\xi_{n} \), respectively. Define \( Z = Z_{TX} + Z_{RX} \), with \( \zeta_{m} \) lies in the interval \( - \left[ {\frac{{Z_{TX} }}{Z},\frac{{Z_{TX} }}{Z}} \right] \) and \( \xi_{n} \) lies in the interval \( \left[ { - \frac{{Z_{RX} }}{Z},\frac{{Z_{RX} }}{Z}} \right] \). Suppose that the transmit elements’ locations \( \zeta \) are independent and identically distributed (i.i.d.) random variables governed by a distribution \( p\left( \zeta \right) \). Similarly, the positions of the receivers \( \xi \) be drawn i.i.d. from a distribution \( p\left( \xi \right) \). We assumed that there are K targets appearing in the far-field of the CS-MIMO radar system, the DOA of the k-th target is denoted by \( \theta_{k} \). Supposed that the transmit elements emit ideal orthogonal waveforms. In the receive end, the matched filters are used to separate the information from each visual transmitter-to-receiver path. The output of the matched filters with sampled L snapshots can be expressed as

Fig. 1.
figure 1

Signal model of DOA estimation for CS-MIMO radar

$$ \varvec{Y = }\left[ {\varvec{A}_{T} \left( \theta \right) \odot \varvec{A}_{R} \left( \theta \right)} \right]\varvec{S}\text{ + }\varvec{N}\varvec{ = }\varvec{AS}\text{ + }\varvec{N} $$
(1)

where \( \varvec{A}_{T} \left( \theta \right) = \left[ {\varvec{a}_{t} \left( {\theta_{1} } \right)\varvec{,a}_{t} \left( {\theta_{2} } \right), \cdots ,\varvec{a}_{t} \left( {\theta_{K} } \right)} \right] \in {\mathbb{C}}^{M \times K} \) is the transmit direction matrix, with the k-th (\( k = 1,2, \cdots ,K \)) transmit steering vector \( \varvec{a}_{t} \left( {\theta_{k} } \right) \) is denoted by \( \varvec{a}_{t} \left( {\theta_{k} } \right) = \left[ {\exp \left( {j\pi Z\zeta_{1} \sin \theta_{k} } \right),\exp \left( {j\pi Z\zeta_{2} \sin \theta_{k} } \right) \cdots ,\exp \left( {j\pi Z\zeta_{M} \sin \theta_{k} } \right)} \right]^{T} \in {\mathbb{C}}^{M \times 1} \). \( \varvec{A}_{R} \left( \theta \right) = \left[ {\varvec{a}_{r} \left( {\theta_{1} } \right)\varvec{,a}_{r} \left( {\theta_{2} } \right), \cdots ,\varvec{a}_{r} \left( {\theta_{K} } \right)} \right] \in {\mathbb{C}}^{N \times K} \) is the receive direction matrix, and the k-th (\( k = 1,2, \cdots ,K \)) receive steering vector \( \varvec{a}_{r} \left( {\theta_{k} } \right) \) is given by \( \varvec{a}_{r} \left( {\theta_{k} } \right) = \left[ {\text{exp}\left( {j\pi Z\xi_{1} \sin \theta_{k} } \right),\exp \left( {j\pi Z\xi_{2} \sin \theta_{k} } \right) \cdots ,\exp \left( {j\pi Z\xi_{N} \sin \theta_{k} } \right)} \right]^{T} \in {\mathbb{C}}^{N \times 1} \). The visual direction matrix is \( \varvec{A}\varvec{ = }\left[ {\varvec{A}_{T} \left( \theta \right) \odot \varvec{A}_{R} \left( \theta \right)} \right] \in {\mathbb{C}}^{MN \times K} \). \( \varvec{S} \in {\mathbb{C}}^{K \times L} \) is the source matrix and \( \varvec{N} \in {\mathbb{C}}^{MN \times L} \) represent an additive Gaussian white noise matrix with variance \( \sigma_{N}^{2} \).

Additional assumption is that the sources are located in the maximum unambiguous angles. By discretizing the possible angle on a fine uniform grid, i.e., \( \left[ {\varphi_{1} ,\varphi_{2} \cdots ,\varphi_{G} } \right] \)(\( K \ll G \)), we can obtain an visual overcomplete dictionary \( \varvec{B} = \left[ {\varvec{b}\left( {\varphi_{1} } \right),\varvec{b}\left( {\varphi_{2} } \right) \cdots ,\varvec{b}\left( {\varphi_{G} } \right)} \right] \in {\text{C}}^{MN \times G} \) for DOA estimation, where the g-th (\( g = 1,2 \cdots ,G \)) steering vector \( \varvec{b}\left( {\varphi_{g} } \right) = \varvec{a}_{t} \left( {\varphi_{g} } \right) \otimes \varvec{a}_{r} \left( {\varphi_{g} } \right) \), with \( \varvec{a}_{t} \left( {\varphi_{g} } \right) \) is \( \varvec{a}_{t} \left( {\varphi_{g} } \right) = \left[ {\exp \left( {j\pi Z\zeta_{1} \sin \varphi_{g} } \right),\exp \left( {j\pi Z\zeta_{2} \sin \varphi_{g} } \right) \cdots ,\exp \left( {j\pi Z\zeta_{M} \sin \varphi_{g} } \right)} \right]^{T} \in {\mathbb{C}}^{M \times 1} \), and \( \varvec{a}_{r} \left( {\varphi_{g} } \right) = \left[ {\exp \left( {j2\pi Z\xi_{1} \sin \varphi_{g} } \right),\exp \left( {j2\pi Z\xi_{2} \sin \varphi_{g} } \right) \cdots ,\exp \left( {j2\pi Z\xi_{M} \sin \varphi_{g} } \right)} \right]^{T} \in {\mathbb{C}}^{N \times 1} \). Therefore, the received data in (1) can be written as

$$ \varvec{Y} = \varvec{BX}\, + \,\varvec{N},\quad \varvec{X}_{g \bullet } = \left\{ {\begin{array}{*{20}l} {\varvec{S}_{k \bullet } ,} \hfill & {{\text{when }}\varphi_{g} \text{ = }\theta_{k} } \hfill \\ {{\mathbf{0}},} \hfill & {others} \hfill \\ \end{array} } \right., $$
(2)

Noting that \( K \ll MN < G \), the model in (2) is a MMV problem [16]. The DOA estimation problem can be expressed as to recover \( \varvec{\theta} \) from the given measurement matrix Y and the known matrix B, which is equal to recovery the support supp(X) (non-zero rows of X). The previous work in [13] has shown that once MN satisfies certain boundary condition, B would provide a uniform recovery guarantee for (2). Relying on a random array geometry, the model in (2) is linked to the CS framework.

3 Environment Awareness

The subspace that obtained from the covariance matrix decomposition can improve the DOA estimation accuracy in low SNR scene. The covariance matrix of the received data can be estimated by \( \hat{\varvec{R}}_{Y} = \varvec{YY}^{H} /L \). If the sources are noncoherent, the rank of \( \varvec{R}_{Y} \) is \( {\text{rank}}\left( {\varvec{R}_{Y} } \right)\text{ = }K \). But there is a rank loss if the sources are coherent. To maintain the rank of \( \varvec{R}_{Y} \), the spatial smooth method is applied to \( \varvec{R}_{Y} \). The estimated covariance matrix becomes

$$ \varvec{R} = \varvec{R}_{Y} \,{ + }\,\varvec{II}_{MN} \varvec{R}_{Y}^{ * } \varvec{II}_{MN} $$
(3)

The eigenvalue decomposition of R is

$$ \varvec{R}\,\varvec{ = }\,\varvec{U\varSigma U}^{H} \, = \,\sum\limits_{j = 1}^{MN} {\lambda_{j} \varvec{u}_{j} \varvec{u}_{j}^{H} } \,\varvec{ = }\,\varvec{U}_{s}\varvec{\varSigma}_{s} \varvec{U}_{s}^{H} \,{\mathbf{ + }}\,\varvec{U}_{n}\varvec{\varSigma}_{n} \varvec{U}_{n}^{H} $$
(4)

where \( {\varvec{\Sigma}} = diag(\lambda_{1} ,\lambda_{2} , \cdots ,\lambda_{MN} ) \), the eigenvalues are complied with \( \lambda_{1} \ge \cdots \ge \lambda_{K} > \) \( \lambda_{K + 1} = \cdots = \lambda_{MN} \) \( = \sigma_{N}^{2} \). The eigenvectors \( \varvec{u}_{1} ,\varvec{u}_{2} , \cdots ,\varvec{u}_{k} \) corresponding to the \( K \) larger eigenvalues \( \lambda_{1} ,\lambda_{2} , \cdots ,\lambda_{N} \) construct signal subspace \( \varvec{U}_{s} = \left[ {\varvec{u}_{1} ,\varvec{u}_{2} , \cdots ,\varvec{u}_{k} } \right] \), with \( \varvec{\varSigma}_{s} = \left[ {\lambda_{1} ,\lambda_{2} , \cdots ,\lambda_{K} } \right] \). Similarly, the later MN-K eigenvalue are depending on the noise and their numeric values are \( \sigma_{N}^{2} \). The eigenvectors \( \varvec{u}_{K + 1} ,\varvec{u}_{K + 2} , \cdots ,\varvec{u}_{MN} \) corresponding to \( \lambda_{K + 1} ,\lambda_{K + 2} , \cdots ,\lambda_{MN} \) construct noise subspace \( \varvec{U}_{n} = \left[ {\varvec{u}_{K + 1} ,\varvec{u}_{K + 2} , \cdots ,\varvec{u}_{N} } \right] \), and \( \varvec{\varSigma}_{n} = \left[ {\lambda_{K + 1} ,\lambda_{K + 2} , \cdots ,\lambda_{N} } \right] \). Generally, the number of the sources can be estimated by the distribution of the eigenvalue. Let \( \varvec{Z} = \varvec{U}_{s} \in {\mathbb{C}}^{MN \times K} \) denotes the estimated signal subspace, hence there exist a nonsingular matrix \( \varvec{T} \in {\mathbb{C}}^{MN \times MN} \) that

$$ \varvec{Z} = \varvec{BT} $$
(5)

Note that the dimension of Z is much smaller than Y, therefore the required iteration steps of SBL for Z will less than Y Let \( \lambda \) be the variance parameter of the noise, \( \gamma_{g} \) is a hyperparameter, R is the covariance matrix of \( {\mathbf{X}}_{g \bullet } \), which captures the intra-block correlation of the source. The mean and covariance parameters are given by \( \varvec{\mu} \) and \( \varvec{\varSigma} \). According to [18], the d + 1-th iteration process of the structural correlated SBL algorithm can be summarized as follows

$$ \left\{ {\begin{array}{*{20}l} {\left( {\gamma_{g} } \right)_{d + 1} \, = \,\frac{1}{L}\left( {\varvec{X}_{g \bullet } } \right)_{d} \varvec{R}_{d}^{ - 1} \left( {\varvec{X}_{g \bullet }^{T} } \right)_{d} \, + \,\left( {\left( {\varvec{\varSigma}_{{\mathbf{X}}} } \right)_{d} } \right)_{gg} ,\quad g = 1,2 \cdots ,G} \hfill \\ {\varvec{R}_{d + 1} \, = \,\tilde{\varvec{R}}_{d + 1} /\left\| {\tilde{\varvec{R}}_{d + 1} } \right\|_{F} ,\quad \tilde{\varvec{R}}_{d + 1} = \sum\nolimits_{g = 1}^{G} {\left( {\varvec{X}_{g \bullet }^{T} } \right)_{d} \left( {\varvec{X}_{g \bullet } } \right)_{d} + \eta \varvec{I}} } \hfill \\ {\lambda_{d + 1} \, = \,\frac{1}{MNL}\left\| {\varvec{Z} - \varvec{BX}_{d} } \right\|_{F}^{2} + \frac{{\lambda_{d} }}{G}Tr\left[ {\varvec{BH}_{d} } \right]} \hfill \\ {\left( {\varvec{\varSigma}_{{\mathbf{X}}} } \right)_{d + 1} \, = \,\left( {\varvec{\varLambda}_{d}^{ - 1} + \frac{1}{{\lambda_{d} }}\varvec{B}^{T}\varvec{\varLambda}_{d} } \right),\quad\varvec{\varLambda}= diag\left[ {\gamma_{1} ,\gamma_{2} , \cdots ,\gamma_{G} } \right]} \hfill \\ {\varvec{X}_{d + 1} \, = \,\varvec{H}_{d} \varvec{Z},\quad \varvec{H}_{d} =\varvec{\varLambda}_{d} \varvec{B}^{T} \left( {\lambda_{d} \varvec{I} + \varvec{B\varLambda }_{d} \varvec{B}^{T} } \right)^{ - 1} } \hfill \\ \end{array} } \right. $$
(6)

where \( \eta \) is a positive scalar. With the iteration of the SBL, the matrixes are trend to stable. The iteration will repeat until algorithm convergence thus we obtain the estimated \( \hat{\varvec{X}} \). With the non-zeros rows in \( \hat{\varvec{X}} \) we can get the estimated DOAs.

4 Simulation Results

In this section, 1000 Monte Carlo trails are used to assess the DOA estimation performance of the S-SBL algorithm. In our simulation, the successful-rate and average running time are used for performance assessment. A successful trial was recognized if the indexes of estimated sources with \( K \) largest \( l_{2} \)-norms were the support of \( \varvec{X} \). Parameters in our simulation are set to \( Z = 250 \), \( M = N = 6 \), \( \varvec{\theta}= \left[ {15,40,65} \right] \). The range of the angle in the dictionary is \( \left[ {0^{ \circ } ,90^{ \circ } } \right] \) with interval \( 0.1^{ \circ } \). The SNR in the simulation is defined as \( 10\log \left( {{{\left\| {\varvec{Y} - \varvec{N}} \right\|_{2}^{2} } \mathord{\left/ {\vphantom {{\left\| {\varvec{Y} - \varvec{N}} \right\|_{2}^{2} } {\left\| \varvec{N} \right\|_{2}^{2} }}} \right. \kern-0pt} {\left\| \varvec{N} \right\|_{2}^{2} }}} \right) \). We compared our algorithm with traditional MUSIC algorithm [4], the OMP method [15], the FOCUSS algorithm [16] and the SBL method [17].

Figures 2 and 3 depict the noncoherent sources estimation performance comparison of all the algorithms with different SNR, respectively. It can be seen from Fig. 2 that all algorithms would achieve better performance with the increasing SNR, while the S-SBL algorithm provides almost precise results. Another result is that the S-SBL runs faster than the SBL method.

Fig. 2.
figure 2

Successful Rate comparison for noncoherent sources with different SNR

Fig. 3.
figure 3

Average running time comparison for noncoherent sources with different SNR

Figures 4 and 5 depict the coherent sources estimation performance comparison of all the algorithms with different SNR, respectively. In the simulation, the second target and the third target are coherent. It can be seen from the results that the Bayesian algorithms are outperform the other algorithms in this situation. Besides, the S-SBL method perform better than the SBL approach, and the S-SBL method requires less computational load than the SBL method.

Fig. 4.
figure 4

Successful rate comparison for coherent sources with different SNR

Fig. 5.
figure 5

Average running time comparison for coherent sources with different SNR

5 Conclusions

In this paper, we have proposed a subspace-based SBL algorithm for DOA estimation in CS-MIMO radar. The subspace operation deduces the computational load and improves the estimation accuracy in low SNR scene. Our work links the estimation of angle to the parameters learning in the Bayesian framework. The S-SBL method does not require the prior information of the number of targets, and outperforms the traditional MUSIC algorithm and existing CS recovery methods.