1 Introduction

To guarantee reliable transmission, various diversity schemes have been proposed in three physical domains: space, time, and frequency. Previous works on spatial diversity assumed independent links between the transmitter and the receiver [13]. However, this assumption is not always valid due to insufficient antenna spaces or scarce scatterers during transmission. Firstly, the probability of error for two-dimensional signal constellations were analyzed in [4]. Then, the effect of space-time (ST) code on the performance of multiple-input multiple-output (MIMO) systems was studied over flat-fading spatial correlation channels in [5, 6]. But different mobile stations or a mobile moving through different geographical locations may experience channel variations in time. The performance of ST-coded MIMO systems was analyzed over temporal correlated Rayleigh fading channels in [7].

For the more interesting case of frequency-selective fading channels, orthogonal frequency division multiplexing (OFDM) has been recognized as an attractive approach to coping with the multipath effect [8].

Recently, several papers have studied that using space-time-frequency (STF) codes across multiple OFDM blocks obtains the full diversity in frequency-selective fading channels [911]. Others have analyzed the approaches of decoding for reducing the complexity at the receiver [12, 13]. However, limited attention has been devoted so far to the problem of the influence of correlated fading channels on diversity. The spatial correlation of the fading channels is always as a Kronecker model, in which the transmitter correlation is independent of the receiver correlation. This model requires few scatterers between the transmitter and the receiver. However, the channel measurements indicate that the Kronecker structure does not describe the multipath propagation channel correctly [14, 15]. The maximum diversity of SF-coded MIMO-OFDM systems with arbitrary spatial correlation was studied in [16]. Unfortunately, it considers no frequency correlation. However, in most practical situations, multipath delay could cause the channel correlation in frequency domain. An arbitrary spatial and temporal correlation model for STF-coded MIMO-OFDM was presented in [17]. The assumption in [17] is that the multipath delays are independent and the multipaths are separated. However, in a multipath channel environment, when the scatterers are located far from the transmit antenna arrays in a narrow angular range, multipath signals that bounce from these scatterers can be correlated temporally, causing path correlation among channels. Furthermore, path correlation can also be caused by using a pulse shaping filter at the transmitter or the receiver [18]. In fact, the frequency-selective fading channels could not avoid spatial, temporal, and frequency/path correlations.

Motivated by this problem, in our recent paper [19], considering general spatial, temporal, and frequency/path correlations in wireless communication, we studied the performance of STF-coded block-fading MIMO-OFDM systems. We went beyond the limitations of ideal assumptions such as quasi-static or rapid fading channels, unlimited antenna spaces or abundant scatterers, and separable multipaths between the transmitter and the receiver. Our spatial correlation is arbitrary and affected by multipaths; hence, it is not subjected to the constraint of the Kronecker model in [9]. We derived an upper bound on the maximum achievable diversity by Hadamard and tensor products. We also addressed STF code designs with maximum diversity. In this paper, based on rank properties of block matrices, we re-derive the upper bound on the maximum achievable diversity in greater details and discuss the physical meaning of each part in the expression that is conducive to analyze the influence of the individual correlation into the performance of block-fading MIMO-OFDM systems. For achieving the diversity of the system with the arbitrarily correlated channels, we give two examples of the STF codes: one traditional STF code and another quasi-SF code, which are designed for achieving maximum diversity. The decoding complexity is considered in the MIMO system with arbitrary correlated fading channels using the traditional STF code. We also identify the newly developed statistical channel models for MIMO Long-Term Evolution (LTE) [20] and 802.11n [21] as special cases of our STF-coded MIMO-OFDM system by showing that our theoretical diversity results match those simulated from these statistical channel models. Finally, we show that our general diversity result recovers various maximum diversity gains for different special correlation scenarios that have appeared in the literature.

The rest of this paper is organized as follows. Section 2 describes our system model. Section 3 gives an upper bound on the maximum diversity of block-fading MIMO-OFDM system with general spatial, temporal, and frequency/path correlations. Section 4 presents two STF code design criteria and two corresponding examples to show achievability of our upper bound. Section 5 provides the decoding complexity. Section 6 specializes our maximum diversity to several specific channel models, including MIMO LTE and 802.11n; it also shows that our result recovers existing maximum diversity gains in the existing literature. Section 7 concludes the paper.

Notation-wise, I N denotes the N×N identity matrix, 1 N×M and 0 N×M , are the all-one and all-zero N×M matrices, respectively; (.)T and (.)H represent transpose and conjugate transpose, respectively; and ⊙ and ⊗ signify Hadamard and tensor products, respectively. We use rank(R) and range(R) to denote the rank and range of matrix R, respectively, and null(R) the dimension of the null space of matrix R. In addition, \(\bar h\) and h denote symbols in the time and frequency domain, respectively. Finally, in the temporal domain, k(1≤kK) is the index of OFDM blocks; in the frequency domain, n(1≤nN) is the index of subcarriers; in the multipath domain, l(1≤lL) is the index of multipaths; in the spatial domain, i(1≤iN t ) and j(1≤jN r ) are the indexes of the transmit and receive antennas, respectively.

2 System model

We consider MIMO-OFDM systems with arbitrarily correlated frequency-selective block-fading channels. It is assumed that the channel is static within each OFDM block but different and correlated from one block to another. The system is provided with N t transmit antennas, N r receive antennas, N subcarriers, and K OFDM blocks. There are L correlated multipaths between each pair of transmit and receive antennas. The channel impulse response between transmit antenna i and receive antenna j in the kth OFDM block is given by

$${} {\small{\begin{aligned} {\bar h}_{i,j,k}\!\left(\!\tau\! \right) =\! \sum\limits_{l = 0}^{L - 1}\! {\alpha_{i,j,k}\left[l\right]\delta \left(\! {\tau - {\tau_{l}}} \right)}, 1\! \le i\! \le\! {N_{t}}, 1\! \le\! j\! \le\! {N_{r}}, 1\!\le k\! \le\! K, \end{aligned}}} $$
((1))

where τ l and \({\alpha _{i,j,k}[\!l]}\sim \mathcal {CN}\left (0, {\sigma _{l}^{2}}\right)\) are the delay and complex amplitude of the lth path between transmit antenna i and receive antenna j, respectively. The powers of the L paths are normalized such that \(\sum \limits _{l = 0}^{L - 1} {{\sigma _{l}^{2}} = 1}\). We assume that all path delays are located exactly at the sampling instances of the receiver. From (1), the frequency response of the channel is

$${} {\small{\begin{aligned} h_{i,j,k}\!\left(\,f \right) \,=\, \sum\limits_{l = 0}^{L - 1}\! {\alpha_{i,j,k}\left[ l \right]{e^{- \mathrm{j} 2\pi f{\tau_{l}}}}}, 1\! \le i \le\! {N_{t}}, 1 \le j \le {N_{r}}, 1 \!\le k \!\le K, \end{aligned}}} $$
((2))

where \(\mathrm {j} = \sqrt { - 1}\) and h i,j,k (f) is the Fourier transform of \({\bar h}_{i,j,k}\left (\tau \right) \) in (1).

To guarantee reliable transmission, at the transmitter, we employ STF encoding across N t transmit antennas, K OFDM blocks, and N subcarriers. Each STF codeword is a KN×N t KN matrix given by

$$\begin{array}{@{}rcl@{}} {\boldsymbol C} = \text{diag} \left[{ {C}_{1}}, {C}_{2}, \ldots, {{C}_{K}} \right] \end{array} $$
((3))

that satisfies the energy constraint \(E\left |\left |{\boldsymbol C} \right |\right |_{F}^{2} = KN{N_{t}}\), with ||.|| F signifying the Frobenius norm. For 1≤kK, C k is an N×N t N matrix that represents the transmitted codes in the kth OFDM block and is constructed by transmitting codes in N subcarriers. That is,

$$\begin{array}{@{}rcl@{}} {C}_{k} = \text{diag}\left[ {{\boldsymbol{c}}_{k}}[\!1]\!, {{\boldsymbol{c}}_{k}}[\!2]\!, \ldots, {{\boldsymbol{c}}_{k}}[N]\!\right]\!, 1 \le k \le {K}, \end{array} $$
((4))

where c k [ n] is a 1×N t vector, representing an STF code in the kth OFDM block and the nth subcarrier from N t transmit antennas, that can be written as

$${} {\small{\begin{aligned} {{\boldsymbol c}_{k}}[\!n] = \left[ {{ c}_{1,k}\left[ n \right]}, {{ c}_{2,k}\left[ n \right]}, \ldots, {{ c}_{N_{t},k}\left[ n \right]} \right], 1 \le k \le K, 1 \le n \le N, \end{aligned}}} $$
((5))

with c i,k [n], 1≤iN t , 1≤kK, 1≤nN, being the transmitted code from the ith transmit antenna in the kth OFDM block and the nth subcarrier.

After matched filtering and fast Fourier transform (FFT), the received signal of our STF-coded MIMO-OFDM system from the jth receive antenna in the kth OFDM block and the nth subcarrier is given by

$${} \begin{aligned} y_{j,k}[\!n] &= \sqrt {\frac{\rho }{{{N_{t}}}}} \sum\limits_{i = 1}^{{N_{t}}} {h_{i,j,k}[\!n]\!c_{i,k}[\!n]} \\ &\quad+ z_{j,k}[\!n]\!, 1\le j \le {N_{r}}, 1 \le k \le {K}, 1 \le n \le {N}, \end{aligned} $$
((6))

where \(z_{j,k}[\!n] \sim \mathcal {CN}(0, 1)\) is the complex AGWN and the scaling factor \(\sqrt {\frac {\rho }{N_{t}}}\) ensures that ρ is the average signal-to-noise ratio (SNR) at each receive antenna. The channel frequency response h i,j,k [ n] from transmit antenna i to receive antenna j in the kth OFDM block and the nth subcarrier is a uniformly sampled version of h i,j,k (f) in (2) and can be expressed as

$${} \begin{aligned} h_{i,j,k}[\!n] &= \sum\limits_{l = 0}^{L - 1} {\alpha_{i,j,k}[\!l]\!{(W)_{N}^{nl}}},1 \le i \le {N_{t}}, 1 \le j \\ &\quad\le {N_{r}}, 1 \le k \le K, 1 \le n \le N, \end{aligned} $$
((7))

where \({(W)_{N}} = \frac {1}{\sqrt N}{e^{-\frac {\mathrm {j}2\pi }{N} }}\).

The received signals in (6) can be represented in matrix form as

$$\begin{array}{@{}rcl@{}} {\boldsymbol Y} = \sqrt{\frac{\rho}{N_{t}}}{\boldsymbol C}\otimes{\boldsymbol {I}}_{N_{r}}{\boldsymbol {H}} +{\boldsymbol Z}, \end{array} $$
((8))

where Y and Z are the N r KN×1 vectors, representing the received signals and noises, respectively.

$$\begin{array}{@{}rcl@{}} {\boldsymbol H}= \left[{H}_{1}^{\mathrm{T}}, {H}_{2}^{\mathrm{T}}, \ldots, {H}_{K}^{\mathrm{T}} \right]^{\mathrm{T}} \end{array} $$
((9))

is an N r N t KN×1 vector, with H k , 1≤kK, representing the fading channels in the kth OFDM block. The N r N t N×1 vector

$$\begin{array}{@{}rcl@{}} {H}_{k} = \left[ {{\boldsymbol {h}}_{k}[\!1]^{\mathrm{T}}}, {{\boldsymbol {h}}_{k}[\!2]^{\mathrm{T}}}, \ldots, {{\boldsymbol {h}}_{k}[N]^{\mathrm{T}}}\right]^{\text{{T}}},1 \le k \le K, \end{array} $$
((10))

is composed of fading channel responses in N subcarriers, where the N r N t ×1 vector h k [ n], 1≤kK, 1≤nN, representing the fading channels H k in the nth subcarrier, can be written as

$${} {\fontsize{8.6}{8.6}{\begin{aligned} {{\boldsymbol {h}}_{k}[\!n]} \,=\, {\left[{h_{1,1,k}\left[{n}\right]}, \ldots, {h_{1,N_{r},k}\left[{n}\right]}, \ldots, {h_{N_{t},1,k}\left[{n}\right]}, \ldots, {h_{N_{t},N_{r},k}\left[{n}\right]}\right]^{\text{{T}}}}. \end{aligned}}} $$
((11))

Similarly, the complex amplitudes of the fading channel responses can be expressed in three domains: space, time, and path. For 1≤kK, 1≤lL, we denote the N r N t ×1 vector

$${} {\small{\begin{aligned} {{\boldsymbol {a}}_{k}[\!l]} = {\left[ {\alpha_{1,1,k}\left[{l}\right]}, \ldots, {\alpha_{1,N_{r},k}\left[{l}\right]}, \ldots, {\alpha_{N_{t},1,k}\left[{l}\right]}, \ldots, {\alpha_{N_{t},N_{r},k}\left[{l}\right]} \right]^{\text{{T}}}} \end{aligned}}} $$
((12))

as the complex amplitudes of the fading channels in the kth OFDM block and the lth path. We also denote the N r N t L×1 vector

$$\begin{array}{@{}rcl@{}} { {A}}_{k} = {\left[ {{\boldsymbol {a}}_{k}[\!1]^{\mathrm{T}}}, {{\boldsymbol {a}}_{k}[\!2]^{\mathrm{T}}}, \ldots, {{\boldsymbol {a}}_{k}[\!L]^{\mathrm{T}}}\right]^{\text{{T}}}},1 \le k \le K, \end{array} $$
((13))

as the complex amplitudes of the fading channels in the kth OFDM block through L paths. Finally, the N r N t KL×1 vector

$$\begin{array}{@{}rcl@{}} {\boldsymbol A}= \left[{A}_{1}^{\mathrm{T}}, {A}_{2}^{\mathrm{T}}, \ldots, {A}_{K}^{\mathrm{T}}\right]^{\mathrm{T}} \end{array} $$
((14))

represents the complex amplitudes of the frequency-selective block-fading channels in our system. Then from (7), the frequency-selective block-fading channels can be expressed as

$$\begin{array}{@{}rcl@{}} {\boldsymbol H} &=&\left({{\boldsymbol {I}}_{K}} \otimes {\boldsymbol T}_{N,L}\otimes {{\boldsymbol {I}}_{{N_{r}}{N_{t}}}}\right) {\boldsymbol A}, \end{array} $$
((15))

where

$$\begin{array}{@{}rcl@{}} {\boldsymbol T}_{N,L} = \left[ {\begin{array}{cccc} {{{(W)_{N}}}} & {{(W)_{N}^{2}}} & \ldots & {{(W)_{N}^{L}}} \\ {{(W)_{N}^{2}}} & {{(W)_{N}^{4}}}& \ldots & {{(W)_{N}^{2L}}}\\ \vdots & \vdots & \ddots & \vdots \\ {{(W)_{N}^{N}}} & {{(W)_{N}^{2N}}} & \ldots & {{(W)_{N}^{NL}}} \\ \end{array}} \right] \end{array} $$
((16))

is the N×L FFT matrix.

3 An upper bound on maximum diversity

We study the average pairwise error probability (PEP) before using it to help us determine the diversity gain of our system. We assume that the receiver has perfect channel state information and employ a maximum-likelihood decoder

$$\begin{array}{@{}rcl@{}} {\boldsymbol {\tilde C}} = \mathop {\arg }\limits_{\boldsymbol {C}} {\min }{\left|\left| {{\boldsymbol {Y}} - \sqrt {\frac{\rho }{{{N_{t}}}}} {{\boldsymbol C}\otimes{\boldsymbol {I}}_{N_{r}}{{\boldsymbol H}}}} \right|\right|^{2}}. \end{array} $$
((17))

We define a KN×N t KN matrix Δ as the difference of the transmitted codeword and its corresponding detected codeword, i.e., \({\boldsymbol \Delta } \buildrel \Delta \over = {\boldsymbol C}-{\boldsymbol {\tilde C}}\). This allows us to explore the difference of two codewords in all three domains, i.e., across N t transmit antennas, K OFDM blocks, and N subcarriers. Similar to the expression of an STF code in (3), we can write the difference of two STF codewords as

$$\begin{array}{@{}rcl@{}} {\boldsymbol \Delta} = \text{diag} \left[ \Delta_{1}, \Delta_{2}, \ldots \Delta_{K}\right], \end{array} $$
((18))

where Δ k , 1≤kK, represents the difference of two codewords in the kth OFDM block.

According to [22], the PEP between two codewords C and \({\boldsymbol {\tilde C}}\) for a given channel realization can be upper bounded by

$$\begin{array}{@{}rcl@{}} \text{Pr} ({\boldsymbol C} \to {\boldsymbol {\tilde C}}|{\boldsymbol H}) \le \frac{1}{2}\exp \left({ - \frac{\rho }{{4{N_{t}}}}{{\left|\left| { {\boldsymbol \Delta} \otimes {\boldsymbol I}_{N_{r}}{ \boldsymbol H}} \right|\right|}^{2}}} \right). \end{array} $$
((19))

Since the channel coefficients are jointly Gaussian, the N r KN×1 vector \({ \boldsymbol \Delta } \otimes {\boldsymbol I}_{N_{r}}{ \boldsymbol H }\) for a fixed code realization has a Gaussian distribution with zero mean and N r KN×N r KN covariance matrix

$$\begin{array}{@{}rcl@{}} {{\boldsymbol {R}}} = E\left\{ {\boldsymbol \Delta} \otimes {\boldsymbol I}_{N_{r}}{\boldsymbol H} {\boldsymbol H}^{\mathrm{H}} {\left({\boldsymbol \Delta} \otimes {\boldsymbol I}_{N_{r}} \right)}^{\mathrm{H}} \right\}. \end{array} $$
((20))

Averaging the PEP in (19) over all channel realizations, we have [23]

$$\begin{array}{@{}rcl@{}} \text{Pr} ({\boldsymbol C} \to {\boldsymbol {\tilde C}}) \le {\left({\frac{\rho }{{4{N_{t}}}}} \right)^{- \text{rank}({{\boldsymbol {R}} })}}\left({\prod\limits_{i = 1}^{\text{rank} ({{\boldsymbol {R}} })} {{\lambda_{i}}{{\left({{{\boldsymbol {R}}}} \right)}^{- 1}}}} \right), \end{array} $$
((21))

where rank(R) and λ i (R) are the rank and the ith eigenvalue of R, respectively. From (21), we see that the diversity order depends on rank(R).

The covariance matrix E{H H H} in (20) represents the correlation of frequency-selective block-fading channels in space, time, and frequency/path domains, and from (15), it can be rewritten as

$${} \begin{aligned} E \left\{{\boldsymbol H} {\boldsymbol H}^{\mathrm{H}}\right\} &= \left({{\boldsymbol {I}}_{K}} \otimes {\boldsymbol T}_{N,L}\otimes {{\boldsymbol {I}}_{{N_{r}}{N_{t}}}}\right) E \left\{{\boldsymbol A} {\boldsymbol A}^{\mathrm{H}}\right\}\\&\quad\times \left({{\boldsymbol {I}}_{K}} \otimes {\boldsymbol T}_{N,L}\otimes {{\boldsymbol {I}}_{{N_{r}}{N_{t}}}}\right)^{\mathrm{H}}. \end{aligned} $$
((22))

Then, from (20) and (22), the covariance matrix R can be expressed as

$$\begin{array}{@{}rcl@{}} {{\boldsymbol R}} = {\boldsymbol {\boldsymbol \varphi}} E \left\{{\boldsymbol A} {\boldsymbol A}^{\mathrm{H}}\right\} {\boldsymbol {\boldsymbol \varphi}}^{\mathrm{H}}, \end{array} $$
((23))

with

$$\begin{array}{@{}rcl@{}} {\boldsymbol \varphi} \buildrel \Delta \over = \left({\boldsymbol \Delta}{{\boldsymbol I}_{K}} \otimes {\boldsymbol T}_{N,L}\otimes{{\boldsymbol I}_{N_{t}}} \right)\otimes{{\boldsymbol I}_{N_{r}}}. \end{array} $$
((24))

According to the property of matrix product, pre-multiplying matrix E{A A H} by φ or post-multiplying it by φ H does not increases its rank, hence from (23), the rank of covariance matrix R satisfies

$$\begin{array}{@{}rcl@{}} {\text{rank}\left({{\boldsymbol R} }\right)} \le {\text{rank}\left(E \{{\boldsymbol A} {\boldsymbol A}^{\mathrm{H}}\} \right)}, \end{array} $$
((25))

with equality holding when matrix φ has full column rank.

For the sake of simplicity, in the sequel, we only consider the influence of the channel frequency/path correlation on the channel spatial correlation and assume that the channel temporal correlation is independent of the spatial and frequency/path correlations. The effect of the channel temporal correlation on the channel spatial and frequency/path correlations can be studied in a similar way, and hence is omitted here.

Under the assumption that the channel temporal correlation is independent of the channel spatial and frequency/path correlations, we have for 1≤i,pN t ,1≤j,qN r ,1≤k,mK,1≤l,dL,

$${} E\!\left\{{\alpha_{i,j,k}[\!l]}\left({\alpha_{p,q,m}[\!d]}\right)^{*}\right\} =r_{l,d}^{L} r_{k,m}^{K} E\left\{\!{\alpha_{i,j,k}[\!l]}\left({\alpha_{p,q,k}[\!d]}\right)^{*}\right\}, $$
((26))

where \(r_{l,d}^{L}\) and \(r_{k,m}^{K}\) denote the frequency/path correlation coefficient between the lth path and the dth path, and the temporal correlation coefficient between the kth OFDM block and the mth OFDM block, respectively.

Therefore, the covariance matrix E{A A H} can be expressed as

$$\begin{array}{@{}rcl@{}} E\! \left\{{\boldsymbol A} {\boldsymbol A}^{\mathrm{H}}\right\} &=& {{\boldsymbol R}^{\boldsymbol K}}\!\otimes\! \left(\!\! {{{\boldsymbol R}^{\boldsymbol L}} \otimes {{\boldsymbol 1}_{{N_{r}}{N_{t}}\times {N_{r}}{N_{t}}}}} \odot\! \left[\! {\begin{array}{ccc} {{\boldsymbol R}^{\boldsymbol {S}}_{1,1}} & \ldots & {{\boldsymbol R}^{\boldsymbol {S}}_{1,L}} \\ \vdots & \ddots & \vdots \\ {{\boldsymbol R}^{\boldsymbol {S}}_{L,1}} & \ldots & {{\boldsymbol R}^{\boldsymbol {S}}_{L,L}} \\ \end{array}} \right]\! \right)\\ &=&{{\boldsymbol R}^{\boldsymbol K}}\otimes \left[ {\begin{array}{ccc} r^{L}_{1,1}{{\boldsymbol R}^{\boldsymbol {S}}_{1,1}} & \ldots & r^{L}_{1,L}{{\boldsymbol R}^{\boldsymbol {S}}_{1,L}} \\ \vdots & \ddots & \vdots \\ r^{L}_{L,1}{{\boldsymbol R}^{\boldsymbol {S}}_{L,1}} & \ldots & r^{L}_{L,L}{{\boldsymbol R}^{\boldsymbol {S}}_{L,L}} \end{array}} \right], \end{array} $$
((27))

where the K×K Hermitian matrix R K and the L×L Hermitian matrix R L represent the channel temporal and path/frequency correlations, respectively, and they can be expressed as

$$\begin{array}{@{}rcl@{}} {{\boldsymbol{R}}^{\boldsymbol K}} = \left[ {\begin{array}{cccc} r^{K}_{1,1} & r^{K}_{1,2} & \ldots & r^{K}_{1,K} \\ r^{K}_{2,1} & r^{K}_{2,2} & \ldots & r^{K}_{2,K} \\ \vdots & \ddots & \ddots & \vdots \\ r^{K}_{K,1} & r^{K}_{K,2} & \ldots & r^{K}_{K,K} \\ \end{array}} \right] \end{array} $$
((28))

and

$$\begin{array}{@{}rcl@{}} {{\boldsymbol{R}}^{\boldsymbol L}} = \left[ {\begin{array}{cccc} r^{L}_{1,1} & r^{L}_{1,2} & \ldots & r^{L}_{1,L} \\ r^{L}_{2,1} & r^{L}_{2,2} & \ldots & r^{L}_{2,L} \\ \vdots & \ddots & \ddots & \vdots \\ r^{L}_{L,1} & r^{L}_{L,2} & \ldots & r^{L}_{L,L} \\ \end{array}} \right], \end{array} $$
((29))

respectively. The N r N t ×N r N t Hermitian matrix \({\boldsymbol R}^{\boldsymbol {S}}_{l,d}\) in (27) denotes the channel spatial correlation between the lth path and the dth path. For 1≤l,dL, it can be written as

$$\begin{array}{*{20}l} &{\boldsymbol R}^{\boldsymbol {S}}_{l,d} =\left[ {\begin{array}{ccccc} E\left\{{\alpha_{1,1,k}[\!l]}\left({\alpha_{1,1,k}[\!d]}\right)^{*}\right\} & \ldots & E\left\{{\alpha_{1,1,k}[\!l]}\left({\alpha_{1,N_{r},k}[\!d]}\right)^{*}\right\} & \ldots & E\left\{{\alpha_{1,1,k}[\!l]}\left({\alpha_{N_{t},N_{r},k}[\!d]}\right)^{*}\right\} \\ \vdots & \ddots & \vdots & \ddots & \vdots \\ E\left\{{\alpha_{1,N_{r},k}[\!l]}\left({\alpha_{1,1,k}[\!d]}\right)^{*}\right\} & \ldots & E\left\{{\alpha_{1,N_{r},k}[\!l]}\left({\alpha_{1,N_{r},k}[\!d]}\right)^{*}\right\} & \ldots & E\left\{{\alpha_{1,N_{r},k}[\!l]}\left({\alpha_{N_{t},N_{r},k}[\!d]}\right)^{*}\right\} \\ \vdots & \ddots & \vdots & \ddots & \vdots \\ E\left\{{\alpha_{N_{t},N_{r},k}[\!l]}\left({\alpha_{1,1,k}[\!l]}\right)^{*}\right\} & \ldots & E\left\{{\alpha_{N_{t},N_{r},k}[\!l]}\left({\alpha_{1,N_{r},k}[\!l]}\right)^{*}\right\} & \ldots & E\left\{{\alpha_{N_{t},N_{r},k}[\!l]}\left({\alpha_{N_{t},N_{r},k}[\!l]}\right)^{*}\right\} \\ \end{array}} \right]. \end{array} $$
((30))

To simplify the notation, we define an N r N t ×N r N t L block matrix

$$ {{\boldsymbol r}_{l} } \buildrel \Delta \over = \left[ {r^{L}_{l,1}{\boldsymbol R}^{\boldsymbol {S}}_{l,1}}, {r^{L}_{l,2}{\boldsymbol R}^{\boldsymbol {S}}_{l,2}}, \ldots, {r^{L}_{l,L}{\boldsymbol R}^{\boldsymbol {S}}_{l,L}}\right] $$

for each l with 1≤lL. Then, the covariance matrix E{A A H} in (27) can be rewritten as

$$\begin{array}{@{}rcl@{}} E \left\{{\boldsymbol A} {\boldsymbol A}^{\mathrm{H}}\right\} = {{\boldsymbol R}^{\boldsymbol K}}\otimes \left[ {{\boldsymbol r}_{1}^{\mathrm{T}} }, {{\boldsymbol r}_{2}^{\mathrm{T}} }, \ldots, {{\boldsymbol r}_{L}^{\mathrm{T}}} \right]^{\mathrm{T}}. \end{array} $$
((31))

To compute the rank of covariance matrix E{A A H}, we first give the following lemma that states the relationship between the rank of a block matrix and those of its submatrices.

Lemma 1.

For two given sets of matrices \(\{{\boldsymbol B}_{1} \in \mathbb {C}^{b \times {b_{1}}}\), \({\boldsymbol B}_{2} \in \mathbb {C}^{b \times {b_{2}}},\ldots, {\boldsymbol B}_{P} \in \mathbb {C}^{b \times {b_{P}}}\}\) and \(\{{\boldsymbol D}_{1} \in \mathbb {C}^{{d_{1}} \times d}, {\boldsymbol D}_{2} \in \mathbb {C}^{{d_{2}} \times d}, \ldots, {\boldsymbol D}_{Q} \in \mathbb {C}^{{d_{Q}} \times d}\}\), we have

$${} \begin{aligned} & \text{rank}\left[ {\boldsymbol B}_{1}, {\boldsymbol B}_{2}, \ldots, {\boldsymbol B}_{P}\right] \\ &=\! \sum_{p = 1}^{P}{\text{rank}\left({\boldsymbol B}_{p} \right)}+\sum_{p = 1}^{P}{{\text{{null}}}\left({\boldsymbol B}_{p} \right)} - {\text{{null}}} \left[ {\boldsymbol B}_{1}, {\boldsymbol B}_{2},\ldots, {\boldsymbol B}_{P} \right], \end{aligned} $$
((32))
$${} \begin{aligned} & \text{rank}\left[ {{\boldsymbol D}_{1}^{\mathrm{T}}}, {{\boldsymbol D}_{2}^{\mathrm{T}}}, \ldots, {{\boldsymbol D}_{Q}^{\mathrm{T}}} \right]^{\mathrm{T}} \\ &=\! \sum_{q = 1}^{Q}{\text{rank}\left({\boldsymbol D}_{q} \right)}+\!\sum_{q = 1}^{Q}\!{\text{{null}}\left(\!{\boldsymbol D}_{q}^{\mathrm{T}}\! \right)} - \text{{null}} \left[\! {{\boldsymbol D}_{1}^{\mathrm{T}}}\!, {{\boldsymbol D}_{2}^{\mathrm{T}}}\!, \ldots, {{\boldsymbol D}_{Q}^{\mathrm{T}}} \right]. \end{aligned} $$
((33))

Further, if \(\text {range}\left ({\boldsymbol B}_{p}\right)\cap \text {range}\left ({\boldsymbol B}_{p'}\right) = \{0\}\phantom {\dot {i}\!}\) and \(\text {range}\left ({\boldsymbol D}_{q}^{\mathrm {T}}\right)\cap \text {range}\left ({\boldsymbol D}_{q'}^{\mathrm {T}}\right) = \{0\}\phantom {\dot {i}\!}\), with 1≤p,p P, 1≤q,q Q, pp , qq , then

$$\begin{array}{*{20}l} & \text{rank}\left[ {\boldsymbol B}_{1}, {\boldsymbol B}_{2}, \ldots, {\boldsymbol B}_{P}\right] = \sum_{p = 1}^{P}{\text{rank}\left({\boldsymbol B}_{p} \right)}, \end{array} $$
((34a))
$$\begin{array}{*{20}l} & \text{rank}\left[ {{\boldsymbol D}_{1}^{\mathrm{T}}}, {{\boldsymbol D}_{2}^{\mathrm{T}}}, \ldots, {{\boldsymbol D}_{Q}^{\mathrm{T}}}\right]^{\mathrm{T}} = \sum_{q = 1}^{Q}{\text{rank}\left({\boldsymbol D}_{q} \right)}. \end{array} $$
((34b))

If \(\text {range}\left ({\boldsymbol B}_{p}\right) \subseteq \text {range}\left ({\boldsymbol B}_{P}\right)\phantom {\dot {i}\!}\) and \(\text {range}\left ({\boldsymbol D}_{q}^{\mathrm {T}}\right) \subseteq \text {range}\left ({\boldsymbol D}_{Q}^{\mathrm {T}}\right)\phantom {\dot {i}\!}\), 1≤pP−1, 1≤qQ−1 then

$$\begin{array}{*{20}l} &\text{rank}\left[ {\boldsymbol B}_{1}, {\boldsymbol B}_{2}, \ldots, {\boldsymbol B}_{P} \right] = \text{rank}\left({\boldsymbol B}_{P} \right), \end{array} $$
((35a))
$$\begin{array}{*{20}l} & \text{rank}\left[ {{\boldsymbol D}_{1}^{\mathrm{T}}}, {{\boldsymbol D}_{2}^{\mathrm{T}}}, \ldots, {{\boldsymbol D}_{Q}^{\mathrm{T}}} \right]^{\mathrm{T}} = \text{rank}\left({\boldsymbol D}_{Q} \right). \end{array} $$
((35b))

Proof.

See Appendix A.

Then, according to Lemma 1, the rank of covariance matrix E{A A H} in (31) can be computed as

$${\kern-16.4pt} {\small{\begin{aligned} &\text{rank}\left(E \{{\boldsymbol A} {\boldsymbol A}^{\mathrm{H}}\} \right)\\ &= \text{rank}\left(\!{\boldsymbol R}^{\boldsymbol K}\!\right)\!\left(\sum_{l = 1}^{L}{\left(\!\text{rank}\left({{\boldsymbol r}_{l}} \right)+\text{{null}}\left(\!{\boldsymbol r}_{l}^{\mathrm{T}} \right)\!\right)} - \text{{null}}\!\left[\! {\boldsymbol r}_{1}^{\mathrm{T}}\!, {\boldsymbol r}_{2}^{\mathrm{T}}\!, \ldots, {\boldsymbol r}_{L}^{\mathrm{T}}\! \right]\!\right)\\ &=\text{rank}\left({\boldsymbol R}^{\boldsymbol K}\right) \left(\sum_{l = 1}^{L}\sum_{d=1}^{L}{\left(\text{rank}\left(r^{L}_{l,d} {\boldsymbol R}^{\boldsymbol{S}}_{l,d} \right) +\text{{null}}\left(r^{L}_{l,d} {\boldsymbol R}^{\boldsymbol{S}}_{l,d} \right)\right)} \right.\\ &\quad\left.+\sum_{l = 1}^{L}{\left(\text{{null}}\left({\boldsymbol r}_{l}^{\mathrm{T}} \right)-\text{{null}}\left({{\boldsymbol r}_{l}} \right)\right)} - \text{{null}}{\left[ {\boldsymbol r}_{1}^{\mathrm{T}}, \ldots, {\boldsymbol r}_{L}^{\mathrm{T}} \right]}\right). \end{aligned}}} $$
((36))

Thus, from (25) and (36), we have the following theorem.

Theorem 1.

An upper bound on the maximum achievable diversity order of STF-coded MIMO-OFDM systems over frequency-selective block-fading channels with arbitrary spatial, temporal, and path/frequency correlations is

$${} {\small{\begin{aligned} {\text{rank}\left({{\boldsymbol R}}\right)} &\le\text{rank}\left({\boldsymbol R}^{\boldsymbol K}\right)\left(\sum_{l = 1}^{L}\sum_{d=1}^{L}\left(\text{rank}\left(r^{L}_{l,d} {\boldsymbol R}^{\boldsymbol{S}}_{l,d} \right)\right.\right.\\ &\left.\left. \quad +~\text{{null}}\left(r^{L}_{l,d} {\boldsymbol R}^{\boldsymbol{S}}_{l,d} \right)\!\right) +\sum_{l = 1}^{L}\!{\left(\!\text{{null}}\left(\!{\boldsymbol r}_{l}^{\mathrm{T}}\! \right)-\text{{null}}\left({{\boldsymbol r}_{l}} \right)\!\right)}\!\!\right.\\ &~\quad\left.-~\text{{null}}\left({\left[ {\boldsymbol r}_{1}^{\mathrm{T}}, \ldots, {\boldsymbol r}_{L}^{\mathrm{T}} \right]}\right)\!{\vphantom{\sum_{l = 1}^{L}}}\right). \end{aligned}}} $$
((37))

Furthermore, under the assumption of NN t L, equality in (37) holds if the matrix φ in (24) has full column rank, i.e., rank(φ)=N r N t KL.

In (37), the \(\sum _{l = 1}^{L}\text {{null}}\left ({\boldsymbol r}_{l}^{\mathrm {T}} \right) - \text {{null}}\left ({\left [ {\boldsymbol r}_{1}^{\mathrm {T}}, \ldots, {\boldsymbol r}_{L}^{\mathrm {T}} \right ]}\right)\) part represents the effect of the frequency/path correlation on the performance, while the \(\sum _{l = 1}^{L}\sum _{d=1}^{L}{\text {{null}}\left (r^{L}_{l,d} {\boldsymbol R}^{\boldsymbol {S}}_{l,d} \right)} - \sum _{l = 1}^{L}{\text {{null}}\left ({{\boldsymbol r}_{l}} \right)}\) part signifies the influence of multipaths on the spatial correlation.

Theorem 1 indicates that our upper bound on the maximum diversity of MIMO-OFDM systems depends on the correlation of fading channels, and the STF coding structure does not affect the upper bound. In addition, the maximum diversity in multiple domains is the product of the maximum diversities in each separated domain. Finally, the channel correlations in each domain, the correlation between the spatially correlated channels, and the frequency/path correlated channels affect our upper bound.

In particular, if the effects of different paths on our system are independent, i.e., \(\text {range} \left ({\boldsymbol r}_{l}\right) \cap \text {range} \left ({\boldsymbol r}_{l'}\right) = \{0\}\phantom {\dot {i}\!}\), 1≤l,l L, ll , then according to Lemma 1, our upper bound on the maximum achievable diversity in Theorem 1 can be simplified as

$$ \begin{aligned} {\text{rank}\left({{\boldsymbol R}}\right)} &\le\sum_{l = 1}^{L}{\text{rank}\left({{\boldsymbol r}_{l}} \right)} \\ &=\text{rank}\left({\boldsymbol R}^{\boldsymbol K}\right) \left(\sum_{l = 1}^{L}\sum_{d=1}^{L}{\text{rank}\left(r^{L}_{l,d} {\boldsymbol R}^{\boldsymbol{S}}_{l,d} \right)}\right.\\ &\quad\left.{ +\, \text{{null}}\left(r^{L}_{l,d} {\boldsymbol R}^{\boldsymbol{S}}_{l,d} \right)} - {\text{{null}}\left({{\boldsymbol r}_{l}} \right)}{\vphantom{\sum_{d=1}^{L}}} \right). \end{aligned} $$
((38))

This result implies that an upper bound on the maximum diversity is the sum of the maximum diversities in each separated path, and the maximum diversities in each path are affected by the spatial correlation of the links between the transmitter and the receiver across different paths and the channel temporal correlation.

Under the condition that the influences of different paths on the system are independent, if the regions where the channel spatial correlation occur in different paths are similar, i.e., \(\text {range}\left ({r^{L}_{l,d}{\boldsymbol R}^{\boldsymbol S}_{l,d}}\right) \subseteq \text {range}\left ({\mathop {\arg } \limits _{{r^{L}_{l,d}}{{\boldsymbol R}^{\boldsymbol S}_{l,d}}} {\mathop {\max } \limits _{1 \le d\le {L}}{\left \{\text {rank}\left (r^{L}_{l,d}{\boldsymbol R}^{\boldsymbol S}_{l,d}\right)\right \}}}}\right)\), 1≤lL, according to Lemma 1, we have

$$\begin{array}{@{}rcl@{}} \text{rank} \left({\boldsymbol r}_{l}\right) = {\mathop {\max} \limits_{1 \le d \le {L}}{\left\{\text{rank}\left(r^{L}_{l,d}{\boldsymbol R}^{\boldsymbol S}_{l,d}\right)\right\}}}, 1 \le l \le {L}. \end{array} $$
((39))

Then, our upper bound on the maximum diversity of MIMO-OFDM system in Theorem 1 can be further simplified as

$$\begin{array}{@{}rcl@{}} \text{rank} \left({\boldsymbol R}\right) \le \text{rank}\left({\boldsymbol R}^{\boldsymbol K}\right)\sum_{l = 1}^{L}{{\mathop {\max} \limits_{1 \le d \le {L}}{\left\{\text{rank}\left(r^{L}_{l,d}{\boldsymbol R}^{\boldsymbol S}_{l,d}\right)\right\}}}}. \end{array} $$
((40))

In this case, the maximum diversity of block-fading MIMO-OFDM system is the product of the time diversity and the sum of maximum space diversities in each path.

4 STF code designs with maximum diversity

From (23), if matrix φ has full column rank, i.e., rank(φ)=N r N t KL, equality in (37) holds, which means that our upper bound on the maximum diversity in Theorem 1 could be achieved. This prompts us to study the STF code design criteria for maximum diversity. We start with addressing the rank of matrix φ.

From (24), we have

$$\begin{array}{@{}rcl@{}} \text{rank}\left({\boldsymbol \varphi}\right) = {N_{r}}\, \text{rank} \left({\boldsymbol \Delta}{{\boldsymbol I}_{K}} \otimes {\boldsymbol T}_{N,L}\otimes {{\boldsymbol I}_{N_{t}}} \right). \end{array} $$
((41))

Owing to the expression of Δ in (18), we can write \({\boldsymbol \Delta }{{\boldsymbol I}_{K}} \otimes {\boldsymbol T}_{N,L}\otimes {{\boldsymbol I}_{N_{t}}}\phantom {\dot {i}\!}\) as

$${} \begin{aligned} &{\boldsymbol \Delta}{{\boldsymbol I}_{K}} \otimes {\boldsymbol T}_{N,L}\otimes {{\boldsymbol I}_{N_{t}}} =\\ &\quad\ \left[ {\begin{array}{cccc} {\Delta}_{1} {\boldsymbol T}_{N,L}\otimes {{\boldsymbol I}_{N_{t}}} & 0 & \ldots & 0 \\ 0 & {\Delta}_{2} {\boldsymbol T}_{N,L}\otimes {{\boldsymbol I}_{N_{t}}} & \ddots & \vdots \\ \vdots & \ddots & \ddots & 0\\ 0 & \ldots & 0 & {\Delta}_{K} {\boldsymbol T}_{N,L}\otimes {{\boldsymbol I}_{N_{t}}} \end{array}} \right]. \end{aligned} $$
((42))

According to the property of matrix product, for 1≤kK, \({\Delta }_{k} {\boldsymbol T}_{N,L}\otimes {{\boldsymbol I}_{N_{t}}}\phantom {\dot {i}\!}\) can be rewritten as

$$ \begin{aligned} {\Delta}_{k} {\boldsymbol T}_{N,L}\otimes {{\boldsymbol I}_{N_{t}}} &=\left[ {\Delta}_{k} {\boldsymbol W}_{N}^{1}\otimes{\boldsymbol I}_{N_{t}}, {\Delta}_{k} {\boldsymbol W}_{N}^{2}\right.\\ &\quad\left.\otimes\,{\boldsymbol I}_{N_{t}}, \ldots, {\Delta}_{k} {\boldsymbol W}_{N}^{L}\otimes{\boldsymbol I}_{N_{t}} \right], \end{aligned} $$
((43))

where the N×1 vector \({\boldsymbol W}_{N}^{l}\phantom {\dot {i}\!}\) is the lth column of the FFT matrix T N,L in (16). That is, for 1≤lL, \({\boldsymbol W}_{N}^{l} \buildrel \Delta \over = \left [ {(W)}_{N}^{l}, { (W)}_{N}^{2l}, \ldots, { (W)}_{N}^{Nl} \right ]^{\mathrm {T}}\).

Because \({\boldsymbol W}_{N}^{l}\) and \({\boldsymbol W}_{N}^{l'}\) are the lth and the l th columns of the FFT matrix, respectively, then due to the property of the FFT matrix, \(\text {range}\left ({\boldsymbol W}_{N}^{l}\right) \cap \text {range}\left ({\boldsymbol W}_{N}^{l'}\right) = \{0\}\phantom {\dot {i}\!}\), 1≤l,l L, ll . According to Lemma 1, from (41) to (43), we have

$$\begin{array}{@{}rcl@{}} \text{rank}\left(\boldsymbol \varphi\right) = N_{r} \sum_{k =1}^{K}{\sum_{l =1}^{L}{\text{rank}\left({{\Delta}_{k} {\boldsymbol W}_{N}^{l}\otimes{\boldsymbol I}_{N_{t}}}\right)}}. \end{array} $$
((44))

Thus, to guarantee that rank(φ)=N r N t KL, we have the following proposition.

Proposition 1.

Under the assumption of NN t L, in each OFDM block, if the differences of two codewords are independent from different transmit antennas across N subcarriers, and if the differences from N t antennas in each subcarrier are not all zeros, then the upper bound on the maximum diversity in Theorem 1 can be achieved.

Proof.

According to the property of matrix product, the matrix \({{\Delta }_{k} {\boldsymbol W}_{N}^{l}\otimes {\boldsymbol I}_{N_{t}}}\phantom {\dot {i}\!}\) in (43) can be rewritten as

$$\begin{array}{@{}rcl@{}} {{\Delta}_{k} {\boldsymbol W}_{N}^{l}\otimes{\boldsymbol I}_{N_{t}}} = \text{diag}\left({\boldsymbol W}_{N}^{l}\right)\left({\Delta}_{k} {\boldsymbol 1}_{N \times 1}\otimes {\boldsymbol I}_{N_{t}}\right), \end{array} $$
((45))

where \({\Delta }_{k} {\boldsymbol 1}_{N \times 1}\otimes {\boldsymbol I}_{N_{t}}\phantom {\dot {i}\!}\) is an N×N t matrix, and the elements in its ith column represent the differences of two codewords from the ith antenna across N subcarriers in the kth OFDM block.

If in each OFDM block, the differences of two codewords are independent from different transmit antennas across N subcarriers (\({\text {range}\left ({\Delta }_{k}[\!n]\!\right)}\bigcap {\text {range}\left ({\Delta }_{k}[m]\!\right)} =0,\) 1≤n,mN,nm) and are not all zeros from N t antennas in each subcarrier (the elements in each row of matrix Δ k are not all zero), then matrix \({\Delta }_{k} {\boldsymbol 1}_{N \times 1}\otimes {\boldsymbol I}_{N_{t}}\phantom {\dot {i}\!}\) will have full column rank, i.e., \(\text {rank}\left ({\Delta }_{k} {\boldsymbol 1}_{N \times 1}\otimes {\boldsymbol I}_{N_{t}}\right) = N_{t}\phantom {\dot {i}\!}\). Hence, from (44) matrix φ has full column rank matrix with rank(φ)=N r N t KL.

Proposition 1 does not put any constraint on the difference of two codewords in one OFDM block and that in another OFDM block, leaving much freedom in code designs. In particular, one can design an STF code in one OFDM block as a scaled version of an STF code in another OFDM block. We call this special STF code a quasi-SF code, which can be expressed as

$$\begin{array}{@{}rcl@{}} {\boldsymbol C}^{\text{SF}} = {\mathbb{D}} \otimes {C^{\text{SF}}}, \end{array} $$
((46))

where \(\mathbb {D}\) is a K×K diagonal matrix, with non-zero elements, satisfying the energy constraint \(E||{\mathbb {D}}||_{F}^{2} = K\). Similar to a traditional SF code, the N×N t N matrix

$$\begin{array}{@{}rcl@{}} {C^{\text{SF}}}= \text{diag} \left[ {{\boldsymbol {c}}^{\text{SF}}[\!1]}, {{\boldsymbol {c}}^{\text{SF}}[\!2]}, \ldots, {{\boldsymbol {c}}^{\text{SF}}[N]} \right] \end{array} $$
((47))

denotes an STF code in one OFDM block, with the N t ×1 vector

$$\begin{array}{@{}rcl@{}} {{\boldsymbol {c}}^{\text{SF}}[\!n]}= \left[ {{c_{1}^{\text{SF}}[\!n]}}, {{c_{2}^{\text{SF}}[\!n]}}, \ldots, {{c_{N_{t}}^{\text{SF}}[\!n]}} \right], 1 \le n \le N, \end{array} $$
((48))

denoting the transmitted codes in the nth subcarriers across N t antennas.

In particular, the transmitted codes can be the same for all different OFDM blocks, i.e., \({\mathbb D} = {\boldsymbol I}_{K}\), leading to a special quasi-SF code that satisfies

$$\begin{array}{@{}rcl@{}} {\boldsymbol C}^{\text{SF}} = {{\boldsymbol {I}_{K}}} \otimes {C}^{\text{SF}}. \end{array} $$
((49))

From (49), we see that the resulting STF code is constructed by repeating an SF code K times over K OFDM blocks.

Because of special structure of quasi-SF code in (46), we have the following proposition in terms of designing quasi-SF code with maximum diversity.

Proposition 2.

Given NN t L, if the differences of two quasi-SF codewords are independent from different transmit antennas across N subcarriers, and if the differences are not all zeros from N t transmit antennas in each subcarrier, then the upper bound on the maximum diversity of quasi-SF-coded MIMO-OFDM system in Theorem 1 can be achieved.

Proof.

Because of the quasi-SF coding scheme in (46), we have

$$ {} \text{rank}\left({\boldsymbol \Delta }{{\boldsymbol I}_{K}} \otimes {\boldsymbol T}_{N,L}\otimes {{\boldsymbol I}_{N_{t}}}\right) = K\, \text{rank}\left({\Delta^{\text{SF}}}{\boldsymbol T}_{N,L}\otimes {{\boldsymbol I}_{N_{t}}}\right), $$
((50))

where \({\Delta ^{\text {SF}}} = {C^{\text {SF}}} - {{\tilde C}}^{\text {SF}}\).

Similar to (43) and (45), we have

$$ \begin{aligned} \text{rank}\left({\Delta^{\text{SF}}} {\boldsymbol T}_{N,L}\otimes {{\boldsymbol I}_{N_{t}}}\right) &= \sum_{l = 1}^{L}\text{rank}\left({\text{diag}\left({\boldsymbol W}_{N}^{l}\right)}\right.\\ &\left.{\quad\times\left({\Delta}^{\text{SF}} {\boldsymbol 1}_{N \times 1}\otimes {\boldsymbol I}_{N_{t}}\right)}\right). \end{aligned} $$
((51))

Given NN t L, if the differences of two quasi-SF codewords are independent from different transmit antennas across N subcarriers (\({\text {range}\left ({\Delta }^{\text {SF}}[\!n]\!\right)}\bigcap {\text {range}\left ({\Delta }^{\text {SF}}[\!m]\!\right)} = 0,\) 1≤n,mN,nm) and are not all zeros from N t transmit antennas in each subcarrier (the elements in each row of matrix Δ k are not all zero), \({\Delta }^{\text {SF}} {\boldsymbol 1}_{N \times 1}\otimes {\boldsymbol I}_{N_{t}}\) has full column rank, i.e., \(\text {rank}\left ({\Delta }^{\text {SF}} {\boldsymbol 1}_{N \times 1}\otimes {\boldsymbol I}_{N_{t}}\right) = N_{t}\). Hence, from (44), matrix φ has full column rank as well, i.e., rank(φ)=N r N t KL. Consequently, the upper bound on the matrix diversity in Theorem 1 is achieved.

We thus see that achievability of our upper bound on the maximum diversity of MIMO-OFDM system in Theorem 1 is independent of the fading channel correlation. By employing an appropriate STF code design, we can achieve the maximum diversity of our MIMO-OFDM systems. There are two design requirements: (1) In each OFDM block, the differences of two codewords should be independent from different transmit antennas and (2) in each subcarrier, the differences of two codewords are not all zeros. In addition, compared with a quasi-SF code satisfying the conditions of Proposition 2, to achieve our upper bound on the maximum diversity in Theorem 1, STF code schemes in general can be different across OFDM blocks.

4.1 STF code examples

We now give two STF code examples: one STF code and another quasi-SF code that meet the conditions of our Propositions 1 and 2, respectively, to prove the achievability of our upper bound on the maximum diversity of block-fading MIMO-OFDM system with arbitrary correlations.

For simplicity, we assume that there are two OFDM blocks with two subcarriers in a 2×2 MIMO system. The number of path between the transmitter and the receiver is only one to guarantee NN t L. The modulation scheme is BPSK.

4.2 Example 1: a traditional STF code

In this case, an STF code scheme in one OFDM block is designed as an Alamouti code [24]. And in these two OFDM blocks, different codewords are transmitted, i.e.,

$$\begin{array}{@{}rcl@{}} C_{1} {\boldsymbol 1}_{2 \times 1} \otimes {\boldsymbol I}_{2} = \left[ {\begin{array}{cc} {x_{1}} & {x_{2}} \\ -x_{2}^{*} & x_{1}^{*} \end{array}} \right] \end{array} $$
((52))

and

$$\begin{array}{@{}rcl@{}} C_{2} {\boldsymbol 1}_{2 \times 1} \otimes {\boldsymbol I}_{2} = \left[ {\begin{array}{cc} {x_{3}} & {x_{4}} \\ -x_{4}^{*} & x_{3}^{*} \end{array}} \right], \end{array} $$
((53))

with C k , k=1,2, given in (3). Because the structure of this STF code satisfies the conditions of Proposition 1, rank(Δ k 1 2×1I 2)=N t =2, k=1,2.

Thus, one STF codeword in (3) can be expressed as

$$\begin{array}{@{}rcl@{}} {\boldsymbol C} = \left[ {\begin{array}{cccccccc} {x_{1}} & {x_{2}} & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & -x_{2}^{*} & x_{1}^{*} & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & {x_{3}} & {x_{4}} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & -x_{4}^{*} & x_{3}^{*} \\ \end{array}} \right]. \end{array} $$
((54))

The corresponding matrix φ can be rewritten as

$${\kern-16.3pt} {\fontsize{7.2}{7.2}{\begin{aligned} {\boldsymbol \varphi} &=\\ &\left[\!\! {\begin{array}{cccc} {(W)_{2}}(x_{1} - {\tilde x}_{1}) & {(W)_{2}}(x_{2} - {\tilde x}_{2}) & 0 & 0 \\ -{(W)_{2}^{2}}(x_{2}^{*}- {\tilde x}_{2}^{*}) & {(W)_{2}^{2}}(x_{1}^{*}- {\tilde x}_{1}^{*}) & 0 & 0 \\ 0 & 0 & {(W)_{2}}(x_{3} - {\tilde x}_{3}) & {(W)_{2}}(x_{4} - {\tilde x}_{4}) \\ 0 & 0 & -{(W)_{2}^{2}}(x_{4}^{*}- {\tilde x}_{4}^{*}) & {(W)_{2}^{2}}(x_{3}^{*}- {\tilde x}_{3}^{*}) \\ \end{array}}\! \right]\otimes {{\boldsymbol I}_{2}}, \end{aligned}}} $$
((55))

where \({\tilde x}_{i}\), 1≤i≤4, denotes the detected code. The φ matrix has full column rank, i.e., rank(φ)=N r N t KL=8. So using this STF code, the upper bound on maximum diversity in Theorem 1 can be achieved. In addition, the code rate in this case is 1 bits/s/Hz.

4.3 Example 2: a quasi-SF code

In this case, an STF code is constructed by repeating an SF code twice over two OFDM blocks, and the SF code again is based on the Alamouti scheme, i.e.,

$$\begin{array}{@{}rcl@{}} C^{\text{SF}} {\boldsymbol 1}_{2 \times 1} \otimes {\boldsymbol I}_{2} = \left[ {\begin{array}{cc} {x_{1}} & {x_{2}} \\ -{x_{2}^{*}} & {x_{1}^{*}} \end{array}} \right], \end{array} $$
((56))

with C SF given in (49). This quasi-SF code satisfies the conditions of Proposition 2, i.e., rank(Δ SF 1 2×1I 2)=N t =2.

Hence, a quasi-SF code C SF in (49) can be written as

$$\begin{array}{@{}rcl@{}} {\boldsymbol C} = \left[ {\begin{array}{cccccccc} {x_{1}} & {x_{2}} & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & -{x_{2}^{*}} & {x_{1}^{*}} & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & {x_{1}} & {x_{2}} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & -{x_{2}^{*}} & {x_{1}^{*}} \\ \end{array}} \right], \end{array} $$
((57))

and the corresponding φ matrix is

$${} {\fontsize{7.2}{7.2}{\begin{aligned} {\boldsymbol \varphi} &=\\ &\left[\!\!\! {\begin{array}{cccc} {(W)_{2}}(x_{1} - {\tilde x}_{1}) & {(W)_{2}}(x_{2} - {\tilde x}_{2}) & 0 & 0 \\ -{(W)_{2}^{2}}(x_{2}^{*}- {\tilde x}_{2}^{*}) & {(W)_{2}^{2}}(x_{1}^{*}- {\tilde x}_{1}^{*}) & 0 & 0 \\ 0 & 0 & {(W)_{2}}(x_{1} - {\tilde x}_{1}) & {(W)_{2}}(x_{2} - {\tilde x}_{2}) \\ 0 & 0 & -{(W)_{2}^{2}}(x_{2}^{*}- {\tilde x}_{2}^{*}) & {(W)_{2}^{2}}(x_{1}^{*}- {\tilde x}_{1}^{*}) \\ \end{array}}\! \right]\otimes {{\boldsymbol I}_{2}}, \end{aligned}}} $$
((58))

where \({\tilde x}_{1}\) and \({\tilde x}_{2}\) are the detected codes of x 1 and x 2, respectively. The matrix φ has full column rank with rank(φ)=N r N t KL=8. Therefore, the upper bound on the maximum diversity of MIMO-OFDM system with general correlation in Theorem 1 can also be achieved by using this special quasi-SF code, whose rate is 1/2 bits/s/Hz. Compared to the STF code in Example 1, because this quasi-SF code is constructed by repeating an SF code over different OFDM blocks, it does not utilize the temporal resource, hence has lower code rate.

Note that this quasi-SF code was used to achieve an upper bound on the maximum diversity of MIMO-OFDM system with independent fading channels in the spatial domain in [25]. Here, we show that the same code achieves the maximum diversity for MIMO-OFDM systems with general spatial, temporal, and frequency/path correlations.

5 Decoding complexity

We know that if the ML decoder at the receiver chooses K complex information symbols x k and the modulation scheme is q-PSK, the decoding complexity cannot exceed q K metric computations.

With quasistatic and frequency-flat i.i.d fading channels, using orthogonal space-time block code (OSTBC), the decoding complexity is linear. However, on time-varying and frequency-selective channels, the STF block codes lose their reduced complexity decoding. We assume the fading channels are independent identically distributed with K OFDM blocks and N subcarriers. If we use STF block codes (such as example 1), the decoding complexity in each group is linear and the decoding complexity of these KN groups is q KL, in which L is the number of separated multipaths. If the fading channels are correlated, the decoding complexity will be further increased.

In one OFDM block and one path, the received signal could be expressed as follows:

$$\begin{array}{@{}rcl@{}} {\bar{\boldsymbol y}} = {\bar{\boldsymbol H}} {\bar{\boldsymbol x}}, \end{array} $$
((59))

where N r ×N t matrix \({\bar {\boldsymbol H}}\) denotes the fading channels, N t ×1 vector \({\bar {\boldsymbol x}}\) denotes the transmitted signals, and N r ×1 vector \({\bar {\boldsymbol y}}\) is the received signals.

A QR decomposition of the matrix \({\bar {\boldsymbol H}}\) can be obtained by applying the Gram-Schmidt procedure to the columns of \({\bar {\boldsymbol H}} =\, [{\bar {\boldsymbol h}}_{1}, \ldots, {\bar {\boldsymbol h}}_{N_{t}}]\) to obtain \({\bar {\boldsymbol H}} = {\bar {\boldsymbol Q}}{\bar {\boldsymbol R}}\), where the columns of \({\bar {\boldsymbol Q}} =\, [{\bar {\boldsymbol q}}_{1}, \ldots, {\bar {\boldsymbol q}}_{N_{t}}]\) are an orthonormal basis for the subspace spanned by \({\bar {\boldsymbol H}}\), and \(\bar {\boldsymbol R}\) is upper triangular with nonnegative real diagonal elements. In [26], using OSTBC, the decoding complexity is q M, where \(M = \frac {N_{R} + 2N_{C}}{4}\), N R is the number of off-diagonal elements of \(\bar {\boldsymbol R}\) which are equal to a real number but not zeros, and N C is the number of off-diagonal elements of \(\bar {\boldsymbol R}\) which are equal to a complex number but not zeros.

Then, in one OFDM block with frequency-selective correlated fading channels, the received signal could be expressed as follows:

$$\begin{array}{@{}rcl@{}} \left[ {\begin{array}{c} {{{\bar {\boldsymbol y}}_{1}}}\\ {{{\bar {\boldsymbol y}}_{2}}}\\ \vdots \\ {{{\bar {\boldsymbol y}}_{N}}} \end{array}} \right] = \left[ {\begin{array}{cccc} {{{\bar {\boldsymbol H}}_{11}}}&{{{\bar {\boldsymbol H}}_{12}}}& \ldots &{{{\bar {\boldsymbol H}}_{1N}}}\\ {{{\bar {\boldsymbol H}}_{21}}}&{{{\bar {\boldsymbol H}}_{22}}}& \ldots &{{{\bar {\boldsymbol H}}_{2N}}}\\ \vdots & \vdots & \ddots & \vdots \\ {{{\bar {\boldsymbol H}}_{N1}}}&{{{\bar {\boldsymbol H}}_{N2}}}& \ldots &{{{\bar {\boldsymbol H}}_{NN}}} \end{array}} \right]\left[ {\begin{array}{c} {{{\bar {\boldsymbol x}}_{1}}}\\ {{{\bar {\boldsymbol x}}_{2}}}\\ \vdots \\ {{{\bar {\boldsymbol x}}_{N}}} \end{array}} \right]. \end{array} $$
((60))

Similarly, the fading channel matrix could be decomposed to QR, i.e.,

$${} {\fontsize{8}{8}{\begin{aligned} \left[\! {\begin{array}{cccc} {{{\bar {\boldsymbol H}}_{11}}}&{{{\bar {\boldsymbol H}}_{12}}}& \ldots &{{{\bar {\boldsymbol H}}_{1N}}}\\ {{{\bar {\boldsymbol H}}_{21}}}&{{{\bar {\boldsymbol H}}_{22}}}& \ldots &{{{\bar {\boldsymbol H}}_{2N}}}\\ \vdots & \vdots & \ddots & \vdots \\ {{{\bar {\boldsymbol H}}_{N1}}}&{{{\bar {\boldsymbol H}}_{N2}}}& \ldots &{{{\bar {\boldsymbol H}}_{NN}}} \end{array}}\! \right] = \left[\! {\begin{array}{cccc} {{{\bar {\boldsymbol Q}}_{11}}}&{{{\bar {\boldsymbol Q}}_{12}}}& \ldots &{{{\bar {\boldsymbol Q}}_{1N}}}\\ {{{\bar {\boldsymbol Q}}_{21}}}&{{{\bar {\boldsymbol Q}}_{22}}}& \ldots &{{{\bar {\boldsymbol Q}}_{2N}}}\\ \vdots & \vdots & \ddots & \vdots \\ {{{\bar {\boldsymbol Q}}_{N1}}}&{{{\bar {\boldsymbol Q}}_{N2}}}& \ldots &{{{\bar {\boldsymbol Q}}_{NN}}} \end{array}}\! \right]\!\!\!\left[\! {\begin{array}{cccc} {{{\bar {\boldsymbol R}}_{11}}}& \ldots &{{{\bar {\boldsymbol R}}_{1L}}}&0\\ 0& \ddots & \vdots &0\\ \vdots & \ddots &{{{\bar {\boldsymbol R}}_{LL}}}& \vdots \\ 0& \ldots &0&0 \end{array}}\! \right] \end{aligned}}} $$
((61))

Using STF block codes, if the L multipaths are independent, the decoding complexity is \(q^{\sum \limits _{i = 1}^{L} M_{i}}\), where \(M_{i} = \frac {N_{R}(i) + 2N_{C}(i)}{4}\), N R (i) is the number of off-diagonal elements of \(\bar {\boldsymbol R}_{\textit {ii}}\) which are equal to a real number but not zeros, and N C (i) is the number of off-diagonal elements of \(\bar {\boldsymbol R}_{\textit {ii}}\) which are equal to a complex number but not zeros. Otherwise, if there exists the path correlation, the decoding complexity is \(q^{\sum \limits _{i,j = 1}^{i,j = L}{M_{\textit {ij}}}}\), where \(q^{M_{\textit {ij}}}\phantom {\dot {i}\!}\) is the complexity through the fading channels corresponding to one sub-matrix \({{{\bar {\boldsymbol H}}_{\textit {ij}}}}\).

Further, in the correlated frequency-selective fading channels with the K independent OFDM blocks, the fading channels and the transmitted codes are divided into these K groups and in each groups using STF block codes, the decoding complexity is \(q^{\sum \limits _{i,j = 1}^{i,j = L}{M_{\textit {ij}}}}\). Because the codes of different groups are not orthometric, the decoding complexity of the K independent OFDM blocks should be \(q^{K\sum \limits _{i,j = 1}^{i,j = L}{M_{\textit {ij}}}}\). If there exists temporal correlation of the fading channels, the complexity will be increased and the calculation is similar to the one of correlated paths.

6 Special cases

We simplify our general diversity result for some special correlation scenarios of practical interests. In the process, we identify the newly developed statistical channel models for MIMO LTE [20] and 802.11n [21] as special cases by showing that our theoretical diversity results match those simulated from these statistical channel models; we also recover several existing results in the literature. Note that in this section, the bit error rate (BER) performance of uncoded Raleigh channel with the diversity is depicted by the function berfading in Matlab. Expanding the Alamouti code to a standard STF code by tensor product, we use these STF codes in the simulation as the one in example 1 last section.

6.1 Case 1: STF-coded MIMO-OFDM system over temporal, frequency, and separable spatial correlation channels

If the scatters between the transmitter and the receiver are scarce, we can consider that the spatial correlation follows the Kronecker model [27], i.e., the transmitted correlation is independent of the received correlation. In the lth path, the spatial correlation can be expressed as

$$\begin{array}{@{}rcl@{}} {{\boldsymbol R}^{\boldsymbol S}_{l,l}} = {{\boldsymbol R}^{\boldsymbol {N_{t}}}_{l,l} }\otimes {{{\boldsymbol R}^{\boldsymbol {N_{r}}}_{l,l} }}, \end{array} $$
((62))

with transmitted correlation \({\boldsymbol R}^{\boldsymbol {N_{t}}}_{l,l}\) and received correlation \({\boldsymbol R}^{\boldsymbol {N_{r}}}_{l,l}\). If the temporal correlation of the fading channels are independent of the spatial correlation and the multipath correlation, according to Theorem 1, the maximum achievable diversity of MIMO-OFDM system in this case is

$${} {\small{\begin{aligned} {\text{rank}\left({{\boldsymbol R}}\right)} &=\text{rank}\left({\boldsymbol R}^{\boldsymbol K}\right) \left(\sum_{l = 1}^{L}\sum_{d=1}^{L}\left(\text{rank}\left(r^{L}_{l,d} {\boldsymbol R}^{\boldsymbol{N_{t}}}_{l,d} \right)\text{rank}\left(r^{L}_{l,d} {\boldsymbol R}^{\boldsymbol{N_{r}}}_{l,d} \right) \right.\right. \\[-4pt] &\left.\quad\!\!+~\text{{null}}\left(r^{L}_{l,d} {\boldsymbol R}^{\boldsymbol{N_{t}}}_{l,d}\otimes {\boldsymbol R}^{\boldsymbol{N_{r}}}_{l,d} \right)\!\right)\!+\sum_{l = 1}^{L}{\left(\!\text{{null}}\left(\!{\boldsymbol r}_{l}^{\mathrm{T}} \right)-\text{{null}}\left({{\boldsymbol r}_{l}} \right)\!\right)} \\[-4pt] &\quad\left.\!\!-~\text{{null}}\left({\left[\! {\boldsymbol r}_{1}^{\mathrm{T}}, \ldots, {\boldsymbol r}_{L}^{\mathrm{T}}\right]}\right){\vphantom{\sum_{l = 1}^{L}}}\!\right)\!. \end{aligned}}} $$
((63))

For illustrating the influence of the space, time, frequency dimensions, and their correlations into the system performance, Fig. 1 depicts the BER performances of seven different systems.

Fig. 1
figure 1

BER performances of seven different systems

From Fig. 1, the BER performance of 2×2 MIMO system with two blocks and three paths are better than the ones of 2×2 MIMO system, 2×2 MIMO system with three paths, and 2×2 MIMO system with two blocks. It illustrates that the effect of reducing one dimension on the BER performance is higher than the effect of correlation of one dimension. Furthermore, compared to the system with two separated blocks or with three separated paths, the diversities of the system with two correlated blocks or with three correlated paths are decreased, respectively. That explains the effect of temporal and multipath correlations on the performance. In addition, the BER performance of 2×2 MIMO system with two blocks, three paths, and separated multipath and temporal correlations is better than the one with the completely correlated fading channels. That illustrates that the correlation between two dimensions could lead to reduce the performance.

Now, we show that this model is consistent with that in MIMO LTE systems [20]. Take LTE Extended Vehicular A (EVA) model for example [20]. It is a 2×2 LTE MIMO system with nine paths. And we use high spatial correlation, i.e.,

$$\begin{array}{@{}rcl@{}} {\boldsymbol R}^{\boldsymbol {N_{r}}}_{l,d} = {\boldsymbol R}^{\boldsymbol {N_{t}}}_{l,d} = \left[ {\begin{array}{cc} 1 & 0.9 \\ 0.9 & 1 \end{array}} \right]. \end{array} $$
((64))

Figure 2 plots the simulated BER performance of the STF-coded 2×2 LTE-EVA system with high spatial correlation. In the simulation, there is one OFDM block and the modulation scheme is quadrature phase shift keying (QPSK). Because there exists the path correlation in LTE-EVA model, though matrices \({\boldsymbol R}^{\boldsymbol {N_{t}}}_{l,d}\) and \({\boldsymbol R}^{\boldsymbol {N_{r}}}_{l,d}\) have full ranks, from Fig. 2, the diversity of this model is less than the number of nine multipaths that is caused by the path correlation and the correlation between the space and the path, i.e. \(\sum _{l = 1}^{L}{\left (\text {{null}}\left ({\boldsymbol r}_{l}^{\mathrm {T}} \right)-\text {{null}}\left ({{\boldsymbol r}_{l}} \right)\right)} - \text {{null}}\left ({\left [ {\boldsymbol r}_{1}^{\mathrm {T}}, \ldots, {\boldsymbol r}_{L}^{\mathrm {T}}\right ]}\right)\) in (63).

Fig. 2
figure 2

BER comparisons between the STF-coded 2×2 LTE-EVA system with high spatial correlation

We further let these nine multipaths be separated, and other parameters be the same with the one of LTE-EVA model. Without multipath correlation, from (63), the achievable maximum diversity can be simplified as

$$\begin{array}{@{}rcl@{}} \text{rank}\left({\boldsymbol R} \right) = \sum_{l=1}^{L}\left(\text{rank}\left({{{\boldsymbol R}^{\boldsymbol {N_{t}}}_{l,l}}} \right)\text{rank}\left({{{\boldsymbol R}^{\boldsymbol {N_{r}}}_{l,l}}} \right)\right). \end{array} $$
((65))

Both covariance matrices \({\boldsymbol R}^{\boldsymbol {N_{t}}}_{l,l}\) and \({\boldsymbol R}^{\boldsymbol {N_{r}}}_{l,l}\) have rank two. From (65), the theoretical achievable maximum diversity in this case is 36. It is seen from Fig. 2 that when compared to the BER performance of uncoded Rayleigh fading channels with diversity order 36, the slope of the curve depicting the performance of the 2×2 LTE system with nine separable paths is the same (when SNR is larger than 8, the BER of the LTE system with nine separated multipaths is too small to obtain by Matlab). Thus, the diversity of the 2×2 LTE system is also 36, which matches the theoretical result.

In the same way, in 802.11n criterion, the model is still followed this case. Compared with the LTE system, in 802.11n, spatial correlation of fading channels is caused by a mean angle of arrival (AoA), a mean angle of departure (AoD), and so forth. Furthermore, the IEEE 802.11n channel models make the following assumptions: (1) each tap is modeled independently; (2) the spatial correlation and temporal correlation for each tap are modeled independently; and (3) each tap is modeled using the Kronecker model for the fading channels [21]. In this paper, we only focus on the rank of spatial correlated matrix, but not the influenced factors of the spatial correlation. Thus, the maximum achievable diversity of MIMO-OFDM system in this case is similar to case 1,

$${} {\small{\begin{aligned} {\text{rank}\left({{\boldsymbol R}}\right)} &=\text{rank}\left({\boldsymbol R}^{\boldsymbol K}\right) \left(\sum_{l = 1}^{L}\sum_{d=1}^{L}\left(\text{rank}\left(r^{L}_{l,d} {\boldsymbol R}^{\boldsymbol{N_{t}}}_{l,d} \right)\text{rank}\left(r^{L}_{l,d} {\boldsymbol R}^{\boldsymbol{N_{r}}}_{l,d} \right) \right.\right.\\[-2pt] &\;\;\left. +~\text{{null}}\left(r^{L}_{l,d} {\boldsymbol R}^{\boldsymbol{N_{t}}}_{l,d}\otimes {\boldsymbol R}^{\boldsymbol{N_{r}}}_{l,d} \right)\!{\vphantom{\sum_{d=1}^{L}}}\!\right)+\!\sum_{l = 1}^{L}{\left(\!\text{{null}}\left(\!{\boldsymbol r}_{l}^{\mathrm{T}} \right)-\text{{null}}\left({{\boldsymbol r}_{l}} \right)\!\right)} \\[-2pt] &\;\;\left.-~\text{{null}}\left({\left[ {\boldsymbol r}_{1}^{\mathrm{T}}, \ldots, {\boldsymbol r}_{L}^{\mathrm{T}} \right]}\right)\!\!{\vphantom{\sum_{l = 1}^{L}}}\right). \end{aligned}}} $$
((66))

Take 802.11n channel model B for example [21]. It is a 2×2 MIMO system with nine Rayleigh-fading paths which have a bell Doppler spectrum.

Figure 3 depicts the simulated BER performance of this model. In this simulation, there is one OFDM block and the modulation scheme is QPSK. The diversity of 802.11n with channel model B is a little larger than 4. This result is caused by the path correlation, spatial correlation, and the correlation between these two dimensions.

Fig. 3
figure 3

BER comparisons between the STF-coded 2×2 LTE system with four separable paths

Further, we let these nine multipaths be separable and the transmitted correlation \({\boldsymbol R}^{\boldsymbol N_{t}}_{1,1}\) and the received correlation \({\boldsymbol R}^{\boldsymbol N_{r}}_{1,1}\) be the same with the standard 802.11n channel model B.

From Fig. 3, the BER performance is much better than the one of 802.11n channel model B. The reason is that under the conditions of separable multipaths, the achievable diversity is linearly increased by the number of multipaths. That is consistent with the theoretical result.

6.2 Case 2: STF-coded MIMO-OFDM system over frequency and temporal correlation channels

If the antenna elements at both the transmitter and the receiver are well separated and scatterers are abundant, then no spatial correlation exists between the different transmit and receive antenna pairs (i.e., independent spatial fading channels). Hence,

$${} {\small{\begin{aligned} & \text{rank} \left({\boldsymbol R}^{\boldsymbol {S}}[\!l,d]\right) = {N_{r}} {N_{t}}, 1 \le l, d \le L,\\ & \text{{null}}{\left[\! {{\boldsymbol r}_{1}}^{\mathrm{T}}, {{\boldsymbol r}_{2}}^{\mathrm{T}}, \ldots, {{\boldsymbol r}_{L}}^{\mathrm{T}}\right]} -\! \sum_{l = 1}^{L}{\text{{null}}\!\left(\!{{\boldsymbol r}_{l}}^{\mathrm{T}} \right)} = {N_{r}} {N_{t}} \left(L -\text{rank}\!\left(\!{\boldsymbol R^{L}}\right)\!\right),\\ &\text{range}\left(r^{L}_{l,d}{\boldsymbol R}^{\boldsymbol {S}}_{l,d}\right) \subseteq \text{range}\left({\boldsymbol R}^{\boldsymbol {S}}_{l,l}\right), 1 \le l, d \le L. \end{aligned}}} $$
((67))

Therefore, the maximum achievable diversity of MIMO-OFDM system without the spatial correlation is

$$\begin{array}{@{}rcl@{}} \text{rank}\left({{{\boldsymbol R} }} \right) = {N_{r}}{N_{t}}\text{rank}\left({{{\boldsymbol R}^{\boldsymbol K}}} \right)\text{rank}\left({{{\boldsymbol R}^{\boldsymbol L} }} \right), \end{array} $$
((68))

which agrees with [25].

6.3 Case 3: SF-coded MIMO-OFDM system over only spatial correlation channels

SF coding, as a special case of STF coding, is done across multiple antennas and all subcarriers within one OFDM block period. In this case, the fading channels are independent in temporal and frequency domains, leading to

$${} \begin{aligned} & {\boldsymbol R}^{\boldsymbol K} = 1,\\ & \text{rank}\left(r^{L}_{l,d}{\boldsymbol R}^{\boldsymbol {S}}_{l,d}\right) = 0, 1 \le l, d \le L, l \ne d,\\ & \text{range}\left({{\boldsymbol r}_{l}}^{\mathrm{T}}\right) \cap \text{range} \left({{\boldsymbol r}_{l'}}^{\mathrm{T}}\right) = \{0\}, 1 \le l, l' \le L, l \ne l', \\ & \text{range} \left(r^{L}_{l,l}{\boldsymbol R}^{\boldsymbol {S}}_{l,l}\right) \cap \text{range} \left(r^{L}_{l,d}{\boldsymbol R}^{\boldsymbol {S}}_{l,d}\!\right) = \left\{ 0 \right\}, 1 \!\le l, d\! \le L, l\! \ne d. \end{aligned} $$
((69))

We substitute rank(R K)=1 into the result of case 1 and have

$$\begin{array}{@{}rcl@{}} \text{rank}\left({{{\boldsymbol R} }} \right) = \sum \limits_{l=1}^{L}\text{rank}\left({{{\boldsymbol R}^{\boldsymbol S}{[\!l,l]} }} \right), \end{array} $$
((70))

which is consistent with the result in [16]. It indicates that the diversity order of the system is equal to the number of degrees of freedom offered by independent scatterers.

Figure 4 depicts the BER performances of 2×2 MIMO system with five separable multipaths under different spaces of transmit and receive antennas. From Fig. 4, with the space increasing, the more spatial diversities are obtained. When the spaces of transmit and receive antennas are one (normalized by the wavelength), the diversity is 20 which illustrates that the full space diversity 4 is achieved in each path.

Fig. 4
figure 4

BER performance of 2×2 system with five paths under different antenna spaces

6.4 Case 4: SF-coded MIMO-OFDM system over only frequency correlation channels

In this case, there is only one OFDM block period (K = 1) and the fading channels in the spatial domain are independent. Hence,

$${} {\small{\begin{aligned} & {\boldsymbol R}^{\boldsymbol K} = 1,\\ & \text{rank} \left({\boldsymbol R}^{\boldsymbol {S}}[\!l,d]\right) = {N_{r}} {N_{t}}, 1 \le l, d \le L,\\ & \text{{null}}{\left[ {{\boldsymbol r}_{1}}^{\mathrm{T}}, {{\boldsymbol r}_{2}}^{\mathrm{T}}, \ldots, {{\boldsymbol r}_{L}}^{\mathrm{T}} \right]} -\! \sum_{l = 1}^{L}{\!\text{{null}} \left(\!{{\boldsymbol r}_{l}}^{\mathrm{T}}\! \right)} = {N_{r}} {N_{t}}\! \left(L -\text{rank}\left({\boldsymbol R^{L}}\right)\!\right),\\ &\text{range}\left(r^{L}_{l,d}{\boldsymbol R}^{\boldsymbol {S}}_{l,d}\right) \subseteq \text{range}\left({\boldsymbol R}^{\boldsymbol {S}}_{l,l}\right), 1 \le l, d \le L. \end{aligned}}} $$
((71))

The maximum achievable diversity of SF-coded MIMO-OFDM system over frequency correlation channels is

$$\begin{array}{@{}rcl@{}} \text{rank}\left({{{\boldsymbol R} }} \right)= {N_{t}}{N_{r}}{\text{rank}\left({{{\boldsymbol R}^{\boldsymbol L} }} \right)}, \end{array} $$
((72))

which agrees with [28].

6.5 Case 5: ST-coded MIMO system over only temporal correlation channels

Without considering the multipath effect and the spatial correlation, the system simplifies to ST-coded MIMO system over temporally correlated channels. Consequently, we have

$$\begin{array}{*{20}l} & {\boldsymbol R}^{\boldsymbol L} = 1,\\ & \text{rank}\left({\boldsymbol R}^{\boldsymbol S}{[\!l,d]}\right) = {N_{r}}{N_{t}}, l = d =1, \end{array} $$
((73))

Because of only one path during the transmitter and the receiver, the covariance matrix E{A A H} in (27) degrades to a tensor product of a temporal channel covariance matrix R K and a spatial channel covariance matrix \({{{\boldsymbol R}_{1,1}^{\boldsymbol S} }}\).

Hence, the maximum achievable diversity for ST-coded MIMO system with temporal correlation is given by

$$\begin{array}{@{}rcl@{}} \text{rank}\left({{{\boldsymbol R}}} \right) = {N_{r}}{N_{t}}{\text{rank}\left({{{\boldsymbol R}^{\boldsymbol K} }} \right)}, \end{array} $$
((74))

which was given in [7].

Figure 5 plots the BER performances of 2×2 MIMO system with five correlated OFDM blocks under different Doppler shifts. From Fig. 5, with the Doppler shift decreasing, the more path diversities are obtained. Without Doppler shift, the diversity is 20 which explains that the full time diversity 5 is achieved.

Fig. 5
figure 5

BER performance of 2×2 system with five OFDM blocks under different Doppler shifts

6.6 Case 6: ST-coded MIMO system over spatial correlation channels with Kronecker model

In this case, the spatial correlation of flat-fading channels follows a Kronecker model [27]. Since the multipath effect and Doppler shift are not considered in this case, both the temporal and frequency covariance matrices R K and R L degenerate, leading to

$$\begin{array}{*{20}l} & {\boldsymbol R}^{\boldsymbol L} = 1,\\ & {\boldsymbol R}^{\boldsymbol K} = 1,\\ & E\left\{{\boldsymbol A}{\boldsymbol A }^{\mathrm{H}}\right\} = {\boldsymbol R}^{\boldsymbol S}_{1,1} = {{\boldsymbol R}^{\boldsymbol {N_{t}}} }\otimes {{{\boldsymbol R}^{\boldsymbol {N_{r}}} }}. \end{array} $$
((75))

The maximum diversity of MIMO fading channels with separable spatial correlation is hence given by

$$\begin{array}{@{}rcl@{}} \text{rank}\left({{{\boldsymbol R}}} \right) = \text{rank}\left({{{\boldsymbol R}^{\boldsymbol {N_{t}}} }} \right)\text{rank}\left({{{\boldsymbol R}^{\boldsymbol {N_{r}}} }} \right), \end{array} $$
((76))

which is the result obtained in [27].

6.7 Case 7: independent fading channels

In this case, there is no correlation among channel fades in the space, time, and frequency/path domains. Consequently, we have

$${} {\small{\begin{aligned} & \text{rank}\left({\boldsymbol R}^{\boldsymbol S}_{l,d}\right) = {N_{r}}{N_{t}}, 1 \le l, d \le L,\\ & \text{rank}\left({\boldsymbol R}^{\boldsymbol K}\right) = K, \\ & \text{rank}\left({\boldsymbol R}^{\boldsymbol L}\right) = L,\\ & \text{rank}\left(r^{L}_{l,d}{\boldsymbol R}^{\boldsymbol {S}}_{l,d}\right) = 0, 1 \le l, d \le L, l \ne d,\\ & \text{range}\left({{\boldsymbol r}_{l}}^{\mathrm{T}}\right) \cap \text{range} \left({{\boldsymbol r}_{l'}}^{\mathrm{T}}\right) = \{0\}, 1 \le l, l' \le L, l \ne l', \\ & \text{range} \left(r^{L}_{l,l}{\boldsymbol R}^{\boldsymbol {S}}_{l,l}\right) \cap \text{range} \left(r^{L}_{l,d}{\boldsymbol R}^{\boldsymbol {S}}_{l,d}\right) = \left\{ 0 \right\}, 1 \le l, d \le L, l \ne d. \end{aligned}}} $$
((77))

According to Theorem 1, we are able to achieve the total number of independent degrees of freedom inherent in the physical structure of the system, which is given by

$$\begin{array}{@{}rcl@{}} \text{rank}\left({{{\boldsymbol R} }} \right) = {N_{t}}{N_{r}}KL, \end{array} $$
((78))

which is consistent with the result in [29].

7 Conclusions

In this paper, we have studied the performance of STF-coded MIMO-OFDM system with arbitrary spatial, temporal, and frequency/path correlations. Our analysis is based on a general transmitted correlation model that goes beyond limitations of ideal assumptions such as quasi-static or rapid fading channels, channel independence in different antennas, and separable multipaths between the transmitter and the receiver. Our channel spatial correlation covers both the Kronecker and non-Kronecker models. We derive an upper bound on the maximum achievable diversity of this system using Hadamard and tensor products. Based on rank properties of block matrices, we also analyze the effect of the general channel correlation on the performance of block-fading MIMO-OFDM systems. Furthermore, achievability of our upper bound is proved via two code design examples: one traditional STF code and another quasi-SF code. The decoding complexity is considered in the MIMO system with arbitrary correlated fading channels using the traditional STF code. By identifying the newly developed statistical channel models for MIMO LTE and 802.11n as special cases of our STF-coded MIMO-OFDM system, we can directly use our theoretical diversity results without resorting to simulations. Finally, our theoretical result for general correlation scenarios subsumes those in the existing literature that only deal with different special cases.

8 Appendix A: proof of Lemma 1

Proof.

According to the property of the rank and nullity of a matrix [30], we have

$${} \begin{aligned} &\text{rank}\left({\boldsymbol B}_{p}\right) + \text{{null}} \left({\boldsymbol B}_{p}\right) = b_{p}, 1\le p \le P,\\ &\text{rank}\left({\boldsymbol D}_{q}^{\mathrm{T}}\right) + \text{{null}} \left({\boldsymbol D}_{q}^{\mathrm{T}}\right) = d_{q}, 1 \le q \le Q,\\ &\text{rank}\left[ {\boldsymbol B}_{1}, {\boldsymbol B}_{2}, \ldots, {\boldsymbol B}_{P}\right] + \text{{null}} \left[ {\boldsymbol B}_{1}, {\boldsymbol B}_{2}, \ldots, {\boldsymbol B}_{P} \right] = \sum_{p = 1}^{P} {b_{p}}.\\ &\text{rank} \left[\! {{\boldsymbol D}_{1}^{\mathrm{T}}}, {{\boldsymbol D}_{2}^{\mathrm{T}}}, \ldots, {{\boldsymbol D}_{Q}^{\mathrm{T}}} \right] + \text{{null}} \left[ {{\boldsymbol D}_{1}^{\mathrm{T}}}, {{\boldsymbol D}_{2}^{\mathrm{T}}}, \ldots, {{\boldsymbol D}_{Q}^{\mathrm{T}}} \right] =\! \sum_{q = 1}^{Q}\!{d_{q}}. \end{aligned} $$
((79))

Thus, the rank of block matrices [B 1,B 2,…,B P ] and \(\left [ {{\boldsymbol D}_{1}^{\mathrm {T}}}, {{\boldsymbol D}_{2}^{\mathrm {T}}}, \ldots, {{\boldsymbol D}_{Q}^{\mathrm {T}}} \right ]^{\mathrm {T}}\) can be calculated, respectively, as

$$ \begin{aligned} &\text{rank}\left[ {\boldsymbol B}_{1}, {\boldsymbol B}_{2}, \ldots, {\boldsymbol B}_{P} \right] \\ &= \sum_{p = 1}^{P} {b_{p}}- \text{{null}} \left[ {\boldsymbol B}_{1}, {\boldsymbol B}_{2}, \ldots, {\boldsymbol B}_{P}\right]\\ &=\sum_{p = 1}^{P}\left({\text{rank}\left({\boldsymbol B}_{p} \right)}+{\text{{null}}\left({\boldsymbol B}_{p} \right)}\right) - \text{{null}} \left[ {\boldsymbol B}_{1}, {\boldsymbol B}_{2}, \ldots, {\boldsymbol B}_{P}\right] \end{aligned} $$
((80))

and

$${} \begin{aligned} &\text{rank}\left[ {{\boldsymbol D}_{1}^{\mathrm{T}}}, {{\boldsymbol D}_{2}^{\mathrm{T}}}, \ldots, {{\boldsymbol D}_{Q}^{\mathrm{T}}} \right]^{\mathrm{T}} = \text{rank} \left[ {{\boldsymbol D}_{1}^{\mathrm{T}}}, {{\boldsymbol D}_{2}^{\mathrm{T}}}, \ldots, {{\boldsymbol D}_{Q}^{\mathrm{T}}} \right] \\ &=\sum_{q = 1}^{Q}{d_{q}} - \text{{null}} \left[ {{\boldsymbol D}_{1}^{\mathrm{T}}}, {{\boldsymbol D}_{2}^{\mathrm{T}}}, \ldots, {{\boldsymbol D}_{Q}^{\mathrm{T}}}\right]\\ &= \sum_{q = 1}^{Q}\left(\!{\text{rank}\left({\boldsymbol D}_{q} \right)}+{\text{{null}}\left(\!{\boldsymbol D}_{q}^{\mathrm{T}} \right)}\!\right) - \text{{null}} \left[ {{\boldsymbol D}_{1}^{\mathrm{T}}}, {{\boldsymbol D}_{2}^{\mathrm{T}}}, \ldots, {{\boldsymbol D}_{Q}^{\mathrm{T}}} \right]. \end{aligned} $$
((81))

Moreover, according to the property of the nullity of a matrix in [30], if \(\text {range}\left ({\boldsymbol B}_{p}\right)\cap \text {range}\left ({\boldsymbol B}_{p'}\right) = \{0\}\phantom {\dot {i}\!}\), 1≤p,p P, pp , and \(\text {range}\left ({\boldsymbol D}_{q}^{\mathrm {T}}\right)\cap \text {range}\left ({\boldsymbol D}_{q'}^{\mathrm {T}}\right) = \{0\}\), 1≤q,q Q, qq , then

$$\begin{array}{*{20}l} \sum_{p = 1}^{P}{\text{{null}}\left({\boldsymbol B}_{p} \right)} &= \text{{null}} \left[ {\boldsymbol B}_{1}, {\boldsymbol B}_{2}, \ldots, {\boldsymbol B}_{P} \right]\\ \sum_{q = 1}^{Q}\text{{null}}\left({\boldsymbol D}_{q}^{\mathrm{T}} \right) &= \text{{null}} \left[ {{\boldsymbol D}_{1}^{\mathrm{T}}}, {{\boldsymbol D}_{2}^{\mathrm{T}}}, \ldots, {{\boldsymbol D}_{Q}^{\mathrm{T}}}\right], \end{array} $$
((82))

hence (34) holds.

If range(B p )⊆range(B P ) and \(\text {range}\left ({\boldsymbol D}_{q}^{\mathrm {T}}\right) \subseteq \text {range}\left ({\boldsymbol D}_{Q}^{\mathrm {T}}\right) \), 1≤pP−1, 1≤qQ−1, we have

$$\begin{array}{*{20}l} &\text{range}\left[ {\boldsymbol B}_{1}, {\boldsymbol B}_{2}, \ldots, {\boldsymbol B}_{P-1}\right] \subseteq \text{range}\left({\boldsymbol B}_{P}\right), \end{array} $$
((83a))
$$\begin{array}{*{20}l} &\text{range}\left[ {{\boldsymbol D}_{1}^{\mathrm{T}}}, {{\boldsymbol D}_{2}^{\mathrm{T}}}, \ldots, {{\boldsymbol D}_{Q-1}^{\mathrm{T}}} \right] \subseteq \text{range}\left({\boldsymbol D}_{Q}^{\mathrm{T}} \right), \end{array} $$
((83b))

then (35) follows from (83) according to the rank properties of block matrices [31].