Introduction

Conventional single-input single-output (SISO) system, which is a wireless communication system with a single antenna at the transmitter and receiver, is vulnerable to multipath fading effect. Multipath is the arrival of the multiple copies of transmitting signal at the receiver through different angles, time delay or differing frequency (Doppler) shifts due to the scattering of electromagnetic waves. Each copy of the transmitted signal will experience differences in attenuation, delay and phase shift while travelling from the transmitter to the receiver. As a result, constructive or destructive interference is experienced at the receiver. The random fluctuation in signal level, known as fading [1, 2], can severely affect the quality and reliability of wireless communication. Strong destructive interference will cause a deep fade and temporary failure of communication due to severe signal power attenuation. Moreover, the constraints posed by limited power, capacity and scarce spectrum make the design of SISO with high data rate and reliability extremely challenging.

The use of multiple antennas at the receiver and transmitter in a wireless network is rapidly superseding SISO to provide higher data rates at longer ranges especially for Long Term Evolution (LTE) systems [3] without consuming extra bandwidth or power. It is also a solution to the capacity limitation of the current wireless systems. The idea of multiple antennas has evolved into multiple-input multiple-output (MIMO) system, which provides transmit and receive diversities. It increases robustness of the effect of multi-path fading in wireless channels, besides yielding higher capacity, spectral efficiency and better bit error rate (BER) performance over conventional SISO systems in multipath fading environments [1, 4]. However, the revolution of SISO to MIMO causes the computation complexity to be increased.

Vertical Bell Laboratories Layered Space-Time (V-BLAST) uses antenna arrays at both the transmitter and receiver to achieve spatial multiplexing gain. Independent data streams that share both frequency bands and time slots are transmitted from multiple antennas and jointly detected at the receiver. The theoretical capacity of V-BLAST increases linearly with the number of antennas in rich scattering environments [5]. The spatial diversity gain is obtained by transmitting or receiving multiple copies of a signal through different antennas. This scheme is designed to combat fading and improves the system BER performance. Space-time coding (STC) technique such as space-time trellis code (STTC) that combines coding, modulation and signal processing has been used to achieve spatial diversity [6]. It achieves maximum diversity and coding gain but the system computational complexity increases exponentially with transmission rate. Alamouti’s space-time block code (STBC) [7] is another technique used to reduce the computational complexity in STTC. It supports linear decoding complexity for maximum-likelihood (ML) decoding. Orthogonal space-time block code (O-STBC), which is a generalization of the Alamouti’s scheme to an arbitrary number of transmit antennas, was introduced in [8].

However, it was shown in [9] that there is a trade-off between spatial diversity gain and spatial multiplexing gain of MIMO systems. For instance, while the V-BLAST scheme increases spatial multiplexing gain, but it does not provide any spatial diversity gain. The V-BLAST scheme is more susceptible to multipath fading and noise compared to Alamouti’s STBC scheme as there is no redundant information. Besides, the error propagation in V-BLAST detection causes BER performance degradation and limits the potential capacity of the V-BLAST scheme [10]. Although Alamouti’s STBC provides full transmit and receive antenna diversity, the maximum code rate of one can be achieved for two transmit antennas only. For more than two antennas, the maximum possible code rate is 3/4 [11], thus Alamouti’s STBC could not satisfy the demand of the desired high system capacity in real time system with good quality of service (QoS) [12, 13]. Therefore, in order to achieve spatial multiplexing and diversity gains simultaneously, the hybrid MIMO system has been introduced [1416]. One of the hybrid MIMO systems is V-BLAST/STBC scheme. However, this hybrid scheme will further induce inevitably higher computational complexity in designing the system.

The V-BLAST/STBC scheme, which was introduced in [14] is a combination of the Alamouti’s STBC and V-BLAST schemes. A number of research efforts on V-BLAST/STBC scheme have been carried for MIMO system with the goal of maximizing the system capacity and reducing its computational complexity. The V-BLAST/STBC scheme improves the performance of MIMO by combining spatial multiplexing and diversity techniques together [17]. However, the spatially-multiplexed V-BLAST and STBC layers in the V-BLAST/STBC scheme assume each other as an interferer. Therefore, the transmitted symbols must be decoded with well-known detection mechanisms such as zero-forcing (ZF), minimum mean-squared error (MMSE) and QR decomposition which are employed in V-BLAST scheme [18]. Thus, the lowest computation complexity detection mechanism will be preferred. In this paper, a new detection mechanism based on QR decomposition, denoted as low complexity QR (LC-QR) decomposition, is presented. The QR decomposition of A × B channel matrix H is a factorization H = QR, where Q is A × B unitary matrix and R is B × B upper triangular matrix. The computational implementation of QR decomposition is less than ZF and MMSE [19], thus the computational complexity of V-BLAST/STBC scheme can be further reduced by using the proposed LC-QR decomposition detection mechanism.

The performance of V-BLAST/STBC transceiver scheme with proposed LC-QR decomposition mechanism is compared with V-BLAST and Alamouti’s STBC schemes. It is shown that the BER performance of V-BLAST/STBC scheme is better than V-BLAST scheme while the system capacity of V-BLAST/STBC scheme is higher than STBC scheme when the LC-QR decomposition mechanism is exploited. Moreover, the computational complexity of proposed LC-QR decomposition mechanism is significantly lower than traditional ZF, MMSE and QR decomposition detection mechanisms. Since MIMO scheme is considered as the latest multiple access technique for the next generation human computer interaction (HCI) [20] or mobile computing devices, especially in LTE-Advanced system, higher computational complexity inherited with it is inevitable. Any computational complexity reduction mechanism can further reduce the computational cost and power consumption. Therefore, by using the proposed LC-QR decomposition detection mechanism in V-BLAST/STBC MIMO scheme the system performance is not only significantly improved but the computational complexity of the overall system is also significantly reduced.

The rest of the paper is organized as follows. In Section System models, an overview of the hybrid V-BLAST/STBC system model is presented with its traditional ZF, MMSE and QR decomposition decoder mechanisms in the sub-sections. Then, the proposed new LC-QR decomposition detection mechanism is introduced in Section LC-QR Decomposition Mechanism. In Section The Computational Complexity of LC-QR Decomposition Compared with ZF, MMSE and QR Decomposition Detection Mechanisms, the computational complexity comparison of LC-QR decomposition with other detection mechanisms is discussed. The system capacity and probability of error in V-BLAST/STBC scheme with LC-QR decomposition are examined in Sections System Capacity of V-BLAST/STBC Scheme with LC-QR Decomposition and Probability of Error in V-BLAST/STBC Scheme with LC-QR Decomposition respectively. Section Performance Evaluation of LC-QR Decomposition in V-BLAST/STBC Scheme illustrates the system performance of proposed LC-QR decomposition in V-BLAST/STBC scheme. Finally, this paper concludes in Section Conclusions.

System models

The V-BLAST/STBC scheme, which was introduced in [14, 17], is a combination of the Alamouti’s STBC and V-BLAST schemes. It provides spatial diversity gain for high priority data and spatial multiplexing gain for low priority data simultaneously by partitioning a single data stream into two parallel sub-streams according to the data priority. The high priority data (e.g. frame header, I-frame, P-frame) is assigned to the STBC layer for extra protection while low priority data (e.g. B-frame, best-effort data) is sent to V-BLAST layer with higher capacity. Since high priority data is more important than low priority data, the corruption of high priority data will severely affect the real time service quality. For instance, error of any missing data in the first enhancement of P-frame layer is propagated to the subsequent P-frames, and it significantly degrades the perceived MPEG video quality. In contrast, any data loss in the B-frame layer affects only the corresponding frame, as it is not referred by other frames for decoding.

The block diagram of a V-BLAST/STBC transceiver model with M (M ≥ 3) transmit and N (N ≥ M − 1) receive antennas is shown in Figure 1. A single main data stream is de-multiplexed into two sub-data streams according to the data priority. The high priority data is assigned to the STBC layer for extra gain while low priority data is sent to V-BLAST layer with higher capacity. As the Alamouti’s STBC layer spans two symbol intervals, the V-BLAST/STBC scheme transmission matrix over two consecutive symbol periods is given as

x = v 1 , 1 v 1 , 2 v M 2 , 1 v M 2 , 2 s M 1 s M s M s M 1 V BLAST Layer STBC Layer
(1)

where * denotes the complex conjugate operation. The transmitted symbols are assumed to go through a Naftali channel coefficient matrix H where H is the N × M matrix as

H = h 1 , 1 h 1 , M 1 h 1 , M h 2 , 1 h 2 , M 1 h 2 , M h N , 1 h N , M 1 h N , M
(2)
Figure 1
figure 1

The proposed V-BLAST/STBC scheme consisting of a transmitter and a LC-QR decomposition receiver.

The complex channel coefficient of the Naftali channel model using random uniformly distributed phase and Rayleigh distributed magnitude can be described as

h k = N 0 , σ k 2 2 + jN 0 , σ k 2 2
(3)

where σ k 2 = σ 0 2 exp k T s / T rms , σ 0 2 = 1 exp T s / T rms , and N 0 , σ k 2 2 is a zero mean Gaussian random variable with variance σ k 2 2 produced by generating an N(0,1) Gaussian random number and multiplying it by σ k 2 . The σ 0 2 = 1 exp T s / T rms is chosen so that the condition k = 0 k max σ k 2 = 1 is satisfied to ensure the same average received power. The parameters T s and T rms represent sampling period and delay spread of the channel respectively. The number of samples to be taken in the impulse response should ensure sufficient decay of the impulse response to avoid inter symbol interference.

At the receiver, the received matrix over two consecutive symbol periods can be expressed as y = Hx + z or

y 1 , 1 y 1 , 2 y 2 , 1 y 2 , 2 y N , 1 y N , 2 = h 1 , 1 h 1 , M 1 h 1 , M h 2 , 1 h 2 , M 1 h 2 , M h N , 1 h N , M 1 h N , M v 1 , 1 v 1 , 2 v M 2 , 1 v M 2 , 2 s M 1 s M s M s M 1 + z 1 , 1 z 1 , 2 z 2 , 1 z 2 , 2 z N , 1 z N , 2
(4)

where y is the N × 2 received signal matrix and x is the V-BLAST/STBC M × 2 transmission matrix. z is the additive white Gaussian noise (AWGN) N × 2 complex matrix with unit variance σ2 and zero mean. For simplicity, it is assumed that the Naftali channel is constant across two consecutive symbol transmission periods, and hence the entries of H are average Naftali channel coefficients. The received matrix over two time intervals in (4) is re-arranged into a single vector to facilitate formulating the detection mechanism resulting as V = H S + N or

y 1 , 1 y 2 , 1 y N , 1 y 1 , 2 * y 2 , 2 * y N , 2 * = h 1 , 1 h 1 , M 2 0 0 h 1 , M 1 h 1 , M h 2 , 1 h 2 , M 2 0 0 h 2 , M 1 h 2 , M h N , 1 h N , M 2 0 0 h N , M 1 h N , M 0 0 h 1 , 1 * h 1 , M 2 * h 1 , M * h 1 , M 1 * 0 0 h 2 , 1 * h 2 , M 2 * h 2 , M * h 2 , M 1 * 0 0 h N , 1 * h N , M 2 * h N , M * h N , M 1 * v 1 , 1 v M 2 , 1 v 1 , 2 v M 2 , 2 s M 1 s M + z 1 , 1 z 2 , 1 z N , 1 z 1 , 2 * z 2 , 2 * z N , 2 *
(5)

From (5), it can be easily seen that the obtained system is equivalent to a spatial multiplexing scheme. The spatially multiplexed V-BLAST and STBC layers in the V-BLAST/STBC scheme assume each other as interferer. Therefore, the transmitted symbols can be decoded with well-known detection techniques such as ZF, MMSE and QR decomposition which are employed in V-BLAST scheme [18]. The ZF and MMSE techniques involve the computation of Moore-Penrose pseudo-inverse of a matrix with cubic computational complexity. Beside MMSE and ZF, QR decomposition is also a common signal processing technique for MIMO detection [21]. The QR decomposition of A × B channel matrix H is a factorization H = QR, where Q is A × B unitary matrix and R is B × B upper triangular matrix. The computational implementation of QR decomposition is less than ZF and MMSE [19], thus the computational complexity of V-BLAST/STBC scheme can be reduced using QR decomposition. The brief overview of ZF, MMSE and QR decomposition decoder are presented in following sub-sections.

Zero-forcing (ZF) decoder

A ZF V-BLAST/STBC receiver architecture is given in [18, 22] where all the undesired sub-data streams are nulled by linear weighting receive vector V at each detecting step. The ZF decoded symbol S ^ i ZF of the i-th sub-stream is calculated by multiplying the i-th row of the equalizer filter matrix W with the receive vector, and S ^ i ZF is given by

S ^ i ZF = W i V
(6)

where (W) i represents the i-th row of matrix W corresponds the criterion in use. The equalizer filter matrix W is then given by

W = H H H 1 H H
(7)

In an orthogonal channel matrix, ZF is identical to ML. However, in general ZF leads to noise amplification, which is especially observed in systems with the same number of transmit and receive antennas.

Minimum mean squared error (MMSE) decoder

In a MMSE decoder, each received sub-data stream is the superposition of the desired signal [22]. The undesired sub-data streams are considered as interference. At each decoding step, all undesired sub-streams are nulled by linear weighting receive vector V. The MMSE decoded symbol S ^ i MMSE of the i-th sub-stream is calculated by multiplying the i-th row of the equalizer filter matrix D with the receive vector. Hence, S ^ i MMSE for MMSE criterion is given as

S ^ i MMSE = D i V
(8)

where (D) i represents the i-th row of matrix D corresponding to the criterion in use. The equalizer filter matrix D is then given by

D = H H H + σ 2 I 2 M 1 1 H H
(9)

where I2(M− 1) is the 2(M − 1) × 2(M − 1) identity matrix. The MMSE detector takes the noise term into account and leads to performance enhancement compared to ZF consequently.

QR decomposition decoder

The QR decomposition decoder is applied to the equivalent channel matrix H to start the V-BLAST/STBC detection mechanism. QR decomposition is performed with H = Q R which is given as

H = q 1 , 1 q 1 , 2 M 2 q 2 , 1 q 2 , 2 M 2 q 2 N 1 , 1 q 2 N 1 , 2 M 2 q 2 N , 1 q 2 N , 2 M 2 r 1 , 1 r 1 , 2 r 1 , 2 M 3 r 1 , 2 M 2 0 r 2 , 2 r 2 , 2 M 3 r 2 , 2 M 2 0 0 0 0 0 0 r 2 M 3 , 2 M 3 r 2 M 3 , 2 M 2 0 0 0 r 2 M 2 , 2 M 2 Q R
(10)

where Q is a unit norm orthogonal columns 2 N × 2(M − 1) matrix and R is a 2(M − 1) × 2(M − 1) upper triangular square matrix. The QR factorization can be calculated with the Householder reflection [23]. Left-multiplying (5) by Q H results V = R S + N or

y 1 , 1 y 2 , 1 y 3 , 1 y 4 , 1 y 5 , 1 y 6 , 1 = r 1 , 1 r 1 , 2 r 1 , 2 M 3 r 1 , 2 M 2 0 r 2 , 2 r 2 , 2 M 3 r 2 , 2 M 2 0 0 0 0 0 0 r 2 M 3 , 2 M 3 r 2 M 3 , 2 M 2 0 0 0 r 2 M 2 , 2 M 2 v 1 , 1 v M 2 , 1 v 1 , 2 v M 2 , 2 s M 1 s M + z 1 , 1 z 2 , 1 z 3 , 1 z 4 , 1 z 5 , 1 z 6 , 1
(11)

where V = Q H V and N = Q H N . From (11), s M is decoded first, followed by sM − 1,…, and finally v1,1.

LC-QR decomposition mechanism

The computational complexity of ZF, MMSE and QR decomposition with H , which have been discussed in the previous section, is high because of the equivalent channel matrix H with dimension 2 N × 2(M − 1) in (5). Therefore, a low complexity V-BLAST/STBC detection mechanism with QR decomposition, denoted as LC-QR decomposition, is proposed. Instead of decoding the transmitted symbols with the equivalent channel matrix H with dimension 2 N × 2(M − 1), the original channel matrix H in (2) with dimension N × M is utilized. If N < M – 1, the LC-QR decomposition mechanism is invalid and not applicable. The flow chart of LC-QR decomposition mechanism is shown in Figure 2.

Figure 2
figure 2

The flow chart of LC-QR decomposition mechanism.

Case A: N receive antenna is greater than or equal to M transmit antenna (N ≥ M)

The five steps of the LC-QR decomposition mechanism for case A (N ≥ M) are described as follows.

Step 1: Application of QR decomposition to the channel matrix H

The QR decomposition is applied to the channel matrix H with dimension N × M to start the V-BLAST/STBC detection mechanism. The QR decomposition is performed with H = QR by

H = Q 1 , 1 Q 1 , 2 Q 1 , M Q 2 , 1 Q 2 , 2 Q 2 , M Q N , 1 Q N , 2 Q N , M R 1 , 1 R 1 , 2 R 1 , M 0 R 2 , 2 R 2 , M 0 0 0 0 0 R M , M Q R
(12)

where Q is a unit norm orthogonal columns N × M matrix and R is a M × M upper triangular square matrix.

Left-multiplying (4) by QH, we get y ˜ = R x + z ˜ or

y ˜ 1 , 1 y ˜ 1 , 2 y ˜ 2 , 1 y ˜ 2 , 2 y ˜ M , 1 y ˜ M , 2 = R 1 , 1 R 1 , 2 R 1 , M 0 R 2 , 2 R 2 , M 0 0 0 0 0 R M , M v 1 , 1 v 1 , 2 v M 2 , 1 v M 2 , 2 s M 1 s M s M s M 1 + z ˜ 1 , 1 z ˜ 1 , 2 z ˜ 2 , 1 z ˜ 2 , 2 z ˜ M , 1 z ˜ M , 2
(13)

where y ˜ = Q H y and z ˜ = Q H z .

Step 2: Calculate the first estimated symbols of STBC layer with STBC 2 × 2 decoder

In QR decomposition, the triangular properties of R allow the sequence of transmitted symbols to be recovered by simple backward substitution with cancellation. If decision error exists in the first detected sub-data streams, the decision feedbacks will significantly increase the probability of error in decoding the subsequent sub-data streams. Therefore, the first decoding decision is made based on STBC layer first so as to decrease the probability of error in decoding the V-BLAST layer. In step 2, RM − 1,M − 1, RM − 1,M and RM,M of upper triangular square matrix R are utilized to calculate the first estimated symbols of the STBC layer with STBC 2 × 2 decoder. These parameters are derived as

s ˜ M 1 = Q R M 1 , M 1 * y ˜ M 1 , 1 + R M 1 , M y ˜ M 1 , 2 * + R M , M y ˜ M , 2 *
(14)
s ˜ M = Q R M 1 , M * y ˜ M 1 , 1 R M 1 , M 1 y ˜ M 1 , 2 * + R M , M * y ˜ M , 1
(15)

where Q(·) denotes the quantization (slicing) operation appropriate to the constellation in use.

Step 3: Decode the data symbols of V-BLAST layer based on the first estimated symbols of STBC layer

The V-BLAST layer data symbols are decoded with the properties of upper triangular square matrix R in (13) and first estimated symbols of STBC layer. These layer symbols in the first and second time slot are then derived in a successive way as

v ^ i , 1 = Q y ˜ i , 1 j = i + 1 M 2 R i , j v ^ j , 1 + R i , M 1 s ˜ M 1 + R i , M s ˜ M R i , i
(16)
v ^ i , 2 = Q y ˜ i , 2 j = i + 1 M 2 R i , j v ^ j , 2 + R i , M 1 s ˜ M + R i , M s ˜ M 1 R i , i
(17)

Step 4: Interference cancellation

A new modified received N × 2 matrix, s is created by subtracting the data symbols of V-BLAST layer from the original received matrix y in this step, which results in the following matrix

s 1 , 1 s 1 , 2 s 2 , 1 s 2 , 2 s N , 1 s N , 2 = y 1 , 1 y 1 , 2 y 2 , 1 y 2 , 2 y N , 1 y N , 2 h 1 , 1 h 1 , M 2 h 2 , 1 h 2 , M 2 h N , 1 h N , M 2 v ^ 1 , 1 v ^ 1 , 2 v ^ M 2 , 1 v ^ M 2 , 2
(18)

Step 5: Decode the data symbols of STBC layer from the new modified received matrix with STBC 2 × N decoder

The data symbols of STBC layer are decoded from the new modified received matrix s with STBC 2 × N decoder and are given by

s ^ M 1 = Q k = 1 N h k , M 1 * s k , 1 + h k , M s k , 2 *
(19)
s ^ M = Q k = 1 N h k , M * s k , 1 h k , M 1 s k , 2 *
(20)

Case B: N receive antenna is less than M transmit antenna by 1 (N = M − 1)

The five steps of the LC-QR mechanism for case B (N = M − 1) are described as follows.

Step 1: Application of QR decomposition to the channel matrix H

The QR decomposition is applied to the channel matrix H with dimension (M − 1) × M to start the V-BLAST/STBC detection mechanism. The QR decomposition is performed with H = QR by

H = Q 1 , 1 Q 1 , 2 Q 1 , M 1 Q 2 , 1 Q 2 , 2 Q 2 , M 1 Q M 1 , 1 Q M 1 , 2 Q M 1 , M 1 R 1 , 1 R 1 , 2 R 1 , M 1 R 1 , M 0 R 2 , 2 R 2 , M 1 R 2 , M 0 0 0 0 0 R M 1 , M 1 R M 1 , M Q R
(21)

where Q is a unit norm orthogonal columns (M − 1) × (M − 1) matrix and R is a (M − 1) × M upper triangular square matrix. Left-multiplying (4) by QH, we get y ˜ = Rx + z ˜ or

y ˜ 1 , 1 y ˜ 1 , 2 y ˜ 2 , 1 y ˜ 2 , 2 y ˜ M 1 , 1 y ˜ M 1 , 2 = R 1 , 1 R 1 , 2 R 1 , M 1 R 1 , M 0 R 2 , 2 R 2 , M 1 R 2 , M 0 0 0 0 0 R M 1 , M 1 R M 1 , M v 1 , 1 v 1 , 2 v M 2 , 1 v M 2 , 2 s M 1 s M s M s M 1 + z ˜ 1 , 1 z ˜ 1 , 2 z ˜ 2 , 1 z ˜ 2 , 2 z ˜ M 1 , 1 z ˜ M 1 , 2
(22)

where y ˜ = Q H y and z ˜ = Q H z .

Step 2: Calculate the first estimated symbols of the STBC layer with STBC 2 × 1 decoder

In QR decomposition, the triangular properties of R allow the sequence of transmitted symbols to be recovered by simple backward substitution with cancellation. If decision error exists in the first detected sub-data streams, the decision feedbacks will significantly increase the probability of error in decoding the subsequent sub-data streams. Therefore, the first decoding decision is made based on STBC layer first so as to decrease the probability of error in decoding the V-BLAST layer. In step 2, RM − 1,M − 1 and RM − 1,M of upper triangular square matrix R are utilized to calculate the first estimated symbols of the STBC layer with STBC 2 × 1 decoder. These parameters are derived as

s ˜ M 1 = Q R M 1 , M 1 * y ˜ M 1 , 1 + R M 1 , M y ˜ M 1 , 2 *
(23)
s ˜ M = Q R M 1 , M * y ˜ M 1 , 1 R M 1 , M 1 y ˜ M 1 , 2 *
(24)

where Q(·) denotes the quantization (slicing) operation appropriate to the constellation in use.

Step 3: Decode the data symbols of V-BLAST layer based on the first estimated symbols of STBC layer

The V-BLAST layer data symbols are decoded with the properties of upper triangular square matrix R in (22) and first estimated symbols of STBC layer. These layer symbols in the first and second time slot are then derived in a successive way as

v ^ i , 1 = Q y ˜ i , 1 j = i + 1 M 2 R i , j v ^ j , 1 + R i , M 1 s ˜ M 1 + R i , M s ˜ M R i , i
(25)
v ^ i , 2 = Q y ˜ i , 2 j = i + 1 M 2 R i , j v ^ j , 2 + R i , M 1 s ˜ M + R i , M s ˜ M 1 R i , i
(26)

Step 4: Interference Cancellation

A new modified received (M − 1) × 2 matrix, s is created by subtracting the data symbols of V-BLAST layer from the original received matrix y in this step, which results in the following matrix

s 1 , 1 s 1 , 2 s 2 , 1 s 2 , 2 s M 1 , 1 s M 1 , 2 = y 1 , 1 y 1 , 2 y 2 , 1 y 2 , 2 y M 1 , 1 y M 1 , 2 h 1 , 1 h 1 , M 2 h 2 , 1 h 2 , M 2 h M 1 , 1 h M 1 , M 2 v ^ 1 , 1 v ^ 1 , 2 v ^ M 2 , 1 v ^ M 2 , 2
(27)

Step 5: Decode the data symbols of STBC layer from the new modified received matrix with STBC 2 × (M − 1) decoder

The data symbols of STBC layer are decoded from the new modified received matrix s with STBC 2 × (M − 1) decoder and are given by

s ^ M 1 = Q k = 1 M 1 h k , M 1 * s k , 1 + h k , M s k , 2 *
(28)
s ^ M = Q k = 1 M 1 h k , M * s k , 1 h k , M 1 s k , 2 *
(29)

The computational complexity of LC-QR decomposition compared with ZF, MMSE and QR decomposition detection mechanisms

The channel matrix H with dimension N × M is used to analyze the computational complexity of the LC-QR decomposition. The equivalent channel matrix H with dimension 2 N × 2(M − 1) is used to analyze the computational complexity of ZF, MMSE and QR decomposition. It is observed that there are zeros in channel matrix H , therefore the multiplication and addition with zero are not taken into account in ZF and MMSE complexity calculation. The detail of computational complexity of each mechanism is presented in the following sub-sections.

Zero-Forcing (ZF) with H

From (7), the process to calculate the ZF equalizer filter matrix W is divided into four steps.

Step 1 involves multiplication of H H with H , it requires 8 N(M − 1)2 multiplications and 4(2 N − 1)(M − 1)2 additions, so the number of complex arithmetic operations in step 1 is 4(4 N − 1)(M − 1)2.

Step 2 involves Gaussian elimination matrix inversion of H H H . According to [23], the computational complexity of Gaussian elimination matrix inversion with dimension a × b matrix is O(ab2). Since the dimension of matrix H H H is 2(M − 1) × 2(M − 1), thus the number of complex arithmetic operations in step 2 is 8(M − 1)3.

Step 3 performs multiplication of H H H 1 with H H , it requires 8 N(M − 1)2 multiplications and 8 N(M − 1)2 4 N(M − 1) additions, then the number of complex arithmetic operations in step 3 is 16 N(M − 1)2 4 N(M − 1).

In step 4, the decoded symbols with ZF mechanism are calculated by multiplying equalizer filter matrix W with the receive vector V and it needs 4 N(M − 1) multiplications and 2(2 N − 1)(M − 1) additions. So, the number of complex arithmetic operations in step 4 is 2 (4 N − 1) (M − 1).

Finally, the total complex arithmetic operation of ZF V-BLAST/STBC is 8(M − 1)3 + 4(8 N − 1)(M − 1)2 + 2(2 N − 1)(M − 1).

Minimum mean-squared error (MMSE) with H

From (9), the process to calculate the MMSE equalizer filter matrix D is divided into five steps.

Step 1 involves multiplication of H H with H , it requires 8 N(M − 1)2 multiplications and 4(2 N − 1)(M − 1)2 additions, so the number of complex arithmetic operations in step 1 is 4(4 N − 1)(M − 1)2.

Step 2 performs the addition of H H H . with σ2I2(M − 1), it requires I2(M − 1) additions, and then the number of complex arithmetic operations in step 2 is I2(M − 1).

Step 3 involves Gaussian elimination matrix inversion of H H H + σ 2 I 2 M 1 . Since the dimension of matrix H H H + σ 2 I 2 M 1 is 2(M − 1) × 2(M − 1), thus the number of complex arithmetic operations in step 3 is 8(M − 1)3[23].

Step 4 performs multiplication of H H H + σ 2 I 2 M 1 1 with H H , it requires 8 N(M − 1)2 multiplications and 8 N(M − 1)2 4 N(M − 1) additions, then the number of complex arithmetic operations in step 4 is 16 N(M − 1)2 4 N(M − 1).

In step 5, the decoded symbols with MMSE mechanism are calculated by multiplying equalizer filter matrix D with the receive vector V and it needs 4 N(M − 1) multiplications and 2(2 N − 1)(M − 1) additions. So, the number of complex arithmetic operations in step 5 is 2 (4 N − 1) (M − 1).

Finally, the total complex arithmetic operation of MMSE V-BLAST/STBC is 8(M − 1)3 + 4(8 N − 1)(M − 1)2 + 4 N(M − 1).

Conventional QR decomposition with H

Step 1 involves application of QR decomposition to the channel matrix H and multiplication of Q H with V. According to [23], the computational complexity of QR decomposition by Householder reflection with size a × b matrix is O(ab2 − b3/3). Since the dimension of channel matrix H is 2 N × 2(M − 1), thus the number of complex arithmetic operations for the QR decomposition of H being 8 N(M − 1)2 (8/3)(M − 1)3. Besides, for the multiplication of Q H with V, it needs 2 N × 2 (M − 1) multiplications and (2 N − 1) × 2(M − 1) additions. Therefore, the number of complex arithmetic operations in step 1 is 8 N (M − 1)2 (8/3)(M − 1)3 + 2(4 N − 1) × (M − 1).

Step 2 decodes the transmitted symbols with backward substitution with cancellation; it requires (M − 2) × (2 M − 3) additions, 2(M − 1)  1 subtractions, (2 M − 3) × (M − 1) multiplications and 2(M − 1) divisions.

Finally, the total complex arithmetic operation of conventional QR decomposition is 8 N(M − 1)2 (8/3)(M − 1)3 + (2 M − 3)2 + 2(M − 1)(4 N + 1)  1.

LC-QR decomposition with H

Step 1 of LC-QR decomposition involves application of QR decomposition to the channel matrix H and multiplication of QH with y. Since the dimension of channel matrix H is N × M, thus the number of complex arithmetic operations for the QR decomposition of H is NM2 − M3/ 3. Besides, for the multiplication of QH with you, it needs N × M multiplications and (N − 1) × M additions. Therefore, the number of complex arithmetic operations in step 1 is NM2 − M3/ 3 + M(2 N − 1).

Step 2 of LC-QR decomposition calculates the first estimate symbols of the STBC layer with STBC 2 × 2 decoder. It needs six multiplications, three additions and one subtraction. So, the number of complex arithmetic operations in step 2 is ten.

Step 3 of the LC-QR decomposition decodes the data symbols of V-BLAST layer based on the first estimate symbols of STBC layer, so it requires (M − 2)(M − 1) additions, 2(M − 2) subtractions, (M − 2)(M + 1) multiplications and 2(M − 2) divisions. Therefore, the number of complex arithmetic operations in step 3 is 2 (M − 2) (M + 2).

Step 4 of LC-QR decomposition involves interference cancellation, thus it requires 2 N(M − 2) multiplications, 2 N(M − 3) additions and 2 N subtractions. Therefore, the number of complex arithmetic operations in step 4 is 4 N (M − 2).

In Step 5, it decodes the data symbols of the STBC layer from the new modified received matrix s with STBC 2 × N decoder, it requires 4 N multiplications, N + 2(N − 1) additions and N subtractions. Therefore, the number of complex arithmetic operations in step 5 is 2 (4 N − 1).

Finally, the total complex arithmetic operation of LC-QR is NM2 − M3/ 3 + (M + 2)(2 N − 1) + 2(M − 2)(M + 2) + 4 N (M − 1) + 10.

The comparison of number of complex arithmetic operations and reduction of computational complexity for ZF, MMSE, QR decomposition with H and LC-QR decomposition detection mechanism with H is shown in Tables 1 and 2 respectively. From Table 2, it can be observed that the proposed LC-QR decomposition detection mechanism significantly reduces the arithmetic complexity compared to ZF, MMSE and QR decomposition mechanisms.

Table 1 Comparison of number of complex arithmetic operations for ZF, MMSE, QR decomposition and LC-QR decomposition detection mechanisms
Table 2 Reduction of computational complexity for ZF, MMSE and QR decomposition compared to LC-QR decomposition detection mechanisms

System capacity of V-BLAST/STBC scheme with LC-QR decomposition

The instantaneous capacity of an orthogonal STBC of rate r c and M transmit antennas, denoted as C STBC , is given by [24]

C STBC = r c lo g 2 1 + ρ M H 2
(30)

where ρ is the signal-to-noise ratio (SNR) per receive antenna. According to (14) and (15), only RM − 1,M − 1, RM − 1,M and RM,M of upper triangular square matrix R are required to decode the STBC layer, thus the instantaneous system capacity of the STBC layer of the LC-QR with r c  = 1 and M = 4, denoted as CP _ STBC, is [25]:

C P _ STBC = lo g 2 1 + ρ 4 R M 1 , M 1 2 + R M 1 , M 2 + R M , M 2
(31)

From the procedure of decoding V-BLAST layer symbols in (16) and (17), the SNR of every V-BLAST channel can be determined. The received sub-data streams of V-BLAST layer in the first time slot are given by

y ˜ i , 1 = R i , i v i , 1 + j = i + 1 M 2 R i , j v ^ j , 1 + R i , M 1 s ˜ M 1 + R i , M s ˜ M + z ˜ i , 1
(32)

while the received sub-data streams of V-BLAST layer in the second time slot are given by

y ˜ i , 2 = R i , i v i , 2 + j = i + 1 M 2 R i , j v ^ j , 2 + R i , M 1 s ˜ M + R i , M s ˜ M 1 + z ˜ i , 2
(33)

where i = 1, …, M – 2. Here, it is assumed that when the V-BLAST layer symbols are decoded, the previously decoded first estimated symbols of STBC layer have been cancelled properly. Hence, decoding of v ^ i , 1 and v ^ i , 2 become

v ^ i , 1 = v i , 1 + z ˜ i , 1 R i , i
(34)
v ^ i , 2 = v i , 2 + z ˜ i , 2 R i , i
(35)

It can be observed from (25) and (26) that the SNR of i-th detected sub-data stream of V-BLAST layer over two time slots is determined by the diagonal element |Ri,i|2 of R where i = 1, 2. So, the SNR of i-th detected sub-data stream, denoted as ρ i , becomes

ρ i = ρ R i , i 2 M
(36)

Since the V-BLAST layer of V-BLAST/STBC scheme transmits four data symbols over two consecutive symbol transmission periods, the instantaneous system capacity of the V-BLAST layer of LC-QR decomposition, denoted as CP _ V − BLAST, is [26]

C P _ V BLAST = 4 lo g 2 1 + min i 1 , , M 2 ρ i 2 = 2 lo g 2 1 + min k 1 , , M 2 ρ R i , i 2 4
(37)

Therefore, the total capacity of V-BLAST/STBC Scheme with LC-QR decomposition, denoted as CP _ LC ‐ QR, is the summation of CP _ STBC and CP _ V ‐ BLAST as

C P _ LC QR = C P _ STBC + C P _ V BLAST = lo g 2 1 + ρ 4 R M 1 , M 1 2 + R M 1 , M 2 + R M , M 2 + 2 lo g 2 1 + min k 1 , , M 2 ρ R i , i 2 4
(38)

Probability of error in V-BLAST/STBC scheme with LC-QR decomposition

A MIMO communication system with M transmit and N receive antennas is considered in a Rayleigh frequency flat-fading channel. The received signal is given by

y = Hx + z
(39)

where y is the N × 2 received signal matrix and H is the N × M Rayleigh flat fading channel matrix. z is the AWGN N × 2 complex matrix with unit variance σ z 2 and zero mean. It is assumed that the entries of H are circularly symmetric, independent and identically distributed (i.i.d.) with zero-mean and unit variance σ h 2 . It is also assumed that the transmission matrix and with uniform power is independent. The covariance matrix of x is E x x = σ x 2 I , where E [•]  denotes the expected value and  (•)* is the conjugate transpose. The SNR, denoted as γ, is defined as below

γ = σ x 2 σ z 2
(40)

Analytical model of V-BLAST layer

Denote the QR decomposition of H as H = QR. The matrix R is upper triangular matrix with diagonal real number elements. The entries of R are independent of each other. The off diagonal elements R i j , for 1 ≤ i < j ≤ M, are zero-mean complex Gaussian with unit variance. The square of the i-th diagonal element of R, denoted as R i i 2 , is a chi-square distribution with the 2 k degrees of freedom, where k is determined by the diversity gain of V-BLAST scheme. Therefore, the probability density function (PDF) of post-detection SNR of V-BLAST scheme with chi-square distribution and 2 k degrees of freedom, denoted as f V (x, 2k), is given by [27]

f V x , 2 k = 1 σ 2 k 2 k Γ k x k 1 e x / 2 σ 2
(41)

where Γ(k) is the gamma function given by

Γ k = 0 t k 1 e t dt , k > 0
(42)

If k is a positive integer, then Γ(k) = (k − 1) !. The received V-BLAST/STBC matrix is given by y ˜ = R x + z ˜ as in (13). Hence, the received sub-data streams of V-BLAST layer in the first and second time slot are given by

y ˜ i , 1 = R i , i v i , 1 + j = i + 1 M 2 R i , j v ^ j , 1 + R i , M 1 s ˜ M 1 + R i , M s ˜ M + z ˜ i , 1
(43)
y ˜ i , 1 = R i , i v i , 1 + j = i + 1 M 2 R i , j v ^ j , 1 + R i , M 1 s ˜ M 1 + R i , M s ˜ M + z ˜ i , 1
(44)

where i = 1, …, M – 2. It is assumed that the propagation error effect is cancelled properly, then (42) and (43) are simplified to

y ˜ i , 1 = R i , i v i , 1 + z ˜ i , 1
(45)
y ˜ i , 2 = R i , i v i , 2 + z ˜ i , 2
(46)

The instantaneous post-detection SNR is determined by the R i i 2 . It is shown in [28] that the diversity gain of i-th detected sub-data stream layer of V-BLAST scheme is (N − i + 1). Thus, R i i 2 is a chi-square distribution with the degree of freedom 2(N − i + 1), which is denoted as χ 2 N i + 1 2 . It can be observed that the when the i is larger, the diversity gain of the i-th layer becomes smaller. Consequently, the largest or M-th layer limits the overall performance of V-BLAST scheme at high SNR. Therefore, the overall diversity gain of V-BLAST scheme is N − M + 1 [29].

After that, M V is defined as the number of transmit antenna of V-BLAST layer in V-BLAST/STBC scheme. Then, the effective diversity gain of V-BLAST layer in V-BLAST/STBC scheme becomes NM V  + 1. Thus, substituting k = NM V  + 1 into (41), the PDF of instantaneous post-detection SNR of V-BLAST layer in V-BLAST/STBC scheme with chi-square distribution and 2(NM V  + 1) degrees of freedom, denoted as f VS V x , 2 N M V + 1 , is given by

f VS V x , 2 N M V + 1 = 1 σ 2 N M V + 1 2 N M V + 1 Γ N M V + 1 x N M V e x / 2 σ 2
(47)

It is shown in [30] that BER of M-ary quadrature amplitude modulation (M-QAM) over AWGN noise is given by

P e , M QAM γ = 4 log 2 M 1 1 M × i = 1 M / 2 Q 2 i 1 3 γ log 2 M M 1
(48)

where  Q(•) is Gaussian Q-Function given by

Q x = 1 2 1 erf x 2
(49)

and the error function  erf(•) is defined as

erf x = 2 π 0 x e t 2 dt
(50)

It is shown in [31] that the average error probability can be obtained by averaging the conditional error probability of modulation scheme over the PDF of instantaneous post-detection SNR of MIMO scheme. Therefore, the BER performance of V-BLAST layer in V-BLAST/STBC scheme with M-QAM modulation, M V and N, denoted as P e , M QAM , VS V M V , N , γ , can be computed by integrating (46) and (47) to become

P e , M QAM , VS V M V , N , γ = 0 P e , M QAM γ f VS V x , 2 N M V + 1 dx = 0 4 log 2 M 1 1 M × i = 1 M / 2 Q 2 i 1 3 log 2 M γ x M 1 × 1 σ 2 N M V + 1 2 N M V + 1 Γ N M V + 1 x N M V e x / 2 σ 2 dx
(51)

Analytical model of STBC layer

From (18), interference cancellation is performed by subtracting the data symbols of V-BLAST layer. Before interference cancellation, the received i-th sub-data stream in l-th time slot is given by

y i , l = h i , M 1 s M 1 + h i , M s M STBC layer symbols + j = i M 2 h i , j v j , l + z i , l Interference AWGN noise
(52)

It can be seen from (51) that the received i-th sub-data stream is composed of STBC layer symbols, AWGN noise and the potential propagation error from V-BLAST layer. The equivalent noise is the combination of the last two parts.

In the ideal case where there is no propagation error from V-BLAST layer, the BER analysis of STBC layer is the same as BER analysis of the Alamouti’s STBC scheme with two transmit and N receive antennas. The instantaneous post-detection SNR of Alamouti’s STBC scheme with two transmit and N receive antennas, denoted as γ S , is given by

γ S = γ i = 1 N h i , 1 2 + h i , 2 2
(53)

Since hi,1 and hi,2 are circularly symmetric, i.i.d. Gaussian random variables with zero-mean and unit variance, the γ S has a chi-square distribution with 2 × 2 × N degrees of freedom.

The diversity gain of O-STBC scheme is given by M × N[32]. So, the PDF of instantaneous post-detection SNR of O-STBC scheme with chi-square distribution and 2 × M × N degrees of freedom, denoted as f S (x, 2MN), is given by

f S x , 2 MN = 1 σ 2 MN 2 MN Γ MN x MN 1 e x / 2 σ 2
(54)

The BER of O-STBC scheme with M transmit and N receive antennas and M-QAM modulation, denoted as P e , M QAM S M , N , γ , can be calculated by integrating (47) and (53) as follows

P e , M QAM S M , N , γ = 0 P e , M QAM γ f S x , 2 MN dx = 0 4 log 2 M 1 1 M × i = 1 M / 2 Q 2 i 1 3 log 2 M γ x M 1 × 1 σ 2 MN 2 MN Γ MN x MN 1 e x / 2 σ 2 dx
(55)

From (19) and (20), the STBC layer of V-BLAST/STBC scheme is decoded with STBC 2 × N decoder after the V-BLAST layer is decoded. Therefore, the BER performance of V-BLAST layer dominates the BER performance of STBC layer. The statistical properties of BER performance of STBC layer are said to be conditional.

M S is defined as the number of transmit antenna of Alamouti’s STBC layer in V-BLAST/STBC scheme, where M S  = 2. On the other hand, an event with k symbol errors detection in V-BLAST layer, denoted as A k , is defined where k ≤ 2M V . Then, P e , M QAM S M S , N , γ | A k is defined as the P e , M QAM S M S , N , γ after the event A k happens. On the other hand, the symbol error rate (SER) of V-BLAST layer with M V transmit, N receive antennas and M-QAM modulation over AWGN noise, denoted as P SER , M QAM V M V , N , γ , can be expressed as [33]

P SER , M QAM V M V , N , γ = 2 1 1 M erfc 3 γ 2 M 1
(56)

where erfc(x) = 1 – erf(x).

Since there is no symbol errors detection in V-BLAST layer, the A k becomes A0. Hence, the BER performance of STBC layer in V-BLAST/STBC scheme in ideal case is the same as the BER performance of Alamouti’s STBC scheme. Therefore, the BER performance of STBC layer in V-BLAST/STBC scheme in the ideal case with M-QAM modulation, M S and N, denoted as P e , M QAM , VS ideal M S , N , γ , can be computed as

P e , M QAM , VS ideal M S , N , γ = P e , M QAM S M S , N , γ | A 0 × 1 P SER , M QAM V M V , N , γ
(57)

where P e , M QAM S M S , N , γ | A 0 = P e , M QAM S M S , N , γ .

In the non-ideal case, there is a propagation error from V-BLAST layer as more than or equal to one symbols of V-BLAST layer are decoded wrongly. The σ V 2 and σ S 2 are defined as unit variance of V-BLAST and STBC layer respectively, where σ V 2 = σ S 2 = σ x 2 . From (51), the combination of interference and AWGN noise with k V-BLAST symbol errors detection can be written as σ z 2 + k σ V 2 . Then, the effective SNR for STBC layer with k V-BLAST symbol errors detection, denoted as γ ˜ k , can be written as

γ ˜ k = σ S 2 σ z 2 + k σ V 2
(58)

where σ z 2 0 . Dividing (57) by σ z 2 , will get

γ ˜ k = γ S 1 + k γ V
(59)

where γ V  = γ S  = γ. Figure 3 shows the block diagram of analytical model of STBC layer in V-BLAST/STBC scheme.

Figure 3
figure 3

Block diagram of analytical model of STBC layer in V-BLAST/STBC scheme.

The BER performance of STBC layer in V-BLAST/STBC scheme in non-ideal case with k-th detection V-BLAST symbol errors, M-QAM modulation, M S and N, denoted as P e , M QAM , VS non ideal M S , N , γ ˜ k , can be computed as

P e , M QAM , VS non ideal M S , N , γ ˜ k = P e , M QAM S M S , N , γ ˜ k | A k × P SER , M QAM V M V , N , γ k = k = 1 2 M V P e , M QAM S M S , N , γ ˜ k × 2 M V k × P SER , M QAM V M V , N , γ k × 1 P SER , M QAM V M V , N , γ 2 M V k
(60)

where

2 M V k = 2 M V ! k ! 2 M V k !
(61)

which is the number of k permutations of an 2M V -element set. Meanwhile, P SER , M QAM V M V , N , γ k × 1 P SER , M QAM V M V , N , γ 2 M V k denotes the probability of error of V-BLAST layer with event A k .

It is known that P SER , M QAM V M V , N , γ 0 for all SNR when σ z 2 0 . As long as P SER , M QAM V M V , N , γ 0 , k exists for 1, …, 2M V . So, it can deduced that σ z 2 is uncorrelated with k when σ z 2 0 . Finally, the BER performance of STBC layer in V-BLAST/STBC scheme with M-QAM modulation, M V and N, denoted as P e , M QAM , VS S M V , N , γ , can be computed as

P e , M QAM , VS S M V , N , γ = P e , M QAM , VS ideal M S , N , γ + P e , M QAM , VS non ideal M S , N , γ ˜ k
(62)

Analytical model of V-BLAST/STBC scheme

By combining (50) and (61), the final overall BER performance P e , M QAM , VS O M , N for V-BLAST/STBC scheme with M-QAM modulation, M transmit and N receive antennas can be computed as

P e , M QAM , VS O M , N = m × P e , M QAM , VS V M V , N , γ + n × P e , M QAM , VS S M V , N , γ m + n
(63)

where M S  = 2 and M S  + M V  = M. The m and n are the symbol transmission rates of V-BLAST and STBC layer respectively.

Performance evaluation of LC-QR decomposition in V-BLAST/STBC scheme

The simulations have been performed on the MIMO system using MATLAB to evaluate the performance of the V-BLAST/STBC scheme with ZF, MMSE and QR decomposition with H and LC-QR decomposition with H. The 4-ary quadrature amplitude modulation (4-QAM) constellation is used in these simulations with Rayleigh flat-fading channel as well as Naftali channel. The entries of Rayleigh flat-fading channel matrix are circularly symmetric, i.i.d. Gaussian random variables with zero-mean and unit variance. The symbol rate of ZF V-BLAST 4 × 4, O-STBC 4 × 4 and V-BLAST/STBC 4 × 4 is shown in Table 3.

Table 3 Symbol rate of various 4 × 4 MIMO schemes

Figure 4 depicts the 10% and 1% outage capacity for MIMO (theoretical limit), ZF V-BLAST, O-STBC and V-BLAST/STBC with LC-QR decomposition in 4 × 4 system. The spectral efficiency of ZF V-BLAST changes a lot with different outage probability. For instance, ZF V-BLAST requires 8 dB to maintain the capacity of 15 bps/Hz when it proceeds from 10% to 1% outage probability. This is caused by lack of diversity of ZF V-BLAST. In contrast, the V-BLAST/STBC with LC-QR decomposition just requires 3 dB to maintain at the capacity of 15 bps/Hz. Last but not least, the O-STBC is the most stable one, as the curve of 10% outage capacity is very close to the curve of 1% outage capacity. Besides, it can be observed that the capacity of ZF V-BLAST with 10% outage probability is the highest among the considered schemes for SNR higher than 37 dB. Meanwhile, the capacity of V-BLAST/STBC with 10% outage probability is close to MIMO capacity at low SNR.

Figure 4
figure 4

The 10% and 1% outage capacity comparison for various schemes 4 × 4.

Figure 5 shows the BER performance comparison of ZF V-BLAST 4 × 4, O-STBC 4 × 4 with rate ¾ and V-BLAST/STBC 4 × 4 with LC-QR decomposition in Rayleigh flat-fading channel environment, which is constant across four consecutive symbol transmission periods. It could be seen that the BER performance of O-STBC 4 × 4 with symbol rate ¾ is the best among the considered schemes as it is a pure spatial diversity scheme with full diversity gain. Moreover, O-STBC does not suffer from inter-symbol interference (ISI) as the transmitted symbols are orthogonal to one another. In contrast, ZF V-BLAST 4 × 4 with symbol rate four is the worst among the schemes because it is a pure spatial multiplexing scheme which suffers from poor diversity gain. Besides, interference between transmitted symbols in ZF V-BLAST scheme greatly reduces the BER performance. It could be seen that the V-BLAST/STBC 4 × 4 with symbol rate three shows a compromise of BER performance with respect to pure spatial multiplexing or diversity scheme.

Figure 5
figure 5

BER performance of various schemes 4 × 4.

It can be concluded that Figures 4 and 5 present the tradeoffs among ZF V-BLAST, O-STBC and V-BLAST/STBC with LC-QR decomposition in 4 × 4 MIMO system. The O-STBC achieves the best BER performance, but the system capacity is the lowest among the considered schemes. Meanwhile, the system capacity of ZF V-BLAST with 10% outage capacity is the highest for SNR above 37 dB, but the BER performance is the worst among the considered schemes. On the other hand, the system capacity of V-BLAST/STBC with LC-QR decomposition is close to MIMO and better than ZF V-BLAST for SNR below 37 dB. Moreover, the BER performance of V-BLAST/STBC with LC-QR decomposition is significantly better than ZF V-BLAST as V-BLAST/STBC achieves spatial multiplexing and diversity gain simultaneously.

Figure 6 shows the BER performance of various mechanisms in V-BLAST/STBC scheme with Rayleigh flat-fading channel model, which is constant across two consecutive symbol transmission periods. The LC-QR decomposition 3 × 2 with H outperforms ZF 3 × 2 and MMSE 3 × 2 with H by more than 2 dB gain at BER of 10-3. At the same time, the LC-QR decomposition 3 × 2 with H outperforms QR 3 × 2 with H by approximately 1.5 dB gain at BER of 10-3. This is because the estimated candidate of STBC layer, which is more robust than V-BLAST layer, is decoded first. After decoding the estimated candidate of STBC layer, interference cancellation activity is performed to produce a new modified received signal matrix s with less interference. For ZF and MMSE mechanisms, there is no interference cancellation activity.

Figure 6
figure 6

BER performance of various mechanisms in V-BLAST/STBC scheme with Rayleigh flat-fading channel model.

It is clear that V-BLAST layer performance dominates over final decisions of STBC layer in (25) and (26). With increasing SNR, the probability of error in decoding the V-BLAST layer is reduced, the probability of correct decoding is increased at the STBC layer. As the V-BLAST layer transmits four data symbols while STBC layer transmits two data symbols over two consecutive symbol transmission periods, thus a better BER performance of V-BLAST layer with LC-QR decomposition leads to overall V-BLAST/STBC system performance improvement.

Figure 7 illustrates the BER performance comparison of various mechanisms in V-BLAST/STBC scheme with a Naftali channel model under different maximum delay spread environment. The LC-QR decomposition 4 × 4 outperforms ZF 4 × 4 and MMSE 4 × 4 by approximately 2 dB gain at BER of 10-3 for both indoor (200 ns) and outdoor (1.6 μs) environment. At the same time, the LC-QR decomposition 4 × 4 outperforms QR decomposition 4 × 4 by approximately 1 dB gain at BER of 10-3 for both indoor and outdoor environment. In order to maintain the BER of 10-3, the LC-QR decomposition with a maximum delay spread = 1.6 μs requires 6 dB gain compared to indoor environment with a maximum delay spread = 200 ns.

Figure 7
figure 7

BER performance of various mechanisms in V-BLAST/STBC scheme under different maximum delay spread environment.

Figure 8 depicts BER performance of various mechanisms in V-BLAST/STBC scheme under different maximum delay spread environment at 16 dB. From Figures 7 and 8, it can be observed that performance BER of all considered mechanism degrades when the maximum delay spread is increasing because the delayed signal from multi-path overlaps the direct signal for the next symbol, thus ISI occurs. Moreover, it is well known that the delay spread causes frequency selective fading in the channel, which acts like a finite impulse response (FIR) filter. If the delay spread is comparable or larger than the symbol duration, the frequency-selective channel will distort the signal and not all frequency components fade simultaneously. As a result, the orthogonality of V-BLAST/STBC data symbols could not be maintained, thus the BER performance declines.

Figure 8
figure 8

BER performance of various mechanisms in V-BLAST/STBC scheme under different maximum delay spread environment at 16 dB.

Figure 9 plots the BER performance of LC-QR decomposition with different standard deviations of channel estimation error σE and delay spread = 200 ns. On the other hand, Figure 10 shows the BER performance of various mechanisms in V-BLAST/STBC scheme with channel estimation error and delay spread = 200 ns at 16 dB. It can be observed that the increase of standard deviation of channel estimation error decreases the BER of all considered mechanisms as channel estimation error is considered as an additional noise in the receiver. Consequently, irreducible error floor occurs when SNR increases in Figure 9. According to Figure 10, it can be deduced that all the considered mechanisms suffer from the same amount of noise variance from channel estimation error, because the equivalent channel matrix for ZF, MMSE and QR decomposition does not reduce the effect of channel estimation error.

Figure 9
figure 9

BER performance of LC-QR 4 × 4 with different standard deviations of channel estimation error σ E and delay spread = 200 ns.

Figure 10
figure 10

BER performance of various mechanisms in V-BLAST/STBC scheme 4 × 4 with different standard deviations of channel estimation error σ E and delay spread = 200 ms at 16 dB.

Figure 11 illustrates the percentage of computational complexity reduction of the LC-QR decomposition with H compared to ZF, MMSE and QR decomposition with H for N ≥ M. It can be observed that the LC-QR decomposition reduces the arithmetic operation complexity by at least 80% compared to ZF and MMSE with H as well as 35% compared to QR decomposition with H . Thus, the LC-QR decomposition shows significant computational complexity improvement. The main reason is that the LC-QR decomposition utilizes channel matrix H with smaller dimension instead of an equivalent channel matrix H . It also can be seen that the percentage of computational complexity reduction of the LC-QR decomposition greatly increases when M is increasing and N is constant. This is due to the calculation of Gaussian elimination matrix inversion of H H H and H H H + σ 2 I 2 M 1 , which requires 8(M − 1)3 complex arithmetic operation, for ZF and MMSE with H respectively. Note that the calculation of Gaussian elimination matrix inversion of H H H and H H H + σ 2 I 2 M 1 does not need the information of N and it increases significantly when M becomes bigger. On the other hand, the calculation of QR decomposition with Householder reflection for H is 8 N(M − 1)2 (8/3)(M − 1)3. Since N is equal to or greater than M, the arithmetic operation of 8 N(M − 1)2 (8/3)(M − 1)3 becomes significant when M is increasing. The performance evaluations from these figures show that by using the proposed LC-QR decomposition detection mechanism in V-BLAST/STBC MIMO scheme the system performance is not only significantly improved but the computational complexity of the overall system is also significantly reduced. Hence, the computational cost and power consumption will be reduced for the next generation MIMO mobile computing devices can be reduced significantly from the reduction of computational complexity.

Figure 11
figure 11

Percentage of computational complexity reduction of LC-QR decomposition compared to ZF, MMSE and QR decomposition with different M and N for NM .

Conclusions

In this paper, it is illustrated that V-BLAST/STBC scheme, which achieves spatial multiplexing and diversity gains simultaneously, increases system capacity and maintains reliable BER performance to accommodate the ever growing demand for real time system with tolerably lower QoS. It is also shown that the system capacity of LC-QR decomposition V-BLAST/STBC scheme is close to the ideal MIMO system and better than ZF V-BLAST for SNR below 37 dB. Moreover, the BER performance of LC-QR decomposition V-BLAST/STBC is significantly better than ZF-VBLAST. The LC-QR decomposition mechanism has also significantly reduced the arithmetic operation complexity and remains a satisfactory BER performance compared to ZF, MMSE and QR decomposition mechanisms. The reduction of computational complexity in V-BLAST/STBC MIMO scheme will see a significant reduction in computational cost and power consumption for next generation MIMO mobile computing devices.