1 Introduction

Differential modulation has been regarded as an attractive solution to improve the spectral efficiency with pilot signal elimination at the transmitter. The potential goal of differential modulation is achieved in fast-varying fading channels. It is difficult to obtain the channel state information (CSI) between transceivers in those channels but differential modulation can be adopted without channel estimations. Due to this property, differential modulation is used for timing and frequency synchronization in order to initiate the frame decoding [1].

Recently, multiple-input multiple-output (MIMO) wireless systems have been developed to increase total reliability and spectral efficiency [2]. In particular, the advantages of MIMO systems, such as capacity and diversity improvements are compared with single-input single-output (SISO) system adopting the combination of the differential modulation and an orthogonal space time block code (STBC) [35]. Unfortunately, however, employing a large number of antennas has been considered as an impractical direction for small mobile terminals due to its limited physical hardware size. Thus, the distributed diversity (or relay-assisted diversity) topology has aroused great interests for an alternative solution to overcome such shortcomings [6, 7]. The diversity gain is organizationally obtained by joint processing using received signals from the distributed relays and transmitter. However, we note that most of prior works for cooperative system have been built upon perfect CSI transmission (coherent transmission). Furthermore, to the best of our knowledge, distributed differential modulation with quasi-orthogonal space-time block coded (D-QOSTBC) system has not achieved full diversity.

In this paper, we present a distributed D-QOSTBC (DD-QOSTBC) system with an amplify-and-forward (AF) mode for multiple half-duplex relaying (HDR) system. Especially, we consider a single frequency relaying system shown in Fig. 1a which assumes that the source (S) has two antennas and each of N relays (R) has one antenna and the destination (D) has a single antenna. Here, we firstly propose a novel robust STBC-like subsystems which preserve orthogonality on both effective channel and symbol matrices to reduce receiver complexity without performance degradations. With the robust subsystems, we propose the efficient joint suboptimal differential decoding (JSDD) utilizing a maximum likelihood (ML) criterion. This JSDD rapidly updates both error distances of the detected DD-QOSTBC symbols and additional information to calculate weighting factor. The weighting factor is combined with additional signals efficiently. Finally, we evaluate the performance of the proposed DD-QOSTBC system to compare with that of the distributed coherent QOSTBC (DC-QOSTBC) system, and verify the system’s validity.

Fig. 1
figure 1

DD-QOSTBC system consisted of source (S), relays (R), and destination (D); a transceiver topology, b DD-QOSTBC transmission format

2 A Signal Model for Efficient DD-QOSTBC System

In this section, we address a fundamental framework of multiple HDR system to realize the efficient differential encoding and decoding for the DD-QOSTBC system. First, we briefly describe Protocol I and III to achieve STBC diversity [7]: In Protocol I, the source communicates with the relays and destination during Time-slot 1. And the relays and source communicate with the destination during Time-slot 2. Protocol III has similar process with Protocol I but \(S \rightarrow D\) link during Time-slot 1 is not considered. In the followings, Fig. 1a shows that the overall block diagram of DD-QOSTBC for multiple relay networks consists of one S, N distributed HDRs, and single D. The modulated data passes QOSTBC generator and four symbols of the kth block time \(s^k_1, s^k_2, s^k_3\), and \(s^k_4\) construct an QOSTBC matrix pattern as shown in (8). Then, \(4 \times 4\) QOSTBC is divided into two blocks and differential modulation is performed with elements separator for symbol transmission. The constructed transmission blocks from each antenna are illustrated in Fig. 1b. It is noted that we omit the process of symbol-based differential modulation in this section because Sect. 4 will discuss the differential process at the destination side. As in Fig. 1b, differentially modulated symbols for the kth block time in each antennas consists of eight symbols that 16 time slots are required to transmit QOSTBC codes successfully. Throughout this paper, we assume that all links have a Rayleigh distribution of \({\mathcal{C}}{\mathcal{N}}\sim (0,\sigma ^2_{h})\).

2.1 Transmission Procedures with Protocol III

(1) During Time-slot 1 As shown in Fig. 1a, the S transmits the symbols \(s_1^k\) and \(s_2^k\) to the nth relay. The received signal \(r^{k}_{n}\) from two antennas of S can be expressed as

$$r^{k}_{n} = {\sqrt{E_{sr}}}h^{k}_{sr,1n}s^{k}_1+{\sqrt{E_{sr}}}h^{k}_{sr,2n}s^{k}_2+n^{k}_{R,n},$$
(1)

where \(E_{sr}\) denotes the average power available at R and \(h^{k}_{sr,in}\) denotes a channel gain between the ith transmit antenna of S and the nth R. It is assumed that the additive noise of nth R, \(n^{k}_{R,n}\) has the Gaussian distribution of \({\mathcal{C}}{\mathcal{N}}\sim (0,\sigma _{n}^{2})\) where \(E[n^{k}_{R,n}n^{k\dag }_{R,n}] = \sigma _{n}^{2}\). \((\cdot )^{\dag }\) denotes hermitian operation. The received signal of (1) is normalized at each relay for the AF relaying transmission as

$$\tilde{r}^{k}_{n} = {\frac{1}{\eta}}\left( {\sqrt{E_{sr}}}h^{k}_{sr,1n}s^{k}_1+{\sqrt{E_{sr}}}h^{k}_{sr,2n}s^{k}_2+n^{k}_{R,n}\right),$$
(2)

where the factor \(\eta ={\sqrt{{2E_{sr}\sigma _{s}^{2}+\sigma _{n}^{2}}}}\) is a unit power at the nth R, and \(\sigma _{s}^{2}\) is an approximated symbol variance for differential transmission [8].

(2) During Time-slot 2 S transmits symbols \(s^k_3\) and \(s^k_4\) to D while each relays forwards the signal (2) to D. As a result, the received signal from both S and R can be expressed as

$$y^{k}_{1}= \sum ^N_{n=1}\tilde{r}^{k}_{n}h^{k}_{rd,n}+{\sqrt{E_{sd}}}\left( h^{k}_{sd,1}s^{k}_3+h^{k}_{sd,2}s^{k}_4\right) +n^{k}_{D2},$$
(3)

where N is the number of distributed relays, and \(h^{k}_{rd,n}\) denotes a channel gain between the nth R and D. \(n^{k}_{D2}\) is an additive white Gaussian noise (AWGN) with \({\mathcal{C}}{\mathcal{N}}\sim (0,\sigma _{n}^{2})\). (3) can be rewritten as

$$y^{k}_{1} =\zeta ^{k}_{1}s^k_1+\zeta ^{k}_{2}s^k_2+\zeta ^{k}_{3}s^k_3+\zeta ^{k}_{4}s^k_4+n^k_1,$$
(4)

where the effective channels \(\zeta ^k_i\) (for \(i=1,2,3,4\)) are specifically defined in (5), and the effective noise \(n^k_1\) is specified as \(n^k_i={\sqrt{E_{rd}}}/\eta \left( \sum ^N_{n=1}n^{(k+i-1)}_{R,1}h^{k}_{rd,n}\right) +n^{(k+i-1)}_{D2}\) (for \(i=1\)).

$${\mathbf{h}}^{k} =\left[ \begin{array}{cccc} \underbrace{{\frac{\sqrt{E_{sr}E_{rd}}}{\eta}}\sum ^N_{n=1}\left( h^{k}_{sr,1n}h^{k}_{rd,n}\right) }_{\zeta ^k_1} &{}\underbrace{{\frac{\sqrt{E_{sr}E_{rd}}}{\eta}}\sum ^N_{n=1}\left( h^{k}_{sr,2n}h^{k}_{rd,n}\right) }_{\zeta ^k_2}&{}\underbrace{{\sqrt{E_{sd}}}h^{k}_{sd,1}}_{\zeta ^k_3}&{}\underbrace{{\sqrt{E_{sd}}}h^{k}_{sd,2}}_{\zeta ^k_4} \end{array} \right]$$
(5)

For the destination signal modeling, let us assume that the perfect D-QOSTBC is transmitted referring to Fig. 1b. In the followings, we can easily obtain the received signals at D during the kth block time as

$$\begin{aligned}y^{k}_{1}&=\zeta ^{k}_{1}s^k_1+\zeta ^{k}_{2}s^k_2+\zeta ^{k}_{3}s^k_3+\zeta ^{k}_{4}s^k_4+n^k_1\\ y^{k}_{2}&=-\zeta ^{k}_{1}s^{k*}_2+\zeta ^{k}_{2}s^{k*}_1-\zeta ^{k}_{3}s^{k*}_4+\zeta ^{k}_{4}s^{k*}_3+n^k_2\\ y^{k}_{3}&=\zeta ^{k}_{1}s^k_3+\zeta ^{k}_{2}s^k_4+\zeta ^{k}_{3}s^k_1+\zeta ^{k}_{4}s^k_2+n^k_3\\ y^{k}_{4}&=-\zeta ^{k}_{1}s^{k*}_4+\zeta ^{k}_{2}s^{k*}_3-\zeta ^{k}_{3}s^{k*}_2+\zeta ^{k}_{4}s^{k*}_1+n^k_4. \end{aligned}$$
(6)

Furthermore, (6) can be rewritten in matrix form as

$${\mathbf{y}}^{k}={\mathbf{h}}^{k}{\mathbf{S}}^{k}+{\mathbf{n}}^{k}$$
(7)

where \({\mathbf{y}}^{k} = [y^{k}_{1}, y^{k}_{2}, y^{k}_{3}, y^{k}_{4}]\) and \({\mathbf{h}}^{k}\) is the effective channel vector defined in (5). The effective noise is \({\mathbf{n}}^{k} = [n^{k}_{1},n^{k}_{2},n^{k}_{3},n^{k}_{4}]\), and the symbol matrix \({\mathbf{S}}^{k}\) is

$$\begin{aligned} {\mathbf{S}}^{k} = \left[ \begin{array}{cccc} s^{k}_1&{}-s^{k*}_2&{}s^{k}_3&{}-s^{k*}_4\\ s^{k}_2&{}s^{k*}_1&{}s^{k}_4&{}s^{k*}_3\\ s^{k}_3&{}-s^{k*}_4&{}s^{k}_1&{}-s^{k*}_2\\ s^{k}_4&{}s^{k*}_3&{}s^{k}_2&{}s^{k*}_1 \end{array}\right] , \end{aligned}$$
(8)

where \((\cdot )^{*}\) denotes conjugate operation.

From (6), it is noted that the traditional D-QOSTBC transmission can be readily extended to a distributed fashion similar to \(4 \times 1\) multi-input single-output point to point communication in single frequency network.

2.2 Transmission Procedures with Protocol I

The received signals during Time-slot 2 of Protocol I are same as (6). On the other hands, Protocol I provides additional information in every Time-slot 1. We will investigate those additional signal by exploiting the proposed robust STBC-like subsystems in the next section. Let us consider the additional received signal \(y^k_{d}\) which is transmitted from S to D during Time-slot 1:

$$y^{k}_{d,1} = {\sqrt{E_{sd}}}h^{k}_{sd,1}s^{k}_1+{\sqrt{E_{sd}}}h^{k}_{sd,2}s^{k}_2+n^{k}_{D1,1}$$
(9)

where \(E_{sd}\) denotes the average power available at D. Similarly to (1), \(h^{k}_{sd,i}\) is the channel gain between the ith transmit antenna of S and D, and \(n^{k}_{D1,1}\) is an AWGN with \({\mathcal{C}}{\mathcal{N}}\sim (0,\sigma _{n}^{2})\). Assuming that the complete D-QOSTBC codes are transmitted, the additional signals of (9) can be expressed as

$$\begin{aligned}y^{k}_{d,1}&={\sqrt{E_{sd}}}h^{k}_{sd,1}s^{k}_1+{\sqrt{E_{sd}}}h^{k}_{sd,2}s^{k}_2+n^{k}_{D1,1}\\ y^{k}_{d,2}&=-{\sqrt{E_{sd}}}h^{k}_{sd,1}s^{k*}_{2}+{\sqrt{E_{sd}}}h^{k}_{sd,2}s^{k*}_1+n^{k}_{D1,2}\\ y^{k}_{d,3}&={\sqrt{E_{sd}}}h^{k}_{sd,1}s^{k}_3+{\sqrt{E_{sd}}}h^{k}_{sd,2}s^{k}_4+n^{k}_{D1,3}\\ y^{k}_{d,4}&=-{\sqrt{E_{sd}}}h^{k}_{sd,1}s^{k*}_4+{\sqrt{E_{sd}}}h^{k}_{sd,2}s^{k*}_3+n^{k}_{D1,4} \end{aligned}$$
(10)

We can see that (10) does not have all elements of QOSTBC block, and it merely have path coefficients \(h^{k}_{sd,1}\) and \(h^{k}_{sd,2}\).

3 Destination Structure for Efficient DD-QOSTBC System

In the following, we discuss differential decoding based on the conventional subsystem based on [10]. With simple manipulations of \(y_1^k, y_2^k, y_3^k\) and \(y_4^k\), we obtain

$$\begin{aligned}y_1^k+y_3^k =&(\zeta ^k_1+\zeta ^k_3)(s^k_1+s^k_3)\\&+(\zeta ^k_2+\zeta ^k_4)(s^k_2+s^k_4)+n_1^k+n_3^k\\ y_2^k+y_4^k =&-(\zeta ^k_1+\zeta ^k_3)(s^{k*}_2+s^{k*}_4)\\&+(\zeta ^k_2+\zeta ^k_4)(s^{k*}_1+s^{k*}_3)+n_2^k+n_4^k\\ y_1^k-y_3^k =&(\zeta ^k_1-\zeta ^k_3)(s^k_1-s^k_3)\\&+(\zeta ^k_2-\zeta ^k_4)(s^k_2-s^k_4)+n_1^k-n_3^k\\ y_2^k-y_4^k =&-(\zeta ^k_1-\zeta ^k_3)(s^{k*}_2-s^{k*}_4)\\&+(\zeta ^k_2-\zeta ^k_4)(s^{k*}_1-s^{k*}_3)+n_2^k-n_4^k. \end{aligned}$$
(11)

From (11), we can make two equivalent subsystems as follows

$$\begin{aligned} {\mathbf{y}}^{k}_{C1}&={\mathbf{h}}^{k}_{1}{\mathbf{S}}^{k}_{1}+{\mathbf{n}}^{k}_{1}\\ {\mathbf{y}}^{k}_{C2}&={\mathbf{h}}^{k}_{2}{\mathbf{S}}^{k}_{2}+{\mathbf{n}}^{k}_{2}, \end{aligned}$$
(12)

where we define subscripts C1 and C2 as the first and second conventional subsystems, respectively. Effective vectors are represented as \({\mathbf{y}}^{k}_{C1} = [y_1^k+y_3^k, y_2^k+y_4^k]\), \({\mathbf{y}}^{k}_{C2} = [y_1^k-y_3^k, y_2^k-y_4^k]\), \({\mathbf{h}}^{k}_{1} = [\zeta _1^k+\zeta _3^k, \zeta _2^k+\zeta _4^k]\), \({\mathbf{h}}^{k}_{2} = [\zeta _1^k-\zeta _3^k, \zeta _2^k-\zeta _4^k]\), \({\mathbf{n}}^{k}_{1} = [n_1^k+n_3^k, n_2^k+n_4^k]\), and \({\mathbf{n}}^{k}_{2} = [n_1^k-n_3^k, n_2^k-n_4^k]\). \({\mathbf{S}}^{k}_{1}\) and \({\mathbf{S}}^{k}_{2}\) are defined as

$$\begin{aligned} {\mathbf{S}}^{k}_{1}&= \left[ \begin{array}{cc} s^{k}_1+s^{k}_3&{}-(s^{k*}_2+s^{k*}_4)\\ s^{k}_2+s^{k}_4&{}s^{k*}_1+s^{k*}_3 \end{array}\right] \\ {\mathbf{S}}^{k}_{2}&= \left[ \begin{array}{cc} s^{k}_1-s^{k}_3&{}-(s^{k*}_2-s^{k*}_4)\\ s^{k}_2-s^{k}_4&{}s^{k*}_1-s^{k*}_3 \end{array}\right]. \end{aligned}$$
(13)

3.1 Proposed Robust STBC-Like Subsystems

3.1.1 Robust STBC-Like Subsystems for Protocol III

Let us define the proposed robust STBC-like subsystems by mapping \({\mathbf{y}}^{k}_{C1}\) to STBC-like format [9]:

$${\mathbf{Y}}^{k}_{1}={\mathbf{H}}^{k}_{1}{\mathbf{S}}^{k}_{1}+{\mathbf{N}}^{k}_{1},$$
(14)

where \({\mathbf{S}}^{k}_{1}\) is defined in (13). The matrices of \({\mathbf{Y}}^{k}_{1}, {\mathbf{H}}^{k}_{1}\), and \({\mathbf{N}}^{k}_{1}\) are

$$\begin{aligned} {\mathbf{y}}^{k}_{C1}\, \mapsto \, {\mathbf{Y}}^{k}_{1}&= \left[ \begin{array}{cc} y^{k}_{1}+y^{k}_{3}&{}y^{k}_{2}+y^{k}_{4}\\ -(y^{k*}_{2}+y^{k*}_{4})&{}y^{k*}_{1}+y^{k*}_{3} \end{array}\right] \\ {\mathbf{h}}^{k}_{1} \, \mapsto \, {\mathbf{H}}^{k}_{1}&= \left[ \begin{array}{cc} \zeta ^{k}_1+\zeta ^{k}_3&{}\zeta ^{k}_2+\zeta ^{k}_4\\ -(\zeta ^{k*}_2+\zeta ^{k*}_4)&{}\zeta ^{k*}_1+\zeta ^{k*}_3 \end{array}\right] \\ {\mathbf{n}}^{k}_{1} \, \mapsto \, {\mathbf{N}}^{k}_{1}&= \left[ \begin{array}{cc} n^{k}_1+n^{k}_3&{}n^{k}_2+n^{k}_4\\ -(n^{k*}_2+n^{k*}_4)&{}n^{k*}_1+n^{k*}_3 \end{array}\right], \end{aligned}$$
(15)

where \(\mapsto\) stands for mapping function. Similarly, the second robust STBC-like subsystem can be formulated by reformulating (11) with (\(y^{k}_{1}-y^{k}_{3}\)) and (\(y^{k}_{2}-y^{k}_{4}\)) of \({\mathbf{y}}^{k}_{C2}\) as follows

$${\mathbf{Y}}^{k}_{2}={\mathbf{H}}^{k}_{2}{\mathbf{S}}^{k}_{2}+{\mathbf{N}}^{k}_{2},$$
(16)

where \({\mathbf{S}}^{k}_{2}\) is defined in (13), the matrices of \({\mathbf{Y}}^{k}_{2}, {\mathbf{H}}^{k}_{2}\), and \({\mathbf{N}}^{k}_{2}\) are

$$\begin{aligned}{\mathbf{y}}^{k}_{C2} \, \mapsto \, {\mathbf{Y}}^{k}_{2}&= \left[ \begin{array}{cc} y^{k}_{1}-y^{k}_{3}&{}y^{k}_{2}-y^{k}_{4}\\ -(y^{k*}_{2}-y^{k*}_{4})&{}y^{k*}_{1}-y^{k*}_{3} \end{array}\right] \\ {\mathbf{h}}^{k}_{2} \, \mapsto \, {\mathbf{H}}^{k}_{2}&= \left[ \begin{array}{cc} \zeta ^{k}_1-\zeta ^{k}_3&{}\zeta ^{k}_2-\zeta ^{k}_4\\ -(\zeta ^{k*}_2-\zeta ^{k*}_4)&{}\zeta ^{k*}_1-\zeta ^{k*}_3 \end{array}\right] \\ {\mathbf{n}}^{k}_{2} \, \mapsto \, {\mathbf{N}}^{k}_{2}&= \left[ \begin{array}{cc} n^{k}_1-n^{k}_3&{}n^{k}_2-n^{k}_4\\ -(n^{k*}_2-n^{k*}_4)&{}n^{k*}_1-n^{k*}_3 \end{array}\right]. \end{aligned}$$
(17)

From (14) and (16), it is noticed that both (\({\mathbf{H}}^{k}_1\), \({\mathbf{H}}^{k}_2\)) and (\({\mathbf{S}}^{k}_1\), \({\mathbf{S}}^{k}_2\)) matrices preserve the orthogonal property, which is important for the efficient differential decoding of (29) and (30). The equivalent property between the DD-QOSTBC system and robust STBC-like subsystems are readily proven, by extracting sequentially (1,1) and (1,2) elements of \({\mathbf{B}}^{k}_1\) and \({\mathbf{B}}^{k}_2\) which are defined as \({\mathbf{B}}^{k}_1=({\mathbf{Y}}^{k}_{1}+{\mathbf{Y}}^{k}_{2})/2\) and \({\mathbf{B}}^{k}_2=({\mathbf{Y}}^{k}_{1}-{\mathbf{Y}}^{k}_{2})/2\).

3.1.2 Robust STBC-Like Subsystems for Protocol I

Here, it is assumed that manipulation for robust STBC-like subsystem is also performed with the additional information of (10). We can also observe the equivalent property from the signal (10) in every Time-slot 1. With manipulations of \(y^{k}_{d,1}, y^{k}_{d,2}, y^{k}_{d,3}\) and \(y^{k}_{d,4}\), the first additional robust STBC-like subsystem can be expressed as

$${\mathbf{Y}}^{k}_{a,1}={\mathbf{H}}^{k}_{a}{\mathbf{S}}^{k}_{1}+{\mathbf{N}}^{k}_{a,1},$$
(18)

where \({\mathbf{S}}^{k}_{1}\) is expressed in (13). The matrices of \({\mathbf{Y}}^{k}_{a,1}\), \({\mathbf{H}}^{k}_{a}\), and \({\mathbf{N}}^{k}_{a,1}\) are defined, respectively, as

$$\begin{aligned} {\mathbf{Y}}^{k}_{a,1}&= \begin{bmatrix}{} y^{k}_{d,1}+y^{k}_{d,3}&y^{k}_{d,2}+y^{k}_{d,4}\\ -(y^{k*}_{d,2}+y^{k*}_{d,4})&y^{k*}_{d,1}+y^{k*}_{d,3} \end{bmatrix} \\ {\mathbf{H}}^{k}_{a}&= \begin{bmatrix}{} {\sqrt{E_{sd}}}h^{k}_{sd,1}&{\sqrt{E_{sd}}}h^{k}_{sd,2}\\ -{\sqrt{E_{sd}}}h^{k*}_{sd,2}&{\sqrt{E_{sd}}}h^{k*}_{sd,1} \end{bmatrix} \\ {\mathbf{N}}^{k}_{a,1}&= \begin{bmatrix}{} n^{k}_{D1,1}+n^{k}_{D1,3}&n^{k}_{D1,2}+n^{k}_{D1,4}\\ -(n^{k*}_{D1,2}+n^{k*}_{D1,4})&n^{k*}_{D1,1}+n^{k*}_{D1,3} \end{bmatrix}. \end{aligned}$$
(19)

The second robust STBC-like subsystem is similarly given by

$${\mathbf{Y}}^{k}_{a,2}={\mathbf{H}}^{k}_{a}{\mathbf{S}}^{k}_{2}+{\mathbf{N}}^{k}_{a,2},$$
(20)

where \({\mathbf{S}}^{k}_{2}\) is expressed in (13). It is noticed that \({\mathbf{H}}^{k}_{a}\) is identically defined as in (19) while \({\mathbf{Y}}^{k}_{a,2}\) and \({\mathbf{N}}^{k}_{a,2}\) are

$$\begin{aligned} {\mathbf{Y}}^{k}_{a,2}&= \begin{bmatrix}{} y^{k}_{d,1}-y^{k}_{d,3}&y^{k}_{d,2}-y^{k}_{d,4}\\ -(y^{k*}_{d,2}-y^{k*}_{d,4})&y^{k*}_{d,1}-y^{k*}_{d,3} \end{bmatrix} \\ {\mathbf{N}}^{k}_{a,2}&= \begin{bmatrix}{} n^{k}_{D1,1}-n^{k}_{D1,3}&n^{k}_{D1,2}-n^{k}_{D1,4}\\ -(n^{k*}_{D1,2}-n^{k*}_{D1,4})&n^{k*}_{D1,1}-n^{k*}_{D1,3} \end{bmatrix}. \end{aligned}$$
(21)

From (18) to (21), the effective channel matrix \({\mathbf{H}}^{k}_{a}\) only contains the channels of \(S \rightarrow D\) link, while effective channel matrices (\({\mathbf{H}}^{k}_{1}\), \({\mathbf{H}}^{k}_{2}\)) contain the channels of both \(S \rightarrow D\) and \(S \rightarrow R \rightarrow D\) links. Therefore, \({\mathbf{H}}^{k}_{a}\) can not provide cooperative diversity. However, the advantage of our subsystem is that the surplus signals \({\mathbf{Y}}^{k}_{a,1}\) and \({\mathbf{Y}}^{k}_{a,2}\) can be combined with \({\mathbf{Y}}^{k}_{1}\) and \({\mathbf{Y}}^{k}_{2}\) by multiplying weighting factors, since \(({\mathbf{Y}}^{k}_{1}, {\mathbf{Y}}^{k}_{2})\) and \(({\mathbf{Y}}^{k}_{a,1}, {\mathbf{Y}}^{k}_{a,2})\) have the same shapes. We define those shapes as matrix-wise sets.

4 Differential Encoding and Decoding for DD-QOSTBC System

In this section, we present the efficient DD-QOSTBC encoding and decoding procedures for the aforementioned robust STBC-like subsystems.

4.1 DD-QOSTBC Encoding

In Fig. 1a, differential encoding for each symbol is equivalently treated in the first robust STBC-like subsystems of (14) and (16). Therefore, we can apply the general D-STBC encoding rule to these two subsystems as

$${\mathbf{S}}^{k}_{1}={\frac{1}{u^{k-1}_{1}}}{\mathbf{S}}^{k-1}_{1}{\mathbf{X}}^{k}_{1},$$
(22)

where \(u^{k-1}_{1}={\sqrt{|x^{k-1}_1+x^{k-1}_3|^2+|x^{k-1}_2+x^{k-1}_4|^2}}\) is a normalization factor. The transmit symbol matrix of \({\mathbf{X}}^{k}_{1}\) is defined as

$$\begin{aligned} {\mathbf{X}}^{k}_{1} = \left[ \begin{array}{cc} x^{k}_{1}+x^{k}_{3}&{} \ \ -(x^{k*}_{2}+x^{k*}_{4})\\ x^{k}_{2}+x^{k}_{4}&{} \ \ \ x^{k*}_{1}+x^{k*}_{3} \end{array}\right], \end{aligned}$$
(23)

where the linear combination rule of data symbols \(x^{k}_{k}\) (for \(k=1,2,3,4\)) follows the formulation rule of (11). (\(x^k_1\), \(x^k_2\)) are the cardinality of phase-shift-keying (PSK) constellation set \({\mathcal{V}}_1\), i.e., \({x_1,x_2}\in {\mathcal{V}}_1\). In contrast, (\(x^k_3\), \(x^k_4\)) are the rotated (offset) cardinality of the PSK constellation set \({\mathcal{V}}_2\), i.e., \({x_3,x_4}\in {\mathcal{V}}_2\), and these two constellation sets are defined to prevent the sum of elements of \({\mathbf{X}}^{k}_1\) becoming a zero matrix [10]. Similarly with (22), the differential encoding for the second robust STBC-like subsystem for (18) and (20) can be expressed as

$${\mathbf{S}}^{k}_{2}={\frac{1}{u^{k-1}_{2}}}{\mathbf{S}}^{k-1}_{2}{\mathbf{X}}^{k}_{2}$$
(24)

where \(u^{k-1}_{2}={\sqrt{|x^{k-1}_1-x^{k-1}_3|^2+|x^{k-1}_2-x^{k-1}_4|^2}}\) and the transmitted symbol matrix \({\mathbf{X}}^{k}_{2}\) is given by

$$\begin{aligned} {\mathbf{X}}^{k}_{2} = \left[ \begin{array}{cc} x^{k}_{1}-x^{k}_{3}&{} \ -(x^{k*}_{2}-x^{k*}_{4})\\ x^{k}_{2}-x^{k}_{4}&{} \ x^{k*}_{1}-x^{k*}_{3} \end{array}\right]. \end{aligned}$$
(25)

4.2 DD-QOSTBC Decoding for Protocol III

We present the proposed ML-based differential decoding scheme with no CSI. It is assumed that the channels are constant over the transmission of the two consecutive block time, i.e., \({\mathbf{H}}^{k-1}_{1}={\mathbf{H}}^{k}_{1}\), \({\mathbf{H}}^{k-1}_{2}={\mathbf{H}}^{k}_{2}\) and \({\mathbf{H}}^{k-1}_{a}={\mathbf{H}}^{k}_{a}\). For the comprehensive understanding, we first consider the differential decoding for \({\mathbf{Y}}^{k}_{1}\) and \({\mathbf{Y}}^{k}_{2}\) (see (14) and (16)), where \({\mathbf{S}}^{k}_{1}\) and \({\mathbf{S}}^{k}_{2}\) are encoded according to the rules of (22) and (24). With the equivalent property between the DD-QOSTBC system and two robust STBC-like subsystems, the proposed ML-based differential decoding which eliminates the necessity of CSI is defined as

$$\begin{aligned}\hat{\mathbf{X}}^{k}_{P3,12}= \, &\underset{{x_1,x_2}\in {\mathcal{V}}_1}{{\text {argmin}}}\left\| {\mathbf{V}}^{k}_1-{\frac{1}{2}}\left( {\mathbf{Y}}^{(k-1)\dag }_{1}{\mathbf{Y}}^{k}_{1} +{\mathbf{Y}}^{(k-1)\dag }_{2}{\mathbf{Y}}^{k}_{2}\right) \right\| ^2\\ \hat{\mathbf{X}}^{k}_{P3,34}= \, &\underset{{x_3,x_4}\in {\mathcal{V}}_2}{{\text {argmin}}}\left\| {\mathbf{V}}^{k}_2-{\frac{1}{2}}\left( {\mathbf{Y}}^{(k-1)\dag }_{1}{\mathbf{Y}}^{k}_{1} -{\mathbf{Y}}^{(k-1)\dag }_{2}{\mathbf{Y}}^{k}_{2}\right) \right\| ^2, \end{aligned}$$
(26)

where the reference symbol matrices \({\mathbf{V}}^{k}_1\) and \({\mathbf{V}}^{k}_2\) are

$$\begin{aligned} {\mathbf{V}}^{k}_1=\left[ \begin{array}{cc} c_{1}^k &{} -c_{2}^{k*} \\ c_{2}^k &{} c_{1}^{k*} \end{array}\right] ,\quad {\mathbf{V}}^{k}_2=\left[ \begin{array}{cc} c_{3}^k &{} -c_{4}^{k*} \\ c_{4}^k &{} c_{3}^{k*} \end{array}\right]. \end{aligned}$$
(27)

Here, it is noticed that the correlation between \({\mathbf{H}}^{(k-1)}_{i}\) and \({\mathbf{H}}^{k}_{i}\) can be perfectly decoupled from the orthogonal properties as

$$\begin{aligned}\Vert {\mathbf{H}}^{k}_{1}\Vert ^2&= {\mathbf{H}}^{(k-1)\dag }_{1}{\mathbf{H}}^{k}_{1}\\&=\left( \left( \zeta ^{k}_1+\zeta ^{k}_3\right) ^2+\left( \zeta ^{k}_2+\zeta ^{k}_4\right) ^2\right) {\mathbf{I}}_2 = \left( \beta _1^k\right) ^2{\mathbf{I}}_2\\ \Vert {\mathbf{H}}^{k}_{2}\Vert ^2&= {\mathbf{H}}^{(k-1)\dag }_{2}{\mathbf{H}}^{k}_{2}\\&=\left( \left( \zeta ^{k}_1-\zeta ^{k}_3\right) ^2+\left( \zeta ^{k}_2-\zeta ^{k}_4\right) ^2\right) {\mathbf{I}}_2 = \left( \beta _2^{k}\right) ^2{\mathbf{I}}_2. \end{aligned}$$
(28)

The channel gains in (28) become real value, thus (26) can be rewritten as

$$\begin{aligned} \hat{\mathbf{X}}^{k}_{P3,12}=\underset{{x_1,x_2}\in {\mathcal{V}}_1}{{\text {argmin}}}\left[ \begin{array}{cc} \left| c_{1}^k -\left( \lambda ^k x_{1}^k+\dot{n}^k_1 \right) \right| ^2 &{} \star \\ \left| c_{2}^k -\left( \lambda ^k x_{2}^k+\dot{n}^k_2 \right) \right| ^2 &{} \star \end{array}\right] \end{aligned}$$
(29)
$$\begin{aligned} \hat{\mathbf{X}}^{k}_{P3,34}=\underset{{x_3,x_4}\in {\mathcal{V}}_2}{{\text {argmin}}}\left[ \begin{array}{cc} \left| c_{3}^k -\left( \lambda ^k x_{3}^k+\dot{n}^k_3 \right) \right| ^2 &{} \star \\ \left| c_{4}^k -\left( \lambda ^k x_{4}^k+\dot{n}^k_4 \right) \right| ^2 &{} \star \end{array}\right], \end{aligned}$$
(30)

where \(\lambda ^k={\frac{(\beta ^k_1)^2u_2^{k-1}+(\beta ^k_2)^2u_1^{k-1}}{2u_1^{k-1}u_2^{k-1}}}\). ‘\(\star\)’ means ‘this element does not need to be involved in decoding’, that this notation is caused by the manipulation of left columnFootnote 1. The noise terms \(\dot{n}^k_1, \dot{n}^k_2, \dot{n}^k_3\), and \(\dot{n}^k_4\) are \(\dot{n}^k_1={\frac{1}{2}}\left( s_1^{k-1}\left( \beta ^{k-1}_1 \right) ^2n_1^k+n_1^{(k-1)*}\left( \beta ^{k}_1\right) ^2s_1^k+\sigma _n^2 \right)\), \(\dot{n}^k_2={\frac{1}{2}}\left( s_2^{k-1}\left( \beta ^{k-1}_1 \right) ^2n_2^k+n_2^{(k-1)*}\left( \beta ^{k}_1\right) ^2s_2^k+\sigma _n^2 \right)\), \(\dot{n}^k_3={\frac{1}{2}}\left( s_3^{k-1}\left( \beta ^{k-1}_2 \right) ^2n_3^k+n_3^{(k-1)*}\left( \beta ^{k}_2\right) ^2s_3^k+\sigma _n^2 \right)\), and \(\dot{n}^k_4={\frac{1}{2}}\left( s_4^{k-1}\left( \beta ^{k-1}_2 \right) ^2n_4^k+n_4^{(k-1)*}\left( \beta ^{k}_2\right) ^2s_4^k+\sigma _n^2 \right)\), respectively.

$$\begin{aligned} \hat{\mathbf{X}}^{k}_{P1,12}= \, & {} \underset{{x_1,x_2}\in {\mathcal{V}}_1}{{\text {argmin}}}\left\| {\mathbf{V}}^{k}_1-{\frac{1}{2}}\left( (1-\omega ^k) \left( {\mathbf{Y}}^{(k-1)\dag }_{1}{\mathbf{Y}}^{k}_{1}+{\mathbf{Y}}^{(k-1)\dag }_{2} {\mathbf{Y}}^{k}_{2}\right) \right. \right. \\&\left. \left. + \, \omega ^k \left( {\mathbf{Y}}^{(k-1)\dag }_{a,1}{\mathbf{Y}}^{k}_{a,1}+{\mathbf{Y}}^{(k-1)\dag }_{a,2}{\mathbf{Y}}^{k}_{a,2}\right) \right) \right\| ^2 \\= \, & {} \underset{{x_1,x_2}\in {\mathcal{V}}_1}{{\text {argmin}}}\left[ \begin{array}{cc} \left| c_{1}^k -\left( {\frac{(1-w^k)\left( \left( \beta ^k_1\right) ^2+\left( \beta ^k_3\right) ^2\right) u_2^{k-1}+w^k\left( \left( \beta ^k_2 \right) ^2+\left( \beta ^k_3 \right) ^2\right) u_1^{k-1}}{2u_1^{k-1}u_2^{k-1}}}x_{1}^k+\ddot{n}^k_1 \right) \right| ^2 &{} \star \\ \left| c_{2}^k -\left( {\frac{(1-w^k)\left( \left( \beta ^k_1\right) ^2+\left( \beta ^k_3\right) ^2\right) u_2^{k-1}+w^k\left( \left( \beta ^k_2\right) ^2+\left( \beta ^k_3\right) ^2\right) u_1^{k-1}}{2u_1^{k-1}u_2^{k-1}}}x_{2}^k+\ddot{n}^k_2 \right) \right| ^2 &{} \star \end{array}\right] \end{aligned}$$
(31)
$$\begin{aligned} \hat{\mathbf{X}}^{k}_{P1,34}= \, & {} \underset{{x_3,x_4}\in {\mathcal{V}}_2}{{\text {argmin}}}\left\| {\mathbf{V}}^{k}_2-{\frac{1}{2}}\left( (1-\omega ^k) \left( {\mathbf{Y}}^{(k-1)\dag }_{1}{\mathbf{Y}}^{k}_{1}-{\mathbf{Y}}^{(k-1)\dag }_{2} {\mathbf{Y}}^{k}_{2} \right) \right. \right. \\&\left. \left. + \, \omega ^k \left( {\mathbf{Y}}^{(k-1)\dag }_{a,1}{\mathbf{Y}}^{k}_{a,1}-{\mathbf{Y}}^{(k-1)\dag }_{a,2}{\mathbf{Y}}^{k}_{a,2}\right) \right) \right\| ^2 \\= \, & {} \underset{{x_3,x_4}\in {\mathcal{V}}_2}{{\text {argmin}}}\left[ \begin{array}{cc} \left| c_{3}^k -\left( {\frac{(1-w^k)\left( \left( \beta ^k_1\right) ^2+\left( \beta ^k_3\right) ^2\right) u_2^{k-1}+w^k\left( \left( \beta ^k_2\right) ^2+(\beta ^k_3)^2\right) u_1^{k-1}}{2u_1^{k-1}u_2^{k-1}}}x_{3}^k+\ddot{n}^k_3 \right) \right| ^2 &{} \star \\ \left| c_{4}^k -\left( {\frac{(1-w^k)\left( \left( \beta ^k_1\right) ^2+\left( \beta ^k_3\right) ^2\right) u_2^{k-1}+w^k\left( \left( \beta ^k_2\right) ^2+\left( \beta ^k_3\right) ^2\right) u_1^{k-1}}{2u_1^{k-1}u_2^{k-1}}}x_{4}^k+\ddot{n}^k_4 \right) \right| ^2 &{} \star \end{array}\right] \end{aligned}$$
(32)

4.3 DD-QOSTBC Decoding for Protocol I (Joint Suboptimal Differential Decoding)

From (29) and (30), we present the proposed JSDD which combines the matrix-wise outputs (\({\mathbf{Y}}^{k}_{1}\), \({\mathbf{Y}}^{k}_{2}\)) and (\({\mathbf{Y}}^{k}_{a,1}\), \({\mathbf{Y}}^{k}_{a,2}\)) based on robust STBC-like subsystems. The overall JSDD procedures associated with the detection of \(x_1^k, x_2^k, x_3^k\) and \(x_4^k\) symbols, are represented as following steps.

  1. 1.

    The original symbols are detected from the outputs \({\mathbf{Y}}^{k}_{1}\) and \({\mathbf{Y}}^{k}_{2}\) by using (29) and (30).

  2. 2.

    Through a similar approach with step 1), the same symbols are also detected from the outputs \({\mathbf{Y}}^{k}_{a,1}\) and \({\mathbf{Y}}^{k}_{a,2}\) by using (29) and (30). Here, we obtain channel gains \(\Vert {\mathbf{H}}^{k}_{a}\Vert ^2 = E_{sd}\left( (h_{sd,1}^k)^2+(h_{sd,2}^k)^2\right) {\mathbf{I}}_2=(\beta ^k_3)^2{\mathbf{I}}_2\).

  3. 3.

    Calculate the sum of error distances in terms of \(x_1^k, x_2^k, x_3^k\) and \(x_4^k\) from \(\left( {\mathbf{Y}}^{k}_{1}, {\mathbf{Y}}^{k}_{2}\right)\) and \(\left( {\mathbf{Y}}^{k}_{a,1}, {\mathbf{Y}}^{k}_{a,2}\right)\). We can define relative weighting factor in (33).

  4. 4.

    As a result, the original symbols experiencing different links can be combined to improve the system performance. As a suboptimal approach, we consider the ML-based JSDD scheme as (31) and (32).

In (32), weighting factor is defined as

$$w^k = {\frac{\sum \nolimits _{i=1}^4\left| c^k_i-2y^{(k-1)\dagger }_{d,i}y^{k}_{d,i}\right| ^2}{\sum \nolimits _{i=1}^4\left( \left| c^k_i-2y^{(k-1)\dagger }_{d,i}y^{k}_{d,i}\right| ^2+\left| c^k_i-2y^{(k-1)\dagger }_iy^{k}_i\right| ^2\right)}},$$
(33)

and effective noise terms are \(\ddot{n}^k_1=(1-w^k)\dot{n}^k_1+{\frac{w^k}{2}}\left( s_1^{k-1}\left( \beta ^{k-1}_3\right) ^2n_{D1,1}^k+n_{D1,1}^{(k-1)*}\left( \beta ^{k}_3\right) ^2s_1^k+\sigma _n^2 \right)\), \(\ddot{n}^k_2=(1-w^k)\dot{n}^k_2+{\frac{w^k}{2}}\left( s_2^{k-1}\left( \beta ^{k-1}_3\right) ^2n_{D1,2}^k+n_{D1,2}^{(k-1)*}\left( \beta ^{k}_3\right) ^2s_2^k+\sigma _n^2 \right)\), \(\ddot{n}^k_3=(1-w^k)\dot{n}^k_3+{\frac{w^k}{2}}\left( s_3^{k-1}\left( \beta ^{k-1}_3\right) ^2n_{D1,3}^k+n_{D1,3}^{(k-1)*}\left( \beta ^{k}_3\right) ^2s_3^k+\sigma _n^2 \right)\), and \(\ddot{n}^k_4=(1-w^k)\dot{n}^k_4+{\frac{w^k}{2}}\left( s_4^{k-1}\left( \beta ^{k-1}_3\right) ^2n_{D1,4}^k+n_{D1,4}^{(k-1)*}\left( \beta ^{k}_3\right) ^2s_4^k+\sigma _n^2 \right)\), respectively.

5 Performance Evaluations

In this section, we accomplish the performance evaluations for the proposed DD-QOSTBC systems. For verifying the gain of the proposed system, we have considered various QOSTBC systems:

  • DD-QOSTBC (P3): Protocol III equipped with the ML-based differential decoding of (26)

  • DD-QOSTBC-JSDD (P1): Protocol I equipped with the proposed JSDD of (31) and (32)

  • DD-QOSTBC-MRC (P1): Protocol I equipped with a (differential) maximum ratio combining (MRC) scheme

  • DD-QOSTBC (Direct): Protocol I only considering the received differential signals during Time-slot 1

  • DC-QOSTBC (P3)

  • DC-QOSTBC-MRC (P1)

  • DC-QOSTBC (Direct)

It is assumed that an identity matrix \({\mathbf{I}}_2\) is transmitted to initiate the DD-QOSTBC decoding in the beginning of transmission. It is also assumed that all of associated links are modeled as a Rayleigh channel. The frames consist of QPSK symbols, and a offset for \({\mathcal{V}}_2\) is \(\pi /4\). The number of R, N is 2, and ML detection is performed. The symbol matrices \({\mathbf{S}}^{k}_{1}\) and \({\mathbf{S}}^{k}_{2}\) satisfy the power constraint \(E[|s^{k}_{1}+s^{k}_{3}|^2+|s^{k}_{2}+s^{k}_{4}|^2]=E[|s^{k}_{1}-s^{k}_{3}|^2+|s^{k}_{2}-s^{k}_{4}|^2]=2\) and each symbol has unit variance.

Figure 2 depicts the bit-error-rate (BER) performances of various QOSTBC systems as a function of signal-to-noise (SNR), when \(E_{sr}/\sigma _{n, R}=20\) dB. It is firstly observed that the DC-QOSTBC systems outperform the DD-QOSTBC systems, of which result can be expected from a traditional communication system [3]. It is found that the DD-QOSTBC (P1) systems outperform the DD-QOSTBC (P3) systems as SNR increases because Protocol I provides additional information during Time-slot 1. In addition, it is also found that the proposed DD-QOSTBC-JSDD (P1) outperforms than the DD-QOSTBC-MRC (P1), and this result reveals the robustness of the proposed JSDD scheme as well as its higher efficiency.

Fig. 2
figure 2

BER performances of various DD-QOSTBC systems as a function of SNR when \(E_{sr}/\sigma _{n, R}=20\) dB

Figure 3 depicts the BER performances of QOSTBC systems as a function of signal-to-noise (SNR), when \(E_{sr}/\sigma _{n, R}=5\) dB. The error floor effect is observed due to the limited SNR at R. As \(S\rightarrow D\) does not affect from SNR of R, the performance of DC-QOSTBC (Direct) is same in both Figs. 2 and 3. This tendency is also observed in the DD-QOSTBC (Direct) case.

Fig. 3
figure 3

BER performances of various DD-QOSTBC systems as a function of SNR when \(E_{sr}/\sigma _{n, R}=5\) dB

6 Conclusions

In this paper, we have presented the efficient robust DD-QOSTBC for multiple relay networks, especially the JSDD scheme for Protocol I. In addition, it is shown that the DD-QOSTBC system can be readily applied to Protocol III. In the performance evaluations, we have verified the benefits of the proposed DD-QOSTBC systems by comparing with the DC-QOSTBC systems.