1 Introduction

Next generation mobile communication systems will face new challenges that will require more than increasing the throughput [1]. Tactile Internet [2] demands for low latency. Multitude of devices will be connected in what is being called the Internet of Things [3]. Robustness will be required to deal with large coverage areas. Low out-of-band (OOB) emission is mandatory for opportunistic use of vacant spectrum and fragmented spectrum allocation [4].

Several new multicarrier waveforms are being proposed to deal with these new requirements [1]. Generalized frequency division multiplexing (GFDM) [5] is one candidate that has the flexibility to address the upcoming demands of future networks. This modulation scheme employs K individual circularly filtered subcarriers, each one carrying M subsymbols of data. The circular filtering process keeps the overall block length of MK samples, where cyclic prefix and suffix can be applied to allow for simple frequency domain equalization [6] on the receiver side.

It is clear that any waveform for the next generation of mobile communication systems need to be compatible with multiple-input multiple-output (MIMO). Also, it needs to be easily combined with multiple access (MA) algorithms to share the physical layer (PHY) resources among several users and devices [1]. Recently, a simple approach for space-time coding GFDM has been published in [7]. However, the applied linear GFDM demodulator cannot decouple the subcarriers and subsymbols under the influence of multipath channels. Hence, when space-time coding (STC) is applied directly to the data symbols, residual multipath inter-symbol interference (ISI) leads to a severe performance loss, depending on the channel impulse response (CIR).

To overcome this problem, GFDM can be combined with time-reversal space-time coding (TR-STC) [8], to achieve full diversity gain while keeping the benefits of non-orthogonal waveforms [5]. With the application of TR-STC, the error floor due to multipath ISI is completely removed because space-time combining is performed in the signal domain rather than in the data domain. Also, the system complexity is kept low compared to more elaborate equalization techniques in the data domain [9].

The aim of this paper is to show that the resources of the TR-STC-GFDM can be shared among multiple users in the wireless up-link to achieve a TR-STC generalized frequency division multiple access (GFDMA) scheme. We show that the only requirement for this scheme is to have a band-guard subcarrier between the subchannels for each user. In fact, users with one or two transmit antennas can share the resources from the same TR-STC-GFDMA block. Also, we demonstrate that TR-STC-GFDMA tolerates timing misalignment between users with no penalty. Further, a scheduling mechanism to allocate users on certain subchannels based on their channel quality is presented, showing that the overall system performance can benefit from the knowledge of the channel state information (CSI). Additionally, the application of least-squares (LS) channel estimation for the presented multi-user scenario is contributed. Symbol error rate (SER) performance is simulated with perfect and estimated CSI, where we show that full diversity gain is achieved in both cases. The channel estimation error results in a constant performance loss; however, no other degradation is observed, showing that the channel and the timing misalignment between users can be successfully estimated and compensated with the proposed scheme.

The remainder of this paper is organized as follows: Section 2 presents the background of GFDM modulation, while Section 3 shows the principles of TR-STC. Section 4 presents GFDMA in combination with TR-STC. The performance of the proposed scheme is analyzed in Section 5 and conclusions are drawn in Section 6.

2 GFDM Background

GFDM is a block-based filtered multicarrier system where in each block N=K M complex valued data symbols are distributed in time and frequency across M subsymbols and K subcarriers. d km denotes the data symbol, taken from a -QAM constellation, that is to be transmitted on the kth subcarrier and mth subsymbol. Each d km is transmitted with a specific waveform g km [n] that is derived by a circular time-frequency shift of a prototype waveform g[n] according to

$$\begin{array}{@{}rcl@{}} g_{km}[n] = g\left[\left<n-m K\right>_{N}\right]\exp\left(-j2\pi \frac{kn}{K}\right), \end{array} $$
((1))

where n=0,1,…,N−1 is the time index of one GFDM block and <·> N is the modulo operator. g[n] is typically a raised cosine (RC) filter with roll-off α [5], although other waveforms can be used. The transmit signal x[n] of one GFDM block is the superposition of all constituting waveforms according to

$$\begin{array}{@{}rcl@{}} \vec{x} = \sum_{k=0}^{K-1}\sum_{m=0}^{M-1}\vec{g}_{km}d_{km}=\mathbf{A}\vec{d}, \end{array} $$
((2))

where \(\vec {x}\) is a column vector containing the samples of x[n], \(\vec {g}_{\textit {km}}\) is a column vector containing the samples of g km [n], and \(\vec {d}\) is a column vector containing d km as its (m K+k)th element. A is a N×N matrix, containing \(\vec {g}_{\textit {km}}\) as its (m K+k)th column.

When \(\vec {x}\) is transmitted over a multipath channel, with the help of a cyclic prefix (CP), the convolution with the (CIR) h[n] becomes circular and hence low-complex zero-forcing (ZF) frequency domain equalization (FDE) can be carried out [5], leading to the equalized signal \(\vec {\bar {y}}=\vec {x}+\vec {\bar {w}}\) at the receiver, where \(\bar {w}[n]\) is the colored noise due to channel equalization.

After equalization, convolution with a ZF receiver filter is carried out to detect the transmitted symbols. This operation can be expressed by

$$\begin{array}{@{}rcl@{}} \hat{\vec{d}}=\mathbf{B}\vec{\bar{y}}=\vec{d}+\mathbf{B}\vec{\bar{w}}, \end{array} $$
((3))

where B=A −1 and \(\hat {\vec {d}}\) contains the estimated data symbols \(\hat {d}_{\textit {km}}\) that are sent to the slicer. Note that B cancels self-interference occurring from the non-orthogonality of the transmit filter, but in turn introduces a noise-enhancement factor (NEF) ξ given by

$$\begin{array}{@{}rcl@{}} \xi = \frac{1}{N}\text{tr}(\mathbf{B}^{\text{H}}\mathbf{B}). \end{array} $$
((4))

3 Time-reversal space-time coding

TR-STC has been proposed by [8] to allow the use of STC for single carrier transmission over frequency-selective channels. The proposed approach operates on two subsequent data blocks \(\vec {x}_{i}\) and \(\vec {x}_{i+1}\) of length N which are separated by a CP. Their corresponding discrete Fourier transforms are \(\vec {X}_{(\cdot)}=\mathbf {F}\vec {x}_{(\cdot)}\), where F denotes the unitary Fourier matrix. The transmit signals on both antennas for two subsequent time slots are given by

((5))

where i is an even number and (·)H is the transpose-conjugate operator. Note that the property

$$\begin{array}{@{}rcl@{}} (\mathbf{F}\vec{X}_{i}^{*})_{n} =x_{i}^{*}[\left<-n\right>_{N}] \end{array} $$
((6))

of the discrete Fourier transform reasons the name ‘time-reversal space-time coding’.

At the receiver, after removing the CP, the transmit signals appear circularly convolved with the CIR \(\vec {h}_{j,l}\), where \(\vec {h}_{j,l}\) contains the channel taps between the jth transmit and lth receiving antenna, zero-padded to the block length. Both received blocks are transformed to the frequency domain. Accordingly, assuming the channel remains constant during the transmission of two subsequent blocks, the received blocks in the frequency domain are given by

$$\begin{array}{*{20}l} \vec{Y}_{i,l} &= \mathbf{H}_{1,l}\vec{X}_{i}-\mathbf{H}_{2,l}\vec{X}_{i+1}^{*}+\vec{W}_{1,l} \end{array} $$
((7))
$$\begin{array}{*{20}l} \vec{Y}_{i+1,l} &= \mathbf{H}_{1,l}\vec{X}_{i+1}+\mathbf{H}_{2,l}\vec{X}_{i}^{*}+\vec{W}_{2,l}, \end{array} $$
((8))

where \(\mathbf {H}_{j,l}=\text {diag}(\vec {H}_{j,i})\) with \(\vec {H}_{j,i}=\mathbf {F}\vec {h}_{j,l}\). The received signals can be combined in the frequency by

$$\begin{array}{*{20}l} \hat{\vec{X}}_{i} &= \mathbf{H}_{eq}^{-1}\sum_{l=1}^{L}\mathbf{H}_{1,l}^{*}\vec{Y}_{i,l}+\mathbf{H}_{2,l}\vec{Y}_{i+1,l}^{*}\\ \hat{\vec{X}}_{i+1} &= \mathbf{H}_{eq}^{-1}\sum_{l=1}^{L}\mathbf{H}_{1,l}^{*}\vec{Y}_{i+1,l}-\mathbf{H}_{2,l}\vec{Y}_{i,l}^{*}, \end{array} $$
((9))
$$\begin{array}{*{20}l} \text{where }\mathbf{H}_{eq} &= \sum_{j=1}^{2}\sum_{l=1}^{L}\mathbf{H}_{j,l}^{*}\mathbf{H}_{j,l} \end{array} $$
((10))

and L is the number of receiving antennas. Finally, the estimates of the transmitted blocks are acquired by inverse Fourier transform

$$\begin{array}{@{}rcl@{}} \hat{\vec{x}}_{i} =\mathbf{F}^{\text{H}}\hat{\vec{X}}_{i}. \end{array} $$
((11))

4 Multi-user STC-GFDMA

TR-STC can be directly applied to GFDM. Consider two data vectors \(\vec d_{i}\) that generate two consecutive GFDM frames

$$\begin{array}{@{}rcl@{}} \vec x_{i}=\mathbf{A}\vec d_{i}. \end{array} $$
((12))

The GFDM signals \(\vec {x}_{i}\) and \(\vec {x}_{i+1}\) can be space-time encoded as described in (5) and (9) can be used to recover the signals on the receiver side. Then, conventional GFDM demodulation with ZF receiver is carried out by

$$\begin{array}{@{}rcl@{}} \hat {\vec d}_{i} = \mathbf{B}\hat {\vec x}_{i}. \end{array} $$
((13))

In this section, we combine TR-STC GFDM with a frequency division multiple access (FDMA) technique to serve multiple users with one GFDM system. The K subcarriers are equally divided between U users, i.e., each user allocates K u =K/U adjacent subcarriers. Further, resource allocation algorithms [10] can be used to distribute the subchannels between the users. Two situations are considered in this paper. First, we assume that CSI is not available at the base station to allow for subchannel scheduling and, second, CSI is considered to allocate the subchannels to the users.

Figure 1 depicts the block diagram of the proposed TR-STC-GFDMA system.

Figure 1
figure 1

Simplified block diagram of the TR-STC-GFDMA system.

Each user u generates a GFDM signal based on two successive data vectors \(\vec {d}^{\,(u)}_{i}\), where elements corresponding to non-allocated subcarriers are set to zero. The data is modulated by A for each user and the blocks are space-time encoded as described in (5). In order to combat time misalignment between users, in addition to the CP, a cyclic suffix (CS) is added to the blocks before they are transmitted through independent frequency-selective fading channels. When using a non-orthogonal transmit filter, adjacent subcarriers of two different users interfere with each other. Since the channels for the users are independent, these boundary subcarriers cannot be equalized and high inter-carrier-interference (ICI) occurs. Therefore, one guard subcarrier is used between users to avoid mutual interference. The guard subcarrier are unnecessary when an orthogonal pulse is used because, in this case, there is no ICI.

4.1 Timing misalignment and channel estimation

When all users are synchronized with the base station (BS) clock rate and all signals arrive within the CP and CS duration at the receive antennas, space-time combining according to (9) can be carried out per user, where only the users’ allocated frequency samples are considered. However, CSI including misalignment information needs to be available at the BS. Note that timing misalignment within the CP/CS length only results in a phase rotation of the circulant channel and hence channel equalization can compensate the misalignment. The CSI can be estimated at the BS by sending a separate pilot sequence per user that is transmitted twice in the TR-STC-GFDMA codeword. The sequences are modulated using an orthogonal pulse to avoid self-interference, with K u subcarriers per user and M=1 subsymbols. The signals are time-reversal space-time encoded and CP and CS are appended before transmission.

At the BS, the received signal in the frequency domain is given by

$$\begin{array}{*{20}l} \vec{P}_{i,l} &= \mathbf{P}\vec{H}_{1,l}-\mathbf{P}^{*}\vec{H}_{2,l}+\vec{W}_{i,l} \end{array} $$
((14))
$$\begin{array}{*{20}l} \vec{P}_{i+1,l} &= \mathbf{P}\vec{H}_{1,l}+\mathbf{P}^{*}\vec{H}_{2,l}+\vec{W}_{i+1,l}, \end{array} $$
((15))

where \(\mathbf {P}=\text {diag}\left (\sum _{u=1}^{U}\vec {P}^{(u)}\right)\) with \(\vec {P}^{(u)}\) containing the discrete Fourier transform (DFT) of the pilot sequence of the uth user which is non-zero only at the allocated subcarriers and \(\vec {H}_{j,l}=\left [\vec {H}_{j,l}^{(1)^{T}}\cdots \vec {H}_{j,l}^{(U)^{T}}\right ]^{T}\) denotes the frequency response of the users’ channels including time misalignment seen at the BS. Then, the CIR is estimated by

$$\begin{array}{*{20}l} \vec{H}_{1,l} &= \frac{1}{2}\mathbf{P}^{-1}(\vec{P}_{i,l}+\vec{P}_{i+1,l}), \end{array} $$
((16))
$$\begin{array}{*{20}l} \vec{H}_{2,l} &= \frac{1}{2}(\mathbf{P}^{*})^{-1}(\vec{P}_{i+1,l}-\vec{P}_{i,l}). \end{array} $$
((17))

Note that the proposed scheme does not require all users to have two transmit antennas. Instead, using only the first transmit antenna is equivalent to have \(\vec {h}_{2,l}=\vec {0}\). In this case, the transmitted signal can still be recovered; however, according to (9), no transmit diversity gain is achieved. For example, users with cheap devices or very good channel conditions would not carry out the space-time encoding but can still be correctly received by the BS.

4.2 User scheduling

When it comes to multi-user scenarios, significant gains can be acquired by allocating each user to the subchannel that provides the best channel frequency response available. Let the available bandwidth be distributed into U subchannels, where each subchannel contains K/U subcarriers. Let the set of subcarriers of the κth subchannel for the uth user be denoted by \(\mathcal {K}_{u}\). Consider the scenario where time division duplexing (TDD) is employed and the BS is able to estimate the channel frequency response for each user. In this case, the BS can measure the quality Q u,κ of the κth subchannel of the uth user as

$$\begin{array}{@{}rcl@{}} Q_{u,\kappa} =\sum_{k\in\mathcal{K}_{u}} {~}^{(u)}\mathbf{H}_{eq,k}, \end{array} $$
((18))

where (u) H e q,k is the kth element on the diagonal of the equivalent channel of the uth user, given by (10). Notice that the upper index (u) has been used to denote the equivalent channel between the uth user and the BS.

Based on Q u,κ , the BS can schedule the best channel for the users using a priority-based scheduling algorithm. There, users are sorted according to their priorities and the user with highest priority receives the subchannel with highest quality. This subchannel is marked as unavailable for the other users and the procedure continues until the last user receives the last available subchannel.

In this paper, two simple approaches for the definition of the priority list are considered. In the first one, named ‘random selection priority’, the priorities of the users are randomly defined for each transmission, leading to a fair distribution between the users. For the second approach, named ‘higher gain priority’, the priority list is built in accordance with the channel quality from (18). In this case, the user that can experience the best channel condition is the first to receive a subchannel, which is marked as unavailable for the other users. The procedure repeats again until all subchannels are assigned to every user.

The main advantage of the higher gain priority over the random selection priority is that the former allows for the best use of the communication channel. However, higher gain priority reduces the chance of users with a single antenna to choose the subchannel first. Hence, when higher gain priority is employed, users with a single transmit antenna are expected to receive a smaller gain compared with the gain observed by users with two transmit antennas.

5 Performance analysis

The parameters used to evaluate the SER performance of the proposed system are presented in Table 1. The power delay profile of the wireless channel is shown in Table 2, which is based on the Extended Pedestrian A (EPA) model [11]. The channel taps are multiplied by i.i.d. complex Gaussian variables with variance σ 2=1, creating independent Rayleigh fading for each user and antenna. The channel remains constant during the transmission of two GFDM blocks. All users are synchronized with the base station’s master clock, i.e., no frequency misalignment occurs. The timing misalignment between users and base station is uniformly distributed between [ −3, 3] samples. Notice that the CP and CS are large enough to accommodate the channel impulse response and the time misalignment.

Table 1 Simulation parameters
Table 2 Channel power delay profile used in the simulations

An approximation of the TR-STC-GFDM SER performance under a frequency-selective fading channel can be derived from an upper bound of symbol error probability for orthogonal maximum ratio combiner ([12], Ch. 13), but considering the NEF of GFDM. The approximation is given by

$$\begin{array}{@{}rcl@{}} p_{e}\approx 4\mu\,\,\sum_{i=0}^{JL-1} \left(\begin{array}{c} JL-1+i \\ i \end{array} \right)\left(\frac{1+\eta}{2}\right)^{i}, \end{array} $$
((19))

where

$$\begin{array}{*{20}l} \mu &= \left(\frac{\sqrt{\mathcal{J}}-1}{\sqrt{\mathcal{J}}}\right)\left(\frac{1-\eta}{2}\right)^{JL}, \end{array} $$
((20))
$$\begin{array}{*{20}l} \eta &= \sqrt{\frac{\frac{3{\sigma_{e}^{2}}}{\mathcal{J}-1}\frac{E_{s}}{\xi_{0} N_{0}}}{2+\frac{3{\sigma_{e}^{2}}}{\mathcal{J}-1}\frac{E_{s}}{\xi_{0} N_{0}}}}, \end{array} $$
((21))

with E S and N 0 denoting the average symbol energy and the noise power, respectively. denotes the size of the digital constellation and

$$\begin{array}{@{}rcl@{}} {\sigma_{e}^{2}}=\sum_{n} \text{E}\left[|h_{n}|^{2}\right]. \end{array} $$
((22))

When considering the overhead of the CP of GFDM and orthogonal frequency division multiplexing (OFDM) in the effective noise calculation, we have

$$\begin{array}{@{}rcl@{}} \xi_{0,\text{OFDM}}&=& \frac{K+N_{CP}+N_{CS}}{K}\\ \xi_{0,\text{GFDM}}&=& \xi\cdot \frac{KM+N_{CP}+N_{CS}}{KM}, \end{array} $$
((23))

which shows that GFDM, depending on the NEF ξ from (4), can achieve higher spectral efficiency compared to OFDM. This is due to the fact that GFDM requires only one CP for M subsymbols, whereas OFDM uses one CP per OFDM symbol. As in [12], (19) becomes a tighter upper bound if J L≥2 and the channel frequency response is flat per subcarrier.

Figure 2 compares the SER performance of conventional STC-OFDM [13] and TR-STC-GFDMA with perfect CSI at the BS. It shows that TR-STC-GFDMA users with two transmit antennas (users 4 to 8) achieve the same diversity gain as STC-OFDM; however, due to the more efficient use of the CP, the SER curve of GFDM is shifted 0.25 dB to the left. As expected, users with one transmit antenna (users 1 to 3) do not benefit from transmit diversity gain. Note that the benefit of employing GFDM here is not the SER performance gain compared with OFDM. As described in [5], GFDM has advantage of reducing the OOB emission, increasing the flexibility by covering other orthogonal waveforms as corner cases and better use of the time-frequency resource grid. Also, following (23), the TR-STC-GFDM SER performance can be improved by reducing the impact of the CP and CS in the overall overhead. This can be achieved by increasing the number of subsymbols.

Figure 2
figure 2

SER performance of STC-OFDM and TR-STC-GFDMA.

Under the assumption of perfect CSI and knowledge of the time delay between users, it is evident that the TR-STC-GFDMA achieves full diversity. In a more realistic scenario, the BS needs to estimate the CSI and delay between users. However, if the time misalignment between users is smaller than the CP and CS length, the resulting effect is only the introduction of a phase rotation in the spectrum of the received signals. Under this condition, the BS only needs to estimate the overall equivalent channels for each user.

Figure 3 shows the SER performance with the proposed channel estimation algorithm. When a 3-dB pilot boost is used, the performance is approximately 1.8 dB worse compared to perfect CSI but no further performance degradation in diversity is apparent.

Figure 3
figure 3

SER performance TR-STC-GFDMA considering the channel estimation.

The knowledge of the CSI at the BS side can improve the overall system performance by allocating the best available channel to a given user, in accordance with the channel quality presented in (18). Figure 4 presents the SER performance when random selection priority is employed to define the priority list. The performance gain for all users is clear. Users with single transmit antenna can observe a performance improvement of approximately 6 dB, while users with two transmit antennas can benefit from an approximately 4-dB gain, when compared with the performance achieved with fixed subchannel allocation (Figure 2).

Figure 4
figure 4

SER performance TR-STC-GFDMA assuming random selection priority.

Figure 5 presents the achieved SER performance when the higher gain priority is used to build the priority list. As expected, users with two transmit antennas can benefit more from the subchannel allocation algorithm, resulting in a 5-dB performance gain, compared to the fixed subchannel allocation. However, users with a single transmit antenna are more likely to have lower priority, meaning they have a smaller probability of choosing a high-quality subchannel. This results in a 3-dB gain when compared with the performance achieved with fixed subchannel allocation.

Figure 5
figure 5

SER performance TR-STC-GFDMA assuming higher gain priority.

6 Conclusions

A multiple access MIMO scheme is a mandatory requirement for the next generation of mobile communication systems. GFDM, which is a recent filtered multicarrier scheme, is flexible and can address the main requirements imposed by future networks. In this paper, it has been shown that GFDM can be easily integrated with TR-STC to achieve full diversity gain under frequency-selective fading channels. The paper also proposes an FDMA scheme, in order to share the subcarriers of the TR-STC-GFDM signal among multiple users. Additionally, we present the application of a simple scheduling algorithm to distribute the available subchannels among the users. Two approaches have been considered, and the SER performance has been analyzed. The use of the CSI to assign the available subchannels to the users significantly improves the overall SER performance; however, the approach used to define the priority list impacts the observed gain. Timing misalignment between the signals received from multiple users can be compensated at the BS by a frequency domain equalizer when the overall misalignment and the channel length are within the CP and CS. When considering channel estimation, performance loss only occurs due to imperfect CSI at the BS, but no diversity loss can be observed. Finally, it is shown that the resources of the TR-STC-GFDMA block can be shared between users having one or two transmit antennas. Besides the loss of the diversity gain, no other consequence is observed for the users with only a single transmit antenna.