1 Introduction

In next-generation wireless communication networks, ubiquitous coverage and high data rate are strongly required. To achieve this goal, incorporating orthogonal frequency division multiple access (OFDMA) technology with emerging relaying technologies is highly investigated by both academia and industry. Specifically, OFDMA is able to combat the frequency-selective multi-path fading and flexible in applying dynamic radio resource allocation (RA) for performance improvement. On the other hand, the relaying technology has an attractive feature of coverage extension and data rate improvement [1].

For relay-aided OFDM(A) systems, the authors in [2] and [3] have proposed two efficient types of relaying, namely amplify and forward (AF) and decode and forward (DF). Recently, the DF relaying is receiving a lot of interest due to its simple processing at the relay. With DF relaying, symbols are transmitted in two time slots (TSs). During the first TS, the source broadcasts symbols on all subcarriers with the relay keeping quiet. During the second TS, except from the relay, the source might also broadcast symbols on subcarriers not used by the relay, as will be elaborated later. Adopting DF relaying, the authors in [423] have studied the RA problems for downlink OFDMA intensively.

In particular, the works in [47] have considered RA in OFDM systems with fixed relaying (FR) when the destination cannot or hardly hear from the source, meaning the source-to-destination (S-D) link is unavailable. Considering the case where the S-D link is available, the authors in [823] have studied RA in systems with opportunistic relaying (OR), sometimes termed as selection relaying. To start with, a low spectrum efficiency (LSE) protocol was studied in [813], when only the relay broadcasts symbols during the second TS. Specifically, each transmission is carried out in two TSs using either the relay-aided mode or the direct mode. As for a relay-aided transmission, during the first TS, a symbol is first broadcast by the base station (BS) at a subcarrier k. Both the relay station (RS) and the targeted mobile station (MS) receive this symbol. Then, during the second TS, the RS decodes the received symbol and retransmits it to the targeted MS at a subcarrier l. Note that subcarrier l might not be the same as subcarrier k, which means subcarrier pairing (SP) can be implemented [17, 24]. Finally, the MS chooses to decode the source symbol from either the received symbols during both TSs using some combination methods or the received symbol during the second TS only. For a direct mode transmission, the MS only uses what is received from the BS. Note that adopting this LSE protocol, the BS keeps quiet with unpaired subcarriers unused during the second TS, which causes a waste of the limited spectrum resource. To address this issue, the authors in [1422] have proposed and studied improved high spectrum efficiency (HSE) protocols, which allow new symbols to be transmitted on the unpaired subcarriers during the second TS. In this paper, the HSE protocol will be considered.

Note that all these papers model the cochannel interference (CCI) as additive background noise and only consider RA in single-cell situations. This is reasonable only when the frequency reuse factor 1/W is low, where W denotes the number of cell clusters that cannot use the same frequencies for transmission. However, in next-generation cellular systems, aggressive frequency reuse is recommended due to its ability to achieve higher system capacity [25]. Here, the CCI becomes a key factor affecting the system performance and thus cannot be ignored [26].

Considering the CCI in multi-cell DF relay-aided OFDMA systems, RA algorithms have been proposed in [27] and [28] when powers are uniformly allocated to all stations. Considering subcarrier allocation and power allocation jointly, the papers [26] and [29] have recently formulated and solved the sum rate maximized RA problem in multi-cell OFDMA downlink systems aided by DF RSs in each cell. However, fairness was not considered there. Recently in [30], the maximization of the weighted sum of per cell min-rate (WSMR) was considered for multi-cell OFDMA systems without relaying. The corresponding optimum RA leads to per-cell maximum fairness and different priorities in different cells. The authors only illustrate the per-cell fairness by experiments without mathematical proof.

Compared with the above existing works, the contributions of the current paper are listed below:

  • We formulate the WSMR maximized joint RA problem in multi-cell OFDMA downlink systems aided by a RS in each cell. To start with, the DF protocol with OR and HSE is considered without signal combining. It will be shown that the system performance can be enhanced largely by using OR and the HSE protocol. Note that when modeling the inter-cell CCI of a subcarrier in a selected cell, instead of using an additional integer variable to indicate whether a node in an interfering cell transmits data on this subcarrier or not, we use the corresponding power value to do it. This choice is motivated to simplify the system sum rate expression and facilitate the algorithm design.

  • We unveil and prove the per-cell maximum fairness feature of the formulated RA problem. Specifically, with any feasible mode selection and subcarrier assignment (MSSA) result, the optimum power allocation results in the same transmission rates for all users in the same cell.

  • We propose an iterative RA algorithm to optimize the MSSA and the power allocation (PA) alternatively with the WSMR keeping increasing. Each iteration is composed of the MSSA stage and the PA stage. During the MSSA stage, the original problem is decoupled into mixed integer linear programs (MILPs), which can be solved by typical MILP solvers. To solve the MILPs more efficiently in polynomial time, we further propose the randomized rounding-based MSSA (RR-MSSA) algorithm and the direct rounding-based MSSA (DR-MSSA) algorithm at this stage. During the PA stage, an algorithm based on single condensation and geometric programming PA (SCGP) is used to optimize PA with the tentative MSSA results. The convergence and the per-cell user fairness of the proposed RA algorithm are proven.

The rest of this paper is organized as follows: First, the considered system model is presented in the next section. Then, the RA problem is formulated and its fairness property is discussed in Section 3. After that, the proposed algorithms are described in Section 4. Furthermore, the effectiveness and convergence of the proposed RA algorithms as well as the benefits of using OR and the HSE protocol are illustrated by numerical experiments in Section 5. Finally, some conclusions are drawn in Section 6.

2 System description

Let us consider a cellular downlink OFDMA system with N cells coordinated by a central controller for RA. In each cell, the data is transmitted from a source to U destinations with the help of a DF relay. For each link, the frequency-selective channel is transformed into K parallel subchannels by using OFDM with sufficiently long cyclic prefix. The data transmission is carried out in two TSs choosing either relay-aided mode or direct mode. During the first TS, a symbol is first broadcast by the BS at a subcarrier and received by the RS and the MS. In relay-aided mode, the RS decodes the received symbol and relays it to the targeted MS over this subcarrier with the BS keeping quiet at the subcarrier during the second TS. The destination only decodes the symbol received during the second TS. If direct transmission is used, the targeted destination decodes the symbol received during the first TS. Also, another symbol is broadcast by the source at this subcarrier during the second TS, which is received and decoded by this destination.

With OFDMA, each subcarrier can be allocated to only one destination in each cell. In this paper, we first assume that the coherence time of each link is sufficiently long for implementing the RA in the central controller, which has perfect knowledge of the system channel state information. Moreover, we assume that the optimized RA can be correctly distributed to all nodes. Lastly, perfect timing and carrier synchronization is assumed. Note that an upper bound on the system performances is obtained by assuming the above idealities.

2.1 Transmission procedure

Let us describe the data transmission procedure inside one selected cell n, which is impaired by cochannel interference from the other cells. The transmission procedures in other cells are the same. Specifically in cell n, symbols are transmitted in either relay-aided mode or direct mode, as will be elaborated in the following.

We first consider the relay-aided data transmission. The source s n first produces a symbol P s n , t 1 k x s n , t 1 k at subcarrier k during the first TS, while the transmitter of the relay r n remains idle. Here, x s n , t 1 k denotes the normalized symbol (meaning E | x s n , t 1 k | 2 =1) transmitted by s n at subcarrier k during the first TS, and P s n , t 1 k denotes the corresponding transmit power. Simultaneously in an interfering cell n, a symbol P s n , t 1 k x s n , t 1 k is also produced from the interfering source s n at the same subcarrier. Note that instead of using an additional integer variable to indicate whether s n transmits data on subcarrier k or not, we use P s n , t 1 k to do so. Specifically, P s n , t 1 k >0 means that s n uses subcarrier k for data transmission during the first TS, and P s n , t 1 k =0 means that s n does not transmit at the subcarrier k during the first TS. This choice is motivated to simplify the system sum rate expression and facilitate the algorithm design. At the end of the first TS, the signal received by r n for subcarrier k can be expressed as

y r n k = P s n , t 1 k h s n , r n k x s n , t 1 k + v r n k + n = 1 , n n N P s n , t 1 k h s n , r n k x s n , t 1 k
(1)

where v r n k denotes the additive white Gaussian noise (AWGN) at subcarrier k and r n during the first TS. h s n , r n k denotes the channel frequency response (CFR) for subcarrier k from s n to r n .

During the second TS, the relay r n reencodes the decoded symbol and forwards P r n k x r n k at a subcarrier k. Here, x r n k = x s n , t 1 k . The source s n transmits nothing on this subcarrier, meaning that P s n , t 2 k =0. Here, P r n k and P s n , t 2 k denote the transmit power allocated to r n and s n , respectively, at subcarrier k during the second TS. At the same time, in an interfering cell n, r n and s n also transmit P r n k x r n k and P s n , t 2 k x s n , t 2 k at subcarrier k. Remembering that at most only one power value out of P r n k and P s n , t 2 k can be non-zero. More specifically, when subcarrier k of cell n is assigned, the relay mode for data transmission P s n , t 2 k =0. Otherwise, P r n k =0. Here again E | x r n k | 2 =1 and E | x s n , t 2 k | 2 =1. At the end of the second TS, the signal received by the targeted destination d u n at subcarrier k can be expressed as

y d un , t 2 k = P r n k h r n , d un k x r n k + v d un , t 2 k + n = 1 , n n N P s n , t 2 k h s n , d un k x s n , t 2 k + n = 1 , n n N P r n k h r n , d un k x r n k
(2)

where v d un , t 2 k denotes the AWGN corrupting d u n at subcarrier k during the second TS, h r n , d un k denotes the CFR for subcarrier k from r n to d u n , and h s n , d un k denotes the CFR of subcarrier k from s n to d u n .

Let us now describe the direct data transmission in cell n. During the first TS, s n broadcasts P s n , t 1 k x s n , t 1 k at subcarrier k. The targeted destination d u n receives signals from all sources. The signal received by d u n at subcarrier k can be expressed as

z d un , t 1 k = P s n , t 1 k h s n , d un k x s n , t 1 k + v d un , t 1 k + n = 1 , n n N P s n , t 1 k h s n , d un k x s n , t 1 k
(3)

where v d un , t 1 k denotes the AWGN corrupting d u n at subcarrier k during the first TS.

During the second TS, another symbol P s n , t 2 k x s n , t 2 k is broadcast by s n at subcarrier k and received by the destination d u n . The received signal can be expressed as

z d un , t 2 k = P s n , t 2 k h s n , d un k x s n , t 2 k + v d un , t 2 k + n = 1 , n n N P s n , t 2 k h s n , d un k x s n , t 2 k + n = 1 , n n N P r n k h r n , d un k x r n k .
(4)

2.2 Weighted sum of per cell minimum rate

In order to formulate the WSMR, we now introduce binary variables a un k and b un k to describe the mode selection and subcarrier assignment in both TSs. To be more specific, a un k =1 indicates that subcarrier k is allocated for data transmission to d u n in direct mode during the first TS, and so is subcarrier k to d u n during the second TS. b un k =1 indicates that subcarrier k is allocated for data transmission to d u n aided by r n .

We assume v r n k , v d un , t 2 k , v d un , t 1 k are independent zero-mean circular Gaussian random variables with the same variance σ2. After some mathematical calculations, the sum rate achievable for the direct mode subcarrier k is given by

R un , 1 k = ln 1 + Υ d un , t 1 k + ln 1 + Υ d un , t 2 k
(5)

in nats/two-TSs, where

Υ d un , t 1 k = P s n , t 1 k G s n , d un k f d un , t 1 k
(6)

denotes the SINR associated with the decoding of x s n , t 1 k from y d un , t 1 k at d u n during the first TS. f d un , t 1 k = σ 2 + n = 1 , n n N P s n , t 1 k G s n , d un k denotes the sum power of the AWGN and the interference received by d u n at subcarrier k during the first TS. G s n , d un k =| h s n , d un k | 2 denotes the channel gain of subcarrier k from s n to d u n .

Υ d un , t 2 k = P s n , t 2 k G s n , d un k f d un , t 2 k
(7)

denotes the SINR associated with the decoding of x s n , t 2 k from y d un , t 2 k at d u n during the second TS. f d un , t 2 k = σ 2 + n = 1 , n n N P s n , t 2 k G s n , d un k + n = 1 , n n N P r n k G r n , d un k denotes the sum power of the AWGN and the interference received by d u n at subcarrier k during the second TS. G r n , d un k =| h r n , d un k | 2 denotes the channel gain of subcarrier k from r n to d u n .

Also, the maximum achievable rate for subcarrier k when allocated to d u n in relay-aided mode is given by [3]

R un , 2 k = min ln 1 + Γ r n k , ln 1 + Γ d un , t 2 k
(8)

in nats/two-TSs, where

Γ r n k = P s n , t 1 k G s n , r n k f r n k
(9)

denotes the signal-to-interference-plus-noise ratio (SINR) associated with decoding x s n , t 1 k from y r n k at r n during the first TS, f r n k = σ 2 + n , n n P s n , t 1 k G s n , r n k denotes the sum power of the AWGN and the interference received by r n at subcarrier k during the first TS, and G s n , r n k =| h s n , r n k | 2 denotes the channel gain of subcarrier k from s n to r n .

Γ d un , t 2 k = P r n k G r n , d un k f d un , t 2 k
(10)

denotes the SINR associated with decoding x r n k , which equals x s n , t 1 k , from y d un , r n k at d u n during the second TS.

Thus, the WSMR of the considered system is denoted as

R P , A , B = n = 1 N ω n min u U n k = 1 K a un k R un , 1 k P k + k = 1 K b un k R un , 2 k P k ,
(11)

where P k = P 1 k T , , P N k T T , P n k = P s n , t 1 k , P s n , t 2 k , P r n k T , A= a un k , and B= b un k . ω n denotes the weight of the minimal rate in cell n, and U n denotes the MS set in cell n.

3 Problem formulation and its property

3.1 Problem formulation

We consider maximizing the WSMR under per-cell total power constraints. The optimization variables are the transmission mode for each subcarrier, the subcarrier assignments, and the power allocations at the sources and the relays. According to the system model, the considered RA problem can be formulated as

max P , A , B R P , A , B s.t. C 1 : u = 1 U a un k + u = 1 U b un k 1 , n , k , C 2 : a un k { 0 , 1 } , b un k { 0 , 1 } , u , n , k , C 3 : k = 1 K P s n , t 1 k + k = 1 K P s n , t 2 k + k = 1 K P r n k P t , n , n , C 4 : P s n , t 1 k 0 , P s n , t 2 k 0 , P r n k 0 , n , k , C 5 : P s n , t 1 k 1 u = 1 U a un k u = 1 U b un k = 0 , n , k , C 6 : P s n , t 2 k 1 u = 1 U a un k = 0 , n , k , C 7 : P r n k 1 u = 1 U b un k = 0 , n , k.
(12)

Here, C 1 and C 2 ensure that each subcarrier k during the first TS can select only one mode (direct/relay-aided) to transmit data towards only one destination d u n . Moreover, C 3 and C 4 ensure that the consumed sum power for each cell is less than its available sum power. This type of power constraints give an upper bound of the system performance. In practice, each node (source, relay) in each cell will have an individual power constraint. Finally, C 5, C 6, and C 7 guarantee that no data is transmitted on an unused subcarrier, and subcarrier k is used by only one node (either the source or the relay) in each cell during the second TS.

3.2 Fairness property

To investigate the fairness property of the formulated problem, let us denote by R u n the sum rate of each user u in each cell n, U n the set of users in cell n, U ~ n U n the set of users with the minimum rate in cell n, V ~ n the set of other users (the complimentary set of U ~ n ), and S ũn ( S v ~ n ) the set of subcarriers allocated to MS d ũn ( d v ~ n ), where ũ U ~ n ( v ~ V ~ n ). Note that V ~ n means that

min v ~ V ~ n R v ~ n > min ũ U ~ n R ũn = min u U n R un .
(13)

Then, for each subcarrier k, there exists three states considering which one of the two user sets does the allocated user belong to in each cell. Specifically, state 1 corresponds to k S ũn ,n, meaning that in each cell, subcarrier k is allocated to a user with the minimum rate of that cell. State 2 corresponds to k S v ~ n ,n, meaning that in each cell, subcarrier k is allocated to a user with a rate larger than the minimum rate of that cell. State 3 corresponds to ∃ n1,n2 such that k S ũ n 1 and k S v ~ n 2 , meaning other possible allocation results for subcarrier k in all cells. Note that when all subcarriers are in state 1, per-cell user fairness is reached, meaning all users inside the same cell obtain the same rate.

We first consider the following lemma, which will be used later in this section.

Lemma 1. Given any feasible values of variables A, B and P, if there exists n and k where V ~ n and k S v ~ n (meaning that there exists subcarrier k in state 2 or state 3), then part of the feasible powers at this subcarrier k in each cell( P n k = P s n , t 1 k , P s n , t 2 k , P r n k and P n k , n n) can be saved while keeping the WSMR of the system non-decreased.

Proof. Considering the subcarrier k in cell n where V ~ n and k S v ~ n , we now introduce two sets of cells N 1 k ={n|k S ũn } and N 2 k ={n| V ~ n ,k S v ~ n }. Note that when k is in state 2, N 1 k = and N 2 k . Also, when k is in state 3, N 1 k and N 2 k .

Let us consider the case when k is in state 3. We define

P ~ n k ( s k ) = s k P n k n N 2 k P n k n N 1 k
(14)

where sk∈(0,1], R ~ un k P ~ k ( s k ) = a un k R un , 1 k P ~ k s k + b un k R un , 2 k P ~ k s k , and P ~ k s k = P ~ 1 k s k T , , P ~ N k s k T T . Please note that sk is the same for all cells.

n N 2 k , Υ d v ~ n , t 1 k can be denoted as

s k P s n , t 1 k G s n , d v ~ n k σ 2 + n N 1 k P s n , t 1 k G s n , d v ~ n k + n n , n N 2 k s k P s n , t 1 k G s n , d v ~ n k ,
(15)

which can be further written as

P s n , t 1 k G s n , d v ~ n k σ 2 s k + n N 1 k P s n , t 1 k G s n , d v ~ n k s k + n n , n N 2 k P s n , t 1 k G s n , d v ~ n k .
(16)

It is obvious that Υ d v ~ n , t 1 k is an increasing function of sk. Similarly, Υ d v ~ n , t 2 k , Γ r n k , and Γ d v ~ n , t 2 k are also increasing functions of sk. Thus, R ~ v ~ n k P ~ k s k is an increasing function of sk.

n N 1 k , Υ d ũn , t 1 k can be denoted as

P s n , t 1 k G s n , d ũn k σ 2 + n n , n N 1 k P s n , t 1 k G s n , d ũn k + n N 2 k s k P s n , t 1 k G s n , d ũn k .
(17)

It is obvious that Υ d ũn , t 1 k is a decreasing function of sk. Similarly, Υ d ũn , t 2 k , Γ r n k , and Γ d ũn , t 2 k are also decreasing functions of sk. Thus, R ~ ũn k P ~ k s k is a decreasing function of sk.

Note that P ~ k s k P ~ k (1)= P k , where the inequality should be understood as elementwise. Thus, s 0 k (0,1), ε n k >0, η n k >0, such that n N 2 k , R ~ v ~ n k P ~ k s 0 k = R ~ v ~ n k P ~ k ( 1 ) ε n k = R v ~ n k P k ε n k , R ~ v ~ n = R v ~ n ε n k > min u U n R un and n N 1 k , R ~ ũ n k P ~ k s 0 k = R ũ n k P ~ k ( 1 ) + η n k = R ũ n k P k + η n k , R ~ ũ n = R ũ n + η n k > min u U n R u n . Note that the WSMR of the system is non-decreased, while the power 1 s 0 k P n k is saved from subcarrier k in cell n N 2 k .

Let us introduce R n min = min u U n R un ,n. After substituting P n k with P ~ n k ( s 0 k ), we now have n N 2 k ,k S v ~ n , R ~ v ~ n > R n min and n N 1 k ,k S ũn , R ~ ũn > R n min . Let us define P ̆ n k c k = c k P n k where ck∈(0,1). Thus, Υ d un , t 1 k can be denoted as

P s n , t 1 k G s n , d un k σ 2 c k + n n P s n , t 1 k G s n , d un k .
(18)

It is obvious that Υ d un , t 1 k is an increasing function of ck. Similarly, Υ d un , t 2 k , Γ r n k , and Γ d un , t 2 k are also increasing functions of ck. Thus, R ̆ un k P ̆ k c k is an increasing function of ck. There exists a c 0 k (0,1), α n k >0 such that n, R ̆ un k P ̆ k c 0 k = R ̆ un k P ̆ k ( 1 ) α n k = R un k P k α n k = R ~ un k P ~ k s 0 k α n k , R ̆ un = R ~ un α n k > R n min . The power 1 c 0 k P ~ n k s 0 k can be saved from subcarrier k in each cell n while keeping the WSMR of the system non-decreased.

We now consider the case when k is in state 2, meaning n N 2 k ,n. By using the previous derivations, we can still find s 1 k (0,1), such that the power 1 s 1 k P n k can be saved from subcarrier k in each cell n while keeping the WSMR of the system non-decreased. This concludes the proof of Lemma 1.

We now state the main result of this section. Based on Lemma 1, the following theorem is proposed, which sheds light on the fairness property of the formulated problem.

Theorem 1. With any feasible values A=A0 and B=B0, the optimum power allocation Popt of the formulated problem results in the same transmission rates for all users in the same cell.

Proof. When A=A0 and B=B0, we assume at the optimum, ∃ n0 where V ~ n 0 and a subcarrier k 0 S v ~ n in state 2 or state 3. Then, n,k S ũn where ũ U ~ n , subcarrier k can be in either state 3 or state 1.

When subcarrier k is in state 3, as discussed in the proof of Lemma 1, we can still improve R ũn k by decreasing the interfering powers of subcarrier k in cell n while keeping the minimum user rate of cell n non-decreased.

When subcarrier k is in state 1, as at the optimum ∃ k0k that is in state 2 or state 3, we can save power from P n k 0 in each cell n while keeping the WSMR of the system non-decreased. Let us denote the saved power in each cell n as Δ P n and define P ̂ n k ( t k )= t k P n k where tk>1. Thus, Υ d ũn , t 1 k can be denoted as

P s n , t 1 k G s n , d ũn k σ 2 t k + n n , n N 1 k P s n , t 1 k G s n , d ũn k .
(19)

It is obvious that Υ d ũn , t 1 k is an increasing function of tk. Similarly, Υ d ũn , t 2 k , Γ r n k , and Γ d ũn , t 2 k are also increasing functions of tk. Thus, for the subcarrier k in state 1, R ~ ũn k P ̂ k t k is an increasing function of tk. Thus, there exists a t 0 k >1 such that R ~ ũn k P ̂ k t 0 k > R ũn k and k K ̂ t 0 k 1 P n k Δ P n . Here, K ̂ denotes the set of subcarriers in state 1.

Therefore, k S ũn in cell n, R ũn k can still be improved. Thus, the minimum user rate of each cell can still be improved by adjusting power allocation. Specifically, we can find matrices S,C,T,P1=S C T Popt, according to the previous discussions, such that R(P1,A0,B0)>R(Popt,A0,B0). Here, each element of S,C, and T takes value from (0,1],(0,1], and [1,), respectively. This obviously contradicts the optimum assumption and concludes the proof of Theorem 1.

4 Algorithm development

In this section, we will propose an iterative RA algorithm, which obtains a local optimum of the formulated problem after convergence. The proposed algorithm is based on the coordinate ascent (CA) approach, which divides the whole set of optimization variables into several sets of variables and iteratively optimizes each set of variables with other sets of variables fixed.

Specifically, variables {P,A,B} are divided into two sets: the integral variable set {A,B} and the continuous variable set {P}. During each iteration, the MSSA stage is first carried out followed by the PA stage. Let us introduce integer m to indicate the iteration number. At the MSSA stage of iteration m, the integral variables {A,B} are optimized with P=Pm−1. While at the PA stage of iteration m, the continuous variables {P} are optimized with A=Am and B=Bm. Here, a variable with the superscript m denotes the value obtained at the end of iteration m. In the following, we first design algorithms for both stages, based on which the iterative RA algorithm will then be stated. Finally, several characteristics of the proposed RA algorithm will be discussed.

4.1 MSSA optimization

In this subsection, the MSSA stage for iteration m is considered. After setting P to Pm−1, the formulated problem can be rewritten as

max A , B R P m 1 , A , B s.t. C 1 , C 2 , C 5 , C 6 , C 7 .
(20)

Note that each cell has independent constraints for its local integral variables. Therefore problem (20) can be decoupled into N subproblems. As for subproblem n0, the minimum rate of all users in the cell n0 is maximized subject to the local constraints. Specifically, subproblem n0 is formulated as

max A n 0 , B n 0 R n 0 P m 1 , A n 0 , B n 0 s.t. C 1 : u = 1 U a u n 0 k + u = 1 U b u n 0 k 1 , k , C 2 : a u n 0 k { 0 , 1 } , b u n 0 k { 0 , 1 } , u , k , C 5 : P s n 0 , t 1 k , m 1 1 u = 1 U a u n 0 k u = 1 U b u n 0 k = 0 , k , C 6 : P s n 0 , t 2 k , m 1 1 u = 1 U a u n 0 k = 0 , k , C 7 : P r n 0 k , m 1 1 u = 1 U b u n 0 k = 0 , k.
(21)

where A n 0 = a un k | n = n 0 , B n 0 = b un k | n = n 0 , and

R n 0 = min u U n 0 k = 1 K a u n 0 k R u n 0 , 1 k P m 1 + k = 1 K b u n 0 k R u n 0 , 2 k P m 1 .

By introducing a slack variable ξ n 0 , problem (21) can be reformulated as

max A n 0 , B n 0 , ξ n 0 ξ n 0 s.t. C 1 , C 2 , C 5 , C 6 , C 7 and C 3 : k = 1 K a u n 0 k R u n 0 , 1 k + b u n 0 k R u n 0 , 2 k ξ n 0 , u U n .
(22)

Note that (22) is actually a mixed integer linear program (MILP). Hence, it can be solved by means of typical MILP solvers, like MOSEK and TOMLAB. In order to solve it much faster in polynomial time, we now propose a RR-MSSA algorithm and a DR-MSSA algorithm. Specifically, both algorithms consist of three steps: the relaxation step, the rounding step, and the decision step.

During the relaxation step, we first relax constraint C 2 as C 2 ′′ : a u n 0 k [0,1], b u n 0 k [0,1],u,k. Then, the MILP is relaxed as a linear program (LP), which can easily be solved by any standard LP algorithm in polynomial time (time that is bounded by a fixed polynomial of the length of the input). Note that as the feasible set of the LP is larger than that of the MILP, the optimal objective function value of the LP is an upper bound of that of the MILP.

During the rounding step, we round the fractional optimal solution A n 0 , B n 0 of the LP into an integer solution A n 0 , B n 0 . Specifically, each value of the fractional solution is viewed as the probability that this value should be rounded to 1. Then, a direct rounding (DR) algorithm is ∀k,

a u n 0 k , = 1 a u n 0 k , = max u a u n 0 k , , b u n 0 k , 0 a u n 0 k , max u a u n 0 k , , b u n 0 k , ,
b u n 0 k , = 1 b u n 0 k , = max u a u n 0 k , , b u n 0 k , 0 b u n 0 k , max u a u n 0 k , , b u n 0 k , .

After this rounding, the R n 0 calculated with A n 0 , B n 0 is decreased, compared to that calculated with A n 0 , B n 0 .

In order to make R n 0 P m 1 , A n 0 , B n 0 closer to R n 0 P m 1 , A n 0 , B n 0 , we now use the randomized rounding (RR) algorithm, where N s samples of binary values are generated for variables A n 0 , B n 0 by a randomized process with probability A n 0 , B n 0 . Each sample is denoted as A n 0 ( s ) , B n 0 ( s ) . Finally, the feasible sample with the maximum R n 0 is assigned to A n 0 , B n 0 .

During the decision step, we assign A n 0 m , B n 0 m with

argmax A n 0 , B n 0 R n 0 P m 1 , A n 0 , B n 0 , R n 0 P m 1 , A n 0 m 1 , B n 0 m 1 .

4.2 PA optimization

In this subsection, the PA stage for iteration m is considered. After setting the indicators to {Am,Bm}, the transmission modes are fixed in all cells. Let us denote by S un (d) ( S un (r)) the set of subcarriers allocated to MS d u n in direct (relay-aided) mode. Then, the objective function of problem (12) can be rewritten as R(P,Am,Bm), given by

n ω n min u U n k S un ( d ) ln 1 + Υ d un , t 1 k + ln 1 + Υ d un , t 2 k + k S un ( r ) ln min 1 + Γ r n k , 1 + Γ d un , t 2 k .
(23)

Note that R is a non-convex function due to the presence of interfering power terms in the denominators of Υ d un , t 1 k , Υ d un , t 2 k , Γ r n k , and Γ d un , t 2 k . To solve problem (12), we first replace it with an equivalent complementary geometric program (CGP; see Additional file 1) that is then addressed by means of the algorithm based on SCGP. Note that this methodology has also been used in [26] to tackle another optimization problem.

The equivalent CGP is obtained by formulating problem (12) as a minimization problem. Problem (12) is first converted into an equivalent one given by

min P e R P , A m , B m s.t. C 3 C 7 ,
(24)

where e R P , A m , B m is given by

n max u U n k S un ( d ) 1 1 + Υ d un , t 1 k 1 1 + Υ d un , t 2 k k S un ( r ) max 1 1 + Γ r n k , 1 1 + Γ d un , t 2 k ω n ,
(25)

and

1 1 + Υ d un , t 1 k = f d un , t 1 k f d un , t 1 k + P s n , t 1 k G s n , d un k ,
(26)
1 1 + Υ d un , t 2 k = f d un , t 2 k f d un , t 2 k + P s n , t 2 k G s n , d un k ,
(27)
1 1 + Γ d un , t 2 k = f d un , t 2 k f d un , t 2 k + P r n k G r n , d un k ,
(28)
1 1 + Γ r n k = f r n k f r n k + P s n , t 1 k G s n , r n k .
(29)

Second, another equivalent formulation is obtained by introducing slack variables { R n } and Ψ r = Ψ r unk , u , n , k S un ( r ) . Then, problem (24) can be formulated as

min P , { R n } , Ψ r n R n s.t. C 3 C 7 , C 8 : 1 R n k S un ( d ) f d un , t 1 k g d un , t 1 k f d un , t 1 k g d un , t 2 k k S un ( r ) Ψ r unk ω n 1 , u , n , C 9 : f r n k g r n k Ψ r unk 1 , u , n , k S un ( r ) , C 10 : f d un , t 2 k g d un , t 2 k Ψ r unk 1 , u , n , k S un ( r ) ,
(30)

where

g d un , t 1 k = f d un , t 1 k + P s n , t 1 k G s n , d un k ,
(31)
g d un , t 2 k = f d un , t 2 k + P s n , t 2 k G s n , d un k ,
(32)
g d un , t 2 k = f d un , t 2 k + P r n k G r n , d un k ,
(33)
g r n k = f r n k + P s n , t 1 k G s n , r n k .
(34)

Problem (30) consists of an objective function and bounding constraints which all are ratios of two posynomials, making the problem belong to the class of CGP [31]. Problem (30) cannot be made convex and is NP-hard [32]. In order to solve it, the SCGP algorithm is now proposed. The proposed SCGP algorithm approximates the non-convex problem (30) into a series of standard GPs. Therefore, it belongs to the class of successive convex approximation methods [33]. The SCGP is described in Algorithm 1. Integer m indicates the current iteration number, and a variable with superscripts m and m denotes the value obtained at the end of inner iteration m in outer iteration m.

Specifically in the proposed SCGP algorithm, we first set m=0 and initialize the PA vector Pm,0 with Pini. Each iteration m contains an SC stage followed by a standard geometric programming (SGP) stage.

During the SC stage, a GP approximation of problem (30) is constructed by using the method of Lemma 1 in [31] to condense all denominator posynomials g d un , t 1 k , g d un , t 2 k , g r n k , g d un , t 2 k into monomials g ~ d un , t 1 k , g ~ d un , t 2 k , g ~ r n k , g ~ d un , t 2 k . Specifically, the exact expressions of the monomials are given in Additional file 2. Functions h 1 n , k = f r n k g r n k Ψ r unk , h 2 n , k = f d un , t 2 k g d un , t 2 k Ψ r unk and h 3 n , k = 1 R n k S un ( d ) f d un , t 1 k g d un , t 1 k f d un , t 1 k g d un , t 2 k k S un ( r ) Ψ r unk ω n are approximated by, respectively, h ~ 1 n , k = f r n k g ~ r n k Ψ r unk , h ~ 2 n , k = f d un , t 2 k g ~ d un , t 2 k Ψ r unk and h ~ 3 n , k = 1 R n k S un ( d ) f d un , t 1 k g ~ d un , t 1 k f d un , t 1 k g ~ d un , t 2 k k S un ( r ) Ψ r unk ω n . According to proposition 3 in [31], all the approximations satisfy the three conditions proposed in [33] for the convergence of the successive approximation method. By denoting Θ= P , R n , Ψ r , the three conditions are listed as follows:

  1. 1.

    Bounding condition: ∀Θ,

    h i n , k ( Θ ) h ~ i n , k ( Θ ) , i = 1 , 2 , 3 .
  2. 2.

    Tightness condition: At the beginning of iteration m ,

    h i n , k Θ m , m 1 = h ~ i n , k Θ m , m 1 , i = 1 , 2 , 3 .
  3. 3.

    Differential condition: At the beginning of iteration m , ∀θΘ,

    h i n , k Θ m , m 1 ∂θ = h ~ i n , k Θ m , m 1 ∂θ , i = 1 , 2 , 3 .

After the SC stage, problem (30) is formulated by a standard GP. The SGP stage amounts to solving the GP by means of a standard GP solver, e.g., the software provided in [34]. The output provided by this stage corresponds to P m , m .

Thanks to the three conditions fulfilled during the SC stage, our proposed SCGP algorithm is a general inner approximation algorithm [33], which will converge to a local optimum satisfying the KKT conditions of problem (30) according to corollary 1 of [33]. In practice, the iterations of the SCGP algorithm will be stopped when || P m , m P m , m 1 || ε 1 or when m exceeds a prescribed value M.

4.3 The proposed RA algorithm and its characteristics

Based on the algorithms proposed at both the MSSA stage and the PA stage, we now propose the overall RA algorithm as follows:

Specifically in the proposed RA algorithm, we first set m=0 and initialize the power P0 by uniform power allocation (UPA). Each iteration consists of the MSSA stage and the PA stage. During the MSSA stage of iteration m, we set P=Pm−1 and decouple problem (12) into n MILPs, which can be solved by either typical MILP solvers (named the MSSA algorithm) or the RR-MSSA algorithm and the DR-MSSA algorithm proposed in Section 4.1. Note that both the RR-MSSA algorithm and the DR-MSSA algorithm can be solved in polynomial time. The output delivered is denoted as Am and Bm. Finally, we have R(Pm−1,Am−1,Bm−1)≤R(Pm−1,Am,Bm).

During the PA stage of iteration m, we set A=Am, B=Bm and solve problem (12) using the SCGP algorithm proposed in Section 4.1. The output delivered is denoted as Pm. As shown in Section 4.2, the solutions of the approximated GP problems converge to a local optimum satisfying the Karush-Kuhn-Tucker (KKT) conditions of the non-convex problem. Note that as each standard GP can be solved in polynomial time [32], Pm will be obtained in polynomial time after the convergence of the SCGP algorithm. Finally, we have R(Pm−1,Am,Bm)≤R(Pm,Am,Bm).

Considering the proposed RA algorithm, we now have

R P m 1 , A m 1 , B m 1 E 1 : R P m 1 , A m , B m E 2 : R P m , A m , B m ,

where E1 and E2 are due to effects of the MSSA stage and the PA stage, respectively. This means that Algorithm 2 yields non-decreasing sum rates with iterations. Moreover, the optimum sum rate is upper bounded due to the total power constraint in each cell. Thus, the sum rate values will not increase indefinitely with iterations, meaning that the iterations will eventually converge. Algorithm 2 will stop when the sum rate increase is below a prescribed value ε2 or when m reaches a prescribed value M.

At the end of iteration m, a local optimum of the formulated problem is obtained at the SCGP stage, which is then improved at the MSSA stage of the next iteration. After that, a better local optimum can be calculated at the SCGP stage of iteration m+1. Finally, a good local optimum can be obtained after convergence. As both problems at two stages can be solved by the proposed algorithms in polynomial time, the formulated problem can be solved by Algorithm 2 in polynomial time.

We now propose Theorem 2 as follows:

Theorem 2. After the convergence of Algorithm 2, per-cell user fairness is reached, meaning that every user in the same cell obtains the same transmission rate.

Proof. As mentioned before, after the convergence of Algorithm 2, a local optimum of problem (12) is obtained. Let us denote the optimal solutions as A, B, and P. Thus, for the local optimum, there exists r>0, such that ∀PB r (P),R(P,A,B)≤R(P,A,B). Here, B r (P)={P | d(P,P)<r} is the open ball with center P and radius r, where d(P,P) denotes the Euclidean distance between P and P. Let us assume ∃n where V ~ n . Following the methodology similar to the proof of Theorem 1, there exists matrices S,C,T and P1=S C T PB r (P), such that R(P1,A,B)>R(P,A,B). Here, each element of S,C, and T takes the value from (0,1],(0,1], and [ 1,), respectively. This obviously contradicts the previous optimality statement, and V ~ n =, which concludes the proof of Theorem 2.

5 Numerical experiments

In this section, the performance of our proposed RA algorithm as well as the benefits of using OR and the HSE protocol will be discussed. To illustrate the convergence and effectiveness of the algorithm, the rates of all users and the WSMRs are first presented for one particular channel realization. Then, the results are provided and discussed for the performance of the algorithms averaged over many channel realizations. Note that we introduce the MSSA-based algorithm, the RR-MSSA-based algorithm, and the DR-MSSA-based algorithm to denote the proposed iterative RA algorithms which adopt the MSSA algorithm, the RR-MSSA algorithm, and the DR-MSSA algorithm, respectively, at the MSSA stage.

5.1 System setup

For the purpose of illustration, a multi-cell OFDMA system with N=3 coordinated cells and K=32 available subcarriers is considered. Each cell contains U=4 MSs. Figure 1 shows the positions of the BSs, the RSs, and the MSs expressed in meters. Specifically, the coordinates of BS 1, 2, and 3 are 100 3 , 100 , 100 3 , 100 , and (0,200), respectively. Also, the coordinates of RS 1, 2, and 3 are 100 3 , 60 , 100 3 , 60 , and (0,160), respectively. Each member of the MS group U n in cell n is located randomly and uniformly inside a confined region of that cell, which is depicted as a blue box in Figure 1.

Figure 1
figure 1

A random system setup.

The channel impulse response (CIR) of each link is drawn randomly from an 8-tap delay line model, where each tap has a circular complex Gaussian distribution with zero mean and variance as σ i 2 . We further assume σ i 2 σ i + 1 2 = e 3 , meaning that the tap power decreases exponentially with a coefficient 3. Moreover, i σ i 2 = d 3 , meaning that the received power decreases exponentially with distance d and the propagation exponent equals 3. Finally, the CFR of each link is computed from its CIR using the K-point FFT. Each cell is assumed to have the same total power constraint, and we set ω n =1,Pt,n=P t ,∀n, σ2=−70 dBm, ε1=Rini/100, and ε2=Rini/100. Here, Rini denotes the sum rate calculated with the initial RA results at the beginning of each iteration.

5.2 Results for a random realization of channels

In order to illustrate the convergence and effectiveness of three RA algorithms, we set P t =20 dBm and U=4. The positions of MSs and a set of channels are randomly generated. Specifically, the coordinates of d11, d12, d13, d14, d21, d22, d23, d24, d31, d32, d33, and d34 are (−110.9,−29.6), (−57,−34.9), (−76.9,−53.9), (−122.9,−73.6), (72.5,−35), (121.4,−0.1), (66.9,−49.5), (110.6,−73.4), (−3.2,130.8), (−32.6,78.9), (14.6,115.1), and (15.2,72.2), respectively.

As shown in Figure 2, using the three proposed algorithms, the WSMRs keep increasing continuously and converge smoothly to the final rate. After convergence, the WSMRs are increased by around 74%, 73%, and 55% using the MSSA-based algorithm, the RR-MSSA-based algorithm, and the DR-MSSA-based algorithm, respectively. Compared to the MSSA-based algorithm, the RR-MSSA-based algorithm has similar performance and less complexity. Thus, it is preferred for practical implementation.

Figure 2
figure 2

Convergence of the WSMR.

Figure 3 shows the optimized user rates in three cells using three RA algorithms. After the convergence of each RA algorithm, the minimal user rate in each cell is enhanced and all users in the same cell obtain the same rates. Therefore, the fairness of our algorithms is illustrated.

Figure 3
figure 3

Rate results for the random realization of channels. Rates for users in (A) cell 1, (B) cell 2, and (C) cell 3.

5.3 Results averaged over channel distribution

In order to illustrate the average performance of our proposed RA algorithms, 100 random realizations of channels are generated with U=8. We set up different total power constraints ranging from 0 to 50 dBm. To show the benefit of using OR and the HSE protocol, we now compare the proposed protocol with a FR protocol and a LSE protocol. Specifically, the FR protocol is the same as the proposed protocol except that all subcarriers are forced to be in relay-aided mode. The LSE protocol is also the same as the proposed protocol except that all direct mode subcarriers are activated only during the first TS. We also compare the performances of the MSSA-based algorithm, the RR-MSSA-based algorithm, and the DR-MSSA-based algorithm.

Figure 4 adopts the MSSA-based algorithm. It appears that the average WSMR is increased significantly after optimization, especially when P t takes high values. This confirms the effectiveness of our proposed algorithm. Figure 5 provides the average WSMRs for the FR protocol, the LSE protocol, and the proposed protocol. We can see that the proposed protocol performs much better than the other two protocols. Thus, the effectiveness of using OR and the HSE protocol is illustrated. Figure 6 provides the average WSMRs of using the MSSA-based algorithm, the RR-MSSA-based algorithm, and the DR-MSSA-based algorithm. We can see that the RR-MSSA-based algorithm performs similarly as the MSSA-based algorithm and better than the DR-MSSA-based algorithm. Note that the RR-MSSA-based algorithm can be solved in polynomial time. Thus, it offers a good trade-off between the performance and the complexity.

Figure 4
figure 4

Average WSMRs over 100 random realizations of channels.

Figure 5
figure 5

Comparison of average WSMRs for three protocols.

Figure 6
figure 6

Comparison of average WSMRs with three RA algorithms.

6 Conclusion

We have considered a multi-cell OFDMA downlink system where the BS transmissions are aided by a DF RS with OR and the HSE protocol. Assuming a central controller, we have formulated the WSMR maximization problem with per-cell total power constraints. Three iterative RA algorithms have been proposed, and their convergence as well as fairness characteristics were proven. Through numerical experiments, the convergence and effectiveness of the proposed algorithms as well as the benefit of using OR and the HSE protocol have been illustrated. Future work will be devoted to considering signal combining schemes at the destinations and AF/compress-and-forward (CF) relaying protocols with subcarrier pairing. Moreover, the case of multiple RSs and individual power constraints will also be investigated.