1 Introduction

Modern cellular networks need to provide ubiquitous coverage and high data rates with low infrastructure deployment cost [1]. The incorporation of two-hop fixed relays, which are connected to base-stations via wireless backhaul, provides a convenient solution toward such a goal. In particular, cooperative relaying in wireless networks is an attractive solution for extending base station coverage and filling coverage gaps that affect user equipment’s performance in the strongly shadowed urban environment. There are two main types of relaying schemes, namely half-duplex (HD) relaying and full-duplex (FD) relaying.

In literature, a large number of works have been done on HD relaying which focused on spectral efficiency [2], channel capacity [3], outage probability [4], etc., as it enables a low-complexity relay design. Nevertheless, HD relaying systems require additional resources (frequency bands or time slots) to transmit data in a multi-hop manner which results in a loss of spectral efficiency.

On the contrary, although FD relaying suffers from inherent self-interference (SI) which was considered impractical in the past, it has regained the attention of both industry [5, 6] and academia [79]. Recent research shows that FD relaying is feasible by using interference cancellation techniques and transmit/receive antenna isolation [1013]. Some works on FD relaying in cellular systems have been done. For example, in [14], the authors focused on designing FD cellular networks with co-channel femtocells. Because wireless virtualization can provide more flexibility, diversity, and other benefits in wireless network design, the authors in [15] introduced the idea of wireless virtualization into FD cellular networks. Because the FD relaying can achieve higher spectral efficiency than HD relaying, a new device-to-device (D2D) communication scheme was proposed that allows D2D links to underlay cellular downlink by assigning D2D transmitters as FD relays to assist cellular downlink transmission [16]. Although these excellent works have been done for FD relaying in cellular networks, the energy efficiency aspect of FD relay systems did not receive much attention.

With the exponential growth of data traffic, energy consumption of wireless networks has been rapidly increasing, and energy-efficient design of wireless networks has become the focus of both academia and industry [17, 18]. However, the existing works on energy efficiency analysis of relay systems mainly focused on the HD mode, which cannot be directly applied in FD relay systems because of SI. The spectral-energy efficiency tradeoff in FD two-way relay networks has been studied [19], in which an optimization problem was formulated to maximize the energy efficiency under the spectral efficiency requirement and the transmit power constraint. A shared FD relay deployment scenario, where a FD relay is deployed at the intersection of several adjacent cell sectors, was considered in [20]. Energy efficiency was also studied in FD massive multi-input multi-output (MIMO) systems [21, 22].

Some researchers also considered hybrid HD/FD transmission schemes, which combine the advantages of both HD and FD. In [23], an optimal transmission scheduling scheme for a hybrid HD/FD relaying system was proposed, which can attain higher spectral efficiency than the HD mode or FD mode. In [24], duplex mode selection and transmit power adaptation were integrated to maximize the spectral efficiency. In [25], the network interference from FD mode cells was characterized in a heterogeneous network operating in the FD or HD mode. In [26], a unified model was proposed to represent four hybrid relay modes and a joint relay mode selection and power allocation algorithm was developed.

Motivated by the above-mentioned works in FD relaying, a hybrid HD/FD transmission scheme in a relay-aided cellular network is studied in this paper. The system energy efficiency in the hybrid scheme is evaluated under the system spectral efficiency constraint and is compared with those of the HD and FD modes. This hybrid scheme not only utilizes the throughput advantage of FD but also weakens the negative effect of SI. The authors in [24] also studied hybrid HD/FD transmission and transmit power optimization. They considered a two-hop relay system with direct link, where the flexible switching can be done between HD and FD as well as between relaying and direct transmission. However, the switching between HD and FD is based on the end-to-end SINR, which requires global channel state information compared to the proposed hybrid scheme. Besides, they only considered maximizing the system spectral efficiency but ignored the system energy efficiency, which is very important for energy-efficient implementation of the hybrid scheme in the 5G era.

The rest of this paper is organized as follows. Section 2 introduces the relay-aided cellular network model and the downlink transmission for HD, FD, and hybrid modes. Then, the energy-efficient power allocation problem is formulated and solved in Section 3. Section 4 presents the simulation results. Finally, some concluding remarks are made in Section 5.

1.1 System model

We consider a downlink, two-hop, relay-aided wireless cellular network as depicted in Fig. 1. We focus on a single cell where the base station (BS) is located at the center of the cell. The coverage radius of the cell is denoted by R. We consider the case where there is no direct link between the BS and the mobile users (MUs) due to shadowing, which may happen when the BS and the MUs are far away from each other. Moreover, coordinated multi-point transmission among the relay stations (RSs) is not considered. There is one BS, a set of K RSs, and a set of K MUs staying within the cell. The BS and the MUs are equipped with a single antenna while it is beneficial to equip the RSs with isolated receive and transmit antennas. Each MU connects with the BS through the RS and each RS is allowed to connect only one MU for simplicity. The decode-and-forward relaying protocol is adopted.

Fig. 1
figure 1

A relay-aided cellular network

The network comprises three kinds of wireless links: feeder link (from the BS to the RS), access link (from the RS to the MU), and residual SI link (from a RS’s transmit antenna to its receive antenna). Their channel coefficients are denoted by H SR, H RM, and H L, respectively. Moreover, the channels H SRH RM are assumed to experience Rayleigh block fading. We denote H SRH RM ~ CN(0, d − ν), where v is the path-loss exponent and d is the distance between the respective transmitter and receiver. The transmit power of the BS and the kth RS are denoted by P B and \( {P}_{{\mathrm{R}}_{\mathrm{k}}} \), respectively. The unit-variance transmitted signal from the BS and the RS are denoted by X S and X R, respectively.

1.2 Downlink transmission

1.2.1 HD mode

In the HD mode, each transmission frame is divided into two stages. The downlink transmission can be split into two parts: feeder link and access link.

Signal transmission in feeder links

In the first stage, the BS transmits a signal to the associated RSs in a broadcast manner. The signals received at the RSs in the cell can be written as

$$ {y}_{{\mathrm{S}\mathrm{R}}_k}=\sqrt{P_{\mathrm{B}}}{H}_{{\mathrm{S}\mathrm{R}}_k}{X}_{\mathrm{S}}+{U}_k,\kern1em k=1,\ 2,\cdots,\ K, $$
(1)

where \( {H}_{{\mathrm{SR}}_k} \) is the channel coefficient from the BS to the kth RS and U k is an additive white Gaussian noise with zero mean and variance σ 2. The SINRs at the RSs can be calculated as

$$ {\mathrm{SINR}}_{1,k}^{\mathrm{HD}}=\frac{P_B{\left|{H}_{{\mathrm{SR}}_k}\right|}^2}{\sigma^2},\kern1em k=1,2,\dots, K $$
(2)

The information rate of the kth feeder link can be expressed as

$$ {R}_{\mathrm{Feeder}}^{\mathrm{HD},k}=\frac{1}{2}W{ \log}_2\left(1+{\mathrm{SINR}}_{1,k}^{\mathrm{HD}}\right), $$
(3)

where W is the cell bandwidth.

Signal transmission in access links

In the second stage, each RS forwards the decoded signal to its associated MU. It is assumed that the decoding is always successful. The received signals at the MUs in the cell can be written as

$$ {y}_{{\mathrm{R}\mathrm{M}}_k}=\sqrt{P_{{\mathrm{R}}_k}}{H}_{{\mathrm{R}\mathrm{M}}_k}{X}_{\mathrm{R}}+{I}_k+{U}_k,\kern1.25em k=1,2,\cdots, K, $$
(4)

where \( {H}_{{\mathrm{RM}}_k} \) is the channel coefficient from the kth RS to its associated MU, U k is an additive white Gaussian noise with variance σ 2, and I k is the co-channel interference among the RSs. The SINRs at the MUs can be calculated as

$$ {\mathrm{SINR}}_{2,k}^{\mathrm{HD}}=\frac{P_{{\mathrm{R}}_k}{\left|{H}_{{\mathrm{R}\mathrm{M}}_k}\right|}^2}{{\displaystyle {\sum}_{j=1,j\ne k}^K{P}_{R_j}{\left|{H}_{{\mathrm{R}}_j{\mathrm{M}}_k}\right|}^2}+{\sigma}^2},\kern0.5em k=1,2,\cdots, K. $$
(5)

The information rate of the kth access link can be expressed as

$$ {R}_{\mathrm{Access}}^{\mathrm{HD},k}=\frac{1}{2}W{ \log}_2\left(1+{\mathrm{SINR}}_{2,k}^{\mathrm{HD}}\right) $$
(6)

1.2.2 FD mode

The FD relaying allows simultaneous transmission and reception over the same frequency band. Thus, the communication quality is degraded by SI from a RS transmit antenna to its receive antenna. For the convenience of description, the downlink transmission is also split into two parts: feeder link and access link.

Signal transmission in feeder links

The BS transmits a signal to the associated RSs also in a broadcast manner. The signal received at a RS includes both SI and co-channel interference among the RSs. Therefore, after SI cancellation (by using antenna isolation, SI cancellation techniques, etc.), the signal received at the kth RS in the cell can be written as

$$ {y}_{{\mathrm{S}\mathrm{R}}_k}=\sqrt{P_{\mathrm{B}}}{H}_{{\mathrm{S}\mathrm{R}}_k}{X}_{\mathrm{S}}+\sqrt{P_{{\mathrm{R}}_k}}{H}_{{\mathrm{L}}_k}^{\hbox{'}}{X}_{\mathrm{R}}+{N}_k+{U}_k,\kern0.75em k=1,2,\cdots, K, $$
(7)

where \( {H}_{{\mathrm{L}}_k}^{\hbox{'}}=\beta {\mathrm{H}}_{{\mathrm{L}}_k} \), \( {H}_{{\mathrm{L}}_k} \) is the SI channel coefficient from the transmit antenna to the receive antenna of the kth RS, β(0 ≤ β ≤ 1) is the interference cancellation factor at the RSs (obviously, β = 0 indicates that the SI is perfectly cancelled, but it is impossible in practical environments), \( G={\left|{H}_{{\mathrm{L}}_k}^{\hbox{'}}\right|}^2 \) is the SI channel gain, which is caused by imperfect cancellation of SI, and N k is the co-channel interference from the other RS transmissions. The SINRs at the kth RS can be calculated as

$$ {\mathrm{SINR}}_{1,k}^{\mathrm{FD}}=\frac{P_{\mathrm{B}}{\left|{H}_{{\mathrm{SR}}_k}\right|}^2}{P_{{\mathrm{R}}_k}{\left|{H}_{{\mathrm{L}}_k}^{\hbox{'}}\right|}^2+{\displaystyle {\sum}_{l=1,l\ne k}^K{P}_{{\mathrm{R}}_l}{\left|{H}_{{\mathrm{R}}_{l,k}}\right|}^2}+{\sigma}^2}. $$
(8)

The information rate of the kth feeder link can be expressed as

$$ {R}_{\mathrm{Feeder}}^{\mathrm{FD},k}=W{ \log}_2\left(1+{\mathrm{SINR}}_{1,k}^{\mathrm{FD}}\right). $$
(9)

From this expression, we find that a larger power of residual SI will result in a lower rate of the feeder link. That is to say, the residual SI couples the feeder link and the access link, which makes power allocation at RSs quite important and different from that of HD relay systems.

Signal transmission in access links

Each RS also forwards the decoded signal to its associated MU. It is assumed that the decoding is always successful. So the signals received at the kth MU in the cell can be written as

$$ {y}_{{\mathrm{R}\mathrm{M}}_k}=\sqrt{P_{{\mathrm{R}}_k}}{H}_{{\mathrm{R}\mathrm{M}}_k}{X}_{\mathrm{R}}+{I}_k+{U}_k,\kern0.5em k=1,2,\cdots, K, $$
(10)

where \( {H}_{{\mathrm{RM}}_k} \) is the channel coefficient from the kth RS to its associated MU and I k is the co-channel interference among the RSs. The SINRs at the kth MU can be calculated as

$$ {\mathrm{SINR}}_{2,k}^{\mathrm{FD}}=\frac{P_{{\mathrm{R}}_k}{\left|{H}_{{\mathrm{R}\mathrm{M}}_k}\right|}^2}{{\displaystyle {\sum}_{j=1,j\ne k}^K{P}_{{\mathrm{R}}_j}{\left|{H}_{{\mathrm{R}}_j{\mathrm{M}}_k}\right|}^2+{\sigma}^2}},\kern1em k=1,2,\cdots, K. $$
(11)

The information rate of the kth access link can be expressed as

$$ {R}_{\mathrm{Access}}^{\mathrm{FD},k}=W{ \log}_2\left(1+{\mathrm{SINR}}_{2,k}^{\mathrm{FD}}\right). $$
(12)

To guarantee the throughput, the information rate of the feeder link should be no lower than that of the access link [20], i.e., \( {R}_{{}_{\mathrm{Feeder}}}^k\ge {R}_{\mathrm{Access}}^k \). Here, the information rate \( {R}_{\mathrm{Access}}^k \) can be viewed as the effective service rate for each MU. From the above signal transmissions, the cell throughput can be expressed as

$$ {R}_{\mathrm{HD}}=\frac{1}{2}W{\displaystyle \sum_{k=1}^K{ \log}_2\left(1+{\mathrm{SINR}}_{2,k}^{\mathrm{HD}}\right)} $$
(13)
$$ {R}_{\mathrm{FD}}=W{\displaystyle \sum_{k=1}^K{ \log}_2\left(1+{\mathrm{SINR}}_{2,k}^{\mathrm{FD}}\right)} $$
(14)

1.2.3 Hybrid mode

In the hybrid mode, a RS switches between the HD mode and the FD mode with a certain probability. For the ease of analysis, suppose now the kth RS is working on the HD mode. In the first stage, if \( {\mathrm{SINR}}_{1,k}^{\mathrm{HD}}\le {I}_{th} \), where I th is the preset SINR threshold that the access link requires, in order to improve the information rate of the access link, the kth RS will switch to the FD mode; otherwise, the kth RS will stay in the HD mode. So, the cell throughput can be expressed as

$$ \begin{array}{l}{R}_{\mathrm{Hybrid}}=p\left\{{\mathrm{SINR}}_{1,k}^{\mathrm{HD}}\ge {I}_{th}\right\}{R}_{\mathrm{HD}}+p\left\{{\mathrm{SINR}}_{1,k}^{\mathrm{HD}}<{I}_{th}\right\}{R}_{\mathrm{FD}},\\ {}\kern3.25em =x{R}_{\mathrm{HD}}+\left(1-x\right){R}_{\mathrm{FD}}\end{array} $$
(15)

where \( x={e}^{-\frac{a}{P_B}},\ a=\frac{I_{th}.{\sigma}^2}{d^{\hbox{-} \nu }} \).

1.3 Power consumption model

The power consumption of the stations can be expressed as a linear form similar to [27, 28]:

$$ {P}_{i- tot}={a}_i{\displaystyle {P}_i}+{b}_i\begin{array}{cc}\hfill, \hfill & \hfill i=B,R\hfill \end{array} $$
(16)

where the coefficients a B and a R, respectively, account for power consumption that scales with the average radiated power, and the terms b B and b R model the static power consumed by signal processing, battery backup, and cooling. Therefore, if the RSs work on the FD mode, the total power consumption is written as

$$ {P}_{\mathrm{full}}={P}_{\mathrm{C}}+{a}_{\mathrm{B}}{P}_{\mathrm{B}}+{a}_{\mathrm{R}}{\displaystyle \sum_{k=1}^K{P}_{{\mathrm{R}}_k}}, $$
(17)

where P C = b B + Kb R is the static power consumed by signal processing, battery backup, and cooling. When the RSs work on the HD mode, the static power P C is the same as that in FD, but the average radiated power is half of that in FD. The total power consumption is written as

$$ {P}_{\mathrm{half}}={P}_{\mathrm{C}}+\frac{1}{2}\left({a}_{\mathrm{B}}{P}_{\mathrm{B}}+{a}_{\mathrm{R}}{\displaystyle \sum_{k=1}^K{P}_{{\mathrm{R}}_k}}\right) $$
(18)

According to the total probability formula, if the RSs work on the hybrid mode, the total power consumption can be written as

$$ {P}_{\mathrm{T}}=x{P}_{\mathrm{half}}+\left(1-x\right){P}_{\mathrm{full}}. $$
(19)

The system spectral efficiency and system energy efficiency can be respectively defined as

$$ {\eta}_{\mathrm{SE}}=\frac{R}{W},\ {\eta}_{\mathrm{EE}}=\frac{R}{P} $$
(20)

2 Energy-efficient power allocation

The energy-efficient power allocation problem is formulated to maximize the system energy efficiency under the system spectral efficiency and the transmit power constraints, which is

$$ \begin{array}{l}\underset{P_{\mathrm{B}},{P}_{{\mathrm{R}}_k}}{ \max }\ {\eta}_{\mathrm{EE}}\\ {}\mathrm{s}.\mathrm{t}.\kern0.5em {\eta}_{\mathrm{SE}}\ge \overline{\eta_{\mathrm{SE}}}\\ {}\kern1.75em 0\kern0.5em \le {P}_{\mathrm{B}}\le \overline{P_{\mathrm{B}}}\\ {}\kern1.75em 0\le {P}_{{\mathrm{R}}_k}\le \overline{P_{{\mathrm{R}}_k}},\kern1em k=1,2,\dots, K,\end{array} $$
(P)

where \( \overline{\eta_{\mathrm{SE}}} \) is the minimum spectral efficiency requirement, \( {\displaystyle {\overline{P}}_{\mathrm{B}}} \) and \( {\displaystyle {\overline{P}}_{{\displaystyle {\mathrm{R}}_{\mathrm{k}}}}} \) are the maximum allowed transmit powers of the BS and the RSs, respectively. Here for simplicity it is assumed that the transmit powers of the RSs are the same. According to the signal transmissions described above, the cell throughput only includes the effective service rate of the MUs, and the information rate of feeder links is not included. We further consider the information rate of feeder links as a constraint in the following, which guarantees that the information rate of a feeder link is no lower than that of a corresponding access link. The above optimization problem is not concave in P B and \( {P}_{{\mathrm{R}}_k} \) jointly. To overcome this difficulty, we adopt the alternating optimization method (i.e., first fix the BS power and optimize the power of each RS; then fix the RS power and optimize the BS power) to maximize the system energy efficiency under the constraints below. This method can reach a near-optimal solution with affordable computational complexity, compared to other optimization methods.

2.1 Optimizing over P B for a fixed \( {P}_{{\mathrm{R}}_k} \)

Given a fixed \( {P}_{{\mathrm{R}}_k} \), our target is to seek the optimal transmit power P B to maximize the system energy efficiency while satisfying the above-mentioned constraints. Thus, the problem (P) can be rewritten as

$$ \begin{array}{l}\kern1.75em \underset{P_{\mathrm{B}},{P}_{{\mathrm{R}}_k}}{ \max}\frac{\eta_{\mathrm{SE}}}{P_{\mathrm{T}}}\\ {}\kern0.1em \mathrm{s}.\mathrm{t}.\kern0.5em {\eta}_{\mathrm{SE}}\ge \overline{\eta_{\mathrm{SE}}}>0\\ {}\kern3.25em 0\le {P}_{\mathrm{B}}\le \overline{P_{\mathrm{B}}}\\ {}\kern0.5em {\displaystyle {R}_{\mathrm{Feeder}}^k}\ge {\displaystyle {R}_{\mathrm{Access}}^k}.\end{array} $$
(21)

Obviously, with a fixed \( {P}_{{\mathrm{R}}_k} \), η SE is a strictly decreasing function of P B. However, the objective function is a fractional function and it is hard to find the optimal solution. To overcome this issue, we apply the fractional programming technique [29] to transform the objective function to an integral expression. The optimal value \( {q}^{*}=\frac{\eta_{\mathrm{SE}}\left({P}_{\mathrm{B}}^{*},{P}_{{\mathrm{R}}_k}^{*}\right)}{P_{\mathrm{T}}\left({P}_{\mathrm{B}}^{*},{P}_{{\mathrm{R}}_k}^{*}\right)} \) can be achieved as long as \( {\eta}_{\mathrm{SE}}\left({P}_{\mathrm{B}}^{*},{P}_{{\mathrm{R}}_k}^{*}\right)-{q}^{*}{P}_{\mathrm{T}}\left({P}_{\mathrm{B}}^{*},{P}_{{\mathrm{R}}_k}^{*}\right)=0 \). Hence, we can conclude that the solution of sub-problem Eq. (21) is equivalent to \( F\left({q}^{*}\right)={\eta}_{\mathrm{SE}}\left({P}_{\mathrm{B}},{P}_{{\mathrm{R}}_k}\right)-{q}^{*}{P}_{\mathrm{T}}\left({P}_{\mathrm{B}},{P}_{{\mathrm{R}}_k}\right)=0 \). Searching q* can be done by the Dinkelbach method which can converge to the optimal value at a superlinear rate [29]. Therefore, for any feasible q, the sub-problem Eq. (21) can be equivalently transformed into a sub-problem Eq. (22), as follows:

$$ \begin{array}{l} \max \kern0.75em {\eta}_{\mathrm{SE}}-q{P}_{\mathrm{T}}\\ {}\mathrm{s}.\mathrm{t}.\ {\eta}_{\mathrm{SE}}\ge \overline{\eta_{\mathrm{SE}}}\\ {}\kern0.45em 0\kern0.5em \le {P}_{\mathrm{B}}\le \overline{P_{\mathrm{B}}}\\ {}\kern0.65em {\mathrm{R}}_{\mathrm{Feeder}}^k\ge {R}_{\mathrm{Access}}^k\end{array} $$
(22)

The objective function can be calculated as

F q (x) = x(R HD − R FD) + R FD − q[x(P half − P full) + P full], where x ~ [max(b1, b2), min(b3, b4)] ⊂ (0, 1), b1 is taken from \( {R}_{\mathrm{Feeder}}^{\mathrm{HD},k}\ge {R}_{\mathrm{Access}}^{\mathrm{HD},k} \), b2 is got from \( {R}_{\mathrm{Feeder}}^{\mathrm{FD},k}\ge {R}_{\mathrm{Access}}^{\mathrm{FD},k} \) (obviously, b 2 = max(b 1, b 2)), \( b3=\frac{\overline{\eta_{\mathrm{SE}}}-{R}_{\mathrm{FD}}}{R_{\mathrm{HD}}-{R}_{\mathrm{FD}}} \), and \( b4={e}^{-\frac{a}{\overline{P_{\mathrm{B}}}}} \). From the above description, the objective function can be expressed as

$$ {F}_q(x)=bx+\frac{2c-cx}{ \ln x}+d, $$
(23)

where \( b={R}_{\mathrm{HD}}-{R}_{\mathrm{FD}}+\frac{1}{2}q{a}_{\mathrm{R}}{\displaystyle {\sum}_{k=1}^K{P}_{{\mathrm{R}}_k}},\ c=\frac{1}{2}aq{a}_{\mathrm{B}},\ d={R}_{\mathrm{FD}}-q\left({P}_{\mathrm{C}}+{a}_{\mathrm{R}}{\displaystyle {\sum}_{k=1}^K{P}_{{\mathrm{R}}_k}}\right) \). The first-order derivative of the objective function is \( {F}_q^{\hbox{'}}(x)=b+\frac{g(x)}{x{\left( \ln x\right)}^2} \), where g(x) = − cx ln x + cx − 2c, g '(x) = − c ln x > 0. Obviously, g(x) < 0.

In order to determine whether the first-order derivative of the objective function is positive or negative, we discuss it in the following two cases.

Case 1: b < 0

The first-order derivative of the objective function \( {F}_q^{\hbox{'}}(x) \) will always be less than zero. Hence,

$$ {F}_q{(x)}_{\max }={F}_q(b2),{P}_{\mathrm{B}}^{\hat{\mkern6mu} }=\frac{P_{{\mathrm{R}}_k}.k1.\left({P}_{{\mathrm{R}}_k}{\left|{H}_{{\mathrm{L}}_k}^{\hbox{'}}\right|}^2+{\displaystyle {\sum}_{l=1,l\ne k}^K{P}_{{\mathrm{R}}_l}{\left|{H}_{{\mathrm{R}}_{l,k}}\right|}^2}+{\sigma}^2\right)}{{\left|{H}_{{\mathrm{SR}}_k}\right|}^2}, $$
(24)

where \( k1=\frac{{\left|{H}_{{\mathrm{R}\mathrm{M}}_k}\right|}^2}{{\displaystyle {\sum}_{j=1,j\ne k}^K{P}_{{\mathrm{R}}_j}{\left|{H}_{{\mathrm{R}}_j{\mathrm{M}}_k}\right|}^2+{\sigma}^2}} \).

Case 2: b > 0

The first-order derivative of the objective function is \( {F}_q^{\hbox{'}}(x)=\frac{bx{\left( \ln x\right)}^2-cx \ln x+cx-2c}{x{\left( \ln x\right)}^2} \). Let y(x) = − cx ln x + cx + bx(ln x)2 − 2c, y '(x) = b(ln x)2 + (2b − c)ln x. We can get y '(e t) = bt 2 + (2b − c)t = 0, t = ln x, t < 0. The solution to it is \( {t}_1=0,\kern0.01em {t}_2=\frac{c-2b}{b}. \). Next, we will discuss it under two different conditions:

  1. 1)

    t 2 > 0. We have y '(x) > 0 and \( \underset{x\to 1}{ \lim }y(x)=-c<0 \). Hence, F '(x) < 0, F q (x)max = F q (b2).

  2. 2)

    \( {t}_2<0.\ y{(x)}_{\max }=\left(4b-c\right){e}^{\frac{c-2b}{b}}-2c \).

The above discussion of the positive/negative property of y(x)max is summarized below:

$$ \begin{array}{l}y{(x)}_{\max}\le 0,{F}_q{(x)}_{\max }={F}_q(b2)\\ {}y{(x)}_{\max }>0,\underset{x\to 0}{ \lim }y(x)=-2c,\underset{x\to 1}{ \lim }y(x)=-c.\ let\ y\left({x}_0\right)=0\\ {}\kern4.75em \left\{\begin{array}{l}{F}_q{(x)}_{\max }= \max \left({F}_q\left({x}_0\right),{F}_q(b2)\right),{P}_{\mathrm{B}}^{*}= \min \left({P}_{\mathrm{B}}^{\hat{\mkern6mu} },-\frac{a}{ \ln {x}_0}\right)\\ {}{F}_q{(x)}_{\max }= \max \left({F}_q\left( \min \left(b3,b4\right)\right),{F}_q(b2)\right),{P}_{\mathrm{B}}^{*}= \min \left({P}_{\mathrm{B}}^{\hat{\mkern6mu} },-\frac{a}{\operatorname{lnmin}\left(b3,b4\right)}\right)\end{array}\right.\end{array} $$
(25)

2.2 Optimizing over \( {P}_{{\mathrm{R}}_k} \) with a fixed P B

Given a fixed P B, our target is to seek the optimal transmission power \( {P}_{{\mathrm{R}}_k} \) to maximize the system energy efficiency while satisfying the above-mentioned constraints. The problem (P) can be rewritten as

$$ \begin{array}{l} \max \kern0.75em {\eta}_{\mathrm{SE}}-q{P}_{\mathrm{T}}\\ {}\kern0.5em \mathrm{s}.\mathrm{t}.\kern0.75em {\eta}_{\mathrm{SE}}\ge \overline{\eta_{\mathrm{SE}}}\\ {}\ 0\le {P}_{{\mathrm{R}}_k}\le \overline{P_{{\mathrm{R}}_k}},\kern1em k=1,2,\dots, K\\ {}\kern1.0em {R}_{\mathrm{Feeder}}^k\ge {R}_{\mathrm{Access}}^k\end{array} $$
(26)

With a fixed P B, the sub-problem Eq. (26) is a convex optimization problem for a given q. We now prove the convexity. For a given q, the transformed subtractive form can be written as η SE − qP T. The first part of the subtractive form is η SE which is a concave function in \( {P}_{{\mathrm{R}}_k} \) and the second part is an affine function. Since the sum of a concave function and an affine function is also a concave function, the proof of convexity is complete. Therefore, we can derive the optimal solution in an effective way. The partial Lagrangian function of the sub-problem Eq. (26) can be expressed as

$$ {L}_{\mathrm{EE}}\left({P}_{{\mathrm{R}}_k}\right)={\eta}_{\mathrm{SE}}-q{P}_{\mathrm{T}}+{\lambda}_1\left({\eta}_{\mathrm{SE}}-\overline{\eta_{\mathrm{SE}}}\right), $$
(27)

where λ 1 is the Lagrangian multiplier associated with the inequality constraints and is required to be non-negative. By using the Karush-Kuhn-Tucker (KKT) conditions, it is required that the first-order derivative of L EE with respect to \( {P}_{{\mathrm{R}}_k} \) be zero, i.e.,

$$ \frac{\partial {L}_{\mathrm{EE}}}{\partial {P}_{{\mathrm{R}}_k}}=\left(1+{\lambda}_1\right)\frac{\partial {\eta}_{\mathrm{SE}}}{\partial {P}_{{\mathrm{R}}_k}}-q\frac{\partial {P}_{\mathrm{T}}}{\partial {P}_{{\mathrm{R}}_k}}=0. $$

We can obtain

$$ {P}_{{\mathrm{R}}_k}^{\hat{\mkern6mu} }={\left[\frac{1}{ \ln 2*\theta }-\frac{1}{k1}\right]}^{+} $$
(28)

where \( \theta =\frac{q{a}_{\mathrm{R}}}{1+{\lambda}_1} \), []+ guarantees \( {P}_{{\mathrm{R}}_k}\ge 0 \).

Note that the sub-optimal transmit power of \( {P}_{{\mathrm{R}}_k} \) given by Eq. (28) is similar to customized water-filling solutions, where the heights of the pool are defined as 1/k1 and the water level is partially determined by λ 1 and q. Because \( {P}_{{\mathrm{R}}_k} \) involves the Lagrangian multiplier λ 1 and the KKT conditions require λ 1 to be non-negative, we derive λ 1 in the following two cases:

Case 1: λ 1 > 0

In this case, from the Karush-Kuhn-Tucker (KKT) conditions, we have \( {\eta}_{\mathrm{SE}}=\overline{\eta_{\mathrm{SE}}} \). We can get \( {\lambda}_1=\frac{ \ln 2*q*{a}_{\mathrm{R}}}{2^a}-1 \), where \( a=\frac{{\displaystyle \sum_{k=1}^K{ \log}_2k1-\frac{2\overline{\eta_{\mathrm{SE}}}}{2-x}}}{K} \). Substituting this positive root back yields the sub-optimal solution of the sub-problem Eq. (26) as

$$ {P}_{{\mathrm{R}}_k}^{\hat{\mkern6mu} }={\left[\frac{1}{2^a}-\frac{1}{k1}\right]}^{+} $$
(29)

Case 2: λ 1 = 0

In this case,

$$ {P}_{{\mathrm{R}}_k}^{\hat{\mkern6mu} }={\left[\frac{1}{ \ln 2*q*{a}_{\mathrm{R}}}-\frac{1}{k1}\right]}^{+}. $$
(30)

Note that, this case may break the SE constraint. If this happens, the solution Eq. (29) should be adopted. Otherwise, both Eqs. (29) and (30) are feasible and the one leading to a greater EE is the final solution. In addition, we need to guarantee that the information rate of the feeder link is no lower than that of the access link, i.e., \( {R}_{\mathrm{Feeder}}^k\ge {R}_{\mathrm{Access}}^k \). Obviously, the condition to guarantee it for FD is more strict than that for HD. Hence, we only consider the condition \( {R}_{\mathrm{Feeder}}^{\mathrm{FD},k}\ge {R}_{\mathrm{Access}}^{\mathrm{FD},k} \). In this case,

$$ {P}_{{\mathrm{R}}_k}\le \frac{-{k}_1c+\sqrt{{k_1}^2{c}^2+4{k}_1{P}_{\mathrm{B}}{\left|{H}_{{\mathrm{L}}_k}\right|}^2{\left|{H}_{{\mathrm{SR}}_k}\right|}^2}}{2{k}_1{\left|{H}_{{\mathrm{L}}_k}\right|}^2}={P}_{{\mathrm{R}}_k}^{\hat{\mkern6mu} }, $$
(31)

where \( c={\displaystyle {\sum}_{l=1,l\ne k}^K{P}_{{\mathrm{R}}_l}{\left|{H}_{R_{l,k}}\right|}^2}+{\sigma}^2 \). Considering the transmit power constraints, the final solution can be expressed as

$$ {P}_{{\mathrm{R}}_k}^{*}= \min \left({P}_{{\mathrm{R}}_k}^{\hat{\mkern6mu} },\overline{P_{{\mathrm{R}}_k}}\right). $$
(32)

2.3 Iterative optimization algorithm

In the alternate optimization method, the basic idea is that the sub-problem Eq. (22) and the sub-problem Eq. (26) are solved in turn to reach the solution to the primal problem (P). For clarity, the detailed procedure of the iterative optimization algorithm is listed in Table 1. From Table 1, in each iteration l, we first set the optimal solution to the sub-problem Eq. (26) in the previous iteration l − 1 as the initial value of the sub-problem Eq. (22). Then, we solve the sub-problem Eq. (22) and output the optimal solution P B in Eq. (22) as the initial value of the sub-problem Eq. (26). The process is repeated until convergence is attained, i.e., |q (k) − q (k − 1)| is smaller than a predefined threshold ε 0.

Table 1 Iterative optimization algorithm

3 Simulation results

The MATLAB tool is adopted to simulate the energy efficiency of the relay-aided cellular network under various system parameters and verify the effectiveness of the hybrid scheme. Here, the cell bandwidth has been normalized to 1 Hz. Some simulation parameters are specified in Table 2.

Table 2 Simulation parameters

The distance between the BS and a RS is set to d RS = 0.7R in the cell. The distance between the RS and a MU is set to d ∼ U[0, 0.3R], and the interference distance among the RSs is set to d ∼ U[0.5R, 1.7R], where U[] represents the uniform distribution.

As shown in Fig. 2a, under perfect SI cancellation, the hybrid scheme approaches to the FD mode. That is to say, the probability of hybrid transmission selecting HD will be very low. With the increase of SI, the performance of FD begins to deteriorate, and the hybrid scheme moves toward HD. Under poor SI cancellation, the hybrid scheme tends to the HD mode. Hence, the probability of hybrid transmission selecting HD will be close to 1.

Fig. 2
figure 2

Probability of hybrid transmission select HD and energy efficiency versus self-interference channel gain when b B = 120 W, b R = 22 W

Figure 2b shows the effect of SI channel gain on energy efficiency. It is clear that energy efficiency in the HD mode is a fixed value due to no SI. Note that energy efficiency is defined as the ratio of spectral efficiency to total power consumption. Under perfect SI cancellation, because the throughput in the FD mode is twice of that in the HD mode, the FD mode can achieve higher energy efficiency than the HD mode. Combining Figs. 2a, b, under perfect SI cancellation, the hybrid schemes adopts the FD mode basically, and takes the throughput advantage of FD to achieve higher energy efficiency than the HD mode, which is close to that of the FD mode. With the increase of SI, the performance of FD starts to deteriorate, but the energy efficiency in the hybrid scheme is between those of the HD and the FD schemes. Nearly in G = −45 dB, the energy efficiency in the FD mode starts to be less than the energy efficiency in the HD mode. At this point, because the probability of hybrid transmission selecting HD is greater than that selecting FD, the throughput of the system reduces. However, the total power consumption does not reduce obviously, which leads to the fact that the energy efficiency in the hybrid scheme is less than that in the HD or FD mode. With the increase of SI channel gain, the energy efficiency in the FD mode is lower than that in the hybrid scheme. While the SI becomes large, due to the strong SI of FD, it cannot work normally. Hence, the energy efficiency in the FD mode is close to 0, and the probability of hybrid transmission selecting HD is close to 1. Even in the case of strong SI, the energy efficiency in the hybrid scheme can be much better than that in the FD mode.

In Fig. 2a, b, the static power consumption of the BS and the RSs (b B and b R) are set to 120 and 22 W, respectively. Next, we will show the effect of SI channel gain on energy efficiency under the condition of b B = 10 W and b R = 1 W, and the other parameters are the same as before, as shown in Figs. 3a, b.

Fig. 3
figure 3

Probability of hybrid transmission select HD and energy efficiency versus self-interference channel gain when b B = 10 W, b R = 1 W

Obviously, the static power consumption reduces and the energy efficiency of the system increases. However, the trend of energy efficiency versus SI channel gain does not change.

Figure 4a, b shows the influence of the cell radius on the energy efficiency under the conditions of G = − 20 dB and G = − 60 dB, respectively. Taking the actual scenario into account, the cell radius varies from 700 to 1500 m. From Figs. 4a, b we can see: in the case of G = − 20 dB, the energy efficiency in the FD mode is lower than that in the HD mode. In the case of G = − 60 dB, the energy efficiency in the FD mode is higher than that in the HD mode. The energy efficiency in the hybrid scheme is between those in the HD and the FD modes. Comparing Fig. 4a and Fig. 4b, in the same cell radius, since there is no SI in HD, the energy efficiency is the same; in the case of G = − 60 dB, the energy efficiency in the FD mode and the hybrid scheme are higher than those in the case of G = − 20 dB, respectively. In addition, with the increase of cell radius, the energy efficiency of the system gradually reduces in the three transmission modes.

Fig. 4
figure 4

Energy efficiency versus the radius of the cell

Figure 5 illustrates the convergence of the iterative optimization algorithm in the condition of G = − 60 dB and the other parameters are the same as before. It can be seen that the proposed algorithm can converge quickly.

Fig. 5
figure 5

Energy efficiency in iteration

4 Conclusions

In this paper, a hybrid HD/FD transmission scheme in a relay-aided cellular network has been presented, taking SI cancellation into account. An optimization problem was formulated to maximize the system energy efficiency under the system spectral efficiency constraint as well as the transmit power constraints of BS and RSs. The alternate optimization method was used to solve the optimization problem and obtain the optimal transmit powers. Simulation results showed that the hybrid scheme not only takes the throughput advantage of FD, but also weakens the negative effect of SI.