1 Introduction

1.1 Background and motivation

Cognitive radio (CR) [1] has been widely recognized as a critical technique to mitigate the spectrum scarcity problem and enhance the overall efficiency of spectrum usage, aiming to accommodate for the evolution of wireless systems towards 5G [2]. In a CR network (CRN), unlicensed secondary users (SUs) are allowed to opportunistically access the spectrum allocated to licensed or primary users (PUs) without interfering with the coexisting PUs. That is, the SUs do not violate the quality of service (QoS) requirements of the PUs. Most of the existing research efforts in CRNs mainly focus on the issues of the physical and media access control (MAC) layers for an infrastructure-based single-hop scenario, including spectrum sensing, spectrum access, and sharing techniques [3,4,5]. In addition, SUs can also form a multi-hop decentralized ad hoc network without the support of infrastructure. In a multi-hop cognitive radio ad hoc network (CRANET) [6], SU can access the licensed spectrum by seeking to overlay, underlay, or interweave its signal with the existing PUs’ signals [7]. For the underlay approach, SUs are permitted to concurrently share the licensed spectrum with PUs while guaranteeing the power of interference and noise at the PU not beyond the interference temperature limit. In this context, the interference caused by the SUs should be controlled and mitigated through effective power control strategies. Many studies on power control for CRNs have been reported from different perspectives, such as imperfect channel knowledge [8], arbitrary input distributions [9], and social utility maximization [10].

In comparison with the lower layer solutions as stated before, recent work indicates that there are many new challenges towards routing problem at the network layer in multi-hop CRANETs, aiming to give more insights into the impact of spectrum uncertainty on routing strategies [11, 12]. However, the constraints and challenges with regard to SUs including random mobility, low deployment density, and limited resource along with discontinuous spectrum availability will give rise to intermittent connectivity of links among SUs in a decentralized CRANET [13]. Clearly, stochastic link outage further has a bearing on the successful transmission of data packets between a pair of SUs. To describe effective continuous transmission of SUs, the paradigm contact has been presented from different types [14], e.g., persistent contact, on-demand contact, and scheduled contact. Conceptually, a contact can be defined as a communication opportunity during which two adjacent SUs can communicate with each other. In a scheduled contact-based CRANET, multiple contacts or the set of communication opportunities can be easily derived from the statistical data of a priori available contact. In this case, the scheduled contact can be predicted and calculated accurately.

Similar to a wireline Internet or most other traditional wireless networks, network congestion in CRNs will also occur when the offered data load exceeds the available capacity of SU due to buffer overflow caused by accumulated data packets injected from upstream SUs. This therefore leads to aggressive retransmission, queuing delay, and blocking of new flows from upstream SUs. Indubitably, congestion control policy in the transport layer is essential to balance resource load and to avoid excessive congestion. However, the conventional Transmission Control Protocol (TCP) as a congestion control mechanism via window-based or acknowledgement-triggered methods is initially designed and optimized to perform in reliable wired links with constrained bit error rates (BERs) and round trip times (RTTs) [15]. Recent study by [16] has reported that the performance of HTTP download deteriorates as much as 40% under the TCP window control in an IEEE P1900.4-based cognitive wireless system using User Datagram Protocol (UDP) and TCP transport protocols. Alternatively, to accommodate for the challenging multi-hop wireless environments, some other research efforts about congestion control techniques have been conducted from the perspective of finding methods to modify TCP protocol [17]. Unfortunately, it has been also shown that these schemes of TCP modifications and extensions cannot be applied into CRANETs because of sudden large-scale bandwidth fluctuation and periodic interruption caused by spectrum sensing and channel switching [18, 19].

It is also noted that the TCP congestion control is targeted to regulate the data rate of upstream SUs so that the total accumulated data load does not exceed the available capacity of SU. In principle, the link capacity between a pair of SUs depend strongly on transmit power of SU coupled with wireless channel conditions [20]. By leveraging the congestion control technique, on the one hand, the attainable data rate on a wireless link between a pair of SUs depends on the interference level, which in turn rests on power control policy. On the other hand, each SU is expected to increase its transmit power in order to obtain as much link capacity that each flow requires [21]. However, increasing the link capacity on one link may reduce the link capacities on other links owing to mutual interference of SUs. From the above discussions, we can see that jointly optimizing transmit power in the physical layer and data rate in the transport layer for attaining the optimal link capacity becomes highly valuable. With a joint cross-layer design, the physical layer is able to share its information and configuration about optimal transmit power with the transport layer without breaking the hierarchical structure of the traditional layered architecture [22]. This motivates us to reinvestigate the cross-layer coupling between capacity supply by power control and rate demand by rate control.

1.2 Related works

Congestion control in wireless multi-hop networks has been widely discussed via the NUM optimization problem maximizing the total utility, subject to some different constraints including the efficiency and fairness of resource allocation [22], heterogeneous traffic [22], lossy link [23], and multipath transmission [24]. Under the condition of outage probability caused by lossy links, another work [25] investigates the rate-effective network utility maximization problem to meet with delay-constrained data traffic requirement. However, although all of the aforementioned studies consider some realistic constraints, they apply to the traditional wireless multi-hop networks only and do not consider the spectrum uncertainty in CRNs. To the best of our knowledge, some studies on congestion control for CRNs have been reported recently, although the mainstream research effort is aimed at the problems of the physical and MAC layers. In [26], Xiao et al. developed a robust active queue management scheme to stabilize the TCP queue length at base station in an infrastructure-based CRN. By using the multiple model predictive control, the proposed scheme absorbed the disturbances caused by busty background traffic and capacity variation. It is found that [26] is not suitable for decentralized CRANET scenario due to a lack of centralized control and global information. Unlike the condition of infrastructure-based CRN, other studies in [18, 27, 28] have been undertaken in multi-hop CRN scenarios. In [27], Song et al. proposed an end-to-end congestion control framework without the aid of common control channel by taking into account the non-uniform channel availability. The explicit feedback mechanism without timeouts and the timeout mechanism were also investigated. In [28], Zhong et al. presented a TCP network coding dynamic generation size adjust scheme by jointly considering network coding gain and delay. The proposed scheme can significantly reduce the retransmissions and guarantee the QoS and enhance the TCP performance. In [18], Al-Ali et al. proposed an end-to-end equation-based TCP friendly rate control mechanism, which achieves rate adjustment by identifying network congestion. However, the end-to-end control policy in [18, 27] is ill suited for operation over wireless transmission links characterized by higher RTTs, particularly if the links present the feature of intermittent connectivity in CRANET under predictable contact. On the contrary, the hop-by-hop control reacts to congestion faster where the rates are adjusted at upstream nodes by feedback information about the congestion state of intermediate nodes.

Other recent schemes that exploit the cross-layer interaction information try to deal with congestion control problem in decentralized CRNs from a cross-layer design perspective. The objective of these schemes is to improve the overall network utility while protecting active PUs’ communications from excessive interference introduced by SUs. In [29], Cammarano et al. presented a distributed cross-layer framework for joint optimization of MAC, scheduling, routing, and congestion control in CRAHNs, by maximizing the throughput of a set of multi-hop end-to-end packet flows. However, similar to [18, 27], it is not clear how good the performance of the end-to-end rate control is compared under a wireless transmission environment with higher RTTs. In [30], Nguyen et al. proposed a cross-layer framework to jointly attain both congestion and power control in OFDM-based CRNs through nonconvex optimization method. By means of the adaptation of dual decomposition technique also used by [20], the distributed algorithm was developed to obtain the global optimization. In [31], Nguyen et al. further devised an optimization framework achieving trade-off between energy efficiency and network utility maximization for CRAHNs. By adjusting transmit power, persistence probability, and data rate simultaneously via the interaction between MAC and other layers, the proposed framework can jointly balance interference, collision, and congestion among SUs However, both of the frameworks in [30, 31] fail to take into account the impact of predictable contact or priori available contact between any pair of SUs on overall cross-layer performance.

1.3 Our approach and contributions

Our work in this paper mainly focuses on a decentralized CRANET under predictable contact in that it is easy to obtain the set of communication opportunities derived from the statistical available contacts among SUs. Owing to a lack of global information to achieve the centralized schedule, the cross-layer coupling between capacity supply by power control and rate demand by rate control needs to be carried out distributively by each SU via local information. Distributed implementation for power control and rate control depends on interactive processes among competitive SUs to figure out the cross-layer coupling relationships. Moreover, the objectives of SUs to maximize their utility functions are conflicting and their decisions are interactive. Apparently, it will be far more realistic to dynamically adjust transmit power and data rate according to the current instant time in the practical dynamic environment. The reason for adopting a differential game model rather than other decentralized optimization approaches is that the differential game is a continuous time dynamic game to investigate interactive decision making over time. In a differential game, the interactions among individual players are characterized by time dependency. This is in line with the nature of dynamic spectrum environment in practical CRANET scenario. Therefore, motivated by cross-layer coupling between capacity supply by power control and rate demand by rate control, we present a cross-layer optimization framework for CRANET under predictable contact by achieving the joint congestion and power control using a differential game theoretic approach. The main contributions of this paper are summarized as follows:

  • To measure the uncertainty of contact between a pair of SUs, a predictable contact model is presented by deriving the probability distribution of contact via a mathematical statistics theory. By using Shannon entropy theory, we further devise an entropy paradigm to characterize quantitatively the probability distribution of contact.

  • We propose a distributed cross-layer optimization framework for hop-by-hop congestion control (HHCC) and per-link power control (PLPC) for upstream SUs. The HHCC and the PLPC problems are formulated as two noncooperative differential game models, by taking into account the utility function maximization and linear differential equation constraint with regard to the aggregate power interference to PUs and congestion bid for bottleneck SU.

  • We convert the noncooperative differential game models for the PLPC and the HHCC problems into two dynamic optimization problems. By adopting dynamic programming and maximum principle, we obtain the optimal transmit power and the optimal data rate of upstream SUs, respectively. The cross-layer optimization framework is implemented in a distributed manner through the cross-layer coordination mechanism between capacity supply by power controller and rate demand by rate controller.

1.4 Organization and notation

The rest of this paper is organized as follows. We firstly describe the system model in Section 2. Then, the problem formulation is presented in Section 3. In Section 4, we derive the optimal solutions to the proposed noncooperative differential game models and propose the distributed implementation approach to construct the cross-layer optimization framework. Simulation results are provided in Section 5, followed by the conclusions in Section 6.

Notation: \( \mathcal{A} \) denotes a set, and \( \left|\mathcal{A}\right| \) denotes the cardinality for any set \( \mathcal{A} \). We use a boldface capital to denote vector A to discriminate vectors from scalar quantities. | ⋅ | and ‖⋅‖ represent the absolute value of a polynomial function and the Euclidean distance between the pair of variables, respectively. \( \mathbb{E}\left[\cdot \right] \) stands for the statistical expectation operator.

2 System model

2.1 Network model

Consider an underlay multi-hop CRANET coexisting with a cellular primary network as depicted in Fig. 1, wherein PUs can send their data traffic to a primary base station (PBS) via the licensed uplink channels. We denote the set of uplink channels by ℋ = {ch1, ch2, ⋯, ch ϕ } where ϕ is the number of uplink channels. The uplink channel is either occupied by PUs or unoccupied. We employ the independent and identically distributed alternating ON-OFF process to model the occupation time length of PUs in uplink channels. Specifically, the OFF state indicates the idle state where the uplink channels can be freely occupied by SUs. By performing spectrum sensing on all the uplink channels periodically, S SUs leverage the OFF state to access the unoccupied uplink channels by PUs. Let \( \mathcal{V}=\left\{{v}_1,{v}_2,\cdots, {v}_S\right\} \) refer to the set of S SUs. Each SU is equipped with two radio transceivers. One with a cognitive radio is used to opportunistically access the uplink channels for transmissions of data packets. The other is used for exchange of control signaling. Due to the randomness of data traffic and the dynamic behavior of PUs, we assume that the licensed uplink channels are available for usage by SU v i with a probability of δ i , for \( {v}_i\in \mathcal{V} \). Based on the aforementioned ON-OFF process, the occupancy probability of uplink channel ch ξ by PUs is defined as α ξ /(α ξ  + β ξ ) [32], where α ξ is a probability that uplink channel ξ transits from OFF to ON state, and β ξ is probability that uplink channel ξ transits from ON to OFF state, for ξ = 1, 2, ⋯, ϕ. It is assumed that SUs can determine the occupancy probability of uplink channels by PUs through a priori knowledge of PUs’ activities and local spectrum sensing. Owing to the mutually independent occupancy probability of uplink channel ch ξ , the probability δ i of uplink channels used by SU v i can be expressed as:

$$ {\delta}_i=\prod \limits_{\xi =1}^{\phi}\left(1-\frac{\alpha_{\xi }}{\alpha_{\xi }+{\beta}_{\xi }}\right). $$
(1)
Fig. 1
figure 1

Network model of the underlay multi-hop CRANET where SUs share uplink channels with PUs

Different from the assumption that time is divided into fixed time slots in a discrete way, we exploit the continuous time model to represent the operation duration of the CRANET. The continuous-time operation is confined to a predefined time interval [t0, T]. Use ϑ i (t) and z(t) to denote the positions of SU v i , PU z at time t ∈ [t0, T], respectively. Let R T and R I stand for the maximum transmission range of SU and the interference range of PU, respectively. Without interference with PUs, a pair of SU v i and SU v j can successfully communicate with each other on channel ch ξ at time t only if the Euclidean distance between SU v i and SU v j satisfies ‖ϑ i (t) − ϑ j (t)‖ ≤ R T and when there is no any PU z on channel ch ξ , i.e., ‖ϑ i (t) − z(t)‖ > R I and ‖ϑ j (t) − z(t)‖ > R I , for \( {v}_i,{v}_j\in \mathcal{V} \) and ξ = 1, 2, ⋯, ϕ. In this context, there exists a successful transmission link denoted by l(i, j) from SU v i to SU v j on channel ch ξ at time t. For the sake of conciseness, instant time t will be restricted to the time interval [t0, T] henceforth.

Under the constraint of successful transmission links, we assume that there are multiple different sessions from source SUs to destination SUs. Each session is associated with a route from a source SU to a destination SU. Figure 2 illustrates an example of logical topology of the underlay multi-hop CRANET, where a series of red solid line denote a session along a route from source SU1 to destination SU5, which is one of the different routes. It is assumed that a session consists of several per-link flows with elastic traffic. We use the term per-link flow to describe a sequence of data packets with elastic traffic transmitted along a successful transmission link. With regard to the route from source SU1 to destination SU5, data packets of a flow enter upstream SU2, travel via single hop, then converge at bottleneck SU3 and, finally, move to downstream SU4. We focus on a scenario that multiple different sessions converge at bottleneck SU, aiming to reinvestigate the cross-layer coupling between capacity supply by power control and rate demand by rate control at upstream SUs. From Fig. 2, the convergence of multiple flows from upstream SU2, SU6, and SU8 via single hop may result in a possible congestion at bottleneck SU3 when the offered data load exceeds the available capacity of SU3 due to a buffer overflow, although the amount of data packets has been delivered to downstream SU4, SU7, and SU9. We assume that there are N flows of elastic traffic along the successful transmission links from N upstream SUs to bottleneck SU v b via single hop, for \( {v}_b\in \mathcal{V} \) and N < S. Let \( {\mathcal{V}}_{UP} \) and \( \mathcal{N}=\left\{1,2,\cdots, N\right\} \) represent the set of N upstream SUs and the set of flows of elastic traffic from N upstream SUs to bottleneck SU v b , for \( {\mathcal{V}}_{UP}\subset \mathcal{V} \). For notational simplicity, the flow of elastic traffic along link l(i, b) from upstream SU v i to bottleneck SU v b is described by flow i, for \( i\in \mathcal{N} \) and \( {v}_i\in {\mathcal{V}}_{UP} \). Assuming that flow i of elastic traffic along link l(i, b) arrives as a Poisson process of flow arrival intensity λ i with a size drawn independently from a common distribution of mean \( \mathbb{E}\left[{\lambda}_i\right] \) [33]. When Ψ(i, b) < 1, the transmission link load, denoted by Ψ(i, b), induced by elastic traffic along link l(i, b) is equal to [33]:

Fig. 2
figure 2

Logical topology of the underlay multi-hop CRANET

$$ {\varPsi}_{\left(i,b\right)}=\frac{\lambda_i\times \mathbb{E}\left[{\lambda}_i\right]}{C_{\left(i,b\right)}\left(\mathbf{P}\right)}, $$
(2)

where C(i, b)(P) denotes the capacity of link l(i, b), and P = {p1(t), p2(t), ⋯, p N (t)} corresponds to the transmit power vector of N upstream SUs at time t. Here, we use p i (t) to represent the instant transmit power of upstream SU v i at time t. Noticing that the transmit power p i (t) of upstream SU v i can be adjusted in a continuous way but is also limited by a maximum transmit power threshold denoted by \( {\overline{p}}_i \), i.e., \( {p}_i(t)\in \left[0,{\overline{p}}_i\right) \). Based on elastic traffic model for each flow, the expected duration D i of flow i with size \( \mathbb{E}\left[{\lambda}_i\right] \) is given by [33]:

$$ {D}_i=\frac{\mathbb{E}\left[{\lambda}_i\right]}{C_{\left(i,b\right)}\left(\mathbf{P}\right)\times \left(1-{\varPsi}_{\left(i,b\right)}\right)}. $$
(3)

Under spectrum underlay scenario, SUs can simultaneously transmit with PUs but have to strictly control their transmit power to avoid interfering with coexisting PUs. Note that the simultaneous transmissions among SUs along a successful transmission link must be undertaken on the same channel, which will further incur the co-channel multiple access interference. We assume that the simultaneous transmissions among N SUs along a successful transmission link on channel ch ξ can be undertaken under the CDMA-based medium access in the physical layer [34]. The reason for adopting the CDMA-based medium access model is that transmit power of upstream SU can be controlled to induce a different signal-to-interference-plus-noise ratio (SINR) of successful transmission link due to a co-channel multiple-access interference [20, 35, 36]. In principle, link capacity under this scenario cannot remain fixed but depends on SINR of successful transmission link between a pair of SUs. Let SINR(i, b)(P) be the received SINR of bottleneck SU v b along link l(i, b) on channel ch ξ . Therefore, the capacity of link l(i, b) from upstream SU v i to bottleneck SU v b can be characterized by a global and nonlinear nonconvex function of the transmit power vector and channel conditions as follows [20]:

$$ {C}_{\left(i,b\right)}\left(\mathbf{P}\right)=\frac{1}{T_s}{\log}_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(i,b\right)}\left(\mathbf{P}\right)\right), $$
(4)

where Τ s is a symbol period and χ =  − φ1/log2(φ2 ⋅ BER) is a constant processing gain factor with φ1 and φ2 depending upon an acceptable BER along with the specific modulation and coding scheme. We assume that a large-scale slow-fading channel model is adopted to describe the line-of-sight wireless transmission environment. In this case, channel gain is subject to distance-dependent power attenuation or log-normal shadowing. As for the practical non-line-of-sight scenario, we use a Rayleigh fading model, in which the channel gain is assumed to be independent exponentially distributed random variables with unit mean [37]. Let G(i, b) and F(i, b) denote the large-scale slow-fading and the Rayleigh fading channel gain of link l(i, b) from upstream SU v i to bottleneck SU v b , respectively. Thus, we have the normalized Rayleigh fading channel gain \( \mathbb{E}\left[{F}_{\left(i,b\right)}\right]=1 \). By using the certainty-equivalent transmit power and interference power [34, 37], the received SINR of link l(i, b) at bottleneck SU v b can be expressed as:

$$ {\mathrm{SINR}}_{\left(i,b\right)}\left(\mathbf{P}\right)=\frac{p_i(t){G}_{\left(i,b\right)}}{I_i+{I}_p+{n}_0}, $$
(5)

where n0 is the thermal noise power at bottleneck SU v b , I p is the interference caused by the PBS, and I i is the aggregate power interference introduced by other upstream SUs except upstream SU v i . The aggregate power interference is given by \( {I}_i={\sum}_{j\in \mathcal{N}\backslash i}{p}_j(t){G}_{\left(j,b\right)} \). In what follows, we are targeted at the line-of-sight wireless transmission environment with the large-scale slow-fading channel gain. How to apply the dynamic fast-fading or Rayleigh fading model under the non-line-of-sight scenario into the cross-layer optimization framework for CRANET will be our further work in the future. Under spectrum underlay scenario, the interference power constraint shall be imposed to protect active PUs’ communications from harmful interference caused by all the upstream SUs. We assume that the interference measurement point is located at bottleneck SU v b for convenience. Hence, the total interference caused by all the upstream SUs should be kept below the interference temperature limit ϖPBS at the interference measurement point of PBS:

$$ \sum \limits_{i\in \mathcal{N}}{p}_i(s){G}_{\left(i,b\right)}\le {\varpi}_{\mathrm{PBS}}. $$
(6)

2.2 Predictable contact model

Considering that a contact is viewed as a communication opportunity during which two adjacent SUs can communicate with each other, we move on to model the predictable contact between a pair of SUs from a priori available contact perspective. Based on the insight into successful transmission link l(i, j) as noticed earlier, an encounter e(i, j) is defined as an effective continuous transmission between SU v i and SU v j with a certain duration, for \( {v}_i,{v}_j\in \mathcal{V} \). It is worth pointing out that an encounter rests on the time of incidence and the duration of an effective continuous transmission between a pair of SUs [38]. Let t0, (i, j) and Δt(i, j) represent the time of incidence and the duration of an encounter e(i, j), respectively, for 0 < Δt(i, j) < T − t0. Therefore, an encounter e(i, j) between SU v i and SU v j can be formulated as:

$$ {e}^{\left(i,j\right)}=\left\{{v}_i,{v}_j,{t}^{0,\left(i,j\right)},\Delta {t}^{\left(i,j\right)}\right\}. $$
(7)

Suppose that there exist K encounters between SU v i and SU v j within a predefined time interval [t0, T]. In particular, the ℓth encounter \( {e}_{\mathrm{\ell}}^{\left(i,j\right)} \) between SU v i and SU v j with a duration \( \Delta {t}_{\mathrm{\ell}}^{\left(i,j\right)} \), for ℓ = 1, 2, ⋯, K, is given as:

$$ {e}_{\mathrm{\ell}}^{\left(i,j\right)}=\left\{{v}_i,{v}_j,{t}_{\mathrm{\ell}}^{0,\left(i,j\right)},\Delta {t}_{\mathrm{\ell}}^{\left(i,j\right)}\right\}. $$
(8)

where \( {t}_{\mathrm{\ell}}^{0,\left(i,j\right)} \) refers to the time of incidence of the ℓth encounter \( {e}_{\mathrm{\ell}}^{\left(i,j\right)} \). For mathematical tractability, we use the duration \( \Delta {t}_{\mathrm{\ell}}^{\left(i,j\right)} \) in (8) to characterize the ℓth encounter, i.e., \( {e}_{\mathrm{\ell}}^{\left(i,j\right)}\triangleq \Delta {t}_{\mathrm{\ell}}^{\left(i,j\right)} \). Thus, within time interval [t0, T], contact \( {\mathcal{C}}^{\left(i,j\right)} \) between SU v i and SU v j can be rigorously regarded as the set of all encounters, i.e., \( {\mathcal{C}}^{\left(i,j\right)}=\left\{{e}_1^{\left(i,j\right)},{e}_2^{\left(i,j\right)},\cdots, {e}_K^{\left(i,j\right)}\right\} \) and \( \left|{\mathcal{C}}^{\left(i,j\right)}\right|=K \). Note that the ℓth encounter \( {e}_{\mathrm{\ell}}^{\left(i,j\right)} \) in contact \( {\mathcal{C}}^{\left(i,j\right)} \) can be referred to a random variable due to the uncertainty of communication opportunity between SU v i and SU v j . In this way, we turn to employ a mathematical statistics theory to attain the probability distribution \( {\varUpsilon}^{\left(i,j\right)}=\left\{{\rho}_1^{\left(i,j\right)},{\rho}_2^{\left(i,j\right)},\cdots, {\rho}_M^{\left(i,j\right)}\right\} \) of contact \( {\mathcal{C}}^{\left(i,j\right)} \), which has been derived from Algorithm 1. In Algorithm 1, we introduce a coefficient M to denote the number of the subintervals, which is obtained by dividing interval [a, b] equally. According to the approximate derivation of the sample distribution in mathematical statistics, coefficient M should be reasonably assigned, depending on the number of encounters of K. That is, when K ≤ 100, coefficient M can range from 5 to 12. Obviously, it will be possible to measure the uncertainty of contact \( {\mathcal{C}}^{\left(i,j\right)} \) between SU v i and SU v j by the aid of the probability distribution ϒ(i, j). Thus, by analyzing the statistical data of a priori available contact or all encounters between a pair of SUs, it is implicitly understood that the contact can be in a sense predicted very accurately.

Table 1 summarizes the constrained relationship between the number of encounters within subinterval (dl − 1, d l ] and contact probability \( {\rho}_l^{\left(i,j\right)} \) in Algorithm 1, for l = 1, 2, ⋯, M. It is important to emphasize that the probability distribution ϒ(i, j) belongs to a complete probability distribution, i.e., \( {\sum}_{l=1}^M{\rho}_l^{\left(i,j\right)}=1 \). Technically, the entropy paradigm is widely used for a measure of the uncertainty or randomness associated with a random variable in information theory [39]. In order to characterize quantitatively the probability distribution ϒ(i, j), we put forward an entropy paradigm by using Shannon entropy theory to measure the uncertainty of contact \( {\mathcal{C}}^{\left(i,j\right)} \). Specifically, the entropy H(ϒ(i, j)) of the probability distribution ϒ(i, j) can be given as:

$$ H\left({\varUpsilon}^{\left(i,j\right)}\right)=-\sum \limits_{l=1}^M{\rho}_l^{\left(i,j\right)}{\log}_2{\rho}_l^{\left(i,j\right)}. $$
(9)
Table 1 Constrained relationship from algorithm 1

Based on Algorithm 1, it is obvious to find that coefficient M impacts the structure of the probability distribution ϒ(i, j). Thereby, the entropy H(ϒ(i, j)) will depend on the selection of coefficient M. Recall that the probability δ i of uplink channels used by SU v i determines the stability of successful transmission link l(i, j) or even the contact \( {\mathcal{C}}^{\left(i,j\right)} \) between SU v i and SU v j due to the impact of PUs’ activities on the licensed uplink channels. As such, we formally devise a contact affinity metric to describe the stability of the contact between a pair of SUs. Without losing generality, the contact affinity metric A(i, j) between SU v i and SU v j within time interval [t0, T] is formally expressed as:

figure a
$$ {A}_{\left(i,j\right)}=H\left({\varUpsilon}^{\left(i,j\right)}\right)\times {\delta}_i\times {\delta}_j. $$
(10)

3 Problem formulation

In this section, we intend to employ the differential game theoretic approach to formulate the PLPC problem in the physical layer and the HHCC problem in the transport layer. Clearly, the distributed strategy needs to be used to design the cross-layer optimization framework for congestion and power control due to the lack of centralized control and global information under an underlay CRANET scenario. As depicted in Fig. 3, each upstream SU will serve as power and rate controller in charge of joint optimized allocation of transmit power in the physical layer and data rate in the transport layer. Note that the change of power and rate will be continuous in time due to the fact that dynamic congestion and power control will be more realistic in a practical environment.

Fig. 3
figure 3

Illustration of cross-layer optimization framework for hop-by-hop congestion control and per-link power control

3.1 Per-link power control in the physical layer

Given the channel conditions, the capacity of successful transmission link between a pair of SUs is a nonconvex function of transmit power vector P. In fact, increasing the link capacity on one link may reduce the link capacities on other links because of the mutual interference caused by SUs [20]. Instead, each SU is expected to increase its transmit power to provide as much link capacity that per-link flow requires [21]. However, this adjustment of power will generate extra interference to other SUs. It is necessary to achieve an optimal per-link power allocation in the physical layer for upstream SUs to meet link capacity supply for all the flows. By letting the transmit power of upstream SU v i equals the maximum transmit power threshold, we can easily obtain the maximum transmit power vector \( \overline{\mathbf{P}}=\left\{{\overline{p}}_1,{\overline{p}}_2,\cdots, {\overline{p}}_N\right\} \). The transmission loss along link l(i, b) from upstream SU v i to bottleneck SU v b on channel ch ξ is denoted by η(i, b). Due to the line-of-sight wireless transmission environment with slow-fading channel model, the transmission loss along link l(i, b) is represented by η(i, b) = [c/(4πf ξ  ⋅ ‖ϑ i (t) − ϑ b (t)‖)]2, where f ξ is the carrier frequency operating on channel ch ξ and c is the speed of light. Therefore, the maximum transmit power threshold \( {\overline{p}}_i \) of upstream SU v i along link l(i, b) from upstream SU v i to bottleneck SU v b can be formulated as:

$$ {\overline{p}}_i=\frac{{\mathrm{p}}_{\left(i,b\right)}^{\mathrm{ref}}}{\eta_{\left(i,b\right)}}, $$
(11)

where \( {\mathrm{p}}_{\left(i,b\right)}^{\mathrm{ref}} \) is the received reference power at bottleneck SU v b along link l(i, b). Given the maximum transmit power threshold \( {\overline{p}}_i \), the capacity of link l(i, b) from upstream SU v i to bottleneck SU v b can be denoted by \( {C}_{\left(i,b\right)}^{\ast}\left(\overline{\mathbf{P}}\right) \). Owing to maximum transmit power threshold, the value of power reduction for upstream SU v i is equal to \( {\overline{p}}_i-{p}_i(t) \). Recall that bits-per-Joule capacity usually serves as a metric to measure the energy efficiency of a communication system [40]. Considering the impact of power reduction on energy efficiency with a link capacity constraint, energy efficiency for power reduction is formally defined as the power reduction achieved per capacity obtained under the maximum transmit power threshold. Thus, we plan to use the energy efficiency for power reduction to characterize a pricing factor of energy-per-capacity, aiming to design the revenue function of power reduction for upstream SU. Given the maximum transmit power vector \( \overline{\mathbf{P}} \), the pricing factor Φ i (t) of energy-per-capacity of upstream SU v i at time t can be formally defined as:

$$ {\displaystyle \begin{array}{l}{\varPhi}_i(t)=\frac{{\overline{p}}_i-{p}_i(t)}{C_{\left(i,b\right)}^{\ast}\left(\overline{\mathbf{P}}\right)}\\ {}\kern4em =\frac{{\overline{p}}_i-{p}_i(t)}{\frac{1}{T_s}{\log}_2\left(1+\chi \cdot \frac{{\overline{p}}_i{G}_{\left(i,b\right)}}{\sum \limits_{j\in \mathcal{N}\backslash i}{\overline{p}}_j{G}_{\left(j,b\right)}+{n}_b+{n}_0}\right)}.\end{array}} $$
(12)

Revisiting the pricing factor of energy-per-capacity of upstream SU v i , we define the revenue function of power reduction for upstream SU v i at time t by attaining the product of the pricing factor together with power reduction value, i.e., \( {\varPhi}_i(t)\left({\overline{p}}_i-{p}_i(t)\right) \). Let ω denote the pricing factor announced by upstream SU v i to measure the cost of the amount of aggregate power interference to PUs. The amount of aggregate power interference to PUs is denoted by I(t). The cost function of aggregate power interference to PUs for upstream SU v i at time t is given by ωI(t). Note that I(t) is a dynamic variable influenced by transmit power p i (t) of upstream SU v i and instant level I(t) within time interval [t0, T]. Thus, the aggregate power interference I(t) can be characterized as a linear differential equation given as:

$$ \left\{\begin{array}{l}\frac{\mathrm{d}I(t)}{\mathrm{d}t}=\sum \limits_{i\in \mathcal{N}}{p}_i(t)-\gamma I(t)\\ {}I\left({t}_0\right)={I}_{t_0}\end{array}\right., $$
(13)

where γ > 0 is a penalty factor of the amount of aggregate power interference and \( {I}_{t_0} \) is an initial aggregate power interference to PUs at time t0. Therefore, based on both revenue and cost functions as mentioned, the utility function \( {U}_1^i(t) \) of upstream SU v i at time t is given as:

$$ {U}_1^i(t)=\frac{{\overline{p}}_i-{p}_i(t)}{C_{\left(i,b\right)}^{\ast}\left(\overline{\mathbf{P}}\right)}\left({\overline{p}}_i-{p}_i(t)\right)-\omega I(t). $$
(14)

Note that utility function \( {U}_1^i(t) \) is a continuously differentiable function of p i (t) and I(t). We can find that utility function in (14) mainly relies on pricing factor of energy-per-capacity in that the marginal effect on utility function stems from aggregate power interference. As shown in Fig. 3, we figuratively define pricing factor Φ i (t) as a shadow price i which is a function of transmit power of upstream SU v i . Our optimization objective is to maximize utility function \( {U}_1^i(t) \) by choosing optimal transmit power \( {p}_i^{\#}(t) \) of upstream SU v i according to Φ i (t) and ω:

$$ \underset{p_i(t)}{\operatorname{Maximize}}:\kern1em {\int}_{t_0}^T\left(\frac{{\overline{p}}_i-{p}_i(t)}{C_{\left(i,b\right)}^{\ast}\left(\overline{\mathbf{P}}\right)}\left({\overline{p}}_i-{p}_i(t)\right)-\omega I(t)\right){e}^{-a\left(t-{t}_0\right)}\mathrm{d}t, $$
(15)

where a is a discount factor, for 0 < a < 1. Note that discount factor a is an exponential factor between 0 and 1 by which the future utility must be multiplied in order to obtain the present value under the underlying structure of differential game. Hence, utility function \( {U}_1^i(t) \) has to be discounted by the factor \( {e}^{-a\left(t-{t}_0\right)} \). Formally, the PLPC problem in the physical layer is formulated as a differential game model Γ PLPC :

$$ {\varGamma}_{PLPC}=\left\{\mathcal{N},{\left\{{p}_i(t)\right\}}_{i\in \mathcal{N}},I(t),{\left\{{U}_1^i(t)\right\}}_{i\in \mathcal{N}}\right\}, $$
(16)

where

  • Player set \( \mathcal{N} \): \( \mathcal{N}=\left\{1,2,\cdots, N\right\} \) is the set of all the upstream SUs in the PLPC problem as power controllers playing the game. Note that upstream SU v i stands for the ith player which is a rational policy maker and acts throughout time interval [t0, T].

  • Set of strategies \( {\left\{{p}_i(t)\right\}}_{i\in \mathcal{N}} \): The strategy of the ith player refers to its instant transmit power limited by the maximum transmit power threshold, i.e., \( {p}_i(t)\in \left[0,{\overline{p}}_i\right) \).

  • State variable I(t): The state variable of the ith player corresponds to the amount of aggregate power interference to PUs.

  • Set of utility functions \( {\left\{{U}_1^i(t)\right\}}_{i\in \mathcal{N}} \): \( {U}_1^i(t) \) is the utility function of the ith player. The objective of the ith player is to maximize its utility function by rationally selecting optimal strategy \( {p}_i^{\#}(t) \) and optimal state I#(t).

3.2 Hop-by-hop congestion control in the transport layer

Under the scenario that multiple flows from upstream SU2, SU6, and SU8 via single hop converge at bottleneck SU3 in Fig. 2, bottleneck SU3 is a little more inclined to be a congested SU when offered data load exceeds available capacity of SU3 due to buffer overflow. The amount of data packets with elastic traffic in the buffer of bottleneck SU v b at time t is denoted by φ b (t). Given time t, t ∈ [t0, T] and t < t, the amount of data packets φ b (t) of bottleneck SU v b within the time interval [t, t] satisfies the following iterative equation:

$$ {\varphi}_b\left({t}^{\prime}\right)=\max \left\{\min \left\{{\varphi}_b(t)+{\varphi}_b^{t\to {t}^{\prime }}\left({t}^{\prime}\right),{\kappa}_b\right\}-{\varLambda}_b^{t\to {t}^{\prime }}\left({t}^{\prime}\right),0\right\}, $$
(17)

where κ b is the buffer size of bottleneck SU v b , and \( {\varphi}_b^{t\to {t}^{\prime }}\left({t}^{\prime}\right) \) and \( {\varLambda}_b^{t\to {t}^{\prime }}\left({t}^{\prime}\right) \) are the amount of data packets accumulated in the buffer of bottleneck SU v b and the amount of data packets that could be delivered successfully to downstream SUs, respectively. Considering the constraint of the saturation value \( {\widehat{L}}_b \) of the buffer of bottleneck SU v b , we have the buffer constraint \( {\varphi}_b(t)\le {\widehat{L}}_b \) to guarantee that bottleneck SU v b will not become the real congested SU.

In the end-to-end congestion control, the congestion detection information is piggybacked over data packets to the destination and then sent to the source through the acknowledgement packet from the destination. However, to feed the congestion detection information back to the upstream SU2, SU6, and SU8, bottleneck SU3 will generate a backpressure signal to notify that the congestion occurs as shown in Fig. 2. Compared with the end-to-end mechanism, the backpressure signal is directly sent back to the corresponding upstream SUs from the bottleneck SU3 via single hop. Instead of passing the congestion detection information sent to the source in an end-to-end approach, the idea of hop-by-hop congestion control policy in this paper is that upstream SU v i directly adjusts the data rate r i (t) according to the backpressure signal of bottleneck SU v b when \( {\varphi}_b(t)>{\widehat{L}}_b \). Based on the radio transceiver equipped by each SU, the backpressure signal is assumed to be transmitted through a common control channel.

Given the received SINR of link l(i, b), we proceed to derive the required bandwidth of upstream SU v i for transmissions of data packets with elastic traffic. According to Shannon’s capacity formula, the required bandwidth of upstream SU v i can be expressed by r i (t)/log2(1 + χ ⋅ SINR(i, b)(P)). We further assume that bottleneck SU v b acts as a bidder and pays for upstream SU v i to accommodate the consumption of its network resources while regulating the data rate r i (t). We use x(t) to represent the congestion bid that bottleneck SU v b is willing to pay. We then characterize a bandwidth-to-bid ratio aiming to describe the efficiency of bid with regard to the required bandwidth, i.e., [r i (t)/log2(1 + χ ⋅ SINR(i, b)(P))]/x(t). Hence, the cost function C i (t) of upstream SU v i at time t can be defined as:

$$ {C}_i(t)={r}_i(t)\times \frac{r_i(t)/{\log}_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(i,b\right)}\left(\mathbf{P}\right)\right)}{x(t)}\times {D}_i. $$
(18)

We also use the bandwidth-to-bid ratio to figuratively express the shadow price i which is a function of the data rate of upstream SU v i . Note that the shadow price i depends on an optimal per-link power allocation in the PLPC problem due to the constraint of the received SINR of link l(i, b) in bandwidth-to-bid ratio. In this way, we turn our attention to the cross-layer coordination mechanism between capacity supply by power controller and rate demand by rate controller based on shadow prices i and i as shown in Fig. 3. By taking into account the stability of the contact between a pair of SUs, we conclude that A(i, b)x(t) is the accumulated revenue obtained by upstream SU v i that bottleneck SU v b needs to pay. We also remark that the congestion bid x(t) is a dynamic variable influenced by rate r i (t) as well as by instant level of x(t) within time interval [t0, T]. Accordingly, the congestion bid x(t) can be formulated as a linear differential equation:

$$ \left\{\begin{array}{l}\frac{\mathrm{d}x(t)}{\mathrm{d}t}=x(t)-\sum \limits_{i\in \mathcal{N}}\upsilon {r}_i(t)\\ {}x\left({t}_0\right)={x}_{t_0}\end{array}\right., $$
(19)

where \( {x}_{t_0} \) is an initial congestion bid that bottleneck SU v b needs to pay at time t0 and υ is an average bid per rate, which is assumed to be a unit value for all the upstream SUs, i.e., υ = 1. Formally, based on both revenue and cost functions as stated before, the utility function \( {U}_2^i(t) \) of upstream SU v i at time t can be expressed as:

$$ {U}_2^i(t)={A}_{\left(i,b\right)}x(t)-{C}_i(t). $$
(20)

Noticing that utility function \( {U}_2^i(t) \) is also a continuously differentiable function of r i (t) and x(t). Our optimization objective is to maximize utility function \( {U}_2^i(t) \) by choosing optimal data rate \( {r}_i^{\#}(t) \) of upstream SU v i while satisfying the buffer constraint at the same time:

$$ \underset{r_i(t)}{\operatorname{Maximize}:}\kern1em {\int}_{t_0}^T\left({A}_{\left(i,b\right)}x(t)-{C}_i(t)\right){e}^{-\tau \left(t-{t}_0\right)}\mathrm{d}t, $$
(21)

where τ is a discount factor, for 0 < τ < 1. Similarly, \( {U}_2^i(t) \) will also be discounted by the factor \( {e}^{-\tau \left(t-{t}_0\right)} \). Correspondingly, the HHCC problem in the transport layer can be also defined as a differential game model Γ HHCC :

$$ {\varGamma}_{HHCC}=\left\{\mathcal{N},{\left\{{r}_i(t)\right\}}_{i\in \mathcal{N}},x(t),{\left\{{U}_2^i(t)\right\}}_{i\in \mathcal{N}}\right\}, $$
(22)

where

  • Player set \( \mathcal{N} \): \( \mathcal{N}=\left\{1,2,\cdots, N\right\} \) is the set of all the upstream SUs in the HHCC problem as rate controllers playing the game. Upstream SU v i is also known as the ith player which is a rational policy maker and act throughout time interval [t0, T].

  • Set of strategies \( {\left\{{r}_i(t)\right\}}_{i\in \mathcal{N}} \): The strategy of the ith player corresponds to its instant data rate r i (t).

  • State variable x(t): The state variable of the ith player refers to the congestion bid that bottleneck SU v b is willing to pay.

  • Set of utility functions \( {\left\{{U}_2^i(t)\right\}}_{i\in \mathcal{N}} \): \( {U}_2^i(t) \) is the utility function of the ith player. The objective of the ith player is to maximize its utility function by rationally choosing optimal strategy \( {r}_i^{\#}(t) \) and optimal state x#(t).

4 Optimal solution and distributed implementation

Conventionally, upstream SUs as players of the game are expected to act cooperatively and maximize their joint utility functions with fairness for players by constituting the collaborative coalition. As a result, the global optimization of transmit power and data rate will be attained through cooperation among players with group rationality, which has been recently reported in a cooperative bargaining game [41]. However, each upstream SU is unwilling to jointly adjust the power and rate because of the selfish behavior in forwarding data packets. This is a natural idea due to the fact that the transmissions lead to the consumption of network resources of upstream SUs, such as energy and spectrum. Therefore, the cross-layer optimization framework for congestion and power control will be restricted to noncooperation scenario. In the noncooperative differential game models Γ PLPC and Γ HHCC , the ith player competes to maximize the present value of its utility function derived over time interval [t0, T]. For mathematical tractability, we define the starting time of the differential game models Γ PLPC and Γ HHCC as t0 = 0 hereinafter, but the results can be easily extended to more general cases.

4.1 Optimal solution to Γ PLPC

For the noncooperation scenario, we formulate a dynamic optimization problem ℙ1 to derive the optimal solution to the noncooperative differential game model Γ PLPC by taking into account the utility function maximization problem coupled with the linear differential equation constraint in (13):

$$ {\displaystyle \begin{array}{l}\mathrm{\mathbb{P}}1\kern0.5em \underset{p_i(t)}{\operatorname{Maximize}}:\kern1em {\int}_0^T\left(\frac{{\overline{p}}_i-{p}_i(t)}{C_{\left(i,b\right)}^{\ast}\left(\overline{\mathbf{P}}\right)}\left({\overline{p}}_i-{p}_i(t)\right)-\omega I(t)\right){e}^{- at}\mathrm{d}t\\ {}\kern2.00em \mathrm{Subject}\kern0.5em \mathrm{to}:\frac{\mathrm{d}I(t)}{\mathrm{d}t}=\sum \limits_{i\in \mathcal{N}}{p}_i(t)-\gamma I(t),\\ {}\kern13em I\left({t}_0=0\right)={I}_0.\end{array}} $$
(23)

We aim at deriving an optimal solution to ℙ1 by employing the theory of dynamic programming developed by Bellman [42]. Remark that the optimal solution is also viewed as a Nash equilibrium solution to ℙ1 if all the players play noncooperatively. Here, we relax the terminal time of Γ PLPC to explore when T approaches ∞ (i.e., T → ∞) as an infinite time horizon. It is more realistic to obtain the long-term optimal power allocation for upstream SUs due to spectrum underlay strategy with cellular primary network. We use \( {p}_i^{\#}(t) \) to represent the optimal solution to ℙ1 and assume that there exists a continuously differentiable function Vi(p i , I) satisfying the following partial differential equation:

$$ \kern1.5em {aV}^i\left({p}_i,I\right)=\underset{p_i(t)}{\operatorname{Maximize}}:\left\{\frac{{\left({\overline{p}}_i-{p}_i(t)\right)}^2}{C_{\left(i,b\right)}^{\ast}\left(\overline{\mathbf{P}}\right)}-\omega I(t)+\frac{\partial {V}^i\left({p}_i,I\right)}{\partial I}\left(\sum \limits_{j\in \mathcal{N}\backslash i}{p}_j^{\#}(t)+{p}_i(t)-\gamma I(t)\right)\right\}. $$
(24)

Theorem 1

A vector of optimal transmit power \( {\mathbf{P}}^{\#}=\left\{{p}_1^{\#}(t),{p}_2^{\#}(t),\cdots, {p}_N^{\#}(t)\right\} \) of upstream SUs constitutes a Nash equilibrium solution to ℙ1 if and only if the optimal transmit power \( {p}_i^{\#}(t) \) of the ith player and the continuously differentiable function Vi(p i , I) can be formulated as follows:

$$ {p}_i^{\#}(t)={\overline{p}}_i-\frac{\omega {C}_{\left(i,b\right)}^{\ast}\left(\overline{\mathbf{P}}\right)}{2\left(a+\gamma \right)}, $$
(25)
$$ {V}^i\left({p}_i^{\#},I\right)=\frac{\omega }{a\left(a+\gamma \right)}\left(\frac{\omega {C}_{\left(i,b\right)}^{\ast}\left(\overline{\mathbf{P}}\right)\left(1+2N\right)}{4\left(a+\gamma \right)}- aI-\sum \limits_{i\in \mathcal{N}}{\overline{p}}_i\right). $$
(26)

Proof: See Appendix 1. ■.

From Theorem 1, we can observe that the existence and uniqueness of the Nash equilibrium point to ℙ1 are guaranteed under the constraint of analytical solution in (25) and (26). It is also revealed that the optimal transmit power \( {p}_i^{\#}(t) \) has been characterized by a fixed and unique value in (25). Evidently, Theorem 1 mathematically ensures the convergence of \( {p}_i^{\#}(t) \) to a Nash equilibrium point. The key point to derive the optimal solution to the differential game model Γ PLPC is illustrated with a block diagram shown in Fig. 4a.

Fig. 4
figure 4

The block diagram of the dynamic optimization problems ℙ1 and ℙ2 coupled with their optimal solutions: a optimal solution to Γ PLPC and b optimal solution to Γ HHCC

figure b

Proposition 1

For the given large-scale slow-fading channel model, by letting G1 = ϖPBS/(106g0)N, the optimal transmit power \( {p}_i^{\#}(t) \) of the ith player should follow the interference power constraint:

$$ \prod \limits_{i\in \mathcal{N}}{p}_i^{\#}(t)\le {G}_1\prod \limits_{i\in \mathcal{N}}{\left\Vert {\vartheta}_i(t)-{\vartheta}_b(t)\right\Vert}^4. $$
(27)

Proof: See Appendix 2. ■.

Note that the optimal transmit power \( {p}_i^{\#}(t) \) of the ith player is fully constrained by the Euclidean distance between upstream SU v i and bottleneck SU v b under the given channel model. Substituting for \( {p}_i^{\#}(t) \) with its expression from Theorem 1 and taking into account the previous expression of shadow price i , we can easily rewrite i as:

$$ {\aleph}_i=\frac{\omega }{2\left(a+\gamma \right)}. $$
(28)

Apparently, shadow price i tends to be a constant value for all the upstream SUs. Although (25) and (26) offer an analytical solution to ℙ1, it still remains to design an algorithm to ensure fast convergence of the update of optimal transmit power. Therefore, we devise a distributed optimal transmit power update (OTPU) strategy given in Algorithm 2 to update the optimal transmit power vector P# for upstream SUs. Similar to [43], shadow price i in Algorithm 2 needs to carefully be chosen to ensure fast convergence of the update of instant transmit power \( {p}_i^{\odot }(t) \). It is also noted that the update of instant transmit power \( {p}_i^{\odot }(t) \) for upstream SU v i can be made locally according to its optimal transmit power \( {p}_i^{\#}(t) \) along with interference power constraint.

4.2 Optimal solution to Γ HHCC

For notational simplicity, we begin by defining a notation B(i, b) ≜  − D i /log2(1 + χ ⋅ SINR(i, b)(P)). For the noncooperation scenario, we formulate a dynamic optimization problem ℙ2 to derive the optimal solution to the noncooperative differential game model Γ HHCC by taking into account both the utility function maximization problem and the linear differential equation constraint in (19):

$$ {\displaystyle \begin{array}{l}\mathrm{\mathbb{P}}2\kern0.5em \underset{r_i(t)}{\operatorname{Maximize}}:\kern1em {\int}_0^T\left({A}_{\left(i,b\right)}x(t)+{B}_{\left(i,b\right)}\frac{r_i^2(t)}{x(t)}\right){e}^{-\tau t}\mathrm{d}t\\ {}\kern3.25em \mathrm{Subject}\kern0.5em \mathrm{to}:\frac{\mathrm{d}x(t)}{\mathrm{d}t}=x(t)-\sum \limits_{i\in \mathcal{N}}{r}_i(t),\\ {}\kern13em x\left({t}_0=0\right)={x}_0.\end{array}} $$
(29)

We turn to take advantage of the theory of maximum principle developed by Pontryagin [42] to derive an optimal solution or a Nash equilibrium solution to ℙ2. We further use \( {r}_i^{\#}(t) \) to represent the optimal solution to ℙ2 and assume that there exists a continuously differentiable function Wi(r i , x) satisfying the partial differential equation as follows:

$$ -\frac{\partial {W}^i\left({r}_i,x\right)}{\partial s}=\underset{r_i(t)}{\operatorname{Maximize}}:\left\{\left({A}_{\left(i,b\right)}x(t)+{B}_{\left(i,b\right)}\frac{r_i^2(t)}{x(t)}\right){e}^{-\tau t}+\frac{\partial {W}^i\left({r}_i,x\right)}{\partial x}\left(x(t)-\sum \limits_{j\in \mathcal{N}\backslash i}{r}_j^{\#}(t)-{r}_i(t)\right)\right\}. $$
(30)

For tractability, we introduce two extra introduced auxiliary variables Y i (t) and J i (t) to characterize Wi(r i , x). Specifically, we define Wi(r i , x) ≜ (Y i (t)x(t) + J i (t))eτt.

Theorem 2

A vector of optimal data rate \( {\mathbf{R}}^{\#}=\left\{{r}_1^{\#}(t),{r}_2^{\#}(t),\cdots, {r}_N^{\#}(t)\right\} \) of upstream SUs constitutes a Nash equilibrium solution to ℙ2 if and only if the optimal data rate \( {r}_i^{\#}(t) \) of the ith player can be expressed as:

$$ {r}_i^{\#}(t)=\frac{Y_i(t)x(t)}{2{B}_{\left(i,b\right)}}, $$
(31)

where Y i (t) and J i (t) satisfy the following differential equations:

$$ \frac{\mathrm{d}{Y}_i(t)}{\mathrm{d}t}=\frac{1}{4{B}_{\left(i,b\right)}}{Y}_i^2(t)+\left(\tau -1\right){Y}_i(t)-{A}_{\left(i,b\right)}+\frac{Y_i(t)}{2}\sum \limits_{j\in \mathcal{N}\backslash i}\frac{Y_j(t)}{B_{\left(j,b\right)}}, $$
(32)
$$ \frac{\mathrm{d}{J}_i(t)}{\mathrm{d}t}=\tau \cdot {J}_i(t). $$
(33)

Proof: See Appendix 3. ■.

For notational simplicity, we set \( {\varOmega}_{\left(i,b\right)}=1/\left(4{B}_{\left(i,b\right)}\right)+0.5{\sum}_{j\in \mathcal{N}\backslash i}1/{B}_{\left(j,b\right)} \) and \( \varepsilon =\sqrt{{\left(\tau -1\right)}^2+4{\varOmega}_{\left(i,b\right)}{A}_{\left(i,b\right)}} \), for 4Ω(i, b)A(i, b) + (τ − 1)2 > 0. We also denote G2 as a constant number. Substituting Ω(i, b) into ε, we can rewrite ε as follows:

$$ \varepsilon =\sqrt{{\left(\tau -1\right)}^2+\frac{A_{\left(i,b\right)}}{B_{\left(i,b\right)}}+2{A}_{\left(i,b\right)}\sum \limits_{j\in \mathcal{N}\backslash i}\frac{1}{B_{\left(j,b\right)}}}. $$
(34)

Proposition 2

The auxiliary variable Y i (t) in the Nash equilibrium solution \( {r}_i^{\#}(t) \) to ℙ2 can be further given as:

$$ {Y}_i(t)=\frac{\left(\left(\tau -1\right)+\varepsilon \right){e}^{\left(t-{G}_2\right)\varepsilon }+\varepsilon -\tau +1}{2{\varOmega}_{\left(i,b\right)}\left(1-{e}^{\left(t-{G}_2\right)\varepsilon}\right)}. $$
(35)

Proof: See Appendix 4. ■.

Combining Y i (t) in (35) and ε in (34) yields the expression for Y i (t) as:

$$ {Y}_i(t)=\frac{\left(\tau -1+\sqrt{{\left(\tau -1\right)}^2+\frac{A_{\left(i,b\right)}}{B_{\left(i,b\right)}}+2{A}_{\left(i,b\right)}\sum \limits_{j\in \mathcal{N}\backslash i}\frac{1}{B_{\left(j,b\right)}}}\right){e}^{\left(t-{G}_2\right)\sqrt{{\left(\tau -1\right)}^2+\frac{A_{\left(i,b\right)}}{B_{\left(i,b\right)}}+2{A}_{\left(i,b\right)}\sum \limits_{j\in \mathcal{N}\backslash i}\frac{1}{B_{\left(j,b\right)}}}}+\sqrt{{\left(\tau -1\right)}^2+\frac{A_{\left(i,b\right)}}{B_{\left(i,b\right)}}+2{A}_{\left(i,b\right)}\sum \limits_{j\in \mathcal{N}\backslash i}\frac{1}{B_{\left(j,b\right)}}}-\tau +1}{2\left(\frac{1}{4{B}_{\left(i,b\right)}}+0.5\sum \limits_{j\in \mathcal{N}\backslash i}\frac{1}{B_{\left(j,b\right)}}\right)\left(1-{e}^{\left(t-{G}_2\right)\sqrt{{\left(\tau -1\right)}^2+\frac{A_{\left(i,b\right)}}{B_{\left(i,b\right)}}+2{A}_{\left(i,b\right)}\sum \limits_{j\in \mathcal{N}\backslash i}\frac{1}{B_{\left(j,b\right)}}}}\right)}. $$
(36)

Substituting B(i, b) ≜  − D i /log2(1 + χ ⋅ SINR(i, b)(P)) into (36), we can rewrite Y i (t) as follows:

$$ {\displaystyle \begin{array}{l}{Y}_i(t)=\frac{\left(\tau -1+\sqrt{{\left(\tau -1\right)}^2-\frac{A_{\left(i,b\right)}{\log}_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(i,b\right)}\left(\mathbf{P}\right)\right)}{D_i}-2{A}_{\left(i,b\right)}\sum \limits_{j\in \mathcal{N}\backslash i}\frac{\log_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(j,b\right)}\left(\mathbf{P}\right)\right)}{D_j}}\right){e}^{\left(t-{G}_2\right)\sqrt{{\left(\tau -1\right)}^2-\frac{A_{\left(i,b\right)}{\log}_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(i,b\right)}\left(\mathbf{P}\right)\right)}{D_i}-2{A}_{\left(i,b\right)}\sum \limits_{j\in \mathcal{N}\backslash i}\frac{\log_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(j,b\right)}\left(\mathbf{P}\right)\right)}{D_j}}}}{2\left(\frac{\log_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(i,b\right)}\left(\mathbf{P}\right)\right)}{4{D}_i}+0.5\sum \limits_{j\in \mathcal{N}\backslash i}\frac{\log_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(j,b\right)}\left(\mathbf{P}\right)\right)}{D_j}\right)\left({e}^{\left(t-{G}_2\right)\sqrt{{\left(\tau -1\right)}^2-\frac{A_{\left(i,b\right)}{\log}_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(i,b\right)}\left(\mathbf{P}\right)\right)}{D_i}-2{A}_{\left(i,b\right)}\sum \limits_{j\in \mathcal{N}\backslash i}\frac{\log_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(j,b\right)}\left(\mathbf{P}\right)\right)}{D_j}}}-1\right)}.\\ {}\kern5em +\frac{\sqrt{{\left(\tau -1\right)}^2-\frac{A_{\left(i,b\right)}{\log}_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(i,b\right)}\left(\mathbf{P}\right)\right)}{D_i}-2{A}_{\left(i,b\right)}\sum \limits_{j\in \mathcal{N}\backslash i}\frac{\log_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(j,b\right)}\left(\mathbf{P}\right)\right)}{D_j}}-\tau +1}{2\left(\frac{\log_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(i,b\right)}\left(\mathbf{P}\right)\right)}{4{D}_i}+0.5\sum \limits_{j\in \mathcal{N}\backslash i}\frac{\log_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(j,b\right)}\left(\mathbf{P}\right)\right)}{D_j}\right)\left({e}^{\left(t-{G}_2\right)\sqrt{{\left(\tau -1\right)}^2-\frac{A_{\left(i,b\right)}{\log}_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(i,b\right)}\left(\mathbf{P}\right)\right)}{D_i}-2{A}_{\left(i,b\right)}\sum \limits_{j\in \mathcal{N}\backslash i}\frac{\log_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(j,b\right)}\left(\mathbf{P}\right)\right)}{D_j}}}-1\right)}\end{array}} $$
(37)

Let G3 be a constant number. Based on (31) and (29), the optimal data rate \( {r}_i^{\#}(t) \) and the optimal state variable x#(t) associated with \( {r}_i^{\#}(t) \) can be described as:

$$ {r}_i^{\#}(t)=\frac{Y_i(t)}{2{B}_{\left(i,b\right)}}\left({e}^{\left(1-0.5\sum \limits_{i=1}^N\frac{Y_i(t)}{B_{\left(i,b\right)}}\right)t}+{G}_3\right), $$
(38)
$$ {x}^{\#}(t)={e}^{\left(1-0.5\sum \limits_{i=1}^N\frac{Y_i(t)}{B_{\left(i,b\right)}}\right)t}+{G}_3. $$
(39)

From (38), we can see that the optimal data rate \( {r}_i^{\#}(t) \) is determined by both B(i, b) and auxiliary variable Y i (t) under the received SINR SINR(i, b)(P) of link l(i, b). Unfortunately, it is difficult to directly obtain the relationship between SINR(i, b)(P) and \( {r}_i^{\#}(t) \) through an analytical derivation because Y i (t) cannot be further simplified into a concise structure. Thereby, given the channel gain G(i, b) of link l(i, b) under the large-scale slow-fading channel model, we use the numerical simulations to validate the effectiveness of the optimal data rate \( {r}_i^{\#}(t) \) of upstream SU v i . We further remark that Theorem 2 and Proposition 2 characterize the existence of the Nash equilibrium point to ℙ2. It should be also admitted that the optimal data rate \( {r}_i^{\#}(t) \) has been formulated as a fixed and unique value in (38) by using auxiliary variable Y i (t) in (37). Correspondingly, Theorem 2 mathematically ensures the convergence of \( {r}_i^{\#}(t) \) to a Nash equilibrium point. The key point to derive the optimal solution to the differential game model Γ HHCC is also illustrated with a block diagram depicted in Fig. 4b. With the help of vectors R# and P#, shadow price i can be calculated as:

$$ {\aleph}_i=\left(\frac{1}{\log_2\left(1+\chi \cdot {\mathrm{SINR}}_{\left(i,b\right)}\left({\mathbf{P}}^{\#}\right)\right)}\right)\frac{r_i^{\#}(t)}{x^{\#}(t)}. $$
(40)

Proposition 3

For the given vector R# and vector P#, a strict lower bound of shadow price ℵ i can be approximately calculated as follows:

$$ {\aleph}_i\ge {\left({\log}_2\left(\chi \cdot \frac{p_i^{\#}(t)}{\sum \limits_{j\in \mathcal{N}\backslash i}{p}_j^{\#}(t)}\right)\right)}^{-1}\frac{r_i^{\#}(t)}{x^{\#}(t)}. $$
(41)

Proof: See Appendix 5. ■.

Let \( {\mathrm{r}}_{\left(i,b\right)}^{\mathrm{base}} \) represent the baseline rate along link l(i, b) from upstream SU v i to bottleneck SU v b . To guarantee the constraint of the saturation value of the buffer of bottleneck SU v b , we design a distributed algorithm to obtain the optimal data rate update (ODRU) for upstream SUs as summarized in Algorithm 3, where R = {r1(t), r2(t), ⋯, r N (t)} denotes the data rate vector of N upstream SUs at time t. A simple yet effective way to locally adjust the instant data rate \( {r}_i^{\odot }(t) \) of upstream SU v i is to employ the optimal data rate \( {r}_i^{\#}(t) \) under the condition of buffer constraint \( {\varphi}_b(t)\le {\widehat{L}}_b \). Also, baseline rate \( {\mathrm{r}}_{\left(i,b\right)}^{\mathrm{base}} \) in Algorithm 3 should be carefully chosen to ensure the effectiveness of instant data rate.

figure c

4.3 Distributed implementation

So far, we have devised Algorithms 2 and 3 to satisfy the interference power constraint along with buffer constraint by locally adjusting the optimal transmit power and the optimal data rate, respectively. In what follows, we would like to describe the distributed implementation strategy to realize the cross-layer optimization framework for congestion and power control by jointly optimizing PLPC-HHCC simultaneously. In conclusion, the cross-layer optimization scheme for joint PLPC-HHCC design is implemented in a distributed manner as follows:

4.3.1 Shadow price

Update shadow price i using (28) and shadow price i using (40), respectively.

4.3.2 Power controller in the physical layer

For each upstream SU, we initially assign optimal transmit power \( {p}_i^{\#}(t) \) using (25) to update instant transmit power \( {p}_i^{\odot }(t) \) at power controller. Due to the interference power constraint in (6) to protect PUs, \( {p}_i^{\odot }(t) \) should satisfy the following distributed power-update function when \( {\sum}_{i\in \mathcal{N}}{p}_i(t){G}_{\left(i,b\right)}>{\varpi}_{\mathrm{PBS}} \) via OTPU algorithm:

$$ {p}_i^{\odot }(t)=\left\{\begin{array}{l}\left|{p}_i^{\odot }(t)-{\aleph}_i\times {\overline{p}}_i\right|,\kern4.5em \mathrm{for}\kern1em 0<{\aleph}_i<1\\ {}\left|{p}_i^{\odot }(t)-{\left({\aleph}_i\right)}^{-1}\times {\overline{p}}_i\right|,\kern2em \mathrm{for}\kern1em {\aleph}_i\ge 1\end{array}\right.. $$
(42)

4.3.3 Rate controller in the transport layer

For each upstream SU, we also initially assign optimal data rate \( {r}_i^{\#}(t) \) using (38) to update instant data rate \( {r}_i^{\odot }(t) \) at rate controller. Owing to the buffer constraint \( {\varphi}_b(t)\le {\widehat{L}}_b \) to guarantee that bottleneck SU v b will not become congested, \( {r}_i^{\odot }(t) \) should be subject to the following distributed rate update function when \( {\varphi}_b(t)>{\widehat{L}}_b \) via ODRU algorithm:

$$ {r}_i^{\odot }(t)\leftarrow {r}_i^{\odot }(t)-{\aleph}_i\times {\mathrm{r}}_{\left(i,b\right)}^{\mathrm{base}}. $$
(43)

4.3.4 Cross-layer coordination mechanism

With the aid of the updated power \( {p}_i^{\odot }(t) \), the link capacity supply C(i, b)(P) with respect to each upstream SU is regulated by power controller by using (4) as shown in Fig. 3. The rate demand depends on instant data rate \( {r}_i^{\odot }(t) \) regulated by rate controller, which is nonlinear function of instant transmit power vector P according to (37) and (38).

5 Simulation results

5.1 Simulation settings

The simulation scenario is shown in Fig. 5, which consists of one bottleneck SU and N = 6 randomly distributed upstream SUs transmitting data packets with elastic traffic towards bottleneck SU within a range of 100 m × 100 m. The scenario is easily extendable to a general case which involves much more randomly distributed upstream SUs. Our simulations pay more attention to evaluate the effect of the cross-layer optimization framework for congestion and power control on both optimal data rate and optimal transmit power of six different per-link flows. Different from the channel with carrier frequency of 890.4 MHz used by SUs, we assume that active PUs in simulation scenario occupy other uplink channels from set ℋ.This can make possible the successful transmissions of data packets from upstream SUs to bottleneck SU. The probability δ i of uplink channels used by each SU is assumed to be 0.65. The channel gain of link l(i, b) is defined with large-scale slow-fading model, given by G(i, b) = 100g0ϑ i (t) − ϑ b (t)‖−4 [34], where the reference channel gain g0 is set to 9.7 × 10−4 [44]. We adopt a processing gain factor χ =  − 1.5/log2(5BER) where the target bit error rate is BER = 10−3 for multiple quadrature amplitude modulation with a symbol period of Τ s  = 52.5 μs. The thermal noise power at bottleneck SU and interference caused by PBS are assumed to be n0 =  − 50 dBm and I P  = 10 dBm, respectively. In addition, the receiving reference power at bottleneck SU is chosen as \( {p}_{\left(i,b\right)}^{ref}=-37\kern0.5em \mathrm{dBm} \) for each upstream SU. With regard to the elastic traffic modeled by Poisson process [32], the flow arrival intensity is set to a normalized value λ i  = 125 bps for each upstream SU, and the mean of flow size is given as \( \mathbb{E}\left[{\lambda}_i\right]=2\kern0.5em \mathrm{Mbits} \). Under our differential game models Γ PLPC and Γ HHCC , we choose the pricing factor ω = 22, the penalty factor γ = 0.7, two constant numbers G2 = 5.5 and G3 = 360.

Fig. 5
figure 5

Simulation scenario: N = 6 upstream SUs transmitting data packets with elastic traffic towards bottleneck SU within a range of 100 m × 100 m. The dotted lines correspond to the successful transmission links from upstream SUs to bottleneck SU

Due to the lack of empirical data about available contact or all encounters between a pair of SUs, to evaluate the uncertainty of contact \( {\mathcal{C}}^{\left(i,b\right)} \) along link l(i, b), we assume that the minimum and maximum values of encounter duration for all encounters within contact \( {\mathcal{C}}^{\left(i,b\right)} \) is offered in Fig. 6. Note that Flow i in Fig. 6 corresponds to contact \( {\mathcal{C}}^{\left(i,b\right)} \) along link l(i, b), for i = 1, 2, ⋯, 6. We set the number of subintervals in Algorithm 1 to M = 8 for all upstream SUs in that the number of encounters K seems to be a lower value because of the short time interval in the simulations. In fact, game time or time interval is just set to [0, 5] s in the following simulations. Under this setting, the probability distribution ϒ(i, b) of contact \( {\mathcal{C}}^{\left(i,b\right)} \) derived from Algorithm 1 is assumed to comply with the contact distribution as provided by Fig. 7.

Fig. 6
figure 6

Comparison between minimum value and maximum value of encounter duration among different per-link flows

Fig. 7
figure 7

Probability distribution of contact between upstream SU and bottleneck SU under different per-link flows when the number of subintervals M = 8

The proposed OTPU algorithm for the PLPC problem is compared with the existing classical distributed constrained power control (DCPC) algorithm in [45]. The DCPC algorithm is a SINR-constrained power control algorithm which distributively and iteratively searches for transmit power updated from the ς th iteration to the (ς + 1) th iteration. Let \( {\mathrm{SINR}}_i^{tar} \) denote the target SINR for upstream SU v i to maintain a certain QoS requirement. In the simulations, the target SINR can be set as \( {\mathrm{SINR}}_i^{tar}=8\kern0.5em dB \). Therefore, the iterative function of transmit power update in the DCPC algorithm with number of iteration ς = 0, 1, 2, ⋯ is specifically given as [45]:

$$ {p}_i^{\left(\varsigma +1\right)}=\min \left\{{\overline{p}}_i,\frac{{\mathrm{SINR}}_i^{tar}}{{\mathrm{SINR}}_{\left(i,b\right)}^{\left(\varsigma \right)}\left(\mathbf{P}\right)}\cdot {p}_i^{\left(\varsigma \right)}\right\}. $$
(44)

6 Results

6.1 Performance of OTPU algorithm

Figure 8 shows the optimal transmit power comparison for six per-link flows between the OTPU algorithm with the evolution of discount factor a and the DCPC algorithm with ς = 300 iterations. This figure clearly depicts that an increased discount factor from 0.1 to 0.9 will increase the optimal transmit power of each flow under the OTPU algorithm. Apparently, this is a direct consequence of discount factor a on the optimal transmit power according to (25). However, it is observed that the optimal transmit power of each flow via the DCPC algorithm presents a fixed constant value. This is due to the fact that the optimal transmit power of each flow via the DCPC algorithm converges to an expected equilibrium point after 300 iterations. It is worth mentioning that Theorem 1 mathematically makes the optimal transmit power of each upstream SU converge to a Nash equilibrium point distributively. From the results, we can also see that the optimal transmit power of each flow by the OTPU algorithm is obviously lower than that of the DCPC algorithm. This can be explained by the fact that DCPC algorithm gives rise to more power consumption for maintaining a certain SINR for each upstream SU. However, the optimal transmit power of each flow based on the OTPU algorithm mainly depends upon the maximum transmit power threshold of upstream SU. On the other hand, the instant power level can be further reduced via the change of discount factor a.

Fig. 8
figure 8

Optimal transmit power comparison between our proposed OTPU algorithm and DCPC algorithm with 300 iterations under different per-link flows: a Flows 1 and 5, b flows 2 and 4, and c flows 3 and 6

6.2 The impact of discount factor on optimal transmit power

Figure 9 illustrates the optimal transmit power comparison for six per-link flows via the OTPU algorithm under different discount factors. It is noted that the total interference caused by six upstream SUs satisfies the interference temperature limit ϖPBS =  − 10 dBm according to the constraint of (6). As the discount factor increases, the optimal transmit power of six flows obtained by the OTPU algorithm will raise as well. As expected, the optimal transmit power of flow 6 can achieve the minimum transmit power level with approximately 50 mW, and the optimal transmit power of flow 2 can obtain higher transmit power level with the maximum value nearly 570 mW. However, the increasing rate of the optimal transmit power in regard to flows 6, 3, and 1 flattens out after discount factor a = 0.6. The reason is as follows. Firstly, based on (25), the discount factor effect is in direct proportion to the optimal transmit power. One the other hand, with even higher Euclidean distance between upstream SU and bottleneck SU, the maximum transmit power threshold will increase as well according to (11). Under the simulation scenario, the higher Euclidean distance of the successful transmission link of flow 2 leads to a higher transmit power level accordingly.

Fig. 9
figure 9

Optimal transmit power comparison among six per-link flows through our proposed OTPU algorithm under interference temperature limit ϖPBS =  − 10 dbm

6.3 Optimal data rate performance of ODRU algorithm

Figure 10 exhibits the evolution of the optimal data rate for six per-link flows obtained by the ODRU algorithm versus game time t ∈ [0, 5]s under the condition of discount factor τ = 0.2 and saturation value \( {\widehat{L}}_b=1\kern0.5em Mbps \). From the results, we can see the optimal data rate for six flows gradually increase with the growth of game time t. Meanwhile, the optimal data rate levels of six flows are very close from 0 to 4 s. When game time t is more than 4 s, the gaps among the optimal data rate levels will be enlarged. This demonstrates that the optimal data rate has large values during the game time of the end interval of the game. Under discount factor τ = 0.2, the optimal data rate value of flow 2 is much larger than those of other flows with maximum value of 300 kbps, and the optimal rate of flow 6 has the lowest level within 50 kbps. It can also be observed that the optimal data rate of flow 2 yields significant performance gains than other flows under the condition of the fixed discount factor. According to saturation value \( {\widehat{L}}_b=1\kern0.5em Mbps \), we can observe that the total data rate generated by six upstream SUs is subject to the buffer constraint \( {\varphi}_b(t)\le {\widehat{L}}_b \) such that the instant data rate levels should not be adjusted through the ODRU algorithm.

Fig. 10
figure 10

Optimal data rate comparison among six per-link flows through our proposed ODRU algorithm under saturation value \( {\widehat{L}}_b=1\kern0.5em \mathrm{Mbps} \)

Figure 11 depicts the optimal data rate update comparison for six per-link flows with the aid of the ODRU algorithm on the condition of discount factor τ = 0.2 and saturation value \( {\widehat{L}}_b=490\kern0.5em kbps \). It is implicitly revealed that the total data rate caused by six upstream SUs fail to guarantee the buffer constraint \( {\varphi}_b(t)\le {\widehat{L}}_b \) such that the instant data rate levels must be updated according to the ODRU algorithm. Hence, the evolution of the optimal data rate levels of six upstream SUs will enter the rate update zone (i.e., shadow area in Fig. 11) when the constraint \( {\varphi}_b(t)>{\widehat{L}}_b \). From the results, we can see that the large values of the optimal data rate have been considerably dwindled according to the distributed rate update function in (41) when instant game time t = 4.3 s in order to meet the buffer constraint of bottleneck SU.

Fig. 11
figure 11

Optimal data rate comparison among six per-link flows through our proposed ODRU algorithm under saturation value \( {\widehat{L}}_b=490\kern0.5em \mathrm{kbps} \)

6.4 The impact of discount factor on optimal data rate

Figure 12 displays the evolution of the optimal data rate for six per-link flows via the ODRU algorithm versus discount factor τ under the condition of two fixed instant game time point t (i.e., t = 3 and t = 4) and saturation value \( {\widehat{L}}_b=950\kern0.5em \mathrm{kbps} \). From the results, we can see the total data rate generated by six upstream SUs accommodates for the buffer constraint \( {\varphi}_b(t)\le {\widehat{L}}_b \). It can be also observed that as the discount factor increases from 0.1 to 0.9, the optimal data rate of six flows obtained by the ODRU algorithm will decrease accordingly. The reason for this is that the utility function of each upstream SU must be discounted by the factor eτt at time t under the differential game structure Γ HHCC . As we expected, the optimal data rate of flow 6 can obtain the minimum rate level within approximately interval [6, 26] kbps, and the optimal data rate of flow 2 can gain the maximum value of data rate with nearly interval [30, 140] kbps. This can be explained by the fact that the higher Euclidean distance of the successful transmission link of flow 2 will result in a higher transmit power level accordingly. This result of higher transmit power level of flow 2 will lead to the more link capacity supply. It implies that the upstream SU has the enough link capacity supply to achieve higher data rate in the proposed cross-layer optimization framework. Essentially, this signifies the importance of cross-layer coordination mechanism on the coupling between rate demand regulated by rate controller and capacity supply regulated by power controller.

Fig. 12
figure 12

Optimal data rate comparison among six per-link flows through our proposed ODRU algorithm for different discount factors under saturation value \( {\widehat{L}}_b=950\kern0.5em \mathrm{kbps} \)

7 Conclusions

In this paper, a distributed cross-layer optimization framework for congestion and power control for CRANETs under predictable contact has been proposed. Particularly, we introduced a predictable contact model by achieving the probability distribution of contact between any pair of SUs, aiming to measure the uncertainty of contact. Also, an entropy paradigm was presented to characterize quantitatively the probability distribution of contact. We employed a differential game theoretic approach to formulate the PLPC problem and the HHCC problem, and obtained the optimal transmit power and the optimal data rate of upstream SUs via dynamic programming and maximum principle. To guarantee the interference power constraint for active PUs and the buffer constraint of bottleneck SU, we developed two distributed update algorithms to locally adjust optimal transmit power and optimal data rate of upstream SUs. Finally, we presented a distributed implementation strategy to construct the cross-layer optimization framework for congestion and power control by jointly optimizing PLPC-HHCC simultaneously and validated its performance with simulations. What we have discussed in this paper is the portion of foundation for the cross-layer optimization framework in CRANETs. In the future work, a joint objective function to achieve congestion and power control will be considered. Moreover, it will be interesting and important to investigate a trade-off parameter as a whole to reflect the benefits of the proposed framework.