1 Introduction

The radio spectrum available for wireless communication is extremely scarce due to the widely deployed wireless devices and services. On the other hand, recent studies and measurements have shown that most of the allocated bands are used inefficiently or underutilized [13]. In order to address the aforementioned challenges, a cognitive radio (CR) technology has recently been proposed for the purpose of dramatically improving spectrum utilization and supporting more new services [4]. In cognitive radio networks (CRNs), the licensed spectrum can be shared with secondary users (SUs), provided that they do not cause harmful interference to primary users (PUs). In order to support this spectrum reuse functionality, SUs are permitted to transmit once they detect a spectrum hole [5, 6]. Such schemes usually work when the spectrum is severely underutilized, or otherwise, SUs might not have sufficient opportunities to get channel access. Once PUs are found to be active, SUs must vacate the channels. Therefore, the secondary throughput would be significantly constrained, and the secondary system would suffer from a long latency.

As a spectral-efficient technology, multiple-input multiple-output (MIMO) is capable of providing extra spatial dimensions for signal transmission. MIMO technology can be employed by CRNs for the purpose of reducing interference at the PU and satisfying the demand of high data rate at the SU through carefully designing transmit/receive beamforming [7, 8]. As a result, SUs may access the licensed spectrum without causing harmful interference at the PUs, even if the PUs are also using the same spectrum at the same time. However, an additional spatial resource inherent in MIMO systems becomes a challenging task in the design of efficient spectrum-sharing, although this technology offers several advantages to enhance the system performances. Furthermore, CR spectrum-sharing imposes several new challenging issues on MIMO systems [9]. First, the idea of spectrum-sharing allows simultaneous transmissions of PUs and SUs, provided that the quality of service (QoS) of PUs is guaranteed. Secondly, the primary systems cannot deliberately provide their channel estimation to the secondary systems [10]. The aforementioned challenges may impose difficulty on pre-interference cancelation at the SU’s transmitter side.

CR MIMO broadcast channels (CR MIMO-BCs) have become a topic of increasing research interest for CRNs in recent years [1113], with secondary systems coexisting with the primary systems. Unlike conventional MIMO-BCs, in CR MIMO-BCs, there exist interference between the PU link and the SU link as well as the multiple SUs’ interference. Furthermore, in order to protect the primary transmission, the total-power constraint and the individual-interference-power constraint for each primary receiver are considered. Since beamforming and power control techniques play an important role in interference suppression and power constraint in CR MIMO-BCs, a joint beamforming and power control scheme over CR MIMO-BCs was proposed to minimize the transmit power while satisfying signal-to-interference-plus-noise ratio (SINR) targets for the SUs and maintaining an acceptable interference level to the PUs [14]. The development of the scheme relies crucially on the BC multiple-access channel (BC-MAC) duality result [15, 16], which is only valid for the problem with a single-sum-power constraint. In [17], the problem of joint transmit beamforming and power control was considered in CR MIMO-BCs, with the cognitive multi-antenna base station (BS) being assumed to satisfy the QoS constraints of the served SUs while protecting one primary receiver from interference. It is also assumed that the number of single-antenna users is less than that deployed at the BS. Consequently, SUs may access the primary spectrum without causing harmful interference to the PUs.

For the case of non-zero-interference-power constraint in CR MIMO-BCs, both the SUs’ SNR constraints and the PUs’ interference power constraints usually result in quadratically constrained quadratic programming problems, which may not be directly solved by convex tools, especially when there is a rank constraint. Semidefinite programming (SDP) relaxation can be used to convert the aforementioned problem into a convex optimization problem by dropping the rank constraint, consequently generating a local optimum [18]. It is shown in [19] that under certain conditions, a new solution can be generated from the one obtained by SDP relaxation without ruining the constraints or changing the objection function. Actually, most of the resulting problems of joint beamforming and power control are inherently non-convex, and consequently, no global optimality of an efficient solution can be guaranteed theoretically [20]. Nonetheless, in [20], sufficient conditions were presented to constrain some design parameters, making the joint beamforming and power control problem become convex. Furthermore, in [21], a semi-distributed algorithm was proposed to obtain a local optimal solution to this problem. However, in general scenarios, the obtained local optimum may not be feasible for the original problem because its rank usually does not meet the solution’s requirement. As a result, approximation approaches may be used to generate a feasible solution [2226].

In this paper, we model the problem of joint power control and beamforming in CR MIMO-BCs as a noncooperative game. Note that the noncooperative game theory for economists has been extensively investigated in terms of how rational players do not cooperate and interact in order to reach their goals. Lately, noncooperative game theory has also been widely applied to communications system. For example, in [27], joint Code Division Multiple Access (CDMA) codeword and power adaptation was formulated as a separable game in MIMO CR networks, with two corresponding subgames, i.e., the power control subgame and the codeword control subgame, being played. Furthermore, we also formulate separable games1 to solve the problem of joint power control and beamforming. Based on the proposed noncooperative separate game model, separable cost functions for beamforming and power control parameters are also derived, showing that these cost functions are convex. Furthermore, we use the convex theory of the noncooperative game to investigate the best response strategies as well as existence of the Nash equilibrium solutions. The best response strategies of users are then obtained by constrained minimization of the user cost function subject to constraints on user SINR and beamforming vector. The corresponding algorithm is derived using a game-theoretic approach in which separable cost functions with respect to beamforming and power are defined, such that joint beamforming and power adaptation is formulated as a separable game.

The rest of this paper is organized as follows: In Section 2, we describe the system model and problem statement. In Section 3, we present joint beamforming and power control as a noncooperative game. Section 4 provides numerical simulation results and discussions. Finally, conclusions are drawn in Section 5.

Notations: Scalar is denoted by a lowercase letter. Vector and matrix are denoted by a boldface lowercase letter and a boldface uppercase letter, respectively. I p denotes the p×p identity matrix. ∥·∥ represents the Euclidean norm of a vector. sgn[·] denotes sign function. For a matrix S, tr(S), rank(S), S H, and S T denote its trace, rank, Hermitian matrix, and transpose matrix, respectively. diag(s 1,s 2,⋯,s n ) denotes a diagonal matrix with diagonal elements given by s 1,s 2,⋯,s n . For a matrix S, S≥0 denotes the S is positively semidefined. {·} denotes the subset.

2 System model and problem statement

In this paper, we present an alternative approach to dealing with a variable number of active users and/or QoS requirements in a CDMA system. More specifically, we consider the system model of CR MIMO-BCs, as shown in Fig. 1, where the cognitive base station (CBS) and K SUs share the spectrum with N PUs, with the N t -antenna CBS sending independent information signals to K-different SUs. Without loss of generality, each SU is assumed to have N r (N r N t ) antennas. Without loss of generality, both the primary base station (PBS) and PUs are assumed to be equipped with a single antenna. We assume that there is only one PU (N=1) and that it is enough to demonstrate the key aspects of spectrum-sharing while avoiding unnecessary complications [28, 29], while it is pertinent to investigate the general scenario where multiple PU and SU links coexist [30].

Fig. 1
figure 1

The system model for CR MIMO-BCs. The CBS has N t antennas, and each SU is equipped with N r antennas. The PBS and PU are equipped with a single antenna, respectively

In CR MIMO-BCs, due to the coupled structure of the transmitted signals, the BC optimization problems are usually non-convex and thus cannot be solved directly. To tackle this difficulty, the non-convex BC problem is transformed into a convex MAC problem via a so-called BC-MAC duality relationship [31]. Under a single sum-power constraint or a set of a linear power constraint, the problem of BC can be solved as a minimax optimization problem in its dual MAC setting [32]. The block diagram of BC-MAC duality relationship for the transmission process is shown in Fig. 2.

Fig. 2
figure 2

Block diagram of BC-MAC duality relationship for the transmission process

The signal \({\mathbf {x}}\!\in \!{\mathbb {C}}^{M \!\times \!1}\) transmitted by the CBS is given by [33]

$$ {\mathbf{x}} = \sum\limits_{k = 1}^{K} {{s_{k}}\sqrt {{p_{k}}} {{\mathbf{u}}_{k}}} = {\mathbf{UPs}}, $$
(1)

where s=[s 1,s 2,⋯s K ]T, with s k denoting the data stream for the kth SU (SUk) and satisfying E[|s k |2]=1, \({\mathbf {P}} = \text {diag}\{ \sqrt {{p_{1}}},\sqrt {{p_{2}}}, \cdots \sqrt {{p_{K}}} \}\), with p k denoting the transmitted power allocated to SUk, \({\mathbf {U}} = [{{\mathbf {u}}_{1}},{{\mathbf {u}}_{2}}, \cdots {{\mathbf {u}}_{K}}]\in {\mathbb {C}}^{N_{t} \times K}\) is the downlink beamformer matrix, with u k standing for the beamformer vector of the SUk with ∥u k 2=1. Furthermore, we assume that a perfect CSI of primary and secondary links is available at the CBS. The received signal \({{\mathbf {r}}_{k}}\in {\mathbb {C}}^{N_{r} \times 1}\) at the SUk is given by

$$ \begin{aligned} {{\mathbf{r}}_{k}} &= {{\mathbf{H}}_{k}}{\mathbf{x}} + {{\mathbf{g}}_{k}}{s_{0}}\sqrt {{p_{p}}} + {{\mathbf{n}}_{k}}\\ &= \underbrace {{{\mathbf{H}}_{k}}{s_{k}}\sqrt {{p_{k}}} {{\mathbf{u}}_{k}}}_{{\mathrm{desired ~signal}}} +\underbrace {{{\mathbf{H}}_{k}}\left({\sum\limits_{l = 1,l \ne k}^{K} {{s_{l}}\sqrt {{p_{l}}} {{\mathbf{u}}_{l}}}} \right) + {s_{0}}\sqrt {{p_{p}}} {{\mathbf{g}}_{k}} + {{\mathbf{n}}_{k}}}_{{\mathrm{interference + noise}}}, \end{aligned} $$
(2)

where \({\mathbf {H_{k}}}\in {\mathbb {C}}^{N_{r} \times N_{t}}\) denotes the channel between the CBS and SUk. The vector \({\mathbf {n_{k}}}\in {\mathbb {C}}^{N_{r} \times 1}\) is the additive white Gaussian noise (AWGN), where n k CN(0,σ 2 I). Parameters p p and s 0 denote the PBS transmission power and transmission data stream, respectively. Letting g k represent the channel power gain vector from PBS to the SUk, the right-hand side of (2) can be decomposed into three terms: the first term is the desired signal from CBS to the SUk, whereas the second and third terms denote the interference plus noise from the PBS and noise, respectively. Similarly, the receive signal at the PU from PBS to PU can be written as

$$ \begin{aligned} {r_{p}} &= \sqrt {{p_{p}}} {g_{p}}{s_{0}} + {\mathbf{h}}_{p}^{T}{\mathbf{x}} + {n_{p}}\\ &=\underbrace {\sqrt {{p_{p}}} {g_{p}}{s_{0}}}_{{\mathrm{desired~signal }}} + \underbrace {{\mathbf{h}}_{p}^{T}\sum\limits_{k = 1}^{K} {{s_{k}}\sqrt {{p_{k}}} {{\mathbf{u}}_{k}}} + {n_{p}}}_{{\mathrm{Interference + noise}}}, \end{aligned} $$
(3)

where h p denotes the channel power gain vector from the CBS to the PU and n p is an AWGN item.

In (2), the covariance matrix Z k of the interference plus noise is given by

$$ {\begin{aligned} {{\mathbf{Z}}_{k}} = E\left[ {{{\mathbf{z}}_{k}}{\mathbf{z}}_{k}^{T}} \right] = \sum\limits_{l = 1,l \ne k}^{K} {{p_{l}}{{\mathbf{H}}_{k}}{{\mathbf{u}}_{l}}} {\mathbf{u}}_{l}^{T}{\mathbf{H}}_{k}^{T} + {p_{p}}{{\mathbf{g}}_{k}}{\mathbf{g}}_{k}^{T} + {{\mathbf{n}}_{k}}{\mathbf{n}}_{k}^{T}, \end{aligned}} $$
(4)

where \({{\mathbf {z}}_{k}} = {{\mathbf {H}}_{k}}\left ({\sum \limits _{l = 1,l \ne k}^{K} {{s_{l}}\sqrt {{p_{l}}} {{\mathbf {u}}_{l}}}} \right) + {s_{0}}\sqrt {{p_{p}}} {{\mathbf {g}}_{k}} + {{\mathbf {n}}_{k}}\). Since the combined interference and noise is not white anymore (its covariance is shown in (4)), the optimal maximum likelihood (ML) detector is equivalent to whitening the received signal, followed by applying the ML detector designed for white noise [33, 34]. We can thus whiten the received signal at the SUk by multiplying \({\mathbf {z}}_{k}^{- {1/2}}\) with r k :

$$ \begin{aligned} {\tilde {\mathbf{r}}_{k}} &= {\mathbf{Z}}_{_{k}}^{- 1/2}{{\mathbf{r}}_{k}} = {\mathbf{Z}}_{_{k}}^{- 1/2}{{\mathbf{H}}_{k}}{b_{k}}\sqrt {{p_{k}}} {{\mathbf{u}}_{k}} \\ &\quad + {\mathbf{Z}}_{_{k}}^{{ - 1/2}} \cdot {{\mathbf{z}}_{k}} = {\tilde {\mathbf{H}}_{k}}{b_{k}}\sqrt {{p_{k}}} {{\mathbf{u}}_{k}} + {\tilde {\mathbf{z}}_{k}}, \end{aligned} $$
(5)

where \({\tilde {\mathbf {H}}_{k}}\) and \({\tilde {\mathbf {z}}_{k}}\) are a transformed MIMO channel matrix and Gaussian white noise after the whitening transformation, respectively. Note that the noise-whitening filter should satisfy the condition of \(E\left [ {{{\tilde {\mathbf {z}}}_{k}}\tilde {\mathbf {z}}_{k}^{T}} \right ] = E\left [ {{\mathbf {Z}}_{_{k}}^{- {1/2}}{{\mathbf {z}}_{k}} \cdot {{\left ({{\mathbf {Z}}_{_{k}}^{- {1/2}}{{\mathbf {z}}_{k}}} \right)}^{T}}} \right ] = {{\mathbf {I}}_{{N_{r}}}}\). The singular value decomposition (SVD) [35] of \({\tilde {\mathbf {H}}_{k}}\) is defined as

$$ {\tilde {\mathbf{H}}_{k}} = {{\mathbf{U}}_{k}}{{\mathbf{\Lambda }}_{k}}{\mathbf{V}}_{k}^{H}, $$
(6)

where \({{\mathbf {\Lambda }}_{k}}\in {\mathbb {C}}^{N_{r} \times N_{t}}\) is a real-valued diagonal matrix and can be partitioned into

$$ {{\mathbf{\Lambda }}_{k}} = \left[ \begin{array}{l} {{\hat {\mathbf{\Lambda }}}_{{\rho_{k}} \times {\rho_{k}}}}\quad \quad {{\mathbf{0}}_{{\rho_{k}} \times \left({{N_{t}} - {\rho_{k}}} \right)}}\\ {{\mathbf{0}}_{\left({{N_{r}} - {\rho_{k}}} \right) \times {\rho_{k}}}}\quad {{\mathbf{0}}_{\left({{N_{r}} - {\rho_{k}}} \right) \times \left({{N_{t}} - {\rho_{k}}} \right)}} \end{array} \right]. $$
(7)

Furthermore, \({\rho _{k}} = \text {rank}\left ({{{\tilde {\mathbf {H}}}_{k}}} \right)\) denotes the rank of the SUk-transformed MIMO channel matrix, which is equivalent to the number of non-zero singular values and satisfies ρ k ≤ min(N t ,N r ). Substituting (7) into (5), we can obtain

$$ {\bar {\mathbf{r}}_{k}} = {{\mathbf{\Lambda }}_{k}}{b_{k}}\sqrt {{p_{k}}} {\bar {\mathbf{u}}_{k}} + {\bar {\mathbf{z}}_{k}}, $$
(8)

Taking the partitions in (7) into account, the last (N r ρ k ) zero components in the received signal \({\bar {\mathbf {r}}_{k}}\) can be ignored. Furthermore, the last (N t ρ k ) components of the transformed beamforming vector are also set to zero for the purpose of avoiding wasting transmit power on those dimensions of zero singular values.

Thus, we focus only on those dimensions corresponding to strictly positive singular values of \({\tilde {\mathbf {H}}_{k}}\). To reduce dimensionality to the rank of \({\tilde {\mathbf {H}}_{k}}\), we take the first ρ k elements in \({\bar {\mathbf {r}}_{k}}\) and obtain

$$ {\begin{aligned} {\hat {\mathbf{r}}_{k}} &= \left[ {{{\mathbf{I}}_{{\rho_{k}}}}{{\mathbf{0}}_{{\rho_{k}} \times \left({{N_{r}} - {\rho_{k}}} \right)}}} \right]{\bar {\mathbf{r}}_{k}} = {\hat {\mathbf{\Lambda }}_{k}}{s_{k}}\sqrt {{p_{k}}} \left[ {{{\mathbf{I}}_{{\rho_{k}}}}{{\mathbf{0}}_{{\rho_{k}} \times \left({{N_{t}} - {\rho_{k}}} \right)}}} \right]{\bar {\mathbf{u}}_{k}}\\ &\quad+ \left[ {{{\mathbf{I}}_{{\rho_{k}}}}{{\mathbf{0}}_{{\rho_{k}} \times \left({{N_{r}} - {\rho_{k}}} \right)}}} \right]{\bar {\mathbf{z}}_{k}} = {\hat {\mathbf{\Lambda }}_{k}}{s_{k}}\sqrt {{p_{k}}} {\hat {\mathbf{u}}_{k}} + {\hat {\mathbf{z}}_{k}}, \end{aligned}} $$
(9)

where \({\hat {\mathbf {u}}_{k}} = \left [\! \mathbf {I}_{\rho _{k}}\!\mathbf {0}_{\rho _{k} \!\times \! \left (N_{t} - \rho _{k} \right)} \right ]{\bar {\mathbf {u}}_{k}}\) and \({\hat {\mathbf {z}}_{k}} = \left [ {{\mathbf {I}}_{{\rho _{k}}}}{\mathbf {0}}_{{\rho _{k}} \times \left ({{N_{r}} - {\rho _{k}}} \right)} \right ]{\bar {\mathbf {z}}_{k}}\). We can invert the channel in (9) to obtain the equivalent expression [36]

$$ {\hat {\mathbf{r}}_{k,\text{inv}}} = \hat {\mathbf{\Lambda }}_{k}^{- 1}{\hat {\mathbf{r}}_{k}} = {b_{k}}\sqrt {{p_{k}}} {\hat {\mathbf{u}}_{k}} + \hat {\mathbf{\Lambda }}_{k}^{- 1}{\hat {\mathbf{z}}_{k}}. $$
(10)

The received signal \({\hat {\mathbf {r}}_{k,\text {inv}}}\) by matching the filtering is then given by

$$ {{\mathbf{y}}_{k}} = \hat {\mathbf{u}}_{k}^{T} \cdot {\hat {\mathbf{r}}_{k,\text{inv}}} = \underbrace {{b_{k}}\sqrt {{p_{k}}} }_{{\mathrm{desire~dsignal}}} + \underbrace {\hat {\mathbf{u}}_{k}^{T}\hat {\mathbf{\Lambda }}_{k}^{- 1}{{\hat {\mathbf{z}}}_{k}}}_{{\mathrm{interference + noise}}{\kern 1pt}}, $$
(11)

where the right-hand side of (11) comprises two terms: the first term is the desired signal and the second term denotes interference plus noise. The SINR at SUk can be derived as

$$ {\gamma_{k}}({{\mathbf{u}}_{k}},{\mathbf{p}}) = \frac{{{p_{k}}}}{{\hat {\mathbf{u}}_{k}^{T}\hat {\mathbf{\Lambda }}_{k}^{- 1}{{\hat {\mathbf{z}}}_{k}}\hat {\mathbf{z}}_{k}^{T}\hat {\mathbf{\Lambda }}_{k}^{- T}{{\hat {\mathbf{u}}}_{k}}}} = \frac{{{p_{k}}}}{{\hat {\mathbf{u}}_{k}^{T}\hat {\mathbf{\Lambda }}_{k}^{- 2}{{\hat {\mathbf{u}}}_{k}}}} = \frac{{{p_{k}}}}{{{i_{k}}}}, $$
(12)

where \({i_{k}} = \hat {\mathbf {u}}_{k}^{T}\hat {\mathbf {\Lambda }}_{k}^{- 2}{\hat {\mathbf {u}}_{k}}\) is the interference function that depends explicitly on the SUk beamformer \({\hat {\mathbf {u}}_{k}}\) as well as all the other users’ beamformer \({\hat {\mathbf {u}}_{l}}\) and power p l , ∀lk, but does not depend on SUk power.

In this setup, individual SUs may adjust their beamforming vector and powers in order to meet a set of specified target SINRs \(\{ \gamma _{1}^{*},\gamma _{2}^{*}, \cdots,\gamma _{K}^{*}\}\) with minimum transmitted power. The target SINRs must be admissible and satisfy [37, 38]

$$ \sum\limits_{k = 1}^{K} {\frac{{\gamma_{k}^{*}}}{{1 + \gamma_{k}^{*}}}} < {N_{t}}, $$
(13)

where N t is the CBS antenna numbers and denotes the signal space of dimension.

Game theory provides a powerful framework for analyzing the problem of competitive utility maximization in wireless communication systems and could indicate whether a stable point, i.e., NE point, exists. A noncooperative game is2 formally defined by using the set of players, the sets of strategies (or actions) that each player may take, and the individual player utility or cost functions.

In general, cost functions for a wireless system depend on the transmission power as well as QoS desired by a given user in the system. In CR MIMO-BCs, due to interference from other SUs and PU, the SU’s transmission power needs to pay off a higher price for the purpose of achieving the abovementioned goal. On the other hand, the SU’s transmission at optimal power will minimize the amount of interference. Therefore, the cost function of the SUk can be defined as

$$ J_{k}\left(p_{k}, \overset{\frown}{\mathbf{u}} \right) = {p_{k}}\left({{i_{k}} + \lambda {{\left| {{\mathbf{h}}_{p}^{T}{{\mathbf{u}}_{k}}} \right|}^{2}}} \right), $$
(14)

where λ is the pricing factor. Note that a higher λ implies that the SUs will pay off a higher price. Due to the condition of \({{\mathbf {u}}_{k}} \,=\, {{\mathbf {V}}_{k}}{\bar {\mathbf {u}}_{k}}\! =\! {{\mathbf {V}}_{k}}{\left [\! {{{\hat {\mathbf {u}}}_{k}}{{\mathbf {0}}_{\left (\! {{N_{t}}\! -\! {\rho _{k}}}\! \right) \times 1}}} \!\right ]^{T}} \,=\, \left [ {{{\hat {\mathbf {V}}}_{{N_{t}} \times {\rho _{k}}}}\!{{\mathbf {V}}_{{N_{t}} \times \left ({{N_{t}} - {\rho _{k}}} \right)}}} \right ]{\left [ {{{\hat {\mathbf {u}}}_{k}}{{\mathbf {0}}_{\left ({{N_{t}} - {\rho _{k}}} \right) \times 1}}} \right ]^{T}} = {\hat {\mathbf {V}}_{{N_{t}} \times {\rho _{k}}}}{\hat {\mathbf {u}}_{k}}\), the cost function of the SUk in (14) can be rewritten as

$$ \begin{aligned} {J_{k}}\left({{p_{k}},{{\hat {\mathbf{u}}}_{k}}} \right) &= {p_{k}} \cdot \left({{i_{k}} + \lambda \cdot {{\left| {{\mathbf{h}}_{p}^{T}\hat {\mathbf{V}}{{\hat {\mathbf{u}}}_{k}}} \right|}^{2}}} \right)\\ &= {p_{k}}\left({\hat {\mathbf{u}}_{k}^{T}\hat {\mathbf{\Lambda }}_{k}^{- 2}{{\hat {\mathbf{u}}}_{k}} + \lambda \cdot \hat {\mathbf{u}}_{k}^{T}{{\hat {\mathbf{V}}}^{T}}{{\mathbf{h}}_{p}}{\mathbf{h}}_{p}^{T}\hat {\mathbf{V}}{{\hat {\mathbf{u}}}_{k}}} \right)\\ &= {p_{k}}\hat {\mathbf{u}}_{k}^{T}\left({\hat {\mathbf{\Lambda }}_{k}^{- 2} + \lambda \cdot {{\hat {\mathbf{V}}}^{T}}{{\mathbf{h}}_{p}}{\mathbf{h}}_{p}^{T}\hat {\mathbf{V}}} \right){\hat {\mathbf{u}}_{k}} = \underbrace {{p_{k}}}_{{f_{k}}({p_{k}})}\underbrace {\hat {\mathbf{u}}_{k}^{T}{{\mathbf{S}}_{k}}{{\hat {\mathbf{u}}}_{k}}}_{{g_{k}}\left({{{\hat {\mathbf{u}}}_{k}}} \right)},~~ \end{aligned} $$
(15)

where \({{\mathbf {S}}_{k}} = \hat {\mathbf {\Lambda }}_{k}^{- 2} + \lambda \cdot {\hat {\mathbf {V}}^{T}}{{\mathbf {h}}_{p}}{\mathbf {h}}_{p}^{T}\hat {\mathbf {V}}\) is the correlation matrix of the interference plus noise.

3 Joint beamforming and power control as a noncooperative separable game

According to the separable game definition [39, ch. 11], the cost function in (15) is separable with respect to the parameters beamforming and power control, leading to a separable game with two separate subgames: beamforming subgame and power control subgame.

3.1 Noncooperative beamforming subgame

In the proposed noncooperative game, the SUs’ power are fixed and individual SUs are capable of adjusting their beamforming in their corresponding strategy spaces to minimize their corresponding cost function. The noncooperative beamforming subgame (NPBS) can be modeled as

$$ \text{NPBS} = \left[\Omega,\left\{{\overset{\frown}{\mathbf{u}}_{k}}\right\},\left\{{J_{k}}\left(\cdot \right)\right\} \right], $$
(16)

where the components of NBPS are given as follows.

  1. 1.

    Players SUk: kΩ={1,2,⋯,K}.

  2. 2.

    Action spaces: \(\left \{ {{\overset {\frown }{\mathbf {u}}_{k}}} \right \}\) is the set of beamforming strategies for SUk.

  3. 3.

    Cost functions: {J k (·)} is the cost function that maps the SUk beamforming spaces for fixed power control.

The cost function of NPBS is defined as

$$ \min {J_{k}}\left({{p_{k}},{{\hat {\mathbf{u}}}_{k}}} \right)\left| {{~}_{{\mathbf{P}} = {\text{fixed}}}} \right.,s.t.\hat {\mathbf{u}}_{k}^{T}{\hat {\mathbf{u}}_{k}} = 1. $$
(17)

Here, our aim is to investigate the existence of a Nash equilibrium for NPBS as well as identify the best response strategies for SUs. Before establishing the uniqueness of the Nash equilibrium for NPBS, we state the following formal definitions [40, 41]:

Definition 1 (Nash equilibrium of NPBS): The beamforming matrix \(\overset {\frown }{\mathbf {U}} = [{\overset {\frown }{\mathbf {u}}_{1}},{\overset {\frown }{\mathbf {u}}_{2}}, \cdots {\overset {\frown }{\mathbf {u}}_{K}}]\) is a Nash equilibrium of NPBS if, for every SUk, we have

$$ {\begin{aligned} &\ {J_{k}}\left({\overset{\frown}{\mathbf{u}}_{1}}, \cdots,{\overset{\frown}{\mathbf{u}}_{k-1}},{\overset{\frown}{\mathbf{u}}_{k}},{\overset{\frown}{\mathbf{u}}_{k + 1}}, \cdots {\overset{\frown}{\mathbf{u}}_{K}}\right) \\ \le &\ {J_{k}}\left({\overset{\frown}{\mathbf{u}}_{1}}, \cdots,{\overset{\frown}{\mathbf{u}}_{k - 1}},\overset{\frown}{\mathbf{u}}_{k}^{\prime},{\overset{\frown}{\mathbf{u}}_{k + 1}}, \cdots {\overset{\frown}{\mathbf{u}}_{K}}\right),~~\forall \overset{\frown}{\mathbf{u}}_{k}^{\prime} \in \left\{{\overset{\frown}{\mathbf{u}}_{k}}\right\}. \end{aligned}} $$
(18)

Definition 2 (Best response for NPBS): The best response strategy of SUk beamforming to the other SUs is the set

$$ {\begin{aligned} B_{k}^{{\hat {\mathbf{u}}}_{k}} &= \left\{{\overset{\frown}{\mathbf{u}}_{k}} \in \left\{ {\overset{\frown}{\mathbf{u}}_{k}}\right\} \left| {J_{k}}\left({\overset{\frown}{\mathbf{u}}_{1}}, \cdots,{\overset{\frown}{\mathbf{u}}_{k - 1}},{\overset{\frown}{\mathbf{u}}_{k}},{\overset{\frown}{\mathbf{u}}_{k + 1}}, \cdots {\overset{\frown}{\mathbf{u}}_{K}}\right) \right.\right.\\ &\left.\le{J_{k}}\left(\!{\overset{\frown}{\mathbf{u}}_{1}},\! \cdots \!,{\overset{\frown}{\mathbf{u}}_{k-1}},\overset{\frown}{\mathbf{u}}_{k}^{\prime},{\overset{\frown}{\mathbf{u}}_{k + 1}}, \!\cdots\! {\overset{\frown}{\mathbf{u}}_{K}}\right),~~\forall \overset{\frown}{\mathbf{u}}_{k}^{\prime} \in \left\{{\overset{\frown}{\mathbf{u}}_{k}}\right\} \right\}\!. \end{aligned}} $$
(19)

Definition 3 (Convex game): If the best response function of SUk is a standard function, then NE in this game will be unique. The corresponding game is convex for non-empty, closed, and bounded convex set \(\{ {\overset {\frown }{\mathbf {u}}_{k}}\}\), if the cost function of each SUk is in \({\overset {\frown }{\mathbf {u}}_{k}}\) for every fixed \({\overset {\frown }{\mathbf {u}}_{l}}\), where lk.

For a fixed SU’s power, the cost function in (17) is a quadratic form in the beamforming vector \({\overset {\frown }{\mathbf {u}}_{k}}\). Taking the second-order derivative of the cost function with respect to \({\overset {\frown }{\mathbf {u}}_{k}}\), we get

$$ \frac{{{\partial^{2}}{J_{k}}}}{{\partial \hat {\mathbf{u}}_{k}^{2}}} = 2{p_{k}}{{\mathbf{S}}_{k}}, $$
(20)

where S k is a positive definite matrix. Evidently, the cost function of NPBS is also convex. According to the results proved in Theorem 1 in [42], for concave games, we can extend in a straightforward way to prove the existence of a Nash equilibrium point for convex games [38]. As a consequence, we establish the existence of the equilibrium point for NPBS. In order to solve the best response of the NPBS Nash equilibrium, we defined the SUk Lagrangian function as

$$ \begin{aligned} L_{k}^{{{\hat {\mathbf{u}}}_{k}}}\left({{{\hat {\mathbf{u}}}_{k}},{\alpha_{k}}}\right) &= {J_{k}} + {\alpha_{k}}\left({\hat {\mathbf{u}}_{k}^{T}{{\hat {\mathbf{u}}}_{k}} - 1} \right)\\ &= {p_{k}}\hat {\mathbf{u}}_{k}^{T}{{\mathbf{S}}_{k}}{\hat {\mathbf{u}}_{k}} + {\alpha_{k}}\left({\hat {\mathbf{u}}_{k}^{T}{{\hat {\mathbf{u}}}_{k}} - 1} \right), \end{aligned} $$
(21)

where α k is the Lagrange multipliers. Taking the first-order derivative with respect to the beamformer \({\overset {\frown }{\mathbf {u}}_{k}}\), it leads to the eigenvector equation

$$ \frac{{\partial L_{k}^{{{\hat {\mathbf{u}}}_{k}}}}}{{\partial {{\hat {\mathbf{u}}}_{k}}}} = 2{p_{k}}{{\mathbf{S}}_{k}}{\hat {\mathbf{u}}_{k}} + 2{\alpha_{k}}{\hat {\mathbf{u}}_{k}}. $$
(22)

Let (22) equal to zero, the best response function of the NPBS Nash equilibrium is given by

$$ {{\mathbf{S}}_{k}}{\hat {\mathbf{u}}_{k}} = - \frac{{{\alpha_{k}}}}{{{p_{k}}}}{\hat {\mathbf{u}}_{k}}, $$
(23)

where the best response strategy of NPBS is the eigenvector \({\overset {\frown }{\mathbf {\nu }}_{k}}\), corresponding to the minimum eigenvalue of S k . Thus, at the Nash equilibrium, all SUs beamforming will be the minimum eigenvectors of S k . In order to investigate whether the minimum eigenvector strategy is optimal for the SUk cost function or not, we use the Taylor series to expand the Lagrangian function around the point which satisfies the necessary Karush-Kuhn-Tucker (KKT) conditions [38, 43]. In this expansion, the term containing the first derivative is equal to zero, provided that the higher order terms are neglected. Therefore, we only prove that the second-order term of (21) in the Taylor expansion is positive and satisfies the KKT conditions. The second derivative of (21) is given by

$$\begin{array}{*{20}l} D_{k}^{{{\hat {\mathbf{u}}}_{k}}} &= \left({ - 1} \right)\left| \begin{array}{l} \quad \frac{{{\partial^{2}}L_{k}^{{{\hat {\mathbf{u}}}_{k}}}\left({{{\hat {\mathbf{u}}}_{k}},{\alpha_{k}}} \right)}}{{\partial \hat {\mathbf{u}}_{k}^{2}}}\quad \quad \frac{{{\partial^{2}}L_{k}^{{{\hat {\mathbf{u}}}_{k}}}\left({{{\hat {\mathbf{u}}}_{k}},{\alpha_{k}}} \right)}}{{\partial {{\hat {\mathbf{u}}}_{k}}\partial {\alpha_{k}}}}\\ {\left({\frac{{{\partial^{2}}L_{k}^{{{\hat {\mathbf{u}}}_{k}}}\left({{{\hat {\mathbf{u}}}_{k}},{\alpha_{k}}} \right)}}{{\partial {{\hat {\mathbf{u}}}_{k}}\partial {\alpha_{k}}}}} \right)^{T}}\quad \frac{{{\partial^{2}}L_{k}^{{{\hat {\mathbf{u}}}_{k}}}\left({{{\hat {\mathbf{u}}}_{k}},{\alpha_{k}}} \right)}}{{\partial \alpha_{k}^{2}}} \end{array} \right|\\ &= \left({ - 1} \right)\left| \begin{array}{l} 2{p_{k}}{{\mathbf{S}}_{k}} + 2{\alpha_{k}}\quad 2{{\hat {\mathbf{u}}}_{k}}\\ \quad \quad 2\hat {\mathbf{u}}_{k}^{T}\quad \;\;\;\;\;\;\;0 \end{array} \right| = 4 > 0, \end{array} $$
(24)

which is positive and satisfies the KKT conditions [32, 34]. Hence, the Nash equilibrium point with respect to the constrained minimization of the SUk cost function is optimum.

3.2 Noncooperative power control subgame

In the proposed noncooperative game, the SUs’ beamforming vectors are assumed to be fixed, and individual SUs adjust their power in their corresponding strategy spaces for the purpose of minimizing their corresponding cost function. The noncooperative beamforming subgame (NPCS) can be modelled as

$$ \text{NPCS} = \left[\Omega,\{ {p_{k}}\},\{ {J_{k}}\left(\cdot \right)\} \right], $$
(25)

where the components of the NPCS are given in the list as follows.

  1. 1.

    Players SUk: lΩ={1,2,⋯,K}.

  2. 2.

    Action spaces: {p k } is the set of power control strategies for SUk.

  3. 3.

    Cost functions: {J k (·)} is the cost function that maps the SUk power control spaces for fixed beamforming vectors.

The cost functions of NPCS is defined as

$$ \min \;{J_{k}}\left({{p_{k}},{{\hat {\mathbf{u}}}_{k}}} \right)\left| {{~}_{{{\hat {\mathbf{u}}}_{k}} = {\text{fixed}}}} \right.\;\;s.\ t. \;\; {p_{k}} = \gamma_{k}^{*} \cdot {i_{k}}, $$
(26)

where \(\gamma _{k}^{*}\) is the SUk target SINR. In order to investigate the existence of a Nash equilibrium for NPCS and identify the best response strategies for SUs, we can define the SUk Lagrangian function as

$$ \begin{aligned} L_{k}^{{p_{k}}}\left({{p_{k}},{\beta_{k}}} \right) &= {J_{k}} + {\beta_{k}}\left({{p_{k}} - \gamma_{k}^{*} \cdot {i_{k}}} \right)\\ &={p_{k}}\hat {\mathbf{u}}_{k}^{T}{{\mathbf{S}}_{k}}{\hat {\mathbf{u}}_{k}} + {\beta_{k}}\left({{p_{k}} - \gamma_{k}^{*} \cdot {i_{k}}} \right), \end{aligned} $$
(27)

where β k is a Lagrange multiplier. Taking the first-order derivative with respect to the multiplier β k , we obtain

$$ \frac{{\partial L_{k}^{{p_{k}}}}}{{\partial {\beta_{k}}}} = {p_{k}} - \gamma_{k}^{*} \cdot {i_{k}} = 0, $$
(28)

which is a necessary condition for the constrained optimization problem (26), indicating that the SUk-transmitted power should match its target SINR by giving the interference function i k , i.e.,

$$ {p_{k}} = \gamma_{k}^{*} \cdot {i_{k}}, $$
(29)

where the best response function of NPCS is obtained from (29), and then \(\gamma _{k}^{*} \cdot {i_{k}}\) is the Nash equilibrium point. Similar to (24), we can use the Taylor series to expand the Lagrangian function around the Nash equilibrium point as [34]

$$ \begin{aligned} D_{k}^{{p_{k}}} &= \left({ - 1} \right)\left| \begin{array}{l} \quad \frac{{{\partial^{2}}L_{k}^{{p_{k}}}\left({{p_{k}},{\beta_{k}}} \right)}}{{\partial p_{k}^{2}}}\quad \quad \frac{{{\partial^{2}}L_{k}^{{p_{k}}}\left({{p_{k}},{\beta_{k}}} \right)}}{{\partial {p_{k}}\partial {\beta_{k}}}}\\ {\left({\frac{{{\partial^{2}}L_{k}^{{p_{k}}}\left({{p_{k}},{\beta_{k}}} \right)}}{{\partial {p_{k}}\partial {\beta_{k}}}}} \right)^{T}}\quad \;\frac{{{\partial^{2}}L_{k}^{{p_{k}}}\left({{p_{k}},{\beta_{k}}} \right)}}{{\partial \beta_{k}^{2}}} \end{array} \right| \\&= \left({ - 1} \right)\left| \begin{array}{l} 0\quad 1\\ 1\quad 0 \end{array} \right| = 1 > 0. \end{aligned} $$
(30)

In this case, the cost function of NPCS is a convex function, which is linear in p k . Thus, following [44], there exists a Nash equilibrium. This is also optimal corresponding to the best response strategy, which is capable of updating power to match the target SINR.

3.3 Iterative algorithm

In this section, our proposed algorithm employs incremental updates for beamforming vectors and transmitted powers, which are designed to reduce interference at downlink receivers subject to specified SINR and beamforming constraints. We firstly model the joint beamforming and power control subgame (JBPS) as

$$ \text{JBPS} = \left[\Omega,\left\{{\overset{\frown}{\mathbf{u}}_{k}} \times {p_{k}}\right\},\left\{ {J_{k}}\left(\cdot \right)\right\} \right]. $$
(31)

To find a Nash equilibrium for JBPS, we must consider the two subgames (NPBS and NPCS) to assure their corresponding best response strategies. However, direct application of the best response strategies is not guaranteed to converge to the optimal Nash equilibrium, although multiple Nash equilibrium points for the JBPS are possible to be achieved. Moreover, these best response strategies may cause large-angle deviation during updating of beamforming vectors and/or abrupt power control changes to meet the target SINRs. It is thus not desirable for a practical system, because it may increase error probability at the receiver or even lead to connection loss from the transmitter to the receiver. From a practical perspective, a more desirable approach can be employed to vary the SUs’ beamforming and power control in small increments.

In order to avoid the minimum eigenvector being far away in signal space from the current beamforming vector, we use an incremental update that adapts the user beamforming vector in the direction of the minimum eigenvector. Hence, the beamforming vectors update of SUk at step n of the algorithm can be given by

$$ {\hat {\mathbf{u}}_{k}}\left({n + 1} \right) = \frac{{{{\hat {\mathbf{u}}}_{k}}\left(n \right) + m\beta {{\boldsymbol{\nu }}_{k}}\left(n \right)}}{{\left\| {{{\hat {\mathbf{u}}}_{k}}\left(n \right) + m\beta {{\boldsymbol{\nu }}_{k}}\left(n \right)} \right\|}}, $$
(32)

where β (called beamforming pricing factor) is a constant and \(m = {\mathop {\text {sgn}}} [\hat {\mathbf {u}}_{k}^{T}\left (n \right){{\boldsymbol {\nu }}_{k}}(n)]\).

From (29), the transmitted power corresponding to user k should match its target SINR. After the beamforming update in (32), the power value matching the desired target SINR is given by

$$ {p_{k}}(n + 1)=\gamma_{k}^{*}{\hat {\mathbf{u}}_{k}}\left({n + 1} \right)^{T}{{\mathbf{S}}_{k}}^{-2}{\hat {\mathbf{u}}_{k}}\left({n + 1} \right), $$
(33)

In order to avoid abrupt variations, the power control updates of SUk at step n of the algorithm using a gradient-based approach and the corresponding power control algorithm in small increments are given by

$$ {p_{k}}(n + 1)-{p_{k}}\left(n \right)= \mu \left(\gamma_{k}^{*}{i_{k}}\left(n \right)- {p_{k}}\left(n \right)\right), $$
(34)

where μ (0<μ<1) (called power pricing factor) is a constant. The joint beamforming and power control update algorithm is given in Table 1.

Table 1 The joint beamforming and power control update algorithm

4 Numerical results and simulations

In this section, numerical simulation results are provided to validate the proposed theoretical analysis and examine the performance of our proposed algorithms for various scenarios simultaneously. In CR MIMO-BCs, the channels are subject to Rayleigh fading with zero mean and unity variance. Furthermore, 10,000 channel matrices are generated with Monte Carlo simulations. We consider the CBS with six antennas (N t =6) and the five SUs each equipped with four antennas (N r =4), and the noise covariance matrix is defined as W k =0.1I 4. In addition, the power control matrix and the target SINRs are initialized as P=0.1I 5 and γ ={2,1.9,1.8,1.7,1.5}, respectively, where the target SINRs for SUs are set to satisfy the admissibility condition in (13). Note that the following parameter values are used in all numerical simulations unless stated otherwise: μ=0.5, β=0.5, λ=0.1, and ε=10−6.

In Fig. 3, we examine the convergence speed of the proposed algorithm. It is well known the optimal beamforming signal can be obtained through the zero-forcing algorithm[45]. As observed from Fig. 3, after several iterations, the proposed algorithm converges to the optimal SINRs, and it can achieve better performance than zero-forcing algorithm. Evidently, the proposed algorithm has a fast converge speed.

Fig. 3
figure 3

The comparison of the proposed algorithm and zero-forcing algorithm for the target SINRs

Figures 4, 5, and 6 show the variations of SUs’ power, SINRs, and cost functions for variable target SINRs. These features are useful in CR MIMO-BCs, where different QoS requirements may lead to the target SINR variation. We assume that the system starts with P=0.1I 5 and γ ={2,1.9,1.8,1.7,1.5}, making a Nash equilibrium (NE) configuration be achieved at iteration 10. We then vary the first SU’s target SINR \(\gamma _{1}^{*} = 2\) to new value \(\gamma _{1}^{*} = 1\), leading the algorithm to reach a new NE configuration at iteration 35. Finally, we change SINR \(\gamma _{1}^{*} = 1\) back to its old value \(\gamma _{1}^{*} = 2\), and the system transitions back to the original set of target values between 30 iterations and 50 iterations. As a consequence, we can observe the system transitions from one optimal configuration to another one for variable target SINRs. The algorithm can also track variable target SINRs or a variable number of active users in the system and is therefore useful for dynamic wireless systems with varying QoS requirements. The beamforming matrices of the proposed algorithm are respectively obtained as follows:

$$ \begin{aligned} {{{\mathbf{S}}_{1}}= \left[ {\begin{array}{ccccc} { - 0.7585}&{0.0837}&{ - 0.3258}&{0.5741}&{ - 0.2577}\\ { - 0.3080}&{0.2390}&{ - 0.6237}&{ - 0.5845}&{0.0622}\\ {0.2925}&{ - 0.0904}&{ - 0.4672}&{0.1468}&{ - 0.5583}\\ {0.3001}&{0.3993}&{0.1157}&{0.3617}&{ - 0.3332}\\ {0.3601}&{ - 0.8310}&{ - 0.4280}&{0.3427}&{0.5017}\\ { - 0.1564}&{0.2789}&{ - 0.3000}&{0.2428}&{0.5052} \end{array}} \right]},\\ {{{\mathbf{S}}_{2}} = \left[ {\begin{array}{ccccc} { - 0.7557}&{0.0850}&{ - 0.3626}&{0.5656}&{ - 0.1272}\\ { - 0.3072}&{0.2521}&{ - 0.6666}&{ - 0.5906}&{ - 0.0184}\\ {0.3024}&{ - 0.1541}&{ - 0.4034}&{0.1056}&{ - 0.6227}\\ {0.2922}&{0.3103}&{0.1750}&{0.3596}&{ - 0.3516}\\ {0.3706}&{ - 0.8515}&{ - 0.4002}&{0.3415}&{0.5217}\\ { - 0.1428}&{0.2892}&{ - 0.2659}&{0.2724}&{0.4470} \end{array}} \right]}. \end{aligned} $$
(35)
Fig. 4
figure 4

The variations of SUs’ power versus the number of iterations for variable target SINRs

Fig. 5
figure 5

The variations of SINRs versus the number of iterations for variable target SINRs

Fig. 6
figure 6

The variations of cost functions versus the number of iterations for variable target SINRs

Figures 7, 8, and 9 show the variations of SUs’ power, SINRs, and cost functions for a variable number of SUs. We start with five SUs, with different target SINRs γ ={2,1.9,1.8,1.7,1.5} being considered. The SUs’ beamforming matrix is initialized randomly, with the SUs’ power matrix being taken as P=0.1I 5. We then assume SU5 to be inactive. The remaining active SUs keep the same target SINR as γ ={2,1.9,1.8,1.7}, which satisfies the admissibility condition in (13). After that, we assume SU5 to be active in the system, so that the number of SUs becomes five again, and the same target SINR as before γ ={2,1.9,1.8,1.7,1.5} can be obtained. We note that the system remains in a optimal configuration until iteration 28 when user 5 is dropped from the system. This implies an increase of user SINRs above their corresponding targets due to decreasing multiuser interference. Moreover, we can observe that the proposed algorithm finally converges towards the unique NE configuration for a variable number of SUs. The proposed algorithm is illustrated with numerical examples obtained from simulations which illustrate convergence and tracking properties of the algorithm for different scenarios This change in system configuration triggers the adaptation stage of the algorithm, which yields beamforming matrix S 3.

$$\begin{array}{@{}rcl@{}} {{{\mathbf{S}}_{3}} = \left[ {\begin{array}{cccc} { - 0.7229}&{0.0788}&{ - 0.3522}&{0.5833}\\ { - 0.3171}&{0.1763}&{ - 0.6329}&{ - 0.5777}\\ {0.2716}&{ - 0.0232}&{ - 0.4429}&{0.2020}\\ {0.2871}&{0.5249}&{0.1137}&{0.3563}\\ {0.4435}&{ - 0.7910}&{ - 0.4277}&{0.3389}\\ { - 0.1548}&{0.2468}&{ - 0.2889}&{0.2083} \end{array}} \right]}. \end{array} $$
(36)
Fig. 7
figure 7

The variations of SUs’ power versus the number of iterations for a variable number of SUs

Fig. 8
figure 8

The variations of SINRs versus the number of iterations for a variable number of SUs

Fig. 9
figure 9

The variations of cost functions versus the number of iterations for a variable number of SUs

As is the case with incremental algorithms, the convergence speed of the algorithm depends on the values of the corresponding increments specified by the algorithm constants β and μ. We performed extensive simulations of the proposed algorithm to study convergence to the optimal Nash equilibrium. Figures 10 and 11 illustrate the average number of ensemble iterations for the purpose of converging to the optimal NE with varying β and μ. We ran 1000 trials of the algorithm and recorded the number of ensemble iterations for convergence. It is observed that convergence to the optimal NE is mostly determined by the μ rather than β. Moreover, it can be seen that the optimal NE is reached in less than 20 ensemble iterations under constants of β=0.5 and μ=0.5 in all considered scenarios.

Fig. 10
figure 10

The average number of ensemble iterations for convergence to the optimal NE for varying β and fixed μ=0.5

Fig. 11
figure 11

The average number of ensemble iterations for convergence to the optimal NE for varying μ and fixed β=0.5

Figure 12 shows the variations of the total interference to PU versus the pricing factor λ. As expected, the total interference to PU is a monotonically decreasing function of λ. This is because SU will pay off a higher price to improve its QoS due to the interference from the other SUs and PU. Furthermore, we can also observe that the total interference to PU is minimized at λ=0.06 and the minimum interference of PU will remain unchanged while λ increases.

Fig. 12
figure 12

The variations of the total interference to PU versus the power pricing factor λ

5 Conclusions

In this paper, the problem of joint beamforming and power control in CR MIMO-BCs has been formulated as a noncooperative separable game, which can be further divided into a beamforming subgame and power control subgame. We have investigated the best response strategies of NE solutions in terms of its corresponding subgames: in NPBS, players’ power control strategies are fixed and they are subject to KKT conditions to update their beamforming strategies. In NPCS, on the other hand, players’ beamforming strategies are fixed and they update only their power control strategies subject to target SINRs. It is shown that the best response function of NPBS can be formulated, with SU k beamforming vector corresponding to the minimum eigenvector of the matrix S k , whereas the best response function of NPCS is derived to enable the transmission with a power control that corresponds to the desired target SINR. Furthermore, we have proposed an iterative algorithm to achieve the optimal Nash equilibrium of JBPS. The proposed algorithm is capable of tracking the variation of target SINRs and/or number of SUs in the system and is therefore useful for dynamic wireless systems with varying QoS requirement. Finally, the proposed algorithm may achieve a faster convergence speed by setting different parameter β and μ values.

6 Endnotes

1 Separable games are a specific class of noncooperative games where the player’s cost is a separable function of their strategic choices [34, 39].

2 The game is noncooperative in the sense that a given player is interested only in minimization of its individual cost function, without paying attention to how its actions affect the other players.