1 Introduction

Cognitive radio (CR) has been recently proposed as a promising technology to improve spectrum utilization by enabling secondary access to unused licensed bands. A prerequisite to this secondary access is having no interference to the primary system. This requirement makes spectrum sensing a key function in cognitive radio systems. Among common spectrum sensing techniques, energy detection is an engaging method due to its simplicity and efficiency. However, the major disadvantage of energy detection is the hidden node problem, in which the sensing node cannot distinguish between an idle and a deeply faded or shadowed band [1]. Cooperative spectrum sensing (CSS) which uses a distributed detection model has been considered to overcome that problem [212].

Cooperation among CR users (CUs) is usually coordinated by a fusion center (FC). For each sensing interval, CUs will send their sensing data to the FC. In the FC, all local sensing data will be combined to make a final decision on whether the primary signal is present or absent. An optimal data fusion rule was firstly considered by Chair and Varshney in [13]. Despite a good performance, the requirement for knowledge of detection and false alarm probabilities at each local node is still a barrier to the optimal fusion rule.

CSS schemes require a large communication resource including sensing time delay, control channel overhead, and consumption energy for reporting sensing data to the FC, especially when the network size is large. There are some previous works [39] that considered this problem. In our previous work [3], we proposed an ordered sequential reporting mechanism based on sensing data quality to reduce communication resources. A similar sequential ordered report transmission approach was considered for reducing the reporting time in [4]. However, the reporting time of these methods is still unpredictably long. In [5], the authors proposed to use a censored truncated sequential spectrum sensing technique for saving energy. On the other hand, cluster-based CSS schemes are considered for reducing the energy of CSS [6] and for minimizing the bandwidth requirements by reducing the number of terminals reporting to the fusion center [7]. In [8], Chen et al. proposed a cluster-based CSS scheme to optimize the cooperation overhead along with the sensing reliability. In fact, these proposed cluster schemes can reduce the amount of direct cooperation with the FC but cannot reduce the communication overhead between CUs and the cluster header. A similar problem can be observed in the cluster scheme in [9], though the optimal cluster size to maximize the throughput used for negotiation is identified. Another consideration of the cluster scheme is to enhance sensing performance when the reporting channel suffers from a severe fading environment [10, 11].

In this paper, we propose a cluster-based selective CSS scheme which utilizes an efficient selective method for the best quality sensing data and a parallel reporting mechanism. The selective method, which is usually adopted in cooperative communications [14, 15], is applied in each cluster to implicitly select the best sensing node during each sensing interval as the cluster header without additional collaboration among CUs. The parallel reporting mechanism based on frequency division is considered to strongly reduce the reporting time of the cluster decision. In the FC, the optimal Chair-Vashney rule (CV rule) is utilized to obtain a high sensing performance based on the available cluster’s signal-to-noise ratio (SNR). In this way, the proposed cooperative sensing will be performed with an extremely low cooperation resource while a certain high level of sensing performance is ensured.

The remainder of this paper is organized as follows. In Section 2, some background on spectrum sensing and optimal fusion rule is described. In Section 3, we present system descriptions. The proposed system model and detailed descriptions of the proposed cluster-based selective CSS scheme are also given in Section 4. Simulation results are shown in Section 5. Finally, the conclusions are drawn in Section 6.

2 Preliminaries

2.1 Local spectrum sensing

Each CU conducts a spectrum sensing process, which is called local spectrum sensing in distributed scenario for detecting the primary user (PU) signal. Local spectrum sensing at the i th CU is essentially a binary hypotheses testing problem:

H 0 : x i t = n i t , H 1 : x i t = h i s t + n i t ,
(1)

where H0 and H1 correspond, respectively, to hypotheses of absence and presence of the PU signal, x i (t) represents received data at CU i , h i denotes the gain of the channel between the PU and the CU i , s(t) is the signal transmitted from the primary user, and n(t) is additive white Gaussian noise. Additionally, channels corresponding to different CUs are assumed to be independent, and further, all CUs and PUs share a common spectrum allocation.

Among various methods for spectrum sensing, energy detection has been shown to be quite simple, quick, and able to detect the primary signal - even if the feature of the primary signal is unknown. Here, we consider the energy detection for local spectrum sensing. Figure 1 shows the block diagram of an energy detection scheme. To measure the signal power in a particular frequency region in a time domain, a band-pass filter is applied to the received signal, and power of the signal samples is then measured at CU. The estimation of received signal power is given at CU i by the following equation:

x E i = j = 1 N x j 2 ,
(2)
Figure 1
figure 1

Block diagram of the energy detection scheme.

where x j is the j th sample of the received signal and N = 2T W in which T and W correspond to detection time and signal bandwidth in hertz, respectively.

If the primary signal is absent, x E i follows a central chi-square distribution with N degrees of freedom; otherwise, x E i follows a noncentral chi-square distribution with N degrees of freedom and a noncentrality parameter θ i  = N γ i , i.e.,

x E i χ N 2 , H 0 , χ N 2 θ i H 1 .
(3)

When N is relatively large (e.g., N > 200) [16], x E can be well approximated as a Gaussian random variable under both hypotheses H1 and H0, according to the central limit theorem such that

x E i N N , 2 N , H 0 , N N 1 + γ i , 2 N 1 + 2 γ i H 1 ,
(4)

where γ i is the SNR of the primary signal at the CU.

For the case of local sensing or hard decision fusion, the CUs will make the local sensing decision based on an energy threshold λ i as follows:

D i = 1 , x E i > λ i , - 1 , otherwise ,
(5)

where D i  = 1 and D i  = -1 mean that the hypotheses of H1 and H0 are declared at the i th CU, respectively. The local probability of detection and the local probability of false alarm can be determined based on (4) by:

p f i =Pr D i = 1 H 0 =Q λ i - N 2 N
(6)

and

p d i =Pr D i = 1 H 1 =Q λ i - N 1 + γ i 2 N 1 + 2 γ i ,
(7)

respectively, where Q(.) is the Marcum-Q function, i.e., Q x = 1 2 π x e - t 2 2 dt.

2.2 The optimal fusion rule for global decision

Chair and Varshney provided the optimal data fusion rule in a distributed local hard decision detection system [13]. This optimal rule is in fact the sum of weighted local decisions where the weights are functions of probabilities of detection and false alarm.

The optimal fusion rule is based on the likelihood ratio test as follows:

Pr D 1 , , D n H 1 Pr D 1 , , D n H 0 H 0 H 1 π 0 C 10 - C 00 π 1 C 01 - C 11 ,
(8)

where π0 and π1 are the prior probabilities of the presence and absence of the PU signal, respectively, and C00, C01, C10, and C11 are the decision costs. If we choose C00 = C11 = 0 and C01 = C10 = 1, the likelihood ratio test now follows the minimum probability of error criterion [17], and (8) can be rewritten as:

Pr D 1 , , D n H 1 Pr D 1 , , D n H 0 H 0 H 1 π 0 π 1 ,
(9)

Since the decision set {D i } is independent, the log-likelihood ratio test corresponding to (9) is as follows:

log i = 1 n Pr D i H 1 i = 1 n Pr D i H 0 H 0 H 1 log π 0 π 1 ,
(10)

If S+ and S- denote the set of all i such that {D i  = 1} and {D i  = -1}, respectively, then (10) can be computed by:

S + Pr D i = 1 H 1 Pr D i = 1 H 0 + S - Pr D i = - 1 H 1 Pr D i = - 1 H 0 H 0 H 1 log π 0 π 1 .
(11)

Finally, the Chair-Vashney fusion rule can be rewritten in the form of the weighting formula as follows:

W 0 + i = 1 n W i D i H 0 H 1 0,
(12)

where W 0 =log π 1 π 0 and

W i = log p d i p f i if D i = 1 log 1 - p d i 1 - p f i if D i = - 1 .
(13)

Local false alarm probability p f i and local detection probability p d i are defined in (6) and (7), respectively.

3 System description

In this paper, the CR network, which shares the same spectrum band with a license system, utilizes a cluster-based CSS scheme as shown in Figure 2. The CR network is organized in multiple clusters in each of which the CUs have an identical average SNR of the received primary signal. This identical SNR assumption can be practical when the clusters are divided according to geographical position, i.e., adjacent CUs in a small area are gathered into a cluster. The header in each cluster is not fixed but dynamically selected for each sensing interval based on the quality of the sensing data at each CU. In detail, the node with the most reliable sensing result will take on the cluster header’s roles which include making and reporting the cluster’s decision to the FC. In order to reduce the reporting time and bandwidth, only the sensing data of the cluster header, which is the most reliable sensing data, is utilized to make the cluster decision. This method means that the decision of a cluster is made according to the selective combination method. The FC will combine all cluster decisions to make a final decision and broadcast the final sensing decision to the whole network.

Figure 2
figure 2

System model.

The fusion rule in the FC can be any kind of hard decision fusion rules such as an OR rule, AND rule, ‘K out of N’ rule, or Chair-Varshney rule. Without loss of generality, we propose the utilization of the optimal Chair-Varshney rule at the FC since the SNR value of the received primary signal at the CU is available in this proposed scheme. However, there are three issues with the proposed scheme that need to be considered:

  1. 1.

    How can the scheme efficiently select the cluster header, which is the node with the best quality for sensing data, for each sensing interval without any extra overhead among nodes in the cluster?

  2. 2.

    How can the cluster header optimally make the cluster decision?

  3. 3.

    What is the method for reporting the cluster decision to the FC?

The answers to these questions are given in the following section.

4 The proposed cluster-based selective CSS scheme

4.1 Selective CSS mechanism

In this subsection, we suggest a cluster header selection based on sensing data reliability. For each sensing interval, the CU with the most reliable sensing data in a cluster is selected to be the cluster header. Obviously, the reliability of the sensing data can be evaluated by the log-likelihood ratio (LLR) of the sensing result. The LLR value of the received signal energy x E i is given by:

Λ i =log f X H 1 x E i H 1 f X H 0 x E i H 0 ,
(14)

where f X H j x E i H j ,j=0,1 is the probability density function (PDF) of x E i corresponding to each hypothesis. Since the SNRs of the received primary signals in a cluster are identical, the LLR of the i th user in the c j th cluster Λ i , c j can be considered to be derived from the same distribution f Λ c j (Λ) of LLR Λ. For each cluster, therefore, the LLR value can be normalized such that it has a zero mean as follows:

Y i , c j = Λ i , c j -E Λ c j .
(15)

It is obvious that the reliability of the sensing data will be higher if the absolute value of the normalized LLR is larger. We propose utilization of the absolute value of the normalized LLR Y i , c j as the reliability coefficient for selecting the cluster header as well as the selective cluster data.

In order to implicitly select the most reliable sensing data among CUs in a cluster without additional data collaboration, one contention time should be determined for each CU as follows:

t con =exp - κ Y i , c j ,
(16)

where κ is a predefined constant such that the contention time is sufficient. Obviously, from this equation, the node with the highest absolute value of the normalized LLR will have the smallest contention time. In contention, each CU must monitor the reporting channel and wait for a quiescent condition before considering itself as a cluster header, i.e., the node with the most reliable sensing data, when the contention time expires. The CU who wins the contention will make a local cluster decision and report the cluster decision to the FC based on its own sensing data as follows:

Y c j H 1 : D c j = 1 H 0 : D c j = - 1 τ c j ,
(17)

where Y c j is equal to the normalized LLR with highest absolute value and τ c j is the cluster threshold. Next, we consider the problem of choosing the optimal cluster threshold.

4.2 Cluster threshold determination

In order to make a controllable cluster decision that follows a certain criterion such as the Neyman-Pearson criterion or minimum error probability criterion, one factor to consider is the probability density function of the cluster’s selective sensing data which is utilized to make the cluster decision. In this subsection, we will formulate this requirement.

First, from (15), the normalized LLR distributions of a CU in the c j th cluster are given by:

F Y c j y = F Λ c j y + E Λ c j f Y c j y = f Λ c j y + E Λ c j ,
(18)

where f Λ c j Λ and f Λ c j Λ are the cumulative distribution function (CDF) and PDF of the LLR of the received primary signal power at the CUs in the c j th cluster, respectively. These LLR distributions are given by:

F Λ c j Λ = π 0 F Λ c j H 0 Λ H 0 + π 1 F Λ c j H 1 Λ H 1 f Λ c j Λ = π 0 f Λ c j H 0 Λ H 0 + π 1 f Λ c j H 1 Λ H 1
(19)

where the conditional PDF’s of the LLR under H0 and H1 are determined in [12] as follows:

If Λ- b a

f Λ H 0 Λ = b e - a + - N / 2 2 N 2 4 πN a + f Λ H 1 Λ = b e - a + - N 2 γ + 1 / 2 2 N 2 γ + 1 2 4 πN 2 γ + 1 a +
(20)

and

F Λ H 0 Λ = 1 - 1 2 erfc a + - N / 2 2 N F Λ H 1 Λ = 1 - 1 2 erfc a + - N 2 γ + 1 / 2 2 N 2 γ + 1
(21)

where a = [N2/4 + N log(2γ + 1)/γ](2γ + 1) and b = 2N(2γ + 1)/γ.

Otherwise, f Λ H 0 Λ =0, F Λ H 1 Λ =0, f Λ H 0 Λ =0, and f Λ H 1 Λ =0.

Since the SNRs of the received primary signal at the CUs in a cluster are identical and the selective data for a cluster is the highest absolute value of the normalized LLR, the distribution of the selective cluster data will be equal to the distribution of the n0th absolute order sample, i.e., the sample with the highest absolute value, where n0 is the number of CUs in the cluster. In addition, the PDF of the n0th absolute order sample is given by:

f Y n 0 , c j y = n 0 F Y c j y - F Y c j - y n 0 - 1 f Y c j y .
(22)

The derivation of (22) can be found in the Appendix. Similarly, the conditional PDF of the n0th absolute order sample under the H j hypothesis, j = 0,1, f Y n 0 , c j H j y can be achieved.

For a specific value of threshold τ c j , the probability of false alarm and the probability of detection of the c j th cluster are, respectively, given by:

P F c j = τ c j + f Y n 0 , c j H 0 y dy
(23)

and

P D c j = τ c j + f Y n 0 , c j H 1 y dy.
(24)

Since the probabilities of false alarm and the probability of detection of the c j th cluster in (23) and (24) mainly depend on the received primary signal SNR and the number of nodes in the cluster, the cluster threshold can be determined off-line in the initial phase of the cluster establishment based on the Neyman-Pearson criterion or the minimum error probability criterion. For the Neyman-Pearson criterion, the probability of false alarm P F c j is predefined. Also, the cluster threshold τ c j is computed based on (23). In this paper, we utilize the minimum error probability criterion to numerically determine the optimal cluster threshold through the following equation:

τ c j opt = arg τ c j min π 0 P F c j + π 1 1 - P D c j .
(25)

4.3 Parallel report mechanism

For implementing the proposed selective mechanism in a cluster, all CUs in a cluster have to monitor the control channel to determine the cluster header during the contention time. One question raised here is how to arrange the contention time for multiple clusters in the network. Generally, there are two common solutions for this problem. The first approach is to assume that the contention times of the clusters are carried out sequentially over time. This method requires a strict synchronization among CUs in the network and a long contention time to minimize the collision in contention due to differences in transmission range. Obviously, this method can cause a long reporting time with a high rate of contention collision. The second approach is to assume that the contention times of different clusters are conducted in parallel with different subcontrol channels. Since each cluster only reports a 1-bit hard decision to the FC, the subcontrol channel can be reduced to a pair of frequencies corresponding to two possible values of a cluster decision. This means that a node in a certain cluster only monitors two predetermined frequencies during the contention time, and the node who wins the contention will transmit only one predefined frequency to the FC according to its cluster decision. Normally, a control channel bandwidth is sufficient for allocating a reasonable number of frequency pairs to clusters. For example, it is acceptable to divide 50 pairs of frequencies for 50 clusters in a 200-kHz control channel. Figure 3 shows an example of a sensing frame structure for the proposed parallel report mechanism compared with the conventional fixed allocation direct reporting method.

Figure 3
figure 3

Sensing frame structure.

In this method, the problems of strict synchronization and contention collision, which can occur with the previous method, are completely resolved. Indeed, with this parallel contention and reporting mechanism, the synchronization among CUs can be looser since there is only one contention time that is identical to the reporting time. No collision between two cluster reports will occur since these cluster decisions are transmitted at different frequencies. Even in the case that two CUs in a cluster have the same value of the most reliable sensing data, a collision still will not occur since the two nodes will transmit the same frequency, and at the receiver side, two transmitted frequencies can be considered as two versions of a multipath signal. The remainder problem with this parallel reporting method is that the FC needs to be equipped with parallel communication devices such as an FFT block, which is usually used in an OFDM receiver, or a filter bank block to detect multiple reporting frequencies. However, this requirement is not a big issue.

5 Simulation results

The simulation of the proposed cluster-based selective CSS scheme is conducted under the following assumptions:

  • The LU signal is a DTV signal as in [18].

  • The bandwidth of the PU signal is 6 MHz, and the AWGN channel is considered.

  • The local sensing time is 50 μ s.

  • The probability of the presence and absence of PU signal is 0.5 for both.

  • The network has n0 nodes and can be divided into n c clusters. Each cluster includes n0 nodes.

First, we evaluate the sensing performance of the selective method in the cluster with three different received primary signal SNRs of -14, -12, and -10 dB when the number of nodes in the cluster changes from 1 to 100. As shown in Figure 4, the probability of error will decrease along with the increase in the number of nodes in the cluster. However, the decreasing rate of probability of error is low when the number of nodes in the cluster is large, specially when n0>10. Therefore, the selective method only provides high sensing efficiency when the number of nodes is in the range of 20.

Figure 4
figure 4

Probability of sensing error in a cluster decision of the proposed selective method. This is for different numbers of nodes in the cluster when the received primary signal SNR is -14, -12, and -10 dB.

Second, we assume that the network includes five clusters with different SNR values corresponding to -20, -18, -16, -14, and -12 dB. The error probabilities of the global CV rule-based conventional direct reporting scheme, the cluster and global CV rule-based conventional cluster reporting scheme, and the proposed CSS scheme are then observed according to different values of cluster size. As illustrated in Figure 5, the error probabilities of all CSS schemes decrease along with the increase of the cluster size. The direct conventional CV rule-based CSS scheme provides the best sensing performance. The proposed CSS scheme outperforms the cluster and global CV rule-based conventional cluster CSS scheme when the cluster size is small, i.e., n0 < 8. When the cluster size is large, i.e., n0 > 8, the sensing error probability of the proposed method is slightly higher than that of the conventional cluster scheme, which utilizes a CV rule at both cluster headers and FC. However, it is noteworthy that the cost of this better performance with the conventional cluster and direct schemes compared with the proposed scheme are the extremely large amount of overhead, energy consumption, and reporting time for collecting all decisions from all nodes in the network.

Figure 5
figure 5

Probability of sensing error of the proposed and conventional CSS schemes. Probability of sensing error of the direct conventional CV rule-based scheme, the cluster and global CV rule-based conventional cluster reporting scheme, and the proposed CSS schemes for different cluster sizes when the network includes five clusters with a SNR value corresponding to -20, -18, -16, -14, and -12 dB, respectively.

Finally, to clarify the energy efficiency and collection time savings, we first assume that E0 = k E1 where E0 is the energy for transmitting the report from a cluster header to the FC and E1 is the energy for transmitting the report from a local node to the cluster header. Similarly, we assume that T p  = l T r where T p is the parallel reporting time slot, and T r is the fixed allocation reporting time slot (see Figure 3). We also assume that each cluster utilizes a separate reporting channel to transmit the sensing result from local nodes to the cluster header in the case of a conventional cluster-based CSS scheme, and the guard interval between time slots is ignored. As a result, the reporting energy consumption and the total reporting time of the direct reporting (DIR), the conventional cluster (CON), and the proposed (PROP) CSS schemes can be calculated by:

E DIR = N 0 E 0 = n c n 0 E 0 E CON = n c E 0 + n 0 - 1 n c E 1 E PROP = n c E 0
(26)

and

T DIR = N 0 T r = n c n 0 T r T CON = n c T r + n 0 T r = n c + n 0 T r T PROP = T p = l T r ,
(27)

respectively. The energy consumption efficiency (EE) and the reporting time-saving efficiency (TE) of the conventional cluster and the proposed CSS schemes compared with the direct CSS scheme can be easily obtained by EE  = 1 - E/EDIR and TE  = 1 - T/TDIR, respectively, where the asterisk (*) can be replaced by CON or PROP.

When the number of cluster is constant at n c  = 5 as assumed in Figure 5, we can obtain the energy consumption efficiency and the reporting time saving as shown in Figure 6. Obviously, both cluster schemes enable an increase in the energy efficiency and time savings along with the increase in cluster size. As illustrated in Equation 26 and in the simulation result of Figure 6, it can be concluded that the energy efficiency of the proposed scheme is the upper bound of the conventional cluster scheme for all cases of k. Therefore, the proposed scheme achieves the highest energy efficiency among cluster schemes.

Figure 6
figure 6

Energy consumption efficiency of the proposed and conventional cluster-based CSS schemes. This efficiency is compared with the conventional direct reporting-based CSS scheme for different cluster sizes and different values of k when the network includes n c  = 5 clusters.

Similarly, from Figure 7, we can see that the proposed CSS scheme provides higher reporting time savings than the conventional cluster scheme. In fact, for an acceptable value of l, i.e., l = 4, the proposed scheme produces time savings greater than 80% compared with the conventional direct reporting CSS scheme, while the conventional cluster scheme only remains at 75% of the highest saving percentage.

Figure 7
figure 7

Reporting time saving of the proposed and the conventional cluster-based CSS schemes. This time saving is compared with the conventional direct reporting-based CSS scheme for different cluster sizes and different value of l when the network includes n c  = 5 clusters.

6 Conclusions

In this paper, we have proposed a cluster-based CSS scheme which includes the selective method in the cluster and the optimal fusion rule in the FC. The proposed selective combination method can dramatically reduce the reporting time and energy consumption while achieving a certain high level of sensing performance especially when it is combined with the proposed frequency division-based parallel reporting mechanism.

Appendix

Derivation of Equation 22

Let Y denote a continuous random variable with PDF f Y (y) and CDF f Y (y) and let ( Y 1 , Y 2 ,, Y n 0 ) be a random sample of size n0 drawn from Y. The corresponding ordered sample derived from the parent Y is ( Y ( 1 ) , Y ( 2 ) ,, Y ( n 0 ) ), which is arranged in increasing order of absolute value such that Y 1 < Y 2 << Y k << Y n 0 . In order to determine the PDF of Y(k), we define the event Dk,y = {yY(k)y + Δ y} = {y≤±|Y(k)|≤y + Δ y}. Thus, the probability of event Dk,y can be calculated by

Pr D k , y C k 1 - F Y y + Δy + F - y - Δy n 0 - k × f Y y Δy F Y y - F Y - y k - 1
(28)

if y ≥ 0 or

Pr D k , y C k 1 - F Y - y + F Y y n 0 - k f Y y Δy × F Y - y - Δy - F Y y + Δy k - 1
(29)

if y < 0, where C k = n 0 k - 1 1 n 0 - k .Consequently, the PDF of Y(k) is calculated using

f Y k y = lim Δy 0 Pr D k , y Δy = C k 1 - F Y y - F Y - y n 0 - k × F Y y - F Y - y k - 1 f Y y
(30)

By replacing Y with Y c j and substituting k = n0 into (30), Equation 22 can be obtained.