1 Introduction

In recent years, cognitive radio has attracted intensive research because of pressing demand of efficient frequency spectrum usage. The basic idea is to allow secondary users to use the spectrum when primary users are not transmitting. A key problem in cognitive radio is that the secondary users need to quit the frequency band as quickly as possible if the corresponding primary radio emerges. Essentially, spectrum sensing is used to detect the change of the spectrum activities. For spectrum detection algorithm, there are two typical frameworks, namely block detection and quickest detection. The block detection is based on blocks of observations, and the decision method of the block detection is made at the end of each block. Quickest detection is a sequential change-point detection approach. Meanwhile, there are two major drawbacks of the traditional block detection. One is that the decision is unreliable for a small block size and considerable detection delay occurs for a large block size. The other is that the observations both before and after a change may appear in the same block, thus the decision may be invalid (the block detection is a binary hypothesis test that the primary user exists or not). Some classical detection features, such as energy detection, matched filtering detection and cyclic-feature detection, whose frameworks are block-based, are used to obtain the minimal miss probability subject to given false alarm constraints. Compared with them, quickest detection is used to obtain the minimal detection delay subject to given false alarm constraints, which is more efficient for the dynamic spectrum access.

Quickest detection performs a statistical test to detect the change of distribution in observations in order to attain an agile and robust spectrum sensing [1]. If a primary user stops transmission, then a secondary user should detect this event quickly, in order to be able to start its own transmission. A small detection delay will allow secondary users to take more transmission opportunities. On the other hand, if the primary user starts transmission, the secondary user should detect this event as quickly as possible, in order to vacate the band for the primary user. A small detection delay will allow the design of a spectrum reuse scheme that has minimal impact on the licensed users. Of course, the desire to reduce the detection delay should be balanced with a certain false alarm constraint. Our goal is to establish a statistical framework to analyze detection delay, subject to certain false alarm constraints, and more importantly to design a scheme that can minimize the delay.

The traditional methods for the quickest detection are the well-known cumulative sum (CUSUM) detection and generalized likelihood ratio (GLR) detection. CUSUM algorithm, simple and fast, is sensitive to uncertainty in parameters, as discussed in several papers, e.g. [2]. The GLR algorithm is used for the primary user with unknown parameters, and can get an estimate of the full samples. However, it is unable to create iteration, resulting in long time running and less application in practice. Therefore we adopt the short detection time of CUSUM algorithm and unknown parameters estimation of GLR algorithm to propose a new method, applied to the quickest detection for unknown primary users. Later theoretical analysis and numerical simulations are performed. Besides, we set foot in the wideband spectrum sensing. The traditional quickest detection is performed well in the time domain for the single-channel. In this paper, we have introduced the quickest detection to the multi-channel. However, in the multi-channel, if only perform the quickest detection in the time domain, we cannot distinguish different frequency. Fortunately, Short-time Fourier transform (STFT), Gabor transform, or Wigner-Ville transform is utilized to distinguish different channels and transform the time- detection to the frequency-detection. The method realizes the multi-channel parallel detection. Compared to the single-channel serial spectrum sensing, the spectrum efficiency of multi-channel parallel sensing could be improved. Then we perform comparisons between STFT and the improved methods.

The rest of the paper is organized as follows. In Section II, we introduce the model. In Section III and IV, two improved quickest spectrum sensing approaches are respectively described in details and related mathematic formulas are derived. In Section V, includes extensive simulation results and analyses. In Section VI, we offer some conclusions.

2 System Model

The quickest detection is a dynamic process, namely allowing the secondary user access the band when the primary users disappear. In this process, we can use modeling methods to express clearly, (1) and (2) describes two modeling methods of quickest detection. s(i) is the primary signal and u(i) is additive white Gaussian noise(AWGN) with mean zero and variance σ 2. s(i) is also considered to be white and Gaussian with mean zero and variance P, which is a reasonable assumption [3]. When the primary user appears at an unknown time τ, y(i) becomes:

$$y(i) = \left\{ {\begin{array}{ll} u( i ) & i = 1,2, \ldots ,\tau - 1 \\ s ( i ) + u( i) & i = \tau ,\tau + 1, \ldots \end{array} } \right.$$
(1)

where \(s\left( i \right) \sim N\left( {0,P} \right)\), \(u(i) \sim N(0,\sigma^{2} )\), before τ, the samples obey the distribution of P 0 (namely mean zero and variance σ 2), while after that, the samples obey the distribution of P 1 (namely mean zero and variance P + σ 2) y(i) also becomes:

$$y\left( i \right) = \left\{ {\begin{array}{*{20}l} {u\left( i \right)} & {i = 1,2, \ldots ,\tau - 1} \\ {A\sin \left( {2\pi if_{c} t_{s} + \theta } \right) + u\left( i \right)} & {i = \tau ,\tau + 1, \ldots } \\ \end{array} } \right.$$
(2)

where A is the amplitude of the primary radio signal, which is confined within an interval [A maxA min] (whenA < A min, the impact of the secondary radio signal on the primary radio is negligible), f c is the carrier frequency, \(f_{\text{s}}\) is the sampling frequency, f s  = 1/t s , θ is the phase of the primary user’s signal. It can be modeled as a quickest detection problem for the sequence of distribution change. In the detection, we accumulate the processed sample points and judge each sample point. And once we determine the primary user signal “appear” or “exit” band, the secondary user immediately responses to current band occupancy.

Assume that t is the detection time when the primary user appears. Following [4], we use two performance evaluation indexes, i.e. the worst detection delay \(\overline{{T_{1} }}\) and the mean time to false alarms \(\overline{{T_{0} }}\):

$$\overline{T}_{1} { = }\mathop {\sup }\limits_{\tau \ge 1} ess\sup E_{{p_{1} }} \{ T = t - \tau \left| {t \ge \tau } \right.,y_{1}^{\tau } \}$$
(3)
$$\overline{T}_{0} = E_{{p_{0} }} \{ t\}$$
(4)

where \(T = t{ - }\tau\) is defined as the detection delay. When t ≥ τ, y τ1 is the observation sequence before the primary user appears, and \(E_{{p_{1} }}\) denotes the conditional expectation under the assumption that the primary user appears at τ. Otherwise, {t < τ} is a false alarm event. The quickest detection problem for cognitive radio is formulated as to obtain the minimum worst detection delay subjected to given false alarm constrains. Obviously, a small \(\overline{T}_{1}\) and a large \(\overline{T}_{0}\) are required. In the traditional CUSUM algorithm, \(\overline{{T_{1} }}\) and \(\overline{{T_{0} }}\) can be approximated as:

$$\overline{T}_{1} = \frac{\lambda }{{E_{{p_{1} }} \left[ {l_{i}^{P} } \right]}}$$
(5)
$$\overline{T}_{0} = e^{\lambda }$$
(6)

where λ is a detection threshold.\(E_{{p_{1} }} \left[ {l_{i}^{P} } \right] = D\left( {p_{1} \left\| {p_{0} } \right.} \right)\), where \(D\left( { \cdot || \cdot } \right)\) ψ denotes Kullback–Leibler divergence.

The single-channel system model can be extended to wideband signal model [5, 6]. The received signal in the time domain can be represented as:

$$y\left( i \right) = \left\{ {\begin{array}{*{20}l} {u\left( i \right)} & {i = 1,2, \ldots ,\tau - 1} \\ {s_{k} \left( i \right) + u\left( i \right)} & {i = \tau ,\tau + 1, \ldots } \\ \end{array} } \right.$$
(7)

where k shows the different primary users, and i is the time of different primary users occur, which can be different from each other. It is difficult to detect the change-point of different primary users according to the observation sequence in time domain. Therefore, we come up a method that the cognitive user differentiates the different primary users in frequency domain and obtain the detection time in time domain.

3 The Improved Quickest Spectrum Sensing Approach with Unknown Parameters in a Single Channel

3.1 Algorithm

The CUSUM algorithm is based on the perfect knowledge of the distributions [3]. However, in many situations, the distribution parameters of primary users are usually unknown (e.g. the amplitude and phase) due to channel fading and noise interference. GLR algorithm, proposed by Lorden [7], implements the maximum likelihood ratio estimate. Unfortunately, there is no recursive expression for the GLR computation. So all observations need to be stored and the metrics need to be recomputed in all time slots. This makes the GLR algorithm infeasible for practical system implementation. Therefore we consider combining CUSUM algorithm and the GLR algorithm to improve the traditional quickest detection algorithm. In the improved method, GLR algorithm is applied to narrow the range of parameters. Because of the influence of channel fading and other unknown factors, it is difficult to clear the variance of primary user signal, but we can know P ∊ [P min P max ]. We carry on two times of parallel CUSUM detection of P max and P min in threshold λ 1 and λ 2 respectively to obtain the detection time. Every CUSUM detection method is described as below:Firstly, to obtain the logarithmic likelihood ratio:

$$l_{k} \left( t \right) = \log \left( {\frac{{p_{1k} (y_{t} )}}{{p_{0k} (y_{t} )}}} \right)$$
(8)

where, k is the detection times. Then, to obtain the cumulative sum:

$$s_{k} \left( t \right) = \hbox{max} \left( {s_{k} \left( {t - 1} \right) + l_{k} \left( t \right),0} \right)$$
(9)

the detection time is

$$t_{k} = \inf \left( {t\left| {s_{k} \left( t \right) \ge \lambda } \right.} \right)$$
(10)

the final detection time is

$$T = \inf \left( {t_{k} ,k = 1,2 \ldots ,n} \right)$$
(11)

Then with P max and P min, we can get two times of detection time t 1 and t 2. From [8], we could basically think primary user signals appear in between [t 1t 2], by using the GLR parameter estimation in between [t 1t 2]:

$$P^{ * } = \left\{ {\begin{array}{*{20}l} {P_{{\max }} ,} & {t - \nu \le \frac{{2\hat{y}}}{{P_{{\max }} + \sigma ^{2} }}} \\ {\frac{{2\hat{y}}}{{t - \nu }} - \sigma ^{2} ,} & {\frac{{2\hat{y}}}{{P_{{\max }} + \sigma ^{2} }} \le (t - \nu ) \le {\kern 1pt} {\kern 1pt} \frac{{2\hat{y}}}{{P_{{\min }} + \sigma ^{2} }}} \\ {P_{{\min }} ,} & {(t - \nu ) \ge \frac{{2\hat{y}}}{{P_{{\min }} + \sigma ^{2} }}} \\ \end{array} } \right.$$
(12)

where \(\hat{y} = \sum\limits_{i = v + 1}^{t} {y_{i}^{2} }\), P * is the power that maximizes \(f(P) = \frac{P}{{2(P + \delta^{2} )\delta^{2} }}\widehat{y} + (t - v)\frac{1}{2}\ln (\frac{{\sigma^{2} }}{{P + \sigma^{2} }})\) over the region P min ≤ P ≤ P max. Finally, get the final detection t in the threshold λ 3 with the estimated \(P^{ *}\) through the CUSUM detection.

3.2 Asymptotic Performance Analyses

We will analyze the asymptotic performance of \(\overline{{T_{1} }}\) and \(\overline{{T_{0} }}\) as expressed in (3) and (4), separately. For simplicity, we also assume \(k = \, 3\), namely the detection procedure contains three stages: initial detection, parameter refinement and final detection.

First, we need to study the impact of the mismatched parameter P on the CUSUM detection. Suppose that an imperfect \(\widehat{P}\) is used as the true value in the CUSUM detection. It is easy to verify that, by applying the Brownian motion approximation, \(\overline{T}_{1}\) is approximated by

$$\overline{T}_{1} \approx \frac{\lambda }{{E_{{p_{1} }} \left[ {l_{i}^{{\widehat{P}}} } \right]}}$$
(13)

where

$$\begin{aligned} E_{{p_{1} }} \left[ {l_{i}^{{\widehat{P}}} } \right] & = E_{{p_{1} }} \left[ {\log \left( {\frac{{p_{1}^{{\widehat{P}}} \left( {y_{i} } \right)}}{{p_{0} \left( {y_{i} } \right)}}} \right)} \right] \\ & = E_{{p_{1} }} \left[ {\log \left( {\frac{{p_{1}^{P} \left( {y_{i} } \right)}}{{p_{0} \left( {y_{i} } \right)}}} \right)} \right] - E_{{p_{1} }} \left[ {\log \left( {\frac{{p_{1}^{P} \left( {y_{i} } \right)}}{{p_{1}^{{\widehat{P}}} \left( {y_{i} } \right)}}} \right)} \right] \\ & = D\left( {p_{1}^{P} \left\| {p_{0} } \right.} \right) - D\left( {p_{1}^{P} \left\| {p_{1}^{{\widehat{P}}} } \right.} \right) \\ \end{aligned}$$
(14)

By applying the Information Inequality, which states

$$D\left( {p_{1}^{P} \left\| {p_{1}^{{\widehat{P}}} } \right.} \right) \ge 0$$
(15)

We have

$$E_{{p_{1} }} \left[ {l_{i}^{{\widehat{P}}} } \right] \le E_{{p_{1} }} \left[ {l_{i}^{P} } \right]$$
(16)

From [8] we know, under the not perfect parameter \(\hat{P}\), the first detection time is:

$$i = \arg \min\nolimits_{j} \left( {D\left( {p_{1}^{P} \left\| {p_{1}^{{p_{j} }} } \right.} \right)} \right)$$
(17)

So we can be defined the parameter in a test as:

$$P_{i}^{ * } = \arg \min\nolimits_{{\mathop P\limits^{ \wedge } \in {\rm P}_{i} }} \left( {D\left( {p_{1}^{P} \left\| {p_{1}^{{\mathop P\limits^{ \wedge } }} } \right.} \right)} \right)$$
(18)

where P * i is the collection of the estimate parameter of the primary user in the two parallel CUSUM detection of the i time. From [8], we can design

$$I = D\left( {p_{1}^{P} \left\| {p_{0} } \right.} \right)$$
(19)

Therefore, for the detection of the above process, the average run length of the worst detection delay can be expressed as:

$$\overline{T}_{1} \approx \frac{{\lambda_{1} }}{{I - D\left( {p_{1}^{P} \left\| {p_{1}^{{P_{1}^{ * } }} } \right.} \right)}} + \frac{{\lambda_{2} }}{{I - D\left( {p_{1}^{P} \left\| {p_{1}^{{P_{1}^{ * } }} } \right.} \right)}} + \frac{{\lambda_{3} }}{{I - D\left( {p_{1}^{P} \left\| {p_{1}^{{P_{2}^{ * } }} } \right.} \right)}}$$
(20)

Due to the maximum likelihood ratio criterion, when \(\lambda_{ 1}\) and λ 2 is large,\(D(p_{1}^{p} \parallel p_{1}^{{p_{2}^{*} }} )\) convergence to 0. Besides, as the order number of \(\lambda_{ 3}\) is higher than \(\lambda_{ 1}\) and \(\lambda_{ 2}\), it can be approximated as

$$\overline{T}_{1} \approx \frac{{\lambda_{3} }}{I} = \frac{{\lambda_{3} }}{{D\left( {p_{1}^{P} \left\| {p_{0} } \right.} \right)}}$$
(21)

Through the above analysis we can know, when \(\lambda_{ 1} ,\lambda_{ 2}\) and \(\lambda_{ 3}\) meet the preset conditions, for the same threshold, the average run length of worst detection delay of the improved method is approximate to the CUSUM algorithm. The basis of the average run length of false alarm intervals is false alarm events. In the improved method, the main factors affecting the false alarm is the valuation of P. Similar to the derive of \(\overline{{T_{1} }}\), \(\overline{{T_{0} }}\) mainly depends on the stage detection of the \(\lambda_{ 3}\). Therefore, according to the GLR algorithm, we can conclude the \(\overline{{T_{0} }}\) in the improved method is

$$\bar{T}_{0} \ge {1 \mathord{\left/ {\vphantom {1 a}} \right. \kern-0pt} a}$$
(22)

where a is design parameter. Threshold λ is

$$\lambda = - \ln \left\{ {{a \mathord{\left/ {\vphantom {a b}} \right. \kern-0pt} b}} \right\}$$
(23)
$${\text{where}}\quad b = 3\ln \left\{ {a^{ - 1} (1 + \frac{1}{{D(p_{1,P\hbox{min} } ||p_{0} )}})^{2} } \right\}.$$

4 Two Novel Quickest Spectrum Sensing Approaches for Wideband Cognitive Radio Respectively with Gabor and Wigner-Ville Transform

4.1 Algorithm

Gabor transform is the optimal windowed Fourier transform. Its significance lies in that the Gabor transform brings time–frequency analysis in real sense. The Gabor transformation can achieve frequency localization, which can not only provide all the information on the whole but also provide more vary information at any local time. In short, it can provide the localization information in time and frequency at the same time Gabor transform is described as below:

$$G_{f} (a,b,\omega ) = \int_{ - \infty }^{\infty } {f(t)g_{a}^{ * } } (t - b)e^{ - i\omega t} dt$$
(24)

where f(t) is the signal, and g a (t − b)is a window function of time localization. Among them, the parameters b is used for parallel moving window, in order to cover the whole time domain. If we integral the parameters b, we can conclude:

$$\int_{ - \infty }^{\infty } {G_{f} (a,b,\omega )db = \hat{f}(\omega ),\omega \in R}$$
(25)

where \(\hat{f}(\omega )\) is the Fourier transform of f(t). Namely, when the moving distance of the window from minimum to maximum, Gobar becomes the Fourier trans-form. Through theoretical derivation, it can be drawn that the product of the window width and height is a fixed value under the Gauss window. As it is described below:

$$\left[ {\left. {\begin{array}{*{20}c} {b - \sqrt a ,} & {b + \sqrt a } \\ \end{array} } \right]} \right. \times \left[ {\left. {\begin{array}{*{20}c} {\omega - \frac{1}{a\sqrt a },} & {\omega - \frac{1}{a\sqrt a }} \\ \end{array} } \right]} \right. = 2$$
(26)

Rectangular time–frequency window: the width is \(2\sqrt a\), the height is \({1 \mathord{\left/ {\vphantom {1 {\sqrt a }}} \right. \kern-0pt} {\sqrt a }}\).

Wigner-Ville transform is described as below [9]:

$$W_{x} (t,f) = \int\limits_{ - \infty }^{ + \infty } {x(t + \frac{\tau }{2})x(t - \frac{\tau }{2})e^{ - j2\pi f\tau } d\tau }$$
(27)

where x(t) is the signal, but in practical numerical implementation, we must have the Wigner-Ville dispersion [10]. The discrete expression defined for Wigner-Ville is

$$W_{x} (n,k) = 2\sum\limits_{m = - (N - 1)/2}^{(N - 1)/2} {x(n + m)x^{*} (n - m)} e^{ - j4\pi mk/N}$$
(28)

where \(f = \frac{{f_{s} k}}{2N}\), \(t = \frac{n}{{f_{s} }}\),\(\tau = \frac{m}{{f_{s} }}\). Gabor transform and Wigner-Ville transform are both same as STFT, which are all performed Fourier transform after some mathematical processing. So we can carry on the same modeling and CUSUM detection process for the processed data. With the translation, the one-dimensional time domain signal y(i) will become a two-dimensional matrix Y m,n , in which m is the time sampling-point and n is the frequency sampling-point. But the complex Y m,n not as a sample sequence of spectrum sensing, this is because of the fast detection algorithm of statistics need to be compared with each other, and compare with the preset threshold, so the need for Y m,n to do some conversion. A transformational observation sequence is applied for quickest detection [11, 12]:

$$Y_{m,n} = \text{Re} [Y_{m,n} ] + Im[Y_{m,n} ]$$
(29)

For the sampling-point Y m,n , the log-likelihood ratio at ∀m and ∀n can be defined as:

$$l_{m,n} = \log \left( {\frac{{f_{1n} (\overline{X}_{m,n} )}}{{f_{0n} (\overline{X}_{m,n} )}}} \right)\quad m = 1,2,3 \ldots ,\quad {\kern 1pt} n = 1,2,3 \ldots ,$$
(30)

where f 1n and f 0n represent the probability density of the sampling-value in the environment of H 1 and H 0 after windowing, respectively. then, to obtain the cumulative sum

$$s_{m,n} = \hbox{max} (s_{m - 1,n} + l_{m,n} ,0)$$
(31)

the detection time is

$$t_{n} = \hbox{min} \{ m\left| {s_{m,n} } \right. \ge \gamma \} .$$
(32)

4.2 Asymptotic Performance Analyses

The random variables \(y_{i} \sim N(\mu_{i} ,\sigma_{i}^{2} )\), i = 1, 2, …, n and they are independent of each other. The linear combination of them is: C 1 x 1 + C 2 x 2 + ··· + C n x n (C 1C 2, …C n are constants not all for 0). Still obey the Gauss distribution, namely:

$$C_{1} x_{1} + C_{2} x_{2} + \cdots + C_{n} x_{n} \sim N\left( {\sum\limits_{i = 1}^{n} {C_{i} \mu_{i} } ,\sum\limits_{i = 1}^{n} {C_{i} \sigma_{i}^{2} } } \right)$$
(33)

Definition: a finite sequence set to a length of, is defined as the point of discrete Fourier transform

$$Y(k) = DFT[y\left( n \right)] = \sum\limits_{i = 0}^{N - 1} {y\left( n \right)e^{{ - j\frac{2\pi }{N}kn}} } ,\quad k = 0,1, \ldots ,N - 1$$
(34)

where N > M.

This method uses the Gauss window segmentation signal in time domain, the window length is \(M = 2\sqrt a\), and the length of FFT is alsoM. Based on the above discussion, we can make the following inference. Before Gabor transform, the distribution of primary user signal with length N [13]:

$$s_{k} (i) \sim N(0,P_{k} ),\quad i = 1,2,3, \ldots N$$
(35)

where P k is the variance of primary user signal, k is the number of the primary users. The carrier frequency of each primary user signal is f c,k  = (f s /N) * l and l is random integer. We can think that after the primary user signals are sampled, the side lobes are infinitesimal, i.e. only at a certain frequency, the amplitude is not zero, and the influence between the primary users is infinitely small. So after Gabor transform, if only noise exist Y m,n is described as below:

$$Y_{m,n} \sim N(0,M\sigma^{2} ),{\kern 1pt} \quad m = 1,2,3, \ldots \frac{N}{M},\quad n = 1,2,3, \ldots ,M$$
(36)

if the primary user is in the present frequency band, Y m,n is:

$$Y_{m,n} \sim N(0,MP_{k} + M\sigma^{2} ),\quad m = 1,2,3, \ldots \frac{N}{M},\quad n = 1,2,3, \ldots ,M$$
(37)

According to [11], if only noise exits,

$$\text{Re} \left[ {Y_{m,n} } \right] \sim N\left( {0,\frac{{M\sigma^{2} }}{2}} \right),\quad \text{Im} \left[ {Y_{m,n} } \right] \sim N\left( {0,\frac{{M\sigma^{2} }}{2}} \right)$$
(38)

If the primary user and noise exit at the same time,

$$\text{Re} \left[ {Y_{m,n} } \right] \sim N\left( {0,\frac{{M\sigma^{2} }}{2} + \frac{1}{2}MP_{k} } \right),\quad \text{Im} \left[ {Y_{m,n} } \right] \sim N\left( {0,\frac{{M\sigma^{2} }}{2} + \frac{1}{2}MP_{k} } \right)$$
(39)

According to (38) and (39), we can get the final distribution of Y m,n

$$H_{0} :\overline{Y}_{m,n} \sim{\rm N}(0,M\sigma^{2} )$$
(40)
$$H_{1} :\overline{Y}_{m,n} \sim{\rm N}(0,MP_{k} + M\sigma^{2} )$$
(41)

According to [3], the logarithmic likelihood ratiol m,n is:

$$l_{m,n} = \frac{{P_{k} y_{m,n}^{2} }}{{2\left( {P_{k} + \sigma^{2} } \right)\sigma^{2} }} + \frac{1}{2}\ln \left\{ {\frac{{\sigma^{2} }}{{P_{k} + \sigma^{2} }}} \right\}$$
(42)

n, the average run length of the worst detection delay \(\overline{{T_{1} }}\) is:

$$\overline{T}_{1n} \le \frac{\gamma }{{D(f_{1n} \left\| {f_{0n} } \right.)}}$$
(43)

where \(D(f_{1n} \left\| {f_{0n} } \right.){ = }\frac{1}{2}\ln \{ \frac{{\sigma^{2} }}{{P_{k} + \sigma^{2} }}\} { + }\frac{{P_{k} }}{{2\sigma^{2} }}\). The average run length of false alarm interval \(\overline{{T_{0} }}\) is:

$$\overline{T}_{0n} \ge e^{\lambda }$$
(44)

We can get the final results on different primary user perceived performance index, and \(\overline{{T_{1} }}\) is:

$$\overline{T}_{1k} \le \frac{M\gamma }{{D(f_{1k} \left\| {f_{0} } \right.)}}$$
(45)

\(\overline{{T_{0} }}\) is:

$$\overline{T}_{0} \ge Me^{\lambda }$$
(46)

We can get the threshold λ from (45), and can get \(\overline{{T_{1} }}\) of different primary users by (46).

5 Simulations and Results

In this section, we provide some numerical results. In the simulation of method 1, we set \(\sigma^{ 2} { = 1}\) and \(P \, = \, 3\). In the simulation of method 2, we assume that \(\sigma^{ 2} { = 1}\) and there are five primary users which are BPSK modulated signals with the bandwidth of 6 MHz. All the primary users have the same P for simplicity. Different primary users start transmission at the 101th point, 201th point, 301th point, 401th point, and 501th point, respectively. We set the window length is 20 sampling points, and the movement of the window is non-overlapping. In generating the simulations, the thresholds are set using the bounds for \(\overline{{T_{0} }}\) derived in the paper.

Firstly we show some results about method 1. Figure 1 shows a typical realization of the CUSUM and improved method statistic, when the primary user starts transmission at τ = 100. In the comparison, black is the new method, and blue is the CUSUM. From the Fig. 1, we can see the cumulative sum increases rapidly of the improved method after t = 100, thus a large difference between before the primary users appearing and after. It is very easy to detect the change of the spectrum activities with an appropriate threshold.

Fig. 1
figure 1

A typical realization of the CUSUM and improved method statistic, when τ = 100

Figures 2 and 3 show the distributions of the detection delay in the improved method and CUSUM detection. We can see that detection delay is concentrate from 0 to 12. When the detection delay is higher than 13, the percent is very small in the both CUSUM and the improved method. Though we do not know the channel parameters, the new improved method still achieves the same good performance with the CUSUM detection.

Fig. 2
figure 2

Distribution of the detection delay in the CUSUM detection

Fig. 3
figure 3

Distribution of the detection delay in the new method detection

Figure 4 shows the comparison between method 1 and CUSUM detection subjected to the same \(\overline{{T_{0} }}\) when \(P \, = \, 3\). On the one hand, the growth rate of the detection time and the detection delay in CUSUM detection is faster than method 1. On the other hand, the high detection delay in the new method is substantially lower than CUSUM detection.

Fig. 4
figure 4

Comparison of the detection delay distribution in the new method and CUSUM detection

Then we also show some results about method 2. Figure 5 shows the relationship between the time and frequency of the five primary users. From the figure, we can see the time–frequency distribution after Gobar transform. The colors represent the energy, the deeper the color, the greater the energy. And by converting the time domain into frequency domain, we can clearly distinguish five primary users of different frequencies which appear at different time respectively.

Fig. 5
figure 5

The relationship between the time and frequency of the five primary users

Figure 6 shows the distribution of the detection delay in the wideband spectrum sensing with STFT. Figure 7 shows the distribution of the detection delay in the wideband spectrum sensing with Gobar transform. Figure 8 shows the distribution of the detection delay in the wideband spectrum sensing with Wigner-Ville transform. From the simulation results, we can see that, the delay distribution of the three kinds of transformation detection is relatively concentrated, which can be applied in practical engineer easily. Besides, the probability of detection delay of more than 20 is already very small, and compared with the signal appearing time 300, the result is acceptable.

Fig. 6
figure 6

Distribution of the detection delay in the STFT

Fig. 7
figure 7

Distribution of the detection delay in the Gobar

Fig. 8
figure 8

Distribution of the detection delay in the Wigner-Ville

Figure 9 shows comparison of the detection delay distribution in the wideband spectrum sensing with Wigner-Ville, STFT and Gobar. By comparing the three kinds of transformation, we can see that the energy is relatively large and concentrated after STFT, applied to the primary user signal is close to the noise signal or even smaller than the noise signal. But STFT has a relatively large delay. Gobar and Wigner-Ville transform energy is relatively small, which can be used for the primary user signal is larger than the noise signal. On the other hand, Gobar and Wigner-Ville can easily distinguish between the various signals of different frequency, and detection delay is relatively small. Whether it is STFT, Gobar, or Wigner-Ville, detection delay is within the acceptable range.

Fig. 9
figure 9

Comparison of the detection delay distribution in the Wigner-Ville, STFT and Gobar

6 Conclusions

To solve the problem of spectrum sensing, this paper proposes two kinds of the quickest detection. One is applied to the signal band detection with unknown parameters of primary user. The other performs wideband spectrum sensing. In the detection, we have taken a sequential change detection framework for studying the agility of detection algorithms in cognitive radio. We focus on how to reduce the detection delay under the certain the false alarm interval, and carry out a number of experiments about the distribution of detection delay of various algorithms. Detection delay of the new method can be comparable to the CUSUM with unknown parameters of primary user in a single channel. In the wideband spectrum sensing, we continue to study the Gobar transform and Wigner-Ville transform, and carry on comparisons among them. Then we find their own suitable occasion, and previous greatly shortens the detection time, improve the detection efficiency compared with block detection. Wideband spectrum sensing will continue to focus on the unknown parameters of primary user, thus increasing spectrum efficiency.