Introduction

Automatic modulation classification (AMC) is the process of identifying modulation type of a detected signal without prior information. This technique has both military and civilian applications, and is currently an important research subject in the design of cognitive radios [13]. AMC is a complex task especially in a non co-operative environment as in high frequency (HF) communications, where transmission is affected by atmospheric conditions and other transmission interferences [4].

AMC methods are grouped into two categories: likelihood based (LB) and feature based (FB) methods. LB methods have two steps: calculating the likelihood function of the received signal for all candidate modulations, and then using maximum likelihood ratio test (MLRT) for decision-making. In FB methods, features are first extracted from the received signal and then applied to a classifier in order to recognize the modulation type. Most of the recent literatures use the FB methods due to their low processing complexity and high performance [5]. For more details about AMC methods with a comprehensive literature review, the reader is referred to [6].

Figure 1 shows the classification task in a smart radio. The task of the signal detection block is to identify signal transmission, while the AMC contains a feature extractor followed by a classifier. The classifier can be based on fixed threshold as in decision tree methods, or based on pattern recognition (PR) methods as in artificial neural networks (ANNs) and support vector machines (SVM) [7, 8]. Most of the features used in literature are based on wavelet [9, 10], temporal time domain (TTD) analysis [1113], and higher order cumulants (HOC) [1416]. These features are generally extracted under the assumption that the modulated signals are corrupted by additive white Gaussian noise (AWGN). Although this assumption is valid in many communication environments, recent studies show that HF noise changes between AWG and bi-kappa distributions [17, 18]. The effect of these two noise distributions has been taken into account during the design of the AMC algorithms proposed in [19]. The work shows that the change in noise model affects the classification performance, especially at low signal-to-noise ratio (SNR). Therefore, the robustness of commonly used features against variation in noise models needs to be investigated so that more reliable AMC algorithms can be designed for HF signals.

Figure 1
figure 1

AMC based receiver architecture using feature based methods.

In this paper, we first examine the effect of Gaussian and bi-kappa noise models on wavelet, HOC, and TTD features, when these features are considered for the classification of single carrier modulations commonly used in HF band: 2PSK, 4PSK, 8PSK, 16QAM, 32QAM, and 64QAM [20]. Second, we propose new features based on maximum dissimilarity measures (MDM) in constellation diagram and evaluate their robustness against the change in noise model. Note that the contribution of this article is pertaining to the features extraction stage; hence the results obtained are independent of the classifier being used. However, these results will greatly serve the classifier design stage, as this stage can be based on features that are robust with respect to noise models.

The organization of the article is as follows. 'Signal model' and 'Noise model' sections present signal and channel noise models, respectively. 'Commonly used signal features' section introduces the TTD, HOC, and wavelet based features. 'Proposed features' section presents the proposed features. 'Simulation results' section presents results showing the robustness of the different features against the variation in noise model. 'Conclusion' section presents concluding remarks.

Signal model

The general form of received signal encompassing all modulation schemes under consideration is given by [21]:

(1)

where C(t)is the complex envelope of modulated signal, n(t) is band limited noise, f c is the carrier frequency, and Re{} denotes the real part. The complex envelope is characterized by the constellation points C k , signal power E, and pulse shaping functionp(t). For N symbols with periodicity T, the general form of complex envelope can be expressed as:

(2)

For MPSK modulation, C k ∈ {e-j 2πm/M}, where m = 0, 1, ..., m-1. For MQAM modulations, C k a k + jb k , m = 0, 1, ..., (M)1/2/2, and

Noise model

Noise model assumed in most of the research related to AMC is AWGN. This research focuses on AMC in HF band, where the AWGN assumption no longer remains valid for all transmission times [17, 18]. Instead, the noise varies between AWGN and bi-kappa distributions. The bi-Kappa distribution is characterized by the following probability distribution function:

(3)

where σ and k are the shaping parameter and tuning factor, respectively. Practical values of these parameters are σ = 46, k = 1.1, and σ = 20, k = 1 [17]. Figure 2 shows the probability distribution function, p(x, k), for different values of σ and k. The figure shows that decreasing the shaping parameter produces a shaper peak and that the bi-kappa distribution approaches the AWG distribution when the tuning parameter is increased. In this work, the parameters of bi-kappa distribution are set to σ = 20, k = 1.

Figure 2
figure 2

Probability distribution function of bi-kappa noise for different values of parameters.

A more realistic noise model can be constructed by passing the bi-kappa noise through a band-limiting filter. The bandwidth of this filter is set to 8γs where γs is the symbol rate. This filter is practically used to minimize the transmission bandwidth. Figure 3 shows the constellation diagram of an intercepted 2PSK signal down converted to baseband for different SNR. This figure shows clearly the spiky nature of bi-kappa noise as compared to AWGN, especially at low SNR.

Figure 3
figure 3

Effect of HF noise models on the intercepted signals.

Commonly used signal features

This section gives the general formulas and description of commonly used signal features. Specifically, we consider the TTD, HOS, and wavelet based features.

TTD features

The variations in modulated waveforms can be described by three instantaneous values: frequency, phase, and amplitude [11, 12]. All values related to these variations are defined as the TTD features. Two features will be investigated in our study. The first feature is the standard deviation of the absolute value of the centered non-linear component of the instantaneous phase defined as

(4)

where ϕ NL is the centered non-linear component of the instantaneous phase, tth is the threshold value of the non weak signal, L is the number of samples in ϕ NL .

The second feature is the standard deviation of the absolute value of the normalized-centered instantaneous amplitude; that is,

(5)

where N s is the number of samples, a cn = a/m a -1,a is the absolute value of the analytic form of the received signal, and m a is its sample mean value.

HOC features

The HOC are used to extract hidden information from non-Gaussian signals. In presence of AWGN, all the HOC are zero for orders greater than two. This makes these features attractive to classify modulated signals corrupted by AWGN. Fourth and sixth HOC considered in this study are defined as follows [1416]:

(6)
(7)

where C21 is the average power and M pq is the joint moment. The later can be calculated for any values of p and q using the following equation:

(8)

where x* denotes complex conjugate and E{} is the expectation operation. Table 1 shows the theoretical cumulants for the considered modulation schemes.

Table 1 Theoretical values of HOC for digital modulations

Wavelet features

Wavelet transform preserves the time information while providing the frequency information of an analyzed signal. This makes it a good candidate for AMC. As shown in Figure 4, features extraction using wavelet transform passes through three steps: wavelet decomposition using Haar mother waveform, median filtering, and finally calculation of standard deviation [22]. Robustness of wavelet features against noise model has been tested at level three and four.

Figure 4
figure 4

Steps for wavelet features extraction.

Proposed features

The PSK and QAM modulations are represented by a constellation diagram in which the modulation symbols are depicted in terms of phase and amplitude variations. This diagram is extracted from the analytic form of the IF signal by multiplication with the complex conjugate of the carrier frequency. Many AMC algorithms are designed using features based on constellation diagram. These algorithms use different classification techniques that include maximum likelihood [23], genetic algorithms [24, 25], modified Chi-squared test [26], and subtractive clustering [27]. In this article, we propose a different use of constellation diagram by extracting features based on maximum dissimilarity measures (MDM), firstly to distinguish between different modulation types, such as QAM and PSK signals, and secondly to find the order of a particular modulation. MDM features depend on calculating the dissimilarity between different constellation diagrams after signals normalization. That is, features are extracted from the distance (or dissimilarity measures) between the complex envelope of the received signal and set of reference constellation points for a particular modulation scheme. These reference constellation points are defined by their amplitudes and phases [21]. MDM are computed after normalizing both the received and reference constellation points to their mean values. The dissimilarity function is defined as [28]:

(9)

where d is the Euclidian distance between the complex envelope of intercepted signal × and reference constellation points p. For feature extraction, the signal × is randomly generated at a particular SNR. A dissimilarity vector d, whose entries are the distance between a randomly generated constellation point of x and M reference constellation points p, can be obtained. The element of maximum value of vector d is averaged over several independent runs, and then selected as the desired feature. In practice, the mean and/or standard deviation of d max (x, p) will have values based on the noise level.

Table 2 shows five proposed features related to the MDM, each of which is responsible for discriminating a specific modulation type or modulation order. As shown in Table 1, the first feature d1 is used to discriminate between QAM and PSK signals, while d2 is used to discriminate between 2PSK and other PSK signals of higher orders. For further details see Table 2.

Table 2 Proposed features

Figures 5, 6, 7, 8, and 9 show relevant variations of proposed features as a function of SNR. The results are averaged over 100 independent realizations and displayed for AWGN. Clearly, these figures show that the proposed features have potential applications in AMC, as they can be used in conjunction with decision tree or machine learning techniques for signal classification.

Figure 5
figure 5

d 1 for the discrimination between MPSK and MQAM signals.

Figure 6
figure 6

d 2 for the discrimination between 2PSK and higher PSK signals.

Figure 7
figure 7

d 3 for the discrimination between 4PSK and 8PSK signals.

Figure 8
figure 8

d 4 for the discrimination between 16QAM and higher QAM signals.

Figure 9
figure 9

d 5 for the discrimination between 32QAM and 64QAM.

Effect of noise model on the proposed features will be discussed in the next section.

Simulation results

To evaluate the robustness of presented features, all the modulations schemes under test were generated in presence of band-limited AWGN and bi-kappa noise, where the bandwidth of the band-limiting filter is 8γs; this process is practically used to avoid high bandwidth transmission [13]. SNR is adjusted by multiplying the output noise by the following factor:

(10)

where E, N0, SNR are the signal power, noise power, and desired SNR, respectively. All constellation points are normalized to zero mean and unity variance. The simulation parameters are given in Table 3.

Table 3 Simulation parameters

For evaluation purposes, we measure the absolute value of the percentage deviation of each feature when noise model is changed from AWGN to bi-kappa. This percentage is evaluated using SNR ranging between 0 and 30 dB, and is defined as follows

(11)

where FAWGN and FBi-kappa are the values of feature under consideration computed in the presence of AWGN and bi-kappa noise, respectively, at a particular SNR. Figures 10, 11, 12, 13, 14, and 15 show the results, averaged over 100 independent realizations, for the following set of modulations: 2PSK, 4PSK, 8PSK, 16QAM, 32QAM, and 64QAM.

Figure 10
figure 10

Features deviations computed from 2PSK signal.

Figure 11
figure 11

Features deviations computed from 4PSK signals.

Figure 12
figure 12

Features deviations computed from 8PSK signals.

Figure 13
figure 13

Features deviations computed from 16QAM signals.

Figure 14
figure 14

Features deviations computed from 32QAM signal.

Figure 15
figure 15

Features deviations computed from 64QAM modulation.

The above figures show that at SNR <10 dB, TTD and MDM are more robust than HOC against the change in HF noise model. It is true in general that η decreases as SNR increases. However, for MPSK signals, the instantaneous amplitude feature has lower deviation for SNR <30 dB. This is intuitively not surprising because the difference between FAWGN and Fbi-kappa relative to FAWGN in this SNR range is smaller than that of higher SNR values. Another observation is that the wavelet based features have maintained almost the same values of η for all considered modulations. In addition, the proposed MDM have shown excellent performance in the sense that they have the lowest deviation as compared to other features.

Conclusions

In this article, we have investigated the robustness of four features categories for the classification of digitally modulated signals in the presence of HF noise models; AWGN and bi-kappa noise. Specifically, the TTD, HOC, wavelets, and MDM features are considered, where the last feature is proposed in this work. It has been shown through computer simulations that HOC are sensitive to the change in noise model especially at low SNR (<10 dB), while TTD, wavelets, and MDM show good robustness (η < 25%) in the investigated range of SNR. Note that the proposed MDM features have the lowest values of η, i.e., highest robustness against variation in noise model, as compared to other features. The results of this article have potential values for the design of a classifier, as they identify the features that of higher robustness with respect to HF noise models. Note that the performance of an AMC designed under the assumption of AWGN noise model cannot be ensured when considering HF communications. Classifiers employing features sensitive to variation in noise model should take this variation into consideration. For example, SVM or ANNs based classifiers need to be designed by training with signals corrupted by both HF noise models.