Open-Set Recognition of Shortwave Signal Based on Dual-Input Regression Neural Network

Zhang, Jian; Wu, Di; Hu, Tao; Wang, Shu; Wang, Shiju; Li, Tingli

doi:10.1007/978-981-19-2456-9_88

Jian Zhang⁴⁰,
Di Wu⁴⁰,
Tao Hu⁴⁰,
Shu Wang⁴⁰,
Shiju Wang⁴⁰ &
…
Tingli Li⁴⁰

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE))

Included in the following conference series:

INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND APPLICATIONS

8524 Accesses

Abstract

Open-set recognition in blind shortwave signal processing is an important issue in modern communication signal processing. This paper presents a novel method for this problem. By preprocessing, the signal data matrix and vector diagram are obtained as network input. Then, the network is trained and tested with the known signal, and the upper and lower quintile algorithm is used to obtain the interval threshold for judging the known signal and the distance threshold for intercepting the length range of the unknown signal. Finally, the network is used for numerical regression in open-set range, the threshold combined with kernel density clustering algorithm is used to identify different signals. Simulation results show that the proposed method overcomes the defects of traditional algorithm, which cannot distinguish different types of unknown signals and only applicable for few signal types.

You have full access to this open access chapter, Download conference paper PDF

Intelligent Recognition Method of Short Wave Communication Transmission Signal Based on the Blind Separation Algorithm

Open-Set Recognition Algorithm of Signal Modulation Based on Siamese Neural Network

Weak Pulse Signal Detection Based on the Broad Learning Method under the Chaotic Background

Article 01 April 2022

Keywords

1 Introduction

Due to the flexibility, survivability and long-distance transmission, shortwave communication has always been a reserved and development method in the field of wireless communication. Shortwave signal automatic recognition technology [1] is an important content of signal blind processing and an important basis for subsequent signal analysis, monitoring and countermeasure. With the development of modern shortwave communication technology, shortwave communication shows a trend of diversification of types, fine differentiation of specifications and continuous emergence of new signal types. Most of the traditional signal automatic recognition technologies are concentrated in the closed-set level. When new unknown signal enter the system, the correct result cannot be obtained. Therefore, in order to meet the need of convenience, intelligence and timeliness of modern blind signal processing, it is of great value to carry out the research on efficient open-set recognition technology of shortwave signal.

At present, most traditional signal recognition algorithms as well as algorithms based on deep learning only consider the recognition of known signal types. When a new unknown signal type appears, it will be recognized as one of the known signal, resulting in discrimination error. To solve the above problem, Literature [2] proposed a support vector data description (SVDD) algorithm with density scaled classification margin (DSCM), which determines the interval between hypersphere and positive samples according to the relative density proportion of two types of positive training samples, and carries out open-set recognition in combination with support vector description, However, the algorithm can only distinguish 2 types of positive sample signals, and will classify all unknown signal types into one class. Literature [3] extends the algorithm of incremental support vector machine (ISVM) [4] combined with error correcting output codes (ECOC) [5] to multi classification for incremental learning and recognition, but this algorithm cannot solve the forgetting problem in incremental learning. Besides, designing coding matrix requires more priori information, and its multi classification ability is restricted by the coding length, as well as the model needs to be trained every time when a new signal is received, lead to its low efficiency.

The generative adversarial (GA) method is also used to solve the open-set recognition problem. Literature [6] combines the improved intra class splitting (ICS) algorithm with the genetic adversarial algorithm to obtain the boundary signal samples, then trains the boundary signal samples as unknown types of signals and realizes the open-set recognition. However, the process of constructing boundary samples is complex and the effect is unstable, and it also cannot distinguish different types of unknown signal. Literature [7] uses the generative countermeasure network theory to build a reconstruction and discrimination network (RDN) model to identify the modulation types of signals. However, the difference between the reconstructed signal data and the real unknown signal data is difficult to control, and when the known signal types is more than 2, the classification and discrimination mechanism will be very complex, which results in low operability. In addition, it is still unable to distinguish different types of unknown signals.

Some other methods, such as Literature [8] uses the extreme value-weibull distribution to fit the cut-off probability of the distance from the feature to the feature center, combines the classification cross entropy with the center loss, and modifies the output of the dual channel long-short term memory (DCLSTM) network to conduct the modulation recognition. This algorithm proposes the concepts of feature center and feature distance. In some cases, it can distinguish different unknown types of signals, but it cannot distinguish signals of different specifications with the same modulation mode.

From the above analysis, it can be concluded that the current signal open-set recognition algorithms have the following shortcomings: 1) Some algorithms are only applicable to 2 types of known signals, and no longer applicable when the number of known signal type increases; 2) The existed works focus on the signal modulation recognition, the recognition method for different specifications with the same modulation mode is hardly considered; 3) It is difficult to distinguish different types of unknown signals, unknown signals can only be distinguished into one class, called ‘unknown class’.

In this paper, we propose a method to transform features of different signals into different regression values, and use these values to distinguish different signals. The contributions of proposed method are described as follow: Firstly, we design a dual-input neural network to fuse and map the feature information extracted from signal data stream and vector diagram. For better feature extraction, we design a network structure based on dense convolution theory. Secondly, different from the traditional recognition network structure, we use the hyperbolic tangent (Tanh) activation function to perform numerical regression on signal features at the end of the network, and establish a one-to-one nonlinear mapping relationship between signal feature and specific value. Thirdly, we test the network in closed-set, using the upper and lower quintile algorithm to obtain the regression discrimination threshold of each known signal and the center distance threshold for unknown signal. Finally, we perform open-set experiments to demonstrate the effectiveness of the proposed method.

2 Distinguishing Features of Shortwave Signal

2.1 Data Stream

Specific shortwave standard has unique generation algorithm and transmission specification. These rules and standards make its signal data stream presents unique information organization format. Taking MIL-STD-188-110A (110A) [9], MIL-STD-188-141B(141B) [10] and Link11 SLEW [11] as an example, the typical information transmission format is shown in Fig. 1.

We can conclude that the data transmission organization structure of different signals is unique, and the bits of each sequence and field are not the same. These differences make the received 110A, 141B and Link11 data stream present the unique data characteristics of their respective signal. Based on this, if a feature extraction algorithm with high performance and strong robustness can be found for signal data, the feature extracted from signal data stream can be used as recognition criteria to distinguish the type of different shortwave signals.

2.2 Vector Diagram

Vector diagram shows the symbol track by reconstructing two channels of received signal data in time order, not only can distinguish frequency shift keying (FSK) and phase shift keying (PSK), but also can distinguish signals with different PSK modulation modes, as shown in Fig. 2. The symbols of PSK signals have a fixed phase, so the vector diagram is in the form of constellation point and symbol trajectory, while the phase of FSK signals is random during symbol conversion, so the vector diagram is in the form of circle.

In this paper, the signal vector diagram is used as the supplementary feature extraction source. By powerful feature processing ability of neural network, the different feature information of signal specification represented by data flow and the modulation feature information represented by vector diagram is fused, and then learned and mapped, to further improve the performance of signal recognition.

3 Proposed Method

In this section, we first describe the dual-input neural network architecture of our method, then we present the algorithm for obtaining the discrimination threshold. Finally, we demonstrate the procedure of the proposed scheme.

3.1 Dual-Input Regression Neural Network

Regression analysis (RA) is a statistical analysis method to determine the relationship between two or more variables. We construct dual-input regression neural network to map the extracted signal feature to specific value. By using the difference of numerical regression result, we can distinguish different signals in open-set range.

The proposed dual-input regression neural network is illustrated in Fig. 3. The feature extraction is conducted by 7 feature extraction modules. The structure of feature extraction module is shown in Fig. 4. The network connects adjacent feature extraction module through the transformation module, each transformation module contains a 1 × 1 convolution and a 2 × 2 average pool. After extracting the feature via the above $(66{ + }18) \times 2 + 5 = 173$ layers network and conduct a 7 × 7 global average pool, the acquired feature information are fused by concatenation, and then establish the nonlinear relationship between signal feature and specific value by regression processing. Except for the end of the network, the rectified linear unit (ReLu) is used in each layer. During the compilation and optimization of the network, the Adam algorithm is used to work out the optimal solution of the network structure parameters.

At the end of the network, Tanh activation function is used for regression from signal eigenvectors to preset specific values:

$$ {\text{Tanh}} (x) = \frac{{e^{x} - e^{ - x} }}{{e^{x} + e^{ - x} }},x \in ( - \infty , + \infty ) $$

(1)

Compared with Sigmoid activation function, which is widely used in regression operation:

$$ {\text{Sigmoid}}(x) = \frac{1}{{1 + e^{ - x} }},x \in ( - \infty , + \infty ) $$

(2)

The Sigmoid activation function may change the distribution of original data to some extent, as shown in Fig. 5, while Tanh does not. Moreover, Tanh has a larger gradient, so that the convergence speed is faster in regression operation, which can achieve better training effect.

3.2 Discrimination Threshold

After regression of a specific signal with several signal samples, the result values will fall into a small range. In this paper, the upper and lower quintile algorithm is used to work out the interval threshold and center distance threshold of known signal, in which the interval threshold is used as the basis to distinguish known and unknown signals, the center distance threshold is taken as the length when intercepting the numerical cluster of unknown signals. Suppose that after regression processing of a known signal S, the numerical distribution of several samples is shown in Fig. 6.

Define $\gamma_{low}$ as the lower quintile of the data set, indicating that there is only 1/5 of all data, which value is less than $\gamma_{low}$. Similarly, define $\gamma_{up}$ as the upper quintile of the data set, which means that only 1/5 of all data has a value greater than $\gamma_{up}$. According to the upper and lower quintile algorithm, the interval threshold of regression value for signal S is defined as:

$$ \left\{ \begin{gathered} \delta_{low} = \gamma_{low} - \mu (\gamma_{up} - \gamma_{low} ) \hfill \\ \delta_{up} = \gamma_{up} + \mu (\gamma_{up} - \gamma_{low} ) \hfill \\ \end{gathered} \right. $$

(3)

where $\delta_{low}$ is the lower bound threshold of regression value for signal S, $\delta_{up}$ is the upper bound threshold, and $\mu$ is the scale factor, which is 1.5 in this paper. In addition, $\delta_{up} - \delta_{low}$ is the upper and lower distance threshold of the regression for signal S. After regression test of known signals in the closed-set, use:

$$ D = \lambda \frac{1}{2J}\sum\limits_{{{\text{j}} = 1}}^{J} {(\delta_{up}^{(n)} - \delta_{low}^{(n)} )} $$

(4)

To calculate the center distance threshold D, which is used as the length of subsequent center-distance interception of unknown signals numerical clusters. In Eq. (4), J is the number of known signal types, $\delta_{up}^{(n)}$ and $\delta_{low}^{(n)}$ represent the upper bound threshold and lower bound threshold of the j-th known signal, $\lambda$ is the grace factor, the value we use is 1.38.

3.3 Algorithm Scheme

According to the above discussion, the open-set recognition process is as follows:

1)
Preprocess known shortwave signals and construct training signal data sets;
2)
Use the training data set to train the network, when the network’s loss value falls below the preset threshold, the training is terminated and the network is saved;
3)
Since the network cannot conduct zero-error regression, the trained network is used to test the known signal. With the upper and lower quintile algorithm, the interval threshold and center distance threshold of each known signal are obtained as the standard to distinguish between known and unknown signals and the subsequent interception of the unknown signal;
4)
In the open-set range, use the network to recognize the preprocessed signals. For the regression value of a specific signal, if it falls within the threshold of a known signal interval in step 3), it is judged as such known signal, and if it falls outside the threshold of all known signal intervals, it is judged as unknown signal;
5)
Use the kernel density clustering algorithm [14] to cluster all regression values identified as unknown signals to obtain the number of categories, regression numerical clustering clusters and corresponding density center coordinate. For each numerical clustering cluster, use the density center coordinate combined with the center distance threshold to intercept, the signal samples represented by the regression numerical points falling within the interception range are identified as such unknown signal, so as to complete the open-set recognition.

4 Experimental Results

In this section, the recognition performance of proposed method is simulated and tested. The experimental platform is configured with Intel (R) Xeon (R) e-2276m processor, NVIDIA Quadro RTX 5000 GPU and 32 GB DDR4 memory.

Signal used in the experiment includes 6 types: 110A, MIL-STD-188-110B (110B) [15], MIL-STD-188-141A(141A) [16], 141B, Link11 SLEW, PACTOR [17]. The signal setting of the experiment is shown in Table 1. During experiment, 110A, 141B, Link11 SLEW and PACTOR are used for network training as known signals, and are set to regress to the value of 0, 1, 2, and 3. 110B and 141A as unknown signals are not used for training. After obtaining the discrimination threshold according to Sect. 3.2, 110B and 141A are used as network input together with the 4 known signals in the open-set test stage.

Table 1. Attributes of experimental signal samples

Full size table

For generating vector diagram, the size is set to 128 × 128 to fit the structure of the network. For data stream, as the network’s performance will be affected by the change of data statistical distribution, resulting in the inconsistency of calculation dimensional dynamic range and the decline of learning performance. Therefore, the normalization algorithm is adopted as:

$$ {\text{Norm}}(data) = \frac{{data - \frac{\max (data) + \min (data)}{2}}}{\max (data) - \min (data)}{ + }0.5 $$

(5)

which $data$ represents the signal data before normalization, ${\text{Norm}}(data)$ is the data after normalization processing. With normalization, the network can process data at the same scale, gaining better learning and regression performance. In addition, considering that the neural network can perform efficient operation on two-dimensional data structure, so the normalized data is constructed as 336 × 336 data matrix to obtain the high efficiency of data structure.

4.1 Recognition Performance

Table 2 shows the open-set recognition result of proposed method, The signal-to-noise ratio (SNR) of the experiment is 6dB. It is shown that after regression operation of 4 known signals 110A, 141B, Link11 SLEW and PACTOR, it does not completely regressed to the preset value, but have slight deviation. Therefore, according to the upper and lower quintile algorithm in Sect. 3.2, the upper bound and lower bound thresholds of regression for each known signals are obtained to distinguish known and unknown signal. At the same time, the center distance threshold obtained for center-distance interception of unknown signals is 0.0581. The experiment result indicates that when the SNR is 6dB, the recognition accuracy of known signals reaches more than 96%, which verifies the feasibility of the proposed method.

Table 2. Open-set recognition results of the proposed method

Full size table

Once regression processing is completed, use the kernel density clustering algorithm to obtain the numerical clustering clusters and density centers of unknown signal, and then intercepts them by using the center distance threshold. The proposed method can distinguish the unknown signal 1 (110B) with a recognition accuracy of 90.1%, and the unknown signal 2 (141A) with a recognition accuracy of 99.20%.

Overall, compared with the traditional open-set recognition method, which has few applicable signal types, difficult to distinguish signals of different specifications with same modulation mode and difficult to distinguish different unknown signals, the proposed method can effectively deal with the open-set signal data set, of which 4 signals are 8PSK modulation mode, and can distinguish different types of unknown signals.

4.2 Influence of Numerical Scale on Regression

This section discusses the influence of different training regression scale on network performance through comparative experiments. Table 3 shows the training regression value of 2 experiments on the known signals 110A, 141B, Link11 SLEW and PACTOR. During the training stage, 4 known signals are regressed to the value of 0, 1, 2, 3 and 0, 100, 200, 300.

In order to better observe the result, signal samples are input into the network in the order of signal type during the test stage. The corresponding relationship between signal sample type and signal serial number is shown in Table 4.

Table 3. Training regression value of each experiment

Full size table

The number of each signal type is 1000. The regression result of each experiment is shown in Fig. 7. It can be seen that when different scale of regression is set, the network will carry out numerical regression according to the preset scale, and the result of both experiment have good discrimination.

Table 4. Corresponding relationship between signal sample type and serial number

Full size table

This is because, although the numerical scales are different, once the network completes the training under this scale, a nonlinear mapping relationship matching this scale is formed. In other words, the training of different scale will only lead to the difference in the numerical dimension of regression result, and will not affect the discrimination performance between signals.

5 Conclusions

By combining the feature information of shortwave signal data stream and vector diagram, an open-set signal recognition method is proposed. Using the good feature extraction ability of densely connected convolution and the excellent feature processing and regression performance of dual-input regression neural network, the open-set signal recognition task is well completed. Experimental results show that compared with the traditional method, the proposed method can distinguish different type of unknown signals while maintaining the open-set recognition accuracy, and can effectively distinguish signals of different specifications with same modulation mode. In addition, this paper proposes to establish the regression relationship between signal feature and specific value, and embody the feature of different signal types as different regression values. This idea of transforming feature information for processing provides a new approach for further research in this field.

References

Jondral, F.: Automatic classification of high frequency signals. J. Signal Process. 9, 177–190 (1985)
Article MathSciNet Google Scholar
Zhenxing, L., Shichuan, C., Xiaoniu, Y.: Two-class SVDD algorithm for open-set specific emitter identification. J. Commun. Countermeas. 36, 1–6 (2017)
Google Scholar
Ying, Y., Lidong, Z.: Method for efficiently recognize satellite interference signals via incremental support vector machine. In: 15th Annual Conference of Satellite Communications, pp.163–171. China Academic Journal Electronic Publishing House, Beijing (2019)
Google Scholar
Diehl, C.P., Cauwenberghs, G.: SVM incremental learning, adaptation and optimization. In: The International Joint Conference on Neural Networks, pp. 2685–2690. IEEE Press, Piscataway(2003)
Google Scholar
Escalera, S., Pujol, O., Radeva, P.: Error-correcting output codes library. J. J. Mach. Learn. Res. 11, 661–664 (2010)
MATH Google Scholar
Yujie, X., Xiaowei, Q., Xiaodong, X., Jianqiang, C.: Open-set interference signal recognition using boundary samples: a hybrid Approach. In: 12th International Conference on Wireless Communications and Signal, pp. 269–274. IEEE Press, Piscataway (2020)
Google Scholar
Yunfei, H., Zhangmeng, L., Fucheng, G., Ming, Z.: Open-set recognition of signal modulation based on generative adversarial networks. J. Syst. Eng. Electron. 41, 2619–2624 (2019)
Google Scholar
Youwei, G., Hongyu, J., Jing, W.: Open set modulation recognition based on dual-channel LSTM model. J. arXiv Preprint, arXiv: 2002.12037 (2020)
Google Scholar
Hector, S., Santiago, Z., Ivan, P., Ivana, R., et al.: Special issue on MC-SS validation of a HF spread spectrum multi-carrier technology through real-link measurements. J. Eur. Trans. Telecommun. 17, 651–657 (2012)
Google Scholar
Johnson, E.E.: Simulation results for third-generation HF automatic link establishment. J. Proc. IEEE Milit. Commun. Conf. 2, 984–988 (1999)
Google Scholar
Zhu, C.: Non-cooperative demodulation of LINK11_SLEW. J. Telecommun. Eng. 54, 1378–1384 (2014)
Google Scholar
Gao, H., Zhuang, L., Laurens, V., Kilian, Q.W.: Densely connected convolutional networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2261–2269. IEEE Press, Piscataway (2017)
Google Scholar
Xiong, Z., Mankun, X., Hua, P., Xin, Q., Tianyun, L.: Specific protocol signal recognition based on deep residual network. J. Acta Electronica Sinica. 47, 1532–1537 (2019)
Google Scholar
Fagui, L., Yufei, C.: An Energy aware adaptive kernel density estimation approach to unequal clustering in wireless sensor networks. J. IEEE Access. 7, 40569–40580 (2019)
Article Google Scholar
Nieto, J.W., Furman, W.N.: Constant-amplitude waveform variations of US MIL-STD-188–110B and STANAG 4539. In: 2016 IET International Conference on Ionospheric Radio Systems and Techniques (IRST), pp. 212–216. IET Press, London (2006)
Google Scholar
Baker, M., Beamish, W., Turner, M.: The use of MIL-STD-188–141A in HF data networks. In: IEEE Military Communications Conference, pp. 75–79. IEEE Press, Piscataway (2002)
Google Scholar
Mohd, Y.R., Zainal, N., Abd, M.S.: Performance of 8FSK base on PACTOR I protocol over AWGN channels. In: 5th International Conference on Information Technology, Computer, and Electrical Engineering, pp. 1–5. IEEE Press, Piscataway (2018)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Data Target Engineering, Strategic Support Force Information Engineering University, Science Avenue. 62, Zhengzhou, 450001, China
Jian Zhang, Di Wu, Tao Hu, Shu Wang, Shiju Wang & Tingli Li

Authors

Jian Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Di Wu
View author publications
You can also search for this author in PubMed Google Scholar
Tao Hu
View author publications
You can also search for this author in PubMed Google Scholar
Shu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shiju Wang
View author publications
You can also search for this author in PubMed Google Scholar
Tingli Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jian Zhang .

Editor information

Editors and Affiliations

College of Communication Engineering, Jilin University, Jilin, Jilin, China
Zhihong Qian
Department of AI & ML, Vardhaman College of Engineering, Hyderabad, Telangana, India
M.A. Jabbar
College of Technology, Indiana State University, Terre Haute, IN, USA
Xiaolong Li

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, J., Wu, D., Hu, T., Wang, S., Wang, S., Li, T. (2022). Open-Set Recognition of Shortwave Signal Based on Dual-Input Regression Neural Network. In: Qian, Z., Jabbar, M., Li, X. (eds) Proceeding of 2021 International Conference on Wireless Communications, Networking and Applications. WCNA 2021. Lecture Notes in Electrical Engineering. Springer, Singapore. https://doi.org/10.1007/978-981-19-2456-9_88

Download citation

DOI: https://doi.org/10.1007/978-981-19-2456-9_88
Published: 13 July 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-2455-2
Online ISBN: 978-981-19-2456-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Open-Set Recognition of Shortwave Signal Based on Dual-Input Regression Neural Network

Abstract

Similar content being viewed by others

Intelligent Recognition Method of Short Wave Communication Transmission Signal Based on the Blind Separation Algorithm

Open-Set Recognition Algorithm of Signal Modulation Based on Siamese Neural Network

Weak Pulse Signal Detection Based on the Broad Learning Method under the Chaotic Background

Keywords

1 Introduction