Abstract
Openset recognition in blind shortwave signal processing is an important issue in modern communication signal processing. This paper presents a novel method for this problem. By preprocessing, the signal data matrix and vector diagram are obtained as network input. Then, the network is trained and tested with the known signal, and the upper and lower quintile algorithm is used to obtain the interval threshold for judging the known signal and the distance threshold for intercepting the length range of the unknown signal. Finally, the network is used for numerical regression in openset range, the threshold combined with kernel density clustering algorithm is used to identify different signals. Simulation results show that the proposed method overcomes the defects of traditional algorithm, which cannot distinguish different types of unknown signals and only applicable for few signal types.
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Due to the flexibility, survivability and longdistance transmission, shortwave communication has always been a reserved and development method in the field of wireless communication. Shortwave signal automatic recognition technology [1] is an important content of signal blind processing and an important basis for subsequent signal analysis, monitoring and countermeasure. With the development of modern shortwave communication technology, shortwave communication shows a trend of diversification of types, fine differentiation of specifications and continuous emergence of new signal types. Most of the traditional signal automatic recognition technologies are concentrated in the closedset level. When new unknown signal enter the system, the correct result cannot be obtained. Therefore, in order to meet the need of convenience, intelligence and timeliness of modern blind signal processing, it is of great value to carry out the research on efficient openset recognition technology of shortwave signal.
At present, most traditional signal recognition algorithms as well as algorithms based on deep learning only consider the recognition of known signal types. When a new unknown signal type appears, it will be recognized as one of the known signal, resulting in discrimination error. To solve the above problem, Literature [2] proposed a support vector data description (SVDD) algorithm with density scaled classification margin (DSCM), which determines the interval between hypersphere and positive samples according to the relative density proportion of two types of positive training samples, and carries out openset recognition in combination with support vector description, However, the algorithm can only distinguish 2 types of positive sample signals, and will classify all unknown signal types into one class. Literature [3] extends the algorithm of incremental support vector machine (ISVM) [4] combined with error correcting output codes (ECOC) [5] to multi classification for incremental learning and recognition, but this algorithm cannot solve the forgetting problem in incremental learning. Besides, designing coding matrix requires more priori information, and its multi classification ability is restricted by the coding length, as well as the model needs to be trained every time when a new signal is received, lead to its low efficiency.
The generative adversarial (GA) method is also used to solve the openset recognition problem. Literature [6] combines the improved intra class splitting (ICS) algorithm with the genetic adversarial algorithm to obtain the boundary signal samples, then trains the boundary signal samples as unknown types of signals and realizes the openset recognition. However, the process of constructing boundary samples is complex and the effect is unstable, and it also cannot distinguish different types of unknown signal. Literature [7] uses the generative countermeasure network theory to build a reconstruction and discrimination network (RDN) model to identify the modulation types of signals. However, the difference between the reconstructed signal data and the real unknown signal data is difficult to control, and when the known signal types is more than 2, the classification and discrimination mechanism will be very complex, which results in low operability. In addition, it is still unable to distinguish different types of unknown signals.
Some other methods, such as Literature [8] uses the extreme valueweibull distribution to fit the cutoff probability of the distance from the feature to the feature center, combines the classification cross entropy with the center loss, and modifies the output of the dual channel longshort term memory (DCLSTM) network to conduct the modulation recognition. This algorithm proposes the concepts of feature center and feature distance. In some cases, it can distinguish different unknown types of signals, but it cannot distinguish signals of different specifications with the same modulation mode.
From the above analysis, it can be concluded that the current signal openset recognition algorithms have the following shortcomings: 1) Some algorithms are only applicable to 2 types of known signals, and no longer applicable when the number of known signal type increases; 2) The existed works focus on the signal modulation recognition, the recognition method for different specifications with the same modulation mode is hardly considered; 3) It is difficult to distinguish different types of unknown signals, unknown signals can only be distinguished into one class, called â€˜unknown classâ€™.
In this paper, we propose a method to transform features of different signals into different regression values, and use these values to distinguish different signals. The contributions of proposed method are described as follow: Firstly, we design a dualinput neural network to fuse and map the feature information extracted from signal data stream and vector diagram. For better feature extraction, we design a network structure based on dense convolution theory. Secondly, different from the traditional recognition network structure, we use the hyperbolic tangent (Tanh) activation function to perform numerical regression on signal features at the end of the network, and establish a onetoone nonlinear mapping relationship between signal feature and specific value. Thirdly, we test the network in closedset, using the upper and lower quintile algorithm to obtain the regression discrimination threshold of each known signal and the center distance threshold for unknown signal. Finally, we perform openset experiments to demonstrate the effectiveness of the proposed method.
2 Distinguishing Features of Shortwave Signal
2.1 Data Stream
Specific shortwave standard has unique generation algorithm and transmission specification. These rules and standards make its signal data stream presents unique information organization format. Taking MILSTD188110A (110A) [9], MILSTD188141B(141B) [10] and Link11 SLEW [11] as an example, the typical information transmission format is shown in Fig.Â 1.
We can conclude that the data transmission organization structure of different signals is unique, and the bits of each sequence and field are not the same. These differences make the received 110A, 141B and Link11 data stream present the unique data characteristics of their respective signal. Based on this, if a feature extraction algorithm with high performance and strong robustness can be found for signal data, the feature extracted from signal data stream can be used as recognition criteria to distinguish the type of different shortwave signals.
2.2 Vector Diagram
Vector diagram shows the symbol track by reconstructing two channels of received signal data in time order, not only can distinguish frequency shift keying (FSK) and phase shift keying (PSK), but also can distinguish signals with different PSK modulation modes, as shown in Fig.Â 2. The symbols of PSK signals have a fixed phase, so the vector diagram is in the form of constellation point and symbol trajectory, while the phase of FSK signals is random during symbol conversion, so the vector diagram is in the form of circle.
In this paper, the signal vector diagram is used as the supplementary feature extraction source. By powerful feature processing ability of neural network, the different feature information of signal specification represented by data flow and the modulation feature information represented by vector diagram is fused, and then learned and mapped, to further improve the performance of signal recognition.
3 Proposed Method
In this section, we first describe the dualinput neural network architecture of our method, then we present the algorithm for obtaining the discrimination threshold. Finally, we demonstrate the procedure of the proposed scheme.
3.1 DualInput Regression Neural Network
Regression analysis (RA) is a statistical analysis method to determine the relationship between two or more variables. We construct dualinput regression neural network to map the extracted signal feature to specific value. By using the difference of numerical regression result, we can distinguish different signals in openset range.
The proposed dualinput regression neural network is illustrated in Fig.Â 3. The feature extraction is conducted by 7 feature extraction modules. The structure of feature extraction module is shown in Fig.Â 4. The network connects adjacent feature extraction module through the transformation module, each transformation module contains a 1â€‰Ã—â€‰1 convolution and a 2â€‰Ã—â€‰2 average pool. After extracting the feature via the above \((66{ + }18) \times 2 + 5 = 173\) layers network and conduct a 7â€‰Ã—â€‰7 global average pool, the acquired feature information are fused by concatenation, and then establish the nonlinear relationship between signal feature and specific value by regression processing. Except for the end of the network, the rectified linear unit (ReLu) is used in each layer. During the compilation and optimization of the network, the Adam algorithm is used to work out the optimal solution of the network structure parameters.
At the end of the network, Tanh activation function is used for regression from signal eigenvectors to preset specific values:
Compared with Sigmoid activation function, which is widely used in regression operation:
The Sigmoid activation function may change the distribution of original data to some extent, as shown in Fig.Â 5, while Tanh does not. Moreover, Tanh has a larger gradient, so that the convergence speed is faster in regression operation, which can achieve better training effect.
3.2 Discrimination Threshold
After regression of a specific signal with several signal samples, the result values will fall into a small range. In this paper, the upper and lower quintile algorithm is used to work out the interval threshold and center distance threshold of known signal, in which the interval threshold is used as the basis to distinguish known and unknown signals, the center distance threshold is taken as the length when intercepting the numerical cluster of unknown signals. Suppose that after regression processing of a known signal S, the numerical distribution of several samples is shown in Fig.Â 6.
Define \(\gamma_{low}\) as the lower quintile of the data set, indicating that there is only 1/5 of all data, which value is less than \(\gamma_{low}\). Similarly, define \(\gamma_{up}\) as the upper quintile of the data set, which means that only 1/5 of all data has a value greater than \(\gamma_{up}\). According to the upper and lower quintile algorithm, the interval threshold of regression value for signal S is defined as:
where \(\delta_{low}\) is the lower bound threshold of regression value for signal S, \(\delta_{up}\) is the upper bound threshold, and \(\mu\) is the scale factor, which is 1.5 in this paper. In addition, \(\delta_{up}  \delta_{low}\) is the upper and lower distance threshold of the regression for signal S. After regression test of known signals in the closedset, use:
To calculate the center distance threshold D, which is used as the length of subsequent centerdistance interception of unknown signals numerical clusters. In Eq.Â (4), J is the number of known signal types, \(\delta_{up}^{(n)}\) and \(\delta_{low}^{(n)}\) represent the upper bound threshold and lower bound threshold of the jth known signal, \(\lambda\) is the grace factor, the value we use is 1.38.
3.3 Algorithm Scheme
According to the above discussion, the openset recognition process is as follows:

1)
Preprocess known shortwave signals and construct training signal data sets;

2)
Use the training data set to train the network, when the networkâ€™s loss value falls below the preset threshold, the training is terminated and the network is saved;

3)
Since the network cannot conduct zeroerror regression, the trained network is used to test the known signal. With the upper and lower quintile algorithm, the interval threshold and center distance threshold of each known signal are obtained as the standard to distinguish between known and unknown signals and the subsequent interception of the unknown signal;

4)
In the openset range, use the network to recognize the preprocessed signals. For the regression value of a specific signal, if it falls within the threshold of a known signal interval in step 3), it is judged as such known signal, and if it falls outside the threshold of all known signal intervals, it is judged as unknown signal;

5)
Use the kernel density clustering algorithm [14] to cluster all regression values identified as unknown signals to obtain the number of categories, regression numerical clustering clusters and corresponding density center coordinate. For each numerical clustering cluster, use the density center coordinate combined with the center distance threshold to intercept, the signal samples represented by the regression numerical points falling within the interception range are identified as such unknown signal, so as to complete the openset recognition.
4 Experimental Results
In this section, the recognition performance of proposed method is simulated and tested. The experimental platform is configured with Intel (R) Xeon (R) e2276m processor, NVIDIA Quadro RTX 5000 GPU and 32 GB DDR4 memory.
Signal used in the experiment includes 6 types: 110A, MILSTD188110B (110B) [15], MILSTD188141A(141A) [16], 141B, Link11 SLEW, PACTOR [17]. The signal setting of the experiment is shown in Table 1. During experiment, 110A, 141B, Link11 SLEW and PACTOR are used for network training as known signals, and are set to regress to the value of 0, 1, 2, and 3. 110B and 141A as unknown signals are not used for training. After obtaining the discrimination threshold according to Sect. 3.2, 110B and 141A are used as network input together with the 4 known signals in the openset test stage.
For generating vector diagram, the size is set to 128â€‰Ã—â€‰128 to fit the structure of the network. For data stream, as the networkâ€™s performance will be affected by the change of data statistical distribution, resulting in the inconsistency of calculation dimensional dynamic range and the decline of learning performance. Therefore, the normalization algorithm is adopted as:
which \(data\) represents the signal data before normalization, \({\text{Norm}}(data)\) is the data after normalization processing. With normalization, the network can process data at the same scale, gaining better learning and regression performance. In addition, considering that the neural network can perform efficient operation on twodimensional data structure, so the normalized data is constructed as 336â€‰Ã—â€‰336 data matrix to obtain the high efficiency of data structure.
4.1 Recognition Performance
Table 2 shows the openset recognition result of proposed method, The signaltonoise ratio (SNR) of the experiment is 6dB. It is shown that after regression operation of 4 known signals 110A, 141B, Link11 SLEW and PACTOR, it does not completely regressed to the preset value, but have slight deviation. Therefore, according to the upper and lower quintile algorithm in Sect.Â 3.2, the upper bound and lower bound thresholds of regression for each known signals are obtained to distinguish known and unknown signal. At the same time, the center distance threshold obtained for centerdistance interception of unknown signals is 0.0581. The experiment result indicates that when the SNR is 6dB, the recognition accuracy of known signals reaches more than 96%, which verifies the feasibility of the proposed method.
Once regression processing is completed, use the kernel density clustering algorithm to obtain the numerical clustering clusters and density centers of unknown signal, and then intercepts them by using the center distance threshold. The proposed method can distinguish the unknown signal 1 (110B) with a recognition accuracy of 90.1%, and the unknown signal 2 (141A) with a recognition accuracy of 99.20%.
Overall, compared with the traditional openset recognition method, which has few applicable signal types, difficult to distinguish signals of different specifications with same modulation mode and difficult to distinguish different unknown signals, the proposed method can effectively deal with the openset signal data set, of which 4 signals are 8PSK modulation mode, and can distinguish different types of unknown signals.
4.2 Influence of Numerical Scale on Regression
This section discusses the influence of different training regression scale on network performance through comparative experiments. Table 3 shows the training regression value of 2 experiments on the known signals 110A, 141B, Link11 SLEW and PACTOR. During the training stage, 4 known signals are regressed to the value of 0, 1, 2, 3 and 0, 100, 200, 300.
In order to better observe the result, signal samples are input into the network in the order of signal type during the test stage. The corresponding relationship between signal sample type and signal serial number is shown in Table 4.
The number of each signal type is 1000. The regression result of each experiment is shown in Fig.Â 7. It can be seen that when different scale of regression is set, the network will carry out numerical regression according to the preset scale, and the result of both experiment have good discrimination.
This is because, although the numerical scales are different, once the network completes the training under this scale, a nonlinear mapping relationship matching this scale is formed. In other words, the training of different scale will only lead to the difference in the numerical dimension of regression result, and will not affect the discrimination performance between signals.
5 Conclusions
By combining the feature information of shortwave signal data stream and vector diagram, an openset signal recognition method is proposed. Using the good feature extraction ability of densely connected convolution and the excellent feature processing and regression performance of dualinput regression neural network, the openset signal recognition task is well completed. Experimental results show that compared with the traditional method, the proposed method can distinguish different type of unknown signals while maintaining the openset recognition accuracy, and can effectively distinguish signals of different specifications with same modulation mode. In addition, this paper proposes to establish the regression relationship between signal feature and specific value, and embody the feature of different signal types as different regression values. This idea of transforming feature information for processing provides a new approach for further research in this field.
References
Jondral, F.: Automatic classification of high frequency signals. J. Signal Process. 9, 177â€“190 (1985)
Zhenxing, L., Shichuan, C., Xiaoniu, Y.: Twoclass SVDD algorithm for openset specific emitter identification. J. Commun. Countermeas. 36, 1â€“6 (2017)
Ying, Y., Lidong, Z.: Method for efficiently recognize satellite interference signals via incremental support vector machine. In: 15th Annual Conference of Satellite Communications, pp.163â€“171. China Academic Journal Electronic Publishing House, Beijing (2019)
Diehl, C.P., Cauwenberghs, G.: SVM incremental learning, adaptation and optimization. In: The International Joint Conference on Neural Networks, pp. 2685â€“2690. IEEE Press, Piscataway(2003)
Escalera, S., Pujol, O., Radeva, P.: Errorcorrecting output codes library. J. J. Mach. Learn. Res. 11, 661â€“664 (2010)
Yujie, X., Xiaowei, Q., Xiaodong, X., Jianqiang, C.: Openset interference signal recognition using boundary samples: a hybrid Approach. In: 12th International Conference on Wireless Communications and Signal, pp. 269â€“274. IEEE Press, Piscataway (2020)
Yunfei, H., Zhangmeng, L., Fucheng, G., Ming, Z.: Openset recognition of signal modulation based on generative adversarial networks. J. Syst. Eng. Electron. 41, 2619â€“2624 (2019)
Youwei, G., Hongyu, J., Jing, W.: Open set modulation recognition based on dualchannel LSTM model. J. arXiv Preprint, arXiv: 2002.12037 (2020)
Hector, S., Santiago, Z., Ivan, P., Ivana, R., et al.: Special issue on MCSS validation of a HF spread spectrum multicarrier technology through reallink measurements. J. Eur. Trans. Telecommun. 17, 651â€“657 (2012)
Johnson, E.E.: Simulation results for thirdgeneration HF automatic link establishment. J. Proc. IEEE Milit. Commun. Conf. 2, 984â€“988 (1999)
Zhu, C.: Noncooperative demodulation of LINK11_SLEW. J. Telecommun. Eng. 54, 1378â€“1384 (2014)
Gao, H., Zhuang, L., Laurens, V., Kilian, Q.W.: Densely connected convolutional networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2261â€“2269. IEEE Press, Piscataway (2017)
Xiong, Z., Mankun, X., Hua, P., Xin, Q., Tianyun, L.: Specific protocol signal recognition based on deep residual network. J. Acta Electronica Sinica. 47, 1532â€“1537 (2019)
Fagui, L., Yufei, C.: An Energy aware adaptive kernel density estimation approach to unequal clustering in wireless sensor networks. J. IEEE Access. 7, 40569â€“40580 (2019)
Nieto, J.W., Furman, W.N.: Constantamplitude waveform variations of US MILSTD188â€“110B and STANAG 4539. In: 2016 IET International Conference on Ionospheric Radio Systems and Techniques (IRST), pp. 212â€“216. IET Press, London (2006)
Baker, M., Beamish, W., Turner, M.: The use of MILSTD188â€“141A in HF data networks. In: IEEE Military Communications Conference, pp. 75â€“79. IEEE Press, Piscataway (2002)
Mohd, Y.R., Zainal, N., Abd, M.S.: Performance of 8FSK base on PACTOR I protocol over AWGN channels. In: 5th International Conference on Information Technology, Computer, and Electrical Engineering, pp. 1â€“5. IEEE Press, Piscataway (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
Copyright information
Â© 2022 The Author(s)
About this paper
Cite this paper
Zhang, J., Wu, D., Hu, T., Wang, S., Wang, S., Li, T. (2022). OpenSet Recognition of Shortwave Signal Based on DualInput Regression Neural Network. In: Qian, Z., Jabbar, M., Li, X. (eds) Proceeding of 2021 International Conference on Wireless Communications, Networking and Applications. WCNA 2021. Lecture Notes in Electrical Engineering. Springer, Singapore. https://doi.org/10.1007/9789811924569_88
Download citation
DOI: https://doi.org/10.1007/9789811924569_88
Published:
Publisher Name: Springer, Singapore
Print ISBN: 9789811924552
Online ISBN: 9789811924569
eBook Packages: EngineeringEngineering (R0)