Abstract
In the collected associated data streams, some potential outliers are often fixed with the normal data instances, thus, it is necessary to accurately detect the outliers to improve the reliability of the data streams. In real life, people are more concerned about whether some outliers existed in the small scale data instances that satisfy their constraints, rather than in the huge entire datasets. However, the existing association-based outlier detection methods were proposed to detect the outliers from the entire data streams, thus, the time consumption is very long. To content with the existence of the constraints, this paper proposes an efficient constrained minimal rare pattern-based outlier detection method for data streams, namely AMCMRP-Outlier, to process the succinct and convertible anti-monotonic constraints. In the pattern mining phase, the matrix structure is used to quickly mine the minimal rare patterns that satisfy the constraints, thus providing the pattern basis for the outlier detection. In the outlier detection phase, two deviation indices are defined to measure the deviation degree of each transaction, and then the transactions having large deviation degrees are determined as the outliers. Finally, extensive experiments on one synthetic dataset and two public datasets verify that the AMCMRP-Outlier method can accurately detect the outliers with less time cost.
This work was partly supported by National Natural Science Foundation of China (NSFC grant number: U1836116), and the project of Jiangsu provincial Six Talent Peaks (Grant number XYDXXJS-016).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Branch, J.W., Giannella, C., Szymanski, B., Wolff, R., Kargupta, H.: In-network outlier detection in wireless sensor networks. Knowl. Inf. Syst. 34(1), 23–54 (2013)
Cai, S., Sun, R., Li, J., Deng, C., Li, S.: Abnormal detecting over data stream based on maximal pattern mining technology. In: Sun, Y., Lu, T., Xie, X., Gao, L., Fan, H. (eds.) ChineseCSCW 2018. CCIS, vol. 917, pp. 371–385. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-3044-5_27
Cai, S.H., Li, S.C., Yuan, G., Hao, S.B., Sun, R.Z.: MiFI-outlier: minimal infrequent itemset-based outlier detection approach on uncertain data stream. Knowl.-Based Syst. 191, 105268 (2020)
Hao, S., Cai, S., Sun, R., Li, S.: FCI-Outlier: an efficient frequent closed itemset-based outlier detecting approach on data stream. In: Sun, Y., Lu, T., Xie, X., Gao, L., Fan, H. (eds.) ChineseCSCW 2018. CCIS, vol. 917, pp. 176–187. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-3044-5_13
He, Z.Y., Xu, X.F., Huang, J.Z., Deng, S.C.: FP-Outlier: frequent pattern based outlier detection. Comput. Sci. Inf. Syst. 2(1), 103–118 (2005)
Hemalatha, C.S., Vaidehi, V., Lakshmi, R.: Minimal infrequent pattern based approach for mining outliers in data streams. Expert Syst. Appl. 42(4), 1998–2012 (2015)
Huang, J., Zhu, Q., Yang, L., Cheng, D., Wu, Q.: A novel outlier cluster detection algorithm without top-n parameter. Knowl Based Syst 121, 32–40 (2017)
Iraqi, O., Bakkali, H.E.: Application-level unsupervised outlier-based intrusion detection and prevention. Secur. Commun. Networks 5, 1–13 (2019)
Kontaki, M., Gounaris, A., Papadopoulos, A.N., Tsichlas, K., Manolopoulos, Y.: Efficient and flexible algorithms for monitoring distance-based outliers over data streams. Inf. Syst. 55, 37–53 (2016)
Li, X.J., Lv, J.C., Yi, Z.: Outlier detection using structural scores in a high-dimensional space. IEEE Trans. Cybern. 50(5), 2302–2310 (2020)
Radovanović, M., Nanopoulos, A., Ivanović, M.: Reverse nearest neighbors in unsupervised distance-based outlier detection. IEEE Trans. Knowl. Data Eng. 27(5), 1369–1382 (2015)
Sharma, V., Kumar, R., Cheng, W.H., Atiquzzaman, M., Srinivasan, K., Zomaya, A.: NHAD: neuro-fuzzy based horizontal anomaly detection in online social networks. IEEE Trans. Knowl. Data Eng. 30(11), 2171–2184 (2018)
Tang, B., He, H.: A local density-based approach for outlier detection. Neurocomputing 241, 171–180 (2017)
Vanhoeyveld, J., Martens, D., Peeters, B.: Value-added tax fraud detection with scalable anomaly detection techniques. Appl. Soft Comput. 86, 105895 (2020)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Cai, S., Chen, J., Li, X., Liu, B. (2020). Minimal Rare-Pattern-Based Outlier Detection Method for Data Streams by Considering Anti-monotonic Constraints. In: Susilo, W., Deng, R.H., Guo, F., Li, Y., Intan, R. (eds) Information Security. ISC 2020. Lecture Notes in Computer Science(), vol 12472. Springer, Cham. https://doi.org/10.1007/978-3-030-62974-8_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-62974-8_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-62973-1
Online ISBN: 978-3-030-62974-8
eBook Packages: Computer ScienceComputer Science (R0)