Abstract
The security of data stream attracts more attention in daily life, the huge number of data stream makes it impossible to detect its exceptions, and the maximal frequent itemsets (MFIs) can perfectly imply data stream and the number is smaller, therefore, the time cost and memory usage are much more efficient. This paper proposes DMFI to detect the exceptions of data stream, an improved method called MRMFI and a pattern matching method called IM-Sunday and included in DMFI. MRMFI mines the MFIs from data stream and it uses two matrices to store the information, the frequent multiple-itemsets are generated by the extension of frequent 2-itemsets. Then, the exceptions are detected by using IM-Sunday algorithm to match the patterns in MFIs. Some experimental studies are conducted based on proposed method, the results show that the MRFIM method can mine MFIs in less time and DMFI can efficiently detect the exceptions of data stream.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Calders, T., Dexters, N., Gillis, J.J.M., et al.: Mining frequent itemsets in a stream. Inf. Syst. 39, 233–255 (2014)
Nori, F., Deypir, M., Sadreddini, M.H.: A sliding window based algorithm for frequent closed itemset mining over data streams. J. Syst. Softw. 86(3), 615–623 (2013)
Deng, Z.H.: DiffNodesets: an efficient structure for fast mining frequent itemsets. Appl. Soft Comput. 41, 214–223 (2016)
Shin, S.J., Lee, D.S., Lee, W.S.: CP-tree: an adaptive synopsis structure for compressing frequent itemsets over online data streams. Inf. Sci. 278, 559–576 (2014)
Li, H.F., Lee, S.Y., Shan, M.K.: Online mining (recently) maximal frequent itemsets over data streams. In: 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications (RIDE-SDMA 2005), pp. 11–18. IEEE (2005)
Fan, G.D., Yin, S.H.: A frequent itemsets mining algorithm based on matrix in sliding window over data streams. In: 3rd International Conference Intelligent System Design and Engineering Applications, pp. 66–69 (2013)
Yan, Q.Y., Xia, S.X., Feng, K.W.: Probabilistic distance based abnormal pattern detection in uncertain series data. Knowl.-Based Syst. 36, 182–190 (2012)
Liu, J., Deng, H.F.: Outlier detection on uncertain data based on local information. Knowl.-Based Syst. 51, 60–71 (2013)
Böhm, C., Plant, C., Shao, J., et al.: Clustering by synchronization. In: Proceedings of 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 583–592. ACM (2010)
Kao, L.J., Huang, Y.P.: Association rules based algorithm for identifying outlier transactions in data stream. In: International Conference on Systems, Man, and Cybernetics (SMC), pp. 3209–3214. IEEE (2012)
Angiulli, F., Fassetti, F.: Detecting distance-based outliers in streams of data. In: Proceedings of the 16th Conference on Information and Knowledge Management, pp. 811–820. ACM (2007)
Wang, W., Guyet, T., Quiniou, R., et al.: Autonomic intrusion detection: adaptively detecting anomalies over unlabeled audit data streams in computer networks. Knowl.-Based Syst. 70, 103–117 (2014)
Knuth, D.E., Morris, J.H., Pratt, V.R.: Fast pattern matching in strings. SIAM J. Comput. 6(2), 323–350 (1977)
Cho, S., Na, J.C., Park, K., et al.: A fast algorithm for order-preserving pattern matching. Inf. Process. Lett. 115(2), 397–402 (2015)
Boyer, R.S., Moore, J.S.: A fast string searching algorithm. Commun. ACM 20(10), 762–772 (1977)
Sunday, D.M.: A very fast substring search algorithm. Commun. ACM 33(8), 132–142 (1990)
Chen, J.F., Cai, S.H., Zhu, L.L., et al.: An improved string-searching algorithm and its application in component security testing. Tsinghua Sci. Technol. 21(3), 281–294 (2016)
Acknowledgments
This work was supported by Scientific and technological key projects of Xinjiang Production & Construction Corps (Grant No. 2015AC023).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Cai, S., Sun, R., Cheng, C., Wu, G. (2017). Exception Detection of Data Stream Based on Improved Maximal Frequent Itemsets Mining. In: Xu, M., Qin, Z., Yan, F., Fu, S. (eds) Trusted Computing and Information Security. CTCIS 2017. Communications in Computer and Information Science, vol 704. Springer, Singapore. https://doi.org/10.1007/978-981-10-7080-8_10
Download citation
DOI: https://doi.org/10.1007/978-981-10-7080-8_10
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-7079-2
Online ISBN: 978-981-10-7080-8
eBook Packages: Computer ScienceComputer Science (R0)