International Conference on Intelligent Science and Big Data Engineering

Intelligence Science and Big Data Engineering. Big Data and Machine Learning Techniques pp 487-498 | Cite as

A Novel Complex-Events Analytical System Using Episode Pattern Mining Techniques

  • Jerry C. C. Tseng
  • Jia-Yuan Gu
  • P. F. Wang
  • Ching-Yu Chen
  • Vincent S. Tseng
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9243)

Abstract

Along with the rapid development of IoT (Internet of Things), there comes the ‘Big Data’ era with the fast growth of digital data and the requirements rise for gaining useful knowledge by analyzing the rich data of complex types. How to effectively and efficiently apply data mining techniques to analyze the big data plays a crucial role in real-world use cases. In this paper, we propose a novel complex-events analytical system based on episode pattern mining techniques. The proposed system consists of four major components, including data preprocessing, pattern mining, rules management and prediction modules. For the core mining process, we proposed a new algorithm named EM-CES (Episode Mining over Complex Event Sequences) based on the sliding window approach. We also make the proposed system integrable with other application platform for complex event analysis, such that users can easily and quickly make use of it to gain the valuable information from complex data. Finally, excellent experimental results on a real-life dataset for electric power consumption monitoring validate the efficiency and effectiveness of the proposed system.

Keywords

Data mining Complex event analytics Episode pattern mining Multivariate sequence mining 

References

  1. 1.
    Aboalsamh, H.A., Hafez, A.M., Assassa, G.M.R.: An efficient stream mining technique. WSEAS Trans. Inf. Sci. Appl. 5(7), 1272–1281 (2008)Google Scholar
  2. 2.
    Casas-Garriga, G.: Discovering unbounded episodes in sequential data. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds.) PKDD 2003. LNCS (LNAI), vol. 2838, pp. 83–94. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  3. 3.
    Fournier-Viger, P., Nkambou, R., Tseng, V.S.: RuleGrowth: mining sequential rules common to several sequences by pattern-growth. In: ACM Symposium on Applied Computing, pp. 956–961, Taiwan (2011)Google Scholar
  4. 4.
    Lin, J., Keogh, E., Lonardi, S., Chiu, B.: A symbolic representation of time series, with implications for streaming algorithms. In: Proceedings of ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, pp. 2–11 (2003)Google Scholar
  5. 5.
    Lin, Y., Huang, C., Tseng, V.S.: A novel mining methodology for stock investment. J. Inf. Sci. Eng. 30, 571–585 (2014)Google Scholar
  6. 6.
    Lin, Y., Jiang, P., Tseng, V.S.: Efficient mining of frequent target episodes from complex event sequences. In: Proceedings of International Computer Symposium (2014)Google Scholar
  7. 7.
    Ma, X., Pang, H., Tan, K.L.: Finding constrained frequent episodes using minimal occurrences. In: Proceedings of IEEE International Conference on Data Mining, pp. 471–474 (2004)Google Scholar
  8. 8.
    Mallik, R., Kargupta, H.: A sustainable approach for demand prediction in smart grids using a distributed local asynchronous algorithm. In: Proceedings of CIDU Conference of Intelligent Data Understanding, pp. 01–15 (2011)Google Scholar
  9. 9.
    Mannila, H., Toivonen, H., Verkamo, A.I.: Discovering frequent episodes in sequences. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 210–215 (1995)Google Scholar
  10. 10.
    Mannila, H., Toivonen, H., Verkamo, A.I.: Discovering frequent episodes in sequences. Data Min. Knowl. Disc. 1(3), 259–289 (1997)CrossRefGoogle Scholar
  11. 11.
    Mannila, H., Toivonen, H.: Discovering generalized episodes using minimal occurrences. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 146–151 (1996)Google Scholar
  12. 12.
    Pei, J., Han, J., Mortazavi-Asl, B., Pinto, H., Chen, Q., Dayal, U., Hsu, M.: PrefixSpan: mining sequential patterns efficiently by prefix-projected pattern growth. In: Proceedings of International Conference on Data Engineering, pp. 215–226 (2001)Google Scholar
  13. 13.
    Tseng, V.S., Lee, C.-H.: Effective temporal data classification by integrating sequential pattern mining and probabilistic induction. Expert Syst. Appl. (ESWA) 36(5), 9524–9532 (2009)CrossRefGoogle Scholar
  14. 14.
    Wu, C.-W., Lin, Y.-F., Yu, P.S., Tseng, V.S.: Mining high utility episodes in complex event sequences. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 536–544 (2013)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Jerry C. C. Tseng
    • 1
  • Jia-Yuan Gu
    • 1
  • P. F. Wang
    • 2
  • Ching-Yu Chen
    • 2
  • Vincent S. Tseng
    • 1
  1. 1.Department of Computer Science and Information EngineeringNational Cheng Kung UniversityTainanTaiwan
  2. 2.Institute for Information IndustryTaipeiTaiwan

Personalised recommendations