An Efficient Method for Mining Clickstream Patterns

Bui, Bang V.; Vo, Bay; Huynh, Huy M.; Nguyen-Hoang, Tu-Anh; Huynh, Bao

doi:10.1007/978-3-319-99368-3_45

Bang V. Bui¹⁷,
Bay Vo¹⁸,
Huy M. Huynh¹⁹,
Tu-Anh Nguyen-Hoang¹⁷ &
…
Bao Huynh²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11103))

Included in the following conference series:

International Joint Conference on Rough Sets

1049 Accesses
1 Citations

Abstract

Recently, hybrid approaches, which combine an FP-tree-like data structure with an interaction-based approach, are efficient approaches for mining frequent itemsets. However, applying those approaches for sequential pattern mining arose some challenges. In this paper, we introduce a hybrid approach for a specific version of sequential pattern mining, clickstream pattern mining, with our proposed B-List structure and SMUB algorithm. The SMUB algorithm exploited the B-List structure that is generated from the SPPC tree and the B-List intersection are used to discover all sequential patterns in the given sequence database. Via our experiments on various databases, SMUB has been shown to be more efficient than the current state-of-the-art algorithm, CM-Spade, in terms of runtime, and scalability, especially on huge databases with very small thresholds.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agrawal, R., Imieliński, T., Swami, A.: Mining association rules between sets of items in large databases. ACM Sigmod Rec. 22(2), 207–216 (1993)
Article Google Scholar
Agrawal, R., Srikant, R.: Mining sequential patterns. In: The Eleventh International Conference on Data Engineering, pp. 3–14. IEEE (1995)
Google Scholar
Srikant, R., Agrawal, R.: Mining sequential patterns: generalizations and performance improvements. In: Apers, P., Bouzeghoub, M., Gardarin, G. (eds.) EDBT 1996. LNCS, vol. 1057, pp. 1–17. Springer, Heidelberg (1996). https://doi.org/10.1007/BFb0014140
Chapter Google Scholar
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. ACM Sigmod Rec. 29(2), 1–2 (2000)
Article Google Scholar
Zaki, M.J.: SPADE: an efficient algorithm for mining frequent sequences. Mach. Learn. 42(1–2), 31–60 (2001)
Article Google Scholar
Han, J., et al.: PrefixSpan: mining sequential patterns efficiently by prefix-projected pattern growth. In: The 17th International Conference on Data Engineering, pp. 215–224 (2001)
Google Scholar
Lin, C.-W., et al.: An incremental FUSP-tree maintenance algorithm. In: Proceedings of 2008 Eighth International Conference on Intelligent Systems Design and Applications, vol. 1, pp. 445–449. IEEE (2008)
Google Scholar
Bithi, A.A., Ferdaus, A.A.: Sequential pattern tree mining. IOSR J. Comput. Eng. 5(5), 79–89 (2013)
Article Google Scholar
Deng, Z.-H., Wang, Z., Jiang, J.: A new algorithm for fast mining frequent itemsets using N-Lists. Sci. China Inf. Sci. 55(9), 2008–2030 (2012)
Article MathSciNet Google Scholar
Deng, Z.-H.: DiffNodesets: an efficient structure for fast mining frequent itemsets. Appl. Soft Comput. 41, 214–223 (2016)
Article Google Scholar
Fournier-Viger, P., Gomariz, A., Campos, M., Thomas, R.: Fast vertical mining of sequential patterns using co-occurrence information. In: Tseng, V.S., Ho, T.B., Zhou, Z.-H., Chen, A.L.P., Kao, H.-Y. (eds.) PAKDD 2014. LNCS (LNAI), vol. 8443, pp. 40–52. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-06608-0_4
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

University of Information Technology, Vietnam National University HCMC, Ho Chi Minh City, Vietnam
Bang V. Bui & Tu-Anh Nguyen-Hoang
Faculty of Information Technology, Ho Chi Minh City University of Technology (HUTECH), Ho Chi Minh City, Vietnam
Bay Vo
Faculty of Electrical Engineering and Computer Science, Technical University of Ostrava (VŠB), Ostrava-Poruba, Czech Republic
Huy M. Huynh
Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam
Bao Huynh

Authors

Bang V. Bui
View author publications
You can also search for this author in PubMed Google Scholar
Bay Vo
View author publications
You can also search for this author in PubMed Google Scholar
Huy M. Huynh
View author publications
You can also search for this author in PubMed Google Scholar
Tu-Anh Nguyen-Hoang
View author publications
You can also search for this author in PubMed Google Scholar
Bao Huynh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bao Huynh .

Editor information

Editors and Affiliations

University of Warsaw, Warsaw, Poland
Hung Son Nguyen
Faculty of Information Technology, Vietnam National University, Hanoi, Vietnam
Quang-Thuy Ha
School of Information Science, Southwest Jiaotong University, Chengdu, China
Tianrui Li
Institute of Computer Science, University of Silesia, Sosnowiec, Poland
Małgorzata Przybyła-Kasperek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bui, B.V., Vo, B., Huynh, H.M., Nguyen-Hoang, TA., Huynh, B. (2018). An Efficient Method for Mining Clickstream Patterns. In: Nguyen, H., Ha, QT., Li, T., Przybyła-Kasperek, M. (eds) Rough Sets. IJCRS 2018. Lecture Notes in Computer Science(), vol 11103. Springer, Cham. https://doi.org/10.1007/978-3-319-99368-3_45

Download citation

DOI: https://doi.org/10.1007/978-3-319-99368-3_45
Published: 15 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99367-6
Online ISBN: 978-3-319-99368-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics