Abstract
In this paper, we propose a novel algorithm for mining frequent sequences, called SPaMi-FTS (Sequential Pattern Mining based on Frequent Two-Sequences). SPaMi-FTS introduces a new data structure to store the frequent sequences, which together with a new pruning strategy to reduce the number of candidate sequences and a new heuristic to generate them, allows to increase the efficiency of the frequent sequence mining. The experimental results show that the SPaMi-FTS algorithm has better performance than the main algorithms reported to discover frequent sequences.
Chapter PDF
Similar content being viewed by others
References
Agrawal, R., Srikant, R.: Mining Sequential Patterns. In: Agrawal, R., Srikant, R. (eds.) Proceedings of 1995 International Conference Data Engineering, ICDE 1995, pp. 3–14 (1995)
Yu, X., Li, M., Gyu Lee, D., Deuk Kim, K., Ho Ryu, K.: Application of Closed Gap-Constrained Sequential Pattern Mining in Web Log Data. Advances in Control and Communication 137, 649–656 (2012)
García-Hernández, R.A., Martínez-Trinidad, J.F., Carrasco-Ochoa, J.A.: A fast algorithm to find all the maximal frequent sequences in a text. In: Sanfeliu, A., Martínez Trinidad, J.F., Carrasco Ochoa, J.A. (eds.) CIARP 2004. LNCS, vol. 3287, pp. 478–486. Springer, Heidelberg (2004)
Liao, V.C.-C., Chen, M.S.: An Efficient Sequential Pattern Mining Algorithm for Motifs with Gap Constraints. In: Proceedings of the 2012 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), vol. 1, pp. 1–1 (2012)
Garofalakis, M., Rastogi, R., Shim, K.: SPIRIT: Sequential Pattern Mining with Regular Expression Constraints. In: Proceedings of the 25th International Conference on Very Large Data Bases, pp. 223–234 (1999)
Pei, J., Han, J., Mortazavi-asl, B., Pinto, H., Chen, Q., Dayal, U., Hsu, M.: PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth, pp. 215–224 (2001)
Srikant, R., Agrawal, R.: Mining Sequential Patterns: Generalizations and Performance Improvements. In: Apers, P.M.G., Bouzeghoub, M., Gardarin, G. (eds.) EDBT 1996. LNCS, vol. 1057, pp. 1–17. Springer, Heidelberg (1996)
Zaki, M.J.: SPADE: An Efficient Algorithm for Mining Frequent Sequences. Machine Learning Journal 42, 31–60 (2001)
Ying Hsieh, C., Lin Yang, D., Wu, J.: An Efficient Sequential Pattern Mining Algorithm Based on the 2-Sequence Matrix. In: IEEE International Conference on Data Mining Workshops, pp. 583–591 (2008)
Gouda, K., Hassaan, M., Zaki, M.J.: PRISM: A Prime-Encoding Approach for Frequent Sequence Mining. Journal of Computer and System Sciences 76, 88–102 (2010)
Yang, Z., Wang, Y., Kitsuregawa, M.: LAPIN: Effective Sequential Pattern Mining Algorithms by Last Position Induction for Dense Databases, pp. 1020–1023 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Febrer-Hernández, J.K., Hernández-Palancar, J., Hernández-León, R., Feregrino-Uribe, C. (2014). SPaMi-FTS: An Efficient Algorithm for Mining Frequent Sequential Patterns. In: Bayro-Corrochano, E., Hancock, E. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2014. Lecture Notes in Computer Science, vol 8827. Springer, Cham. https://doi.org/10.1007/978-3-319-12568-8_58
Download citation
DOI: https://doi.org/10.1007/978-3-319-12568-8_58
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12567-1
Online ISBN: 978-3-319-12568-8
eBook Packages: Computer ScienceComputer Science (R0)