Skip to main content

iTM: An Efficient Algorithm for Frequent Pattern Mining in the Incremental Database without Rescanning

  • Conference paper
Next-Generation Applied Intelligence (IEA/AIE 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5579))

Abstract

Frequent pattern mining plays an important role in the data mining community since it is usually a fundamental step in various mining tasks. However, maintenance of frequent patterns is very expensive in the incremental database. In addition, the status of a pattern changes with time. In other words, a frequent pattern is possible to become infrequent, and vice versa. In order to exactly find all frequent patterns, most algorithms have to scan the original database completely whenever an update occurs. In this paper, we propose a new algorithm iTM, stands for incremental Transaction Mapping algorithm for incremental frequent pattern mining without rescanning the whole database. It transfers the transaction dataset to the vertical representation such that the incremental dataset can be integrated to the original database easily. As demonstrated in our experiments, the proposed method is very efficient and suitable for mining frequent patterns in the incremental database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rules Between Sets of Items in Large Databases. In: 1993 ACM SIGMOD Conference, ACM SIGMOD, Washington, DC, pp. 207–216 (1993)

    Google Scholar 

  2. Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)

    Google Scholar 

  3. Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth International Group (1984)

    Google Scholar 

  4. Lloyd, S.P.: Least Squares Quantization in PCM. IEEE Transactions on Information Theory it-28(2), 129–137 (1982)

    Google Scholar 

  5. Song, M., Rajasekaran, S.: A Transaction Mapping Algorithm for Frequent Itemsets Mining. IEEE Transactions on Knowledge and Data Engineering 18(4), 472–481 (2006)

    Article  Google Scholar 

  6. Cheung, D.W., Han, J., Ng, V.T., Wong, C.Y.: Maintenance of Discovered Association Rules in Large Databases: An Incremental Updating Technique. In: 12th International Conference on Data Engineering, ICDE, New Orleans, pp. 106–114 (1996)

    Google Scholar 

  7. Cheung, D.W., Lee, S.D., Kao, B.: A General Incremental Technique for Maintaining Discovered Association Rules. In: Fifth International Conference on Database Systems for Advanced Application, DASFAA, Melbourne, pp. 185–194 (1997)

    Google Scholar 

  8. Chang, C.C., Li, Y.C., Lee, J.S.: An Efficient Algorithm for Incremental Mining of Association Rules. In: 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications, RIDE-SDMA 2005, pp. 3–10 (2005)

    Google Scholar 

  9. Zhang, S., Zhang, J., Zhang, C.: EDUA: An Efficient Algorithm for Dynamic Database Mining. Information Sciences 177, 2756–2767 (2007)

    Article  Google Scholar 

  10. Liu, B., Hsu, W., Ma, Y.: Mining Association Rules With Multiple Minimum Supports. In: Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM SIGKDD, San Diego, pp. 337–341 (1999)

    Google Scholar 

  11. Pei, J., Han, J., Mortazavi-Asl, B., Pinto, H., Chen, Q., Dayal, U., Hsu, M.C.: PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-projected Pattern Growth. In: 17th International Conference on Data Engineering, ICDE 2001, pp. 215–224 (2001)

    Google Scholar 

  12. Leung, C.K.S., Khan, Q.I., Hoque, T.: CanTree: A Tree Structure for Efficient Incremental Mining of Frequent Patterns. In: Fifth IEEE International Conference on Data Mining, pp. 274–281. IEEE Computer Society Press, Los Alamitos (2005)

    Chapter  Google Scholar 

  13. Thomas, S., Bodagala, S., Alsabti, K., Ranka, S.: An Efficient Algorithm for the Incremental Updation of Association Rules in Large Databases. In: Third International Conference on Knowledge Discovery and Data Mining, KDD 1997, Newport Beach, pp. 263–266 (1997)

    Google Scholar 

  14. Lee, C.H., Lin, C.R., Chen, M.S.: Sliding-window Filtering: An Efficient Algorithm for Incremental Mining. In: Tenth International Conference on Information and Knowledge Management, CIKM 2001, Atlanta, pp. 263–270 (2001)

    Google Scholar 

  15. Cheung, W., Zaiane, O.R.: Incremental Mining of Frequent Patterns Without Candidate Generation or Support Constraint. In: Seventh International Database Engineering and Applications Symposium, pp. 111–116. IEEE Computer Society Press, Los Alamitos (2003)

    Google Scholar 

  16. Koh, J.-L., Shieh, S.-F.: An efficient approach for maintaining association rules based on adjusting FP-tree structures1. In: Lee, Y., Li, J., Whang, K.-Y., Lee, D. (eds.) DASFAA 2004. LNCS, vol. 2973, pp. 417–424. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  17. Frequent Itemset Mining Dataset Repository, http://fimi.cs.helsinki.fi/data/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Dai, BR., Lin, PY. (2009). iTM: An Efficient Algorithm for Frequent Pattern Mining in the Incremental Database without Rescanning. In: Chien, BC., Hong, TP., Chen, SM., Ali, M. (eds) Next-Generation Applied Intelligence. IEA/AIE 2009. Lecture Notes in Computer Science(), vol 5579. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02568-6_77

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-02568-6_77

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-02567-9

  • Online ISBN: 978-3-642-02568-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics