Advertisement

An Efficient Approach for Discovering Closed Frequent Patterns in High Dimensional Data Sets

  • Bharat Singh
  • Raghvendra Singh
  • Nidhi Kushwaha
  • O. P. Vyas
Conference paper
Part of the Smart Innovation, Systems and Technologies book series (SIST, volume 27)

Abstract

The growth in the new technology in the field of e-commerce and bioinformatics has resulted in production of large data sets with few new uniqueness. Microarray datasets consist of a very large number of features (nearly thousands of features) but very less number of rows because of its application type. ARM can be used to analyze such data and find the characteristics hidden in these data. However, most state-of-the-art ARM methods are not able to tackle a datasets containing large number of attributes effectively. In this paper, we have proposed and implemented a modified Carpenter algorithm with different consideration of data structure, which in result give us the better time complexity in compare to simple implementation of Carpenter.

Keywords

High Dimensional Data Association Rule Mining (ARM) Closed Frequent Pattern Frequent Pattern Microarray Data 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Wright, A., McCoy, A., Henkin, S., Flaherty, M., Sittig, D.: Validation of an Association Rule Mining-Based Method to Infer Associations Between Medications and Problems. Ppl Clin. Inf. 4, 100–109 (2013)CrossRefGoogle Scholar
  2. 2.
    Pan, F., Cong, G., Tung, A.K.H.: Carpenter: Finding closed patterns in long biological datasets. In: Proceedings of ACM-SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 637–642 (2003)Google Scholar
  3. 3.
    Pei, J., Han, J., Mao, R.: CLOSET: An efficient algorithm for mining frequent closed item sets. In: Proceedings of ACM-SIGMOD International Workshop Data Mining and Knowledge Discovery, pp. 11–20 (2000)Google Scholar
  4. 4.
    Pan, F., Cong, G., Xin, X., Tung, A.K.H.: COBBLER: Combining Column and Row Enu-meration for Closed Pattern Discovery. In: International Conference on Scientific and Statistical Database Management, pp. 21–30 (2004)Google Scholar
  5. 5.
    Zaki, M., Hsiao, C.: Charm: An efficient algorithm for closed association rule mining. In: Proceedings of SDM, pp. 457–473 (2002)Google Scholar
  6. 6.
    Wang, J., Han, J., Pei, J.: Closet+: Searching for the best strategies for mining frequent closed item sets. In: Proceedings of 2003 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2003)Google Scholar
  7. 7.
    Chen, E.S., Hripcsak, G., Xu, H., Markatou, M., Friedma, C.: Automated Acquisition of Disease: Drug Knowledge from Biomedical and Clinical Documents: An Initial Study. J. Am. Med. Inform. Assoc., 87–98 (2008)Google Scholar
  8. 8.
    Sim, S., Gopalkrishnan, V., Zimek, A., Cong, G.: A survey on enhanced subspace clustering. Data Mining Knowl. Disc. 26, 332–397 (2013)CrossRefMATHMathSciNetGoogle Scholar
  9. 9.
    Cheeseman, P.: Auto class: A Bayesian classification system. In: 5th International Conference on Machine Learning. Morgan Kaufmann (1988)Google Scholar
  10. 10.
    Associates, D.S.: The new direct marketing. Business One Irwin, Illinois (1990)Google Scholar
  11. 11.
    Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of 1994 International Conference on Very Large Data Bases (VLDB 1994), pp. 487–499 (1994)Google Scholar
  12. 12.
    Bastide, Y., Taouil, R., Pasquier, N., Stumme, G., Lakhal, L.: Mining frequent closed itemsets with counting inference. SIGKDD Explorations 2(2), 71–80 (2000)CrossRefGoogle Scholar
  13. 13.
    Hongyan, L., Han, J.W.: Mining frequent Patterns from Very High Dimensional Data: A Top-down Row Enumeration Approach. In: Proceedings of the Sixth SIAM International Conference on Data Mining, pp. 20–22 (2006)Google Scholar
  14. 14.
    Pasquier, N., Bastide, Y., Taouil, R., Lakhal, L.: Discovering frequent closed itemsets for association rules. In: Proceedings of 7th International Conference on Database Theory, pp. 398–416 (1999)Google Scholar
  15. 15.
    Kriegel, H.P., Kröger, P., Zimek, A.: Clustering high-dimensional data: A survey on sub-space clustering, pattern-based clustering, and correlation clustering. ACM Transactions on Knowledge Discovery from Data 3(1), 1–58 (2009)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Bharat Singh
    • 1
  • Raghvendra Singh
    • 1
  • Nidhi Kushwaha
    • 1
  • O. P. Vyas
    • 1
  1. 1.Indian Institute of Information TechnologyAllahabadIndia

Personalised recommendations