An Efficient Approach for Discovering Closed Frequent Patterns in High Dimensional Data Sets

Singh, Bharat; Singh, Raghvendra; Kushwaha, Nidhi; Vyas, O. P.

doi:10.1007/978-3-319-07353-8_60

Bharat Singh⁷,
Raghvendra Singh⁷,
Nidhi Kushwaha⁷ &
…
O. P. Vyas⁷

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 27))

1957 Accesses

Abstract

The growth in the new technology in the field of e-commerce and bioinformatics has resulted in production of large data sets with few new uniqueness. Microarray datasets consist of a very large number of features (nearly thousands of features) but very less number of rows because of its application type. ARM can be used to analyze such data and find the characteristics hidden in these data. However, most state-of-the-art ARM methods are not able to tackle a datasets containing large number of attributes effectively. In this paper, we have proposed and implemented a modified Carpenter algorithm with different consideration of data structure, which in result give us the better time complexity in compare to simple implementation of Carpenter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Wright, A., McCoy, A., Henkin, S., Flaherty, M., Sittig, D.: Validation of an Association Rule Mining-Based Method to Infer Associations Between Medications and Problems. Ppl Clin. Inf. 4, 100–109 (2013)
Article Google Scholar
Pan, F., Cong, G., Tung, A.K.H.: Carpenter: Finding closed patterns in long biological datasets. In: Proceedings of ACM-SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 637–642 (2003)
Google Scholar
Pei, J., Han, J., Mao, R.: CLOSET: An efficient algorithm for mining frequent closed item sets. In: Proceedings of ACM-SIGMOD International Workshop Data Mining and Knowledge Discovery, pp. 11–20 (2000)
Google Scholar
Pan, F., Cong, G., Xin, X., Tung, A.K.H.: COBBLER: Combining Column and Row Enu-meration for Closed Pattern Discovery. In: International Conference on Scientific and Statistical Database Management, pp. 21–30 (2004)
Google Scholar
Zaki, M., Hsiao, C.: Charm: An efficient algorithm for closed association rule mining. In: Proceedings of SDM, pp. 457–473 (2002)
Google Scholar
Wang, J., Han, J., Pei, J.: Closet+: Searching for the best strategies for mining frequent closed item sets. In: Proceedings of 2003 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2003)
Google Scholar
Chen, E.S., Hripcsak, G., Xu, H., Markatou, M., Friedma, C.: Automated Acquisition of Disease: Drug Knowledge from Biomedical and Clinical Documents: An Initial Study. J. Am. Med. Inform. Assoc., 87–98 (2008)
Google Scholar
Sim, S., Gopalkrishnan, V., Zimek, A., Cong, G.: A survey on enhanced subspace clustering. Data Mining Knowl. Disc. 26, 332–397 (2013)
Article MATH MathSciNet Google Scholar
Cheeseman, P.: Auto class: A Bayesian classification system. In: 5th International Conference on Machine Learning. Morgan Kaufmann (1988)
Google Scholar
Associates, D.S.: The new direct marketing. Business One Irwin, Illinois (1990)
Google Scholar
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of 1994 International Conference on Very Large Data Bases (VLDB 1994), pp. 487–499 (1994)
Google Scholar
Bastide, Y., Taouil, R., Pasquier, N., Stumme, G., Lakhal, L.: Mining frequent closed itemsets with counting inference. SIGKDD Explorations 2(2), 71–80 (2000)
Article Google Scholar
Hongyan, L., Han, J.W.: Mining frequent Patterns from Very High Dimensional Data: A Top-down Row Enumeration Approach. In: Proceedings of the Sixth SIAM International Conference on Data Mining, pp. 20–22 (2006)
Google Scholar
Pasquier, N., Bastide, Y., Taouil, R., Lakhal, L.: Discovering frequent closed itemsets for association rules. In: Proceedings of 7th International Conference on Database Theory, pp. 398–416 (1999)
Google Scholar
Kriegel, H.P., Kröger, P., Zimek, A.: Clustering high-dimensional data: A survey on sub-space clustering, pattern-based clustering, and correlation clustering. ACM Transactions on Knowledge Discovery from Data 3(1), 1–58 (2009)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Indian Institute of Information Technology, Allahabad, India
Bharat Singh, Raghvendra Singh, Nidhi Kushwaha & O. P. Vyas

Authors

Bharat Singh
View author publications
You can also search for this author in PubMed Google Scholar
Raghvendra Singh
View author publications
You can also search for this author in PubMed Google Scholar
Nidhi Kushwaha
View author publications
You can also search for this author in PubMed Google Scholar
O. P. Vyas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bharat Singh .

Editor information

Editors and Affiliations

Indian Statistical Institute, Machine Intelligence Unit, Kolkata, India
Malay Kumar Kundu
Dept. of Computer Science and Engineering, National Institute of Technology Rourkela, Rourkela, India
Durga Prasad Mohapatra
Dept. of Electronics and Tele-Communication Engineering, Jadavpur University Artificial Intelligence Laboratory, Kolkata, India
Amit Konar
Dept. of Computer Science and Engineering, St. Thomas' College of Engineering & Technology, Kidderpore, West Bengal, India
Aruna Chakraborty

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Singh, B., Singh, R., Kushwaha, N., Vyas, O.P. (2014). An Efficient Approach for Discovering Closed Frequent Patterns in High Dimensional Data Sets. In: Kumar Kundu, M., Mohapatra, D., Konar, A., Chakraborty, A. (eds) Advanced Computing, Networking and Informatics- Volume 1. Smart Innovation, Systems and Technologies, vol 27. Springer, Cham. https://doi.org/10.1007/978-3-319-07353-8_60

Download citation

DOI: https://doi.org/10.1007/978-3-319-07353-8_60
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07352-1
Online ISBN: 978-3-319-07353-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics