Genetic Algorithm Based Fuzzy Frequent Pattern Mining from Gene Expression Data

Mishra, Debahuti; Mishra, Shruti; Satapathy, Sandeep Kumar; Patnaik, Srikanta

doi:10.1007/978-3-642-25507-6_1

Debahuti Mishra³,
Shruti Mishra³,
Sandeep Kumar Satapathy³ &
…
Srikanta Patnaik³

Part of the book series: Studies in Computational Intelligence ((SCI,volume 395))

764 Accesses
1 Citations

Abstract

Efficient algorithms have been developed for mining frequent patterns in traditional data where the content of each transaction is definitely known. It is a core technique used in many mining tasks like sequential pattern mining, correlative mining etc. As we know, fuzzy logic provides a mathematical framework that is compatible with poorly quantitative yet qualitatively significant data. Genetic algorithm (GA) is one of the optimization algorithms, which is invented to mimic some of the processes observed in natural evolution. It is a stochastic search technique based on the mechanism of natural selection and natural genetics. That is a general one, capable of being applied to an extremely wide range of problems. In this paper, we have fuzzified our original dataset and have applied various frequent pattern mining techniques on it. Then the result of a particular frequent pattern mining technique that is frequent pattern (FP) growth is taken into consideration in which we apply the concept of GA. Here, the frequent patterns observed are considered as the set of initial population. For the selection criteria, we consider the mean squared residue score rather using the threshold value. It was observed that out of the three fuzzy based frequent mining techniques and the GA based fuzzy FP growth technique the later finds the best individual frequent patterns. Also, the run time of the algorithm and the number of frequent patterns generated is far better than the rest of the techniques used. To extend our findings we have also compared the results obtained by the GA based fuzzy FP growth with an usual approach on a normalized dataset and then applied the concept of FP growth to find the frequent patterns followed by GA. Then by analyzing the result we found that GA based fuzzy FP growth stills yields the best individual frequent patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Zhu, C., Zhang, X., Sun, J., Huang, B.: Algorithm for mining sequential pattern in time series data. In: International Conference on Communications and Mobile Computing, pp. 258–262 (January 2009)
Google Scholar
Kriegel, H., Kroger, P., Zimek, A.: Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering. ACM Transactions on Knowledge Discovery from Data (TKDD) 3(1), 1–58 (2009)
Article Google Scholar
Koh, Y., Rountree, N., O’Keefe, R.: Mining interesting imperfectly sporadic rules. Knowledge and Information Systems 14(2), 179–196 (2008)
Article Google Scholar
Zheng, Z., Kohavi, R., Mason, L.: Real world performance of association rule algorithms. In: Proc. of ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 401–406 (2001)
Google Scholar
Han, J., Cheng, H., Xin, D., Ya, X.: Frequent Pattern Mining: Current Status and Future Directions. Data Mining and Knowledge Discovery 15(1), 55–86 (2007)
Article MathSciNet Google Scholar
Creighton, C., Hanash, S.: Mining gene expression databases for association rules. Journal of Bioinformatics 19, 79–86 (2003)
Article Google Scholar
Zhang, A., Teo, B., Ooi, B., Tan, K.L.: Mining deterministic biclusters in gene expression data. In: Proc. of 4th Symposium on Bioinformatics and Bioengineering, pp. 283–292 (2004)
Google Scholar
Pasquier, N., Bastide, Y., Taouil, R., Lakhal, L.: Discovering frequent closed itemsets for association rules. In: Beeri, C., Bruneman, P. (eds.) ICDT 1999. LNCS, vol. 1540, pp. 398–416. Springer, Heidelberg (1998)
Chapter Google Scholar
Pei, J., Han, J., Mao, R.: CLOSET: An efficient algorithm for mining frequent closed item sets. In: Proc. of ACM SIGMOD International Workshop on Data Mining and Knowledge Discovery (DMKD), pp. 21–30 (2000)
Google Scholar
Zaki, M.J., Hsiao, C.: CHARM: An efficient algorithm for closed association rule mining. In: Proc. of SIAM International Conf. on Data Mining (SDM), pp. 457–473 (2002)
Google Scholar
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proc. of the 20th International Conference on Very Large Databases, pp. 478–499 (1994)
Google Scholar
Zhang, S., Wu, X., Zhang, C., Lu, J.: Computing the minimum-support for mining frequent patterns. Knowledge and Information Systems 15, 233–257 (2008)
Article Google Scholar
Zaki, M., Ogihara, M.: Theoretical foundations of association rules. In: Proc. of the 3rd ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, pp. 85–93 (1998)
Google Scholar
Silberschatz, A., Tuzhilin, A.: What makes patterns interesting in knowledge discovery systems. In: Proc. of IEEE Transactions on Knowledge and Data Engineering, pp. 970–974 (1996)
Google Scholar
Xu, Y., Yu, Y., Liu, G., Lu, H.: From path tree to frequent patterns: a framework for Mining Frequent Patterns. In: Proc. of IEEE International Conf. on Data Mining (ICDM 2002), pp. 514–521 (2002)
Google Scholar
Cohen, E., Datar, M., Fujiwara, S., Gionis, A., Indyk, P., Motwani, R., Ullman, J.D., Yang, C.: Finding interesting associations without support pruning. In: Proc. of IEEE Transactions on Knowledge and Data Engineering, pp. 64–78 (2001)
Google Scholar
Roddick, J.F., Rice, S.: What’s interesting about cricket? – on thresholds and anticipation in discovered rules. In: Proc. of SIGKDD Explorations, pp. 1–5 (2001)
Google Scholar
Hipp, J., Guntzer, U.: Is pushing constraints deeply into the mining algorithms really what we want? In: Proc. of SIGKDD Explorations, pp. 50–55 (2002)
Google Scholar
Wang, K., He, Y., Han, J.: Pushing support constraints into association rules Mining. IEEE Transactions on Knowledge and Data Engineering, 642–658 (2003)
Google Scholar
Han, J., Pei, J., Yin, Y., Mao, R.: Mining frequent patterns without candidate generations: a frequent pattern tree approach. Data Mining and knowledge Discovery 8(1), 53–87 (2004)
Article MathSciNet Google Scholar
Chan, K.Y., Zhu, H.L., Lau, C.C., Ling, S.H.: Gene Signature Selection for Cancer Prediction Using an Integrated Approach of Genetic Algorithm and Support Vector Machine. In: Proc. of IEEE Congress on Evolutionary Computation(CEC 2008), pp. 217–224 (2008)
Google Scholar
Chakraborty, A., Maka, H.: Biclustering of Gene Expression Data Using Genetic Algorithm. Proc. of IEEE, 765–770 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Technical Education & Research, Siksha O Anusandhan University, Bhubaneswar, Odisha, India
Debahuti Mishra, Shruti Mishra, Sandeep Kumar Satapathy & Srikanta Patnaik

Authors

Debahuti Mishra
View author publications
You can also search for this author in PubMed Google Scholar
Shruti Mishra
View author publications
You can also search for this author in PubMed Google Scholar
Sandeep Kumar Satapathy
View author publications
You can also search for this author in PubMed Google Scholar
Srikanta Patnaik
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Debahuti Mishra .

Editor information

Editors and Affiliations

, Computer Science and Engineering, SOA University, Bhubaneswar, 752 024, India
Srikanta Patnaik
, School of Electronic Engineering, Kumoh National Institute of Technology, Yahoho-dong, Gyeongbuk 1, Gumi, 730-701, Korea, Republic of (South Korea)
Yeon-Mo Yang

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Mishra, D., Mishra, S., Satapathy, S.K., Patnaik, S. (2012). Genetic Algorithm Based Fuzzy Frequent Pattern Mining from Gene Expression Data. In: Patnaik, S., Yang, YM. (eds) Soft Computing Techniques in Vision Science. Studies in Computational Intelligence, vol 395. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25507-6_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-25507-6_1
Published: 11 February 2012
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25506-9
Online ISBN: 978-3-642-25507-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics