Skip to main content

Advertisement

Log in

Mining fuzzy association rules from uncertain data

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Association rule mining is an important data analysis method that can discover associations within data. There are numerous previous studies that focus on finding fuzzy association rules from precise and certain data. Unfortunately, real-world data tends to be uncertain due to human errors, instrument errors, recording errors, and so on. Therefore, a question arising immediately is how we can mine fuzzy association rules from uncertain data. To this end, this paper proposes a representation scheme to represent uncertain data. This representation is based on possibility distributions because the possibility theory establishes a close connection between the concepts of similarity and uncertainty, providing an excellent framework for handling uncertain data. Then, we develop an algorithm to mine fuzzy association rules from uncertain data represented by possibility distributions. Experimental results from the survey data show that the proposed approach can discover interesting and valuable patterns with high certainty.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of ACM SIGMOD, Washington, DC, pp 207–216

  2. Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: Proceedings of the 20th international conference on very large data bases. Santiago, Chile, pp 487–499

  3. Alcala-Fdez J, Alcala R, Gacto MJ, Herrera F (2009) Learning the membership function contexts for mining fuzzy association rules by using genetic algorithms. Fuzzy Set Syst 160(7): 905–921

    Article  MATH  Google Scholar 

  4. Arslan A, Kaya M (2001) Determination of fuzzy logic membership functions using genetic algorithms. Fuzzy Set Syst 118(2): 297–306

    Article  MATH  MathSciNet  Google Scholar 

  5. Berry M, Linoff G (1997) Data mining techniques: for marketing, sales, and customer support. Wiley, New York

    Google Scholar 

  6. Berti-Equille L (2007) Data quality awareness: a case study for cost-optimal association rule mining. Knowl Inf Syst 11(2): 191–215

    Article  Google Scholar 

  7. Chen YL, Ho CY (2005) A sampling-based method for mining frequent patterns from databases. Lect Notes Artif Int 3614: 536–545

    Google Scholar 

  8. Chen YL, Huang TCK (2005) Discovering fuzzy time-interval sequential patterns in sequence databases. IEEE Trans Syst Man Cybern B 35(5): 959–972

    Article  Google Scholar 

  9. Chen YL, Huang TCK (2006) A new approach for discovering fuzzy quantitative sequential patterns in sequence databases. Fuzzy Set Syst 157(12): 1641–1661

    Article  MATH  Google Scholar 

  10. Chen YL, Shen CC (2005) Mining generalized knowledge from ordered data through attribute-oriented induction techniques. Eur J Oper Res 166(1): 221–245

    Article  MATH  Google Scholar 

  11. Chen YL, Tang K, Shen RJ, Hu YH (2005) Market basket analysis in a multiple store environment. Decis Support Syst 40(2): 339–354

    Article  Google Scholar 

  12. Chen YL, Weng CH (2008) Mining association rules from imprecise ordinal data. Fuzzy Set Syst 159(4): 60–474

    Article  MathSciNet  Google Scholar 

  13. Cheng J, Ke Y, Ng W (2008) A survey on algorithms for mining frequent itemsets over data streams. Knowl Inf Syst 16(1): 1–27

    Article  MathSciNet  Google Scholar 

  14. Cheung DW, Ng VT, Fu AW, Fu YJ (1996) Efficient mining of association rules in distributed databases. IEEE Trans Knowl Data Eng 8(6): 911–922

    Article  Google Scholar 

  15. Chui C, Kao B, Hung E (2007) Mining frequent itemsets from uncertain data. Lect Notes Comput Sci 4426: 47–58

    Article  Google Scholar 

  16. Chung SM, Luo C (2008) Efficient mining of maximal frequent itemsets from databases on a cluster of workstations. Knowl Inf Syst 16(3): 359–391

    Article  Google Scholar 

  17. Conci A, Castro EMMM (2002) Image mining by content. Expert Syst Appl 23(4): 377–383

    Article  Google Scholar 

  18. De SK, Krishna PR (2004) Clustering web transactions using rough approximation. Fuzzy Set Syst 148(1): 131–138

    Article  MATH  MathSciNet  Google Scholar 

  19. Delgado M, Marin N, Sanchez D, Vila MA (2003) Fuzzy association rules: general model and applications. IEEE Trans Fuzzy Syst 11(2): 214–225

    Article  Google Scholar 

  20. Dempster AP (1967) Upper and lower probabilities induced by a multivalued mapping. Anuals Math Stat 38: 325–339

    Article  MATH  MathSciNet  Google Scholar 

  21. Denguir-Rekik A, Mauris G, Montmain J (2006) Propagation of uncertainty by the possibility theory in Choquet integral-based decision making: application to an E-commerce website choice support. IEEE Trans Instrum Meas 55(3): 721–728

    Article  Google Scholar 

  22. Djouadi Y, Redaoui S, Amroun K (2007) Mining fuzzy association rules from uncertain data. In: IEEE international fuzzy systems conference, London, pp 1-6

  23. Dubois D (2006) Possibility theory and statistical reasoning. Comput Stat Data Anal 51(1): 47–69

    Article  MATH  Google Scholar 

  24. Dubois D, Prade H (1988) Representation and combination of uncertainty with belief functions and possibility measures. Comput Intell 4(3): 244–264

    Article  Google Scholar 

  25. Eick CF, Rouhana A, Bagherjeiran A, Vilalta R (2006) Using clustering to learn distance functions for supervised similarity assessment. Eng Appl Artif Intel 19(4): 395–401

    Article  Google Scholar 

  26. Han J, Fu Y (1995) Discovery of multiple-level association rules from large databases. In: Proceedings of 1995 international conference on very large databases. Zurmh, Switzerland, pp 420–431

  27. Han J, Kamber WM (2001) Data mining: concepts and techniques. Morgan Kaufmann, San Francisco

    Google Scholar 

  28. Holt JD, Chung SM (2002) Mining association rules using inverted hashing and pruning. Inform Process Lett 83(4): 211–220

    Article  MATH  MathSciNet  Google Scholar 

  29. Hong TP, Chen JB (1999) Finding relevant attributes and membership functions. Fuzzy Set Syst 103(3): 389–404

    Article  Google Scholar 

  30. Hong TP, Kuo CS, Wang SL (2004) A fuzzy AprioriTid mining algorithm with reduced computational time. Appl Soft Comput 5(1): 1–10

    Article  Google Scholar 

  31. Hong TP, Lee CY (1996) Induction of fuzzy rules and membership functions from training examples. Fuzzy Set Syst 84(1): 33–47

    Article  MATH  MathSciNet  Google Scholar 

  32. Hong TP, Lin KY, Wang SL (2003) Fuzzy data mining for interesting generalized association rules. Fuzzy Set Syst 138(2): 255–269

    Article  MathSciNet  Google Scholar 

  33. Hsu HM, Wang WP (2001) Possibilistic programming in production planning of assemble-to-order environments. Fuzzy Set Syst 119(1): 59–70

    Article  MathSciNet  Google Scholar 

  34. Hsu PY, Chen YL, Ling CC (2004) Algorithms for mining association rules in bag databases. Inf Sci 166(1): 31–47

    Article  MATH  MathSciNet  Google Scholar 

  35. Hu YC, Chen RS, Tzeng GH (2003) Discovering fuzzy association rules using fuzzy partition methods. Knowl Based Syst 16(3): 137–147

    Article  Google Scholar 

  36. Hu YC, Tzeng GH (2003) Elicitation of classification rules by fuzzy data mining. Eng Appl Artif Intel 16(7-8): 709–716

    Article  Google Scholar 

  37. Hüllermeier E (2003) Possibilistic instance-based learning. Artif Intell 148((1-2): 335–383

    Article  MATH  Google Scholar 

  38. Ke Y, Cheng J, Ng W (2008) An information-theoretic approach to quantitative association rule mining. Knowl Inf Syst 16(2): 245–258

    Article  MathSciNet  Google Scholar 

  39. Lee G, Lee KL, Chen ALP (2001) Efficient graph-Based algorithms for discovering and maintaining association rules in large databases. Knowl Inf Syst 3(3): 338–355

    Article  MATH  MathSciNet  Google Scholar 

  40. Leung CW, Chan SC, Chung F (2006) A collaborative filtering framework based on fuzzy association rules and multiple-level similarity. Knowl Inf Syst 10(3): 357–381

    Article  Google Scholar 

  41. Loiseau Y, Prade H, Boughanem M (2004) Qualitative pattern matching with linguistic terms. AI Commun 17(1): 25–34

    MATH  MathSciNet  Google Scholar 

  42. Ordonez C, Ezquerra N, Santana CA (2006) Constraining and summarizing association rules in medical data. Knowl Inf Syst 9(3): 259–283

    Article  Google Scholar 

  43. Oussalah M, Maaref H, Barret C (2001) New fusion methodology approach and application to mobile robotics: investigation in the framework of possibility theory. Inf Fusion 2(1): 31–48

    Article  Google Scholar 

  44. Pedrycz W (1998) Fuzzy set technology in knowledge discovery. Fuzzy Set Syst 98(3): 279–290

    Article  Google Scholar 

  45. Prade H, Testemale C (1984) Generalizing database relational algebra for the treatment of incomplete or uncertain information and vague queries. Inf Sci 34(22): 115–143

    Article  MATH  MathSciNet  Google Scholar 

  46. Rastogi R, Shim K (2001) Mining optimized support rules for numeric attributes. Inf Syst 26(6): 425–444

    Article  MATH  Google Scholar 

  47. Shafer G. (1976) A Mathematical Theory of Evidence. Princeton University Press, Princeton

    MATH  Google Scholar 

  48. Shu JY, Tsang ECC, Yeung DS (2001) Query fuzzy association rules in relational database. In: IFSA World Congress and 20th NAFIPS international conference. Vancouver, BC, Canada, pp 2989-2993

  49. Shyu ML, Haruechaiyasak C, Chen SC, Premaratne K, Mining association rules with uncertain item relationships, in [http://www.cs.fiu.edu/~chens/PDF/SCI02.pdf]

  50. Wu X, Zhang C, Zhang S (2005) Database classification for multi-database mining. Inform Syst 30(1): 71–88

    Article  Google Scholar 

  51. Yun H, Ha D, Hwang B, Ryu K Ho (2003) Mining association rules on significant rare data using relative support. J Syst Software 67(3): 181–191

    Article  Google Scholar 

  52. Zadeh LA (1965) Fuzzy sets. Inf Control 8: 338–353

    Article  MATH  MathSciNet  Google Scholar 

  53. Zadeh LA (1978) Fuzzy sets as a basis for a theory of possibility. Fuzzy Set Syst 1: 3–28

    Article  MATH  MathSciNet  Google Scholar 

  54. Zhang S, Wu X, Zhang C, Lu J (2008) Computing the minimum-support for mining frequent patterns. Knowl Inf Syst 15(2): 233–257

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cheng-Hsiung Weng.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Weng, CH., Chen, YL. Mining fuzzy association rules from uncertain data. Knowl Inf Syst 23, 129–152 (2010). https://doi.org/10.1007/s10115-009-0223-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-009-0223-1

Keywords

Navigation