Abstract
We describe the problem of mining set valued rules in large relational tables containing categorical attributes taking a finite number of values. Such rules allow for an interval of possible values to be selected for each attribute in condition instead of a single value for association rules, while conclusion contains a projection of the data restricted by the condition onto a target attribute. An example of such a rule might be “if HOUSEHOLDSIZE = {Two OR Tree} AND OCCUPATION={Professional OR Clerical} THEN PAYMENT_METHOD = {CashCheck (Max=249, Sum=4952) OR DebitCard (Max=175, Sum=3021)} WHERE Confidence=85%, Support=10%.}” We use an original conceptional and formal framework for representing multidimensional distribution induced from data by a number of so-called prime disjunctions upper bounding its surface. Each prime disjunction represents a wide multidimensional interval of impossible combinations of attribute values. This original formalism generalises the conventional boolean approach in two directions: (i) finite-valued attributes (instead of only 0 and 1), and (ii) continuous-valued semantics instead of (true and false). In addition, we describe an efficient algorithm, which carries out the generalised dual transformation from possibilistic disjunctive normal form (DNF) representing data into conjunctive normal form (CNF) representing knowledge and thus generates all the most interesting prime disjunctions. Once obtained they can be used to build different forms of rules or for other purposes (prediction, clustering etc.).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
R. Agrawal, T. Imielinski, and A. Swami, Mining association rules between sets of items in large databases, Proc. of the ACM SIGMOD Conference on Management of Data, Washington, D.C., May 1993, 207–216.
R. Agrawal and R. Srikant, Fast algorithms for mining association rules in large databases, Proceedings of the 20th International Conference on Very Large Data Bases, pp. 487–499, 1994.
R.J. Bayardo Jr., Efficiently mining long patterns from databases, In Proc. of the 1998 SIGMOD Conf. on the Management of Data, 1998.
C. Borgelt and R. Kruse, Efficient maximum projection of database-induced multivariate possibility distributions, Proc. 7th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE’98), Vol. 1, pp. 663–668, Anchorage, Alaska, 1998.
C. Borgelt and R. Kruse, Probabilistic and possibilistic networks and how to learn them from data, in: O. Kaynak, L. Zadeh, B. Turksen, and I. Rudas (eds.), Computational Intelligence: Soft Computing and Fuzzy-Neuro Integration with Applications, NATO ASI Series F, Vol. 162, pp. 403–426, Springer, New York, 1998.
P. Clark and R. Boswell. Rule induction with CN2: some recent improvements. In Y. Kodratoff, editor, Machine Learning — EWSL-91, pages 151–163, Berlin, 1991. Springer-Verlag.
P. Clark and T. Niblett. The CN2 induction algorithm. Machine Learning, 3(4):261–283, 1989.
W. Klösgen, Explora: A multipattern and multistrategy discovery assistant, In: Advances in Knowledge Discovery and Data Mining, U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy (eds.), MIT Press, Cambridge MA, 249–271, 1996.
W. Klösgen, Applications and research problems of subgroup mining, In: Foundations of Intelligent Systems, Z. Ras and A. Skowron (eds.), Springer, Heidelberg, 1–15, 1999.
W. Klösgen, Subgroup patterns. In: Handbook of Data Mining and Knowledge Discovery, W. Klösgen and J. Zytkow (eds.), Oxford University Press, New York, 1999 (to appear).
R. Kruse and E. Schwecke, Fuzzy reasoning in a multidimensional space of hypotheses, Int. J. of Approximate Reasoning 4, 47–68, 1990.
D. Lin and Z.M. Kedem, Pincer-Search: A new algorithm for discovering the maximum frequent set. In Proc. of the Sixth European Conf. on Extending Database Technology, 1998.
H. Mannila, H. Toivonen, and A.I. Verkamo, Efficient algorithms for discovering association rules, In KDD-94: AAAI Workshop on Knowledge Discovery in Databases, Seattle, Washington, July 1994, 181–192.
A.A. Savinov, Matrix representation of fuzzy knowledge in attribute models, Preprint, Institute of Mathematics and CC, AS Moldavia, Kishinev, Shtiintsa, 1991 (in Russian).
A.A. Savinov, Forming knowledge by examples in fuzzy finite predicates, Proc. conf. “Hybrid Intellectual Systems”, Part 1, Rostov-na-Donu—Terskol, 177–179, 1991 (in Russian).
A.A. Savinov, Fuzzy multi-dimensional analysis, online paper, 1997, http://www.geocities.com/ResearchTriangle/7220/fmda_0.html.
A.A. Savinov, Fuzzy multi-dimensional analysis and resolution operation, Computer Sci. J. of Moldova 6(3), 252–285, 1998.
A.A. Savinov. Application of multi-dimensional fuzzy analysis to decision making. In: Advances in Soft Computing — Engineering Design and Manufacturing, R. Roy, T. Furuhashi and P.K. Chawdhry (eds.), Springer-Verlag, London, 1999.
A.A. Savinov, Mining possibilistic set-valued rules by generating prime disjunctions, Proc. 3rd European Conference on Principles and Practice of Knowledge Discovery in Databases — PKDD’99, Prague, Czech Republic, September 15-18, 1999, 536–541.
A.A. Savinov, An algorithm for induction of possibilistic set-valued rules by finding prime disjunctions, 4th On-line World Conference on Soft Computing in Industrial Applications (WSC4), 21-30 Sept. 1999.
A.A. Savinov, An algorithm for finding logical dependencies among multivalued attrbiutes, Perner, P. (Ed.), Proc. Fachgruppentreffen Maschinelles Lernen (FGML’99), Magdeburg, September 27-29, Institut fuer Bildverarbeitung und angewandte Informatik e.V., Leipzig, 8–14, 1999.
J.R. Slagel, C.-L. Chang, and R.C.T. Lee, A new algorithm for generating prime implicants, IEEE Trans. on Computers, C-19(4):304–310, 1970.
S. Wrobel, An algorithm for multi-relational discovery of subgroups, Proc. 1st European Symposium on Principles of KDD, eds. J. Komorowski, and J. Jytkow, Springer, Heidelberg, 78–87, 1997.
A.D. Zakrevsky, Logical inference in finite predicates, Preprint No.6, Institute of Technical Cybernetics, AS Belorussia, Minsk, 1989 (in Russian).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer Science+Business Media New York
About this chapter
Cite this chapter
Savinov, A.A. (2000). Mining Interesting Possibilistic Set-Valued Rules. In: Ruan, D., Kerre, E.E. (eds) Fuzzy If-Then Rules in Computational Intelligence. The Springer International Series in Engineering and Computer Science, vol 553. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-4513-2_6
Download citation
DOI: https://doi.org/10.1007/978-1-4615-4513-2_6
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-7035-2
Online ISBN: 978-1-4615-4513-2
eBook Packages: Springer Book Archive