Mining Interesting Possibilistic Set-Valued Rules

Savinov, Alexandr A.

doi:10.1007/978-1-4615-4513-2_6

Alexandr A. Savinov³

Part of the book series: The Springer International Series in Engineering and Computer Science ((SECS,volume 553))

221 Accesses
2 Citations

Abstract

We describe the problem of mining set valued rules in large relational tables containing categorical attributes taking a finite number of values. Such rules allow for an interval of possible values to be selected for each attribute in condition instead of a single value for association rules, while conclusion contains a projection of the data restricted by the condition onto a target attribute. An example of such a rule might be “if HOUSEHOLDSIZE = {Two OR Tree} AND OCCUPATION={Professional OR Clerical} THEN PAYMENT_METHOD = {CashCheck (Max=249, Sum=4952) OR DebitCard (Max=175, Sum=3021)} WHERE Confidence=85%, Support=10%.}” We use an original conceptional and formal framework for representing multidimensional distribution induced from data by a number of so-called prime disjunctions upper bounding its surface. Each prime disjunction represents a wide multidimensional interval of impossible combinations of attribute values. This original formalism generalises the conventional boolean approach in two directions: (i) finite-valued attributes (instead of only 0 and 1), and (ii) continuous-valued semantics instead of (true and false). In addition, we describe an efficient algorithm, which carries out the generalised dual transformation from possibilistic disjunctive normal form (DNF) representing data into conjunctive normal form (CNF) representing knowledge and thus generates all the most interesting prime disjunctions. Once obtained they can be used to build different forms of rules or for other purposes (prediction, clustering etc.).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

R. Agrawal, T. Imielinski, and A. Swami, Mining association rules between sets of items in large databases, Proc. of the ACM SIGMOD Conference on Management of Data, Washington, D.C., May 1993, 207–216.
Google Scholar
R. Agrawal and R. Srikant, Fast algorithms for mining association rules in large databases, Proceedings of the 20th International Conference on Very Large Data Bases, pp. 487–499, 1994.
Google Scholar
R.J. Bayardo Jr., Efficiently mining long patterns from databases, In Proc. of the 1998 SIGMOD Conf. on the Management of Data, 1998.
Google Scholar
C. Borgelt and R. Kruse, Efficient maximum projection of database-induced multivariate possibility distributions, Proc. 7th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE’98), Vol. 1, pp. 663–668, Anchorage, Alaska, 1998.
Google Scholar
C. Borgelt and R. Kruse, Probabilistic and possibilistic networks and how to learn them from data, in: O. Kaynak, L. Zadeh, B. Turksen, and I. Rudas (eds.), Computational Intelligence: Soft Computing and Fuzzy-Neuro Integration with Applications, NATO ASI Series F, Vol. 162, pp. 403–426, Springer, New York, 1998.
Chapter Google Scholar
P. Clark and R. Boswell. Rule induction with CN2: some recent improvements. In Y. Kodratoff, editor, Machine Learning — EWSL-91, pages 151–163, Berlin, 1991. Springer-Verlag.
Google Scholar
P. Clark and T. Niblett. The CN2 induction algorithm. Machine Learning, 3(4):261–283, 1989.
Google Scholar
W. Klösgen, Explora: A multipattern and multistrategy discovery assistant, In: Advances in Knowledge Discovery and Data Mining, U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy (eds.), MIT Press, Cambridge MA, 249–271, 1996.
Google Scholar
W. Klösgen, Applications and research problems of subgroup mining, In: Foundations of Intelligent Systems, Z. Ras and A. Skowron (eds.), Springer, Heidelberg, 1–15, 1999.
Chapter Google Scholar
W. Klösgen, Subgroup patterns. In: Handbook of Data Mining and Knowledge Discovery, W. Klösgen and J. Zytkow (eds.), Oxford University Press, New York, 1999 (to appear).
Google Scholar
R. Kruse and E. Schwecke, Fuzzy reasoning in a multidimensional space of hypotheses, Int. J. of Approximate Reasoning 4, 47–68, 1990.
Article MathSciNet MATH Google Scholar
D. Lin and Z.M. Kedem, Pincer-Search: A new algorithm for discovering the maximum frequent set. In Proc. of the Sixth European Conf. on Extending Database Technology, 1998.
Google Scholar
H. Mannila, H. Toivonen, and A.I. Verkamo, Efficient algorithms for discovering association rules, In KDD-94: AAAI Workshop on Knowledge Discovery in Databases, Seattle, Washington, July 1994, 181–192.
Google Scholar
A.A. Savinov, Matrix representation of fuzzy knowledge in attribute models, Preprint, Institute of Mathematics and CC, AS Moldavia, Kishinev, Shtiintsa, 1991 (in Russian).
Google Scholar
A.A. Savinov, Forming knowledge by examples in fuzzy finite predicates, Proc. conf. “Hybrid Intellectual Systems”, Part 1, Rostov-na-Donu—Terskol, 177–179, 1991 (in Russian).
Google Scholar
A.A. Savinov, Fuzzy multi-dimensional analysis, online paper, 1997, http://www.geocities.com/ResearchTriangle/7220/fmda_0.html.
A.A. Savinov, Fuzzy multi-dimensional analysis and resolution operation, Computer Sci. J. of Moldova 6(3), 252–285, 1998.
MathSciNet Google Scholar
A.A. Savinov. Application of multi-dimensional fuzzy analysis to decision making. In: Advances in Soft Computing — Engineering Design and Manufacturing, R. Roy, T. Furuhashi and P.K. Chawdhry (eds.), Springer-Verlag, London, 1999.
Google Scholar
A.A. Savinov, Mining possibilistic set-valued rules by generating prime disjunctions, Proc. 3rd European Conference on Principles and Practice of Knowledge Discovery in Databases — PKDD’99, Prague, Czech Republic, September 15-18, 1999, 536–541.
Google Scholar
A.A. Savinov, An algorithm for induction of possibilistic set-valued rules by finding prime disjunctions, 4th On-line World Conference on Soft Computing in Industrial Applications (WSC4), 21-30 Sept. 1999.
Google Scholar
A.A. Savinov, An algorithm for finding logical dependencies among multivalued attrbiutes, Perner, P. (Ed.), Proc. Fachgruppentreffen Maschinelles Lernen (FGML’99), Magdeburg, September 27-29, Institut fuer Bildverarbeitung und angewandte Informatik e.V., Leipzig, 8–14, 1999.
Google Scholar
J.R. Slagel, C.-L. Chang, and R.C.T. Lee, A new algorithm for generating prime implicants, IEEE Trans. on Computers, C-19(4):304–310, 1970.
Article Google Scholar
S. Wrobel, An algorithm for multi-relational discovery of subgroups, Proc. 1st European Symposium on Principles of KDD, eds. J. Komorowski, and J. Jytkow, Springer, Heidelberg, 78–87, 1997.
Google Scholar
A.D. Zakrevsky, Logical inference in finite predicates, Preprint No.6, Institute of Technical Cybernetics, AS Belorussia, Minsk, 1989 (in Russian).
Google Scholar

Download references

Author information

Authors and Affiliations

GMD — German National Research Center for Information Technology AiS.KD —Autonomous Intelligent Systems Institute, Knowledge Discovery Team Schloss Birlinghoven, Sankt-Augustin, D-53754, Germany
Alexandr A. Savinov

Authors

Alexandr A. Savinov
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Belgian Nuclear Research Centre, SCK•CEN, Belgium
Da Ruan
University of Ghent, Belgium
Etienne E. Kerre

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Savinov, A.A. (2000). Mining Interesting Possibilistic Set-Valued Rules. In: Ruan, D., Kerre, E.E. (eds) Fuzzy If-Then Rules in Computational Intelligence. The Springer International Series in Engineering and Computer Science, vol 553. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-4513-2_6

Download citation

DOI: https://doi.org/10.1007/978-1-4615-4513-2_6
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-7035-2
Online ISBN: 978-1-4615-4513-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics