Samples for Understanding Data-Semantics in Relations

  • Fabien De Marchi
  • Stéphane Lopes
  • Jean-Marc Petit
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2366)

Abstract

From statistics, sampling technics were proposed and some of them were proved to be very useful in many database applications. Rather surprisingly, it seems these works never consider the preservation of data semantics. Since functional dependencies (FDs) are known to convey most of data semantics, an interesting issue would be to construct samples preserving FDs satisfied in existing relations.

To cope with this issue, we propose in this paper to define Informative Armstrong Relations (IARs); a relation s is an IAR for a relation r if s is a subset of r and if FDs satisfied in s are exactly the same as FDs satisfied in r. Such a relation always exists since r is obviously an IAR for itself; moreover we shall point out that small IARs with interesting bounded sizes exist. Experiments on relations available in the KDD archive were conducted and highlight the interest of IARs to sample existing relations.

Keywords

Functional Dependency Horn Clause Data Semantic Initial Relation Extend Database Technology 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Reference

  1. [AD80]
    William Ward Armstrong and Claude Delobel. Decomposition and functional dependencies in relations. ACM Transactions on Database Systems, 5(4):404–430, 1980.MATHCrossRefGoogle Scholar
  2. [Arm74]
    W. W. Armstrong. Dependency structures of database relationships. In Jack L. Rosenfeld, editor, International Federation for Information Processing, Amsterdam, pages 580–583, 1974.Google Scholar
  3. [Bay99]
    S. D. Bay. The UCI KDD Archive [http://kdd.ics.uci.edu]. Technical report, Irvine, CA: University of California, Department of Information and Computer Science, 1999.Google Scholar
  4. [BDFS84]
    C. Beeri, M. Dowd, R. Fagin, and R. Statman. On the structure of Armstrong relations for functional dependencies. Journal of the ACM, 31(1):30–56, January 1984.Google Scholar
  5. [DLP01]
    F. De Marchi, S. Lopes, and J-M. Petit. Informative armstrong relations: Application to database analysis. In Bases de Données Avancées, Agadir, Maroc, October 2001.Google Scholar
  6. [DLP02]
    F. De Marchi, S. Lopes, and J.-M. Petit. Efficient algorithms for mining inclusion dependenciess. In International Conference on Extending Database Technology, Prague, Czech Republic. To appear, 2002.Google Scholar
  7. [DT95]
    J. Demetrovics and V.D. Thi. Some remarks on generating armstrong and inferring functional dependencies relation. Acta Cybernetica, 12(2):167–180, 1995.MATHMathSciNetGoogle Scholar
  8. [Fag82a]
    R. Fagin. Armstrong databases. In IBM Symposium on Mathematical Foundations of Computer Science, Kanagawa, Japan, 1982.Google Scholar
  9. [Fag82b]
    R. Fagin. Horn clauses and database dependencies. Journal of the ACM, 99(4):952–985, 1982.CrossRefMathSciNetGoogle Scholar
  10. [LL99]
    M. Levene and G. Loizou. A Guided Tour of Relational Databases and Beyond. Springer, 1999.Google Scholar
  11. [LPL00]
    S. Lopes, J.-M. Petit, and L. Lakhal. Efficient discovery of functional dependencies and armstrong relations. In Carlo Zaniolo, Peter C. Lockemann, Marc H. Scholl, and Torsten Grust, editors, International Conference on Extending Database Technology, Konstanz, Germany, volume 1777 of Lecture Notes in Computer Science, pages 350–364. Springer, 2000.Google Scholar
  12. [LPL01]
    S. Lopes, J-M. Petit, and L. Lakhal. A framework for understanding existing databases. In Michel E. Adiba, Christine Collet, and Bipin C. Desai, editors, International Database Engineering and Applications Symposium, Grenoble, France, pages 330–338, July 2001.Google Scholar
  13. [LPT02]
    S. Lopes, J-M. Petit, and F. Toumani. Discovering interesting inclusion dependencies: Application to logical database tuning. Information System, 17(1):1–19, 2002.CrossRefGoogle Scholar
  14. [MR86]
    H. Mannila and K.-J. Räihä. Design by example: An application of arm-strong relations. Journal of Computer and System Sciences, 63(2):126–141, October 1986.Google Scholar
  15. [MR94]
    H. Mannila and K. J. Räihä. Algorithms for inferring functional-dependencies from relations. Data and Knowledge Engineering, 12(1):83–99, 1994.MATHCrossRefGoogle Scholar
  16. [SM79]
    A. M. Silva and M. A. Melkanoff. A method for helping discover the dependencies of a relation. In Hervé Gallaire, Jean-Marie Nicolas, and Jack Minker, editors, Advances in Data Base Theory, pages 115–133, Toulouse, France, 1979.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • Fabien De Marchi
    • 1
  • Stéphane Lopes
    • 2
  • Jean-Marc Petit
    • 1
  1. 1.Laboratoire LIMOS, CNRS UMR 2239Université Blaise Pascal - Clermont-Ferrand IIAubière cedexFrance
  2. 2.Laboratoire PRISMCNRS FRE 8636Versailles CedexFrance

Personalised recommendations