Skip to main content

Samples for Understanding Data-Semantics in Relations

Part of the Lecture Notes in Computer Science book series (LNAI,volume 2366)

Abstract

From statistics, sampling technics were proposed and some of them were proved to be very useful in many database applications. Rather surprisingly, it seems these works never consider the preservation of data semantics. Since functional dependencies (FDs) are known to convey most of data semantics, an interesting issue would be to construct samples preserving FDs satisfied in existing relations.

To cope with this issue, we propose in this paper to define Informative Armstrong Relations (IARs); a relation s is an IAR for a relation r if s is a subset of r and if FDs satisfied in s are exactly the same as FDs satisfied in r. Such a relation always exists since r is obviously an IAR for itself; moreover we shall point out that small IARs with interesting bounded sizes exist. Experiments on relations available in the KDD archive were conducted and highlight the interest of IARs to sample existing relations.

Keywords

  • Functional Dependency
  • Horn Clause
  • Data Semantic
  • Initial Relation
  • Extend Database Technology

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

This work is partially supported by the AS CNRS-STIC “Data Mining”

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Reference

  1. William Ward Armstrong and Claude Delobel. Decomposition and functional dependencies in relations. ACM Transactions on Database Systems, 5(4):404–430, 1980.

    CrossRef  MATH  Google Scholar 

  2. W. W. Armstrong. Dependency structures of database relationships. In Jack L. Rosenfeld, editor, International Federation for Information Processing, Amsterdam, pages 580–583, 1974.

    Google Scholar 

  3. S. D. Bay. The UCI KDD Archive [http://kdd.ics.uci.edu]. Technical report, Irvine, CA: University of California, Department of Information and Computer Science, 1999.

    Google Scholar 

  4. C. Beeri, M. Dowd, R. Fagin, and R. Statman. On the structure of Armstrong relations for functional dependencies. Journal of the ACM, 31(1):30–56, January 1984.

    Google Scholar 

  5. F. De Marchi, S. Lopes, and J-M. Petit. Informative armstrong relations: Application to database analysis. In Bases de Données Avancées, Agadir, Maroc, October 2001.

    Google Scholar 

  6. F. De Marchi, S. Lopes, and J.-M. Petit. Efficient algorithms for mining inclusion dependenciess. In International Conference on Extending Database Technology, Prague, Czech Republic. To appear, 2002.

    Google Scholar 

  7. J. Demetrovics and V.D. Thi. Some remarks on generating armstrong and inferring functional dependencies relation. Acta Cybernetica, 12(2):167–180, 1995.

    MATH  MathSciNet  Google Scholar 

  8. R. Fagin. Armstrong databases. In IBM Symposium on Mathematical Foundations of Computer Science, Kanagawa, Japan, 1982.

    Google Scholar 

  9. R. Fagin. Horn clauses and database dependencies. Journal of the ACM, 99(4):952–985, 1982.

    CrossRef  MathSciNet  Google Scholar 

  10. M. Levene and G. Loizou. A Guided Tour of Relational Databases and Beyond. Springer, 1999.

    Google Scholar 

  11. S. Lopes, J.-M. Petit, and L. Lakhal. Efficient discovery of functional dependencies and armstrong relations. In Carlo Zaniolo, Peter C. Lockemann, Marc H. Scholl, and Torsten Grust, editors, International Conference on Extending Database Technology, Konstanz, Germany, volume 1777 of Lecture Notes in Computer Science, pages 350–364. Springer, 2000.

    Google Scholar 

  12. S. Lopes, J-M. Petit, and L. Lakhal. A framework for understanding existing databases. In Michel E. Adiba, Christine Collet, and Bipin C. Desai, editors, International Database Engineering and Applications Symposium, Grenoble, France, pages 330–338, July 2001.

    Google Scholar 

  13. S. Lopes, J-M. Petit, and F. Toumani. Discovering interesting inclusion dependencies: Application to logical database tuning. Information System, 17(1):1–19, 2002.

    CrossRef  Google Scholar 

  14. H. Mannila and K.-J. Räihä. Design by example: An application of arm-strong relations. Journal of Computer and System Sciences, 63(2):126–141, October 1986.

    Google Scholar 

  15. H. Mannila and K. J. Räihä. Algorithms for inferring functional-dependencies from relations. Data and Knowledge Engineering, 12(1):83–99, 1994.

    CrossRef  MATH  Google Scholar 

  16. A. M. Silva and M. A. Melkanoff. A method for helping discover the dependencies of a relation. In Hervé Gallaire, Jean-Marie Nicolas, and Jack Minker, editors, Advances in Data Base Theory, pages 115–133, Toulouse, France, 1979.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

De Marchi, F., Lopes, S., Petit, JM. (2002). Samples for Understanding Data-Semantics in Relations. In: Hacid, MS., Raś, Z.W., Zighed, D.A., Kodratoff, Y. (eds) Foundations of Intelligent Systems. ISMIS 2002. Lecture Notes in Computer Science(), vol 2366. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48050-1_60

Download citation

  • DOI: https://doi.org/10.1007/3-540-48050-1_60

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43785-7

  • Online ISBN: 978-3-540-48050-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics