ISMIS 2002: Foundations of Intelligent Systems pp 565-573 | Cite as
Samples for Understanding Data-Semantics in Relations
Abstract
From statistics, sampling technics were proposed and some of them were proved to be very useful in many database applications. Rather surprisingly, it seems these works never consider the preservation of data semantics. Since functional dependencies (FDs) are known to convey most of data semantics, an interesting issue would be to construct samples preserving FDs satisfied in existing relations.
To cope with this issue, we propose in this paper to define Informative Armstrong Relations (IARs); a relation s is an IAR for a relation r if s is a subset of r and if FDs satisfied in s are exactly the same as FDs satisfied in r. Such a relation always exists since r is obviously an IAR for itself; moreover we shall point out that small IARs with interesting bounded sizes exist. Experiments on relations available in the KDD archive were conducted and highlight the interest of IARs to sample existing relations.
Keywords
Functional Dependency Horn Clause Data Semantic Initial Relation Extend Database TechnologyPreview
Unable to display preview. Download preview PDF.
Reference
- [AD80]William Ward Armstrong and Claude Delobel. Decomposition and functional dependencies in relations. ACM Transactions on Database Systems, 5(4):404–430, 1980.MATHCrossRefGoogle Scholar
- [Arm74]W. W. Armstrong. Dependency structures of database relationships. In Jack L. Rosenfeld, editor, International Federation for Information Processing, Amsterdam, pages 580–583, 1974.Google Scholar
- [Bay99]S. D. Bay. The UCI KDD Archive [http://kdd.ics.uci.edu]. Technical report, Irvine, CA: University of California, Department of Information and Computer Science, 1999.Google Scholar
- [BDFS84]C. Beeri, M. Dowd, R. Fagin, and R. Statman. On the structure of Armstrong relations for functional dependencies. Journal of the ACM, 31(1):30–56, January 1984.Google Scholar
- [DLP01]F. De Marchi, S. Lopes, and J-M. Petit. Informative armstrong relations: Application to database analysis. In Bases de Données Avancées, Agadir, Maroc, October 2001.Google Scholar
- [DLP02]F. De Marchi, S. Lopes, and J.-M. Petit. Efficient algorithms for mining inclusion dependenciess. In International Conference on Extending Database Technology, Prague, Czech Republic. To appear, 2002.Google Scholar
- [DT95]J. Demetrovics and V.D. Thi. Some remarks on generating armstrong and inferring functional dependencies relation. Acta Cybernetica, 12(2):167–180, 1995.MATHMathSciNetGoogle Scholar
- [Fag82a]R. Fagin. Armstrong databases. In IBM Symposium on Mathematical Foundations of Computer Science, Kanagawa, Japan, 1982.Google Scholar
- [Fag82b]R. Fagin. Horn clauses and database dependencies. Journal of the ACM, 99(4):952–985, 1982.CrossRefMathSciNetGoogle Scholar
- [LL99]M. Levene and G. Loizou. A Guided Tour of Relational Databases and Beyond. Springer, 1999.Google Scholar
- [LPL00]S. Lopes, J.-M. Petit, and L. Lakhal. Efficient discovery of functional dependencies and armstrong relations. In Carlo Zaniolo, Peter C. Lockemann, Marc H. Scholl, and Torsten Grust, editors, International Conference on Extending Database Technology, Konstanz, Germany, volume 1777 of Lecture Notes in Computer Science, pages 350–364. Springer, 2000.Google Scholar
- [LPL01]S. Lopes, J-M. Petit, and L. Lakhal. A framework for understanding existing databases. In Michel E. Adiba, Christine Collet, and Bipin C. Desai, editors, International Database Engineering and Applications Symposium, Grenoble, France, pages 330–338, July 2001.Google Scholar
- [LPT02]S. Lopes, J-M. Petit, and F. Toumani. Discovering interesting inclusion dependencies: Application to logical database tuning. Information System, 17(1):1–19, 2002.CrossRefGoogle Scholar
- [MR86]H. Mannila and K.-J. Räihä. Design by example: An application of arm-strong relations. Journal of Computer and System Sciences, 63(2):126–141, October 1986.Google Scholar
- [MR94]H. Mannila and K. J. Räihä. Algorithms for inferring functional-dependencies from relations. Data and Knowledge Engineering, 12(1):83–99, 1994.MATHCrossRefGoogle Scholar
- [SM79]A. M. Silva and M. A. Melkanoff. A method for helping discover the dependencies of a relation. In Hervé Gallaire, Jean-Marie Nicolas, and Jack Minker, editors, Advances in Data Base Theory, pages 115–133, Toulouse, France, 1979.Google Scholar