Skip to main content

FastFDs: A Heuristic-Driven, Depth-First Algorithm for Mining Functional Dependencies from Relation Instances Extended Abstract

  • Conference paper
  • First Online:
Data Warehousing and Knowledge Discovery (DaWaK 2001)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2114))

Included in the following conference series:

Abstract

The problem of discovering functional dependencies (FDs) from an existing relation instance has received considerable attention in the database research community. To date, even the most efficient solutions have exponential complexity in the number of attributes of the instance. We develop an algorithm, FastFDs, for solving this problem based on a depth-first, heuristic-driven (DFHD) search for finding minimal covers of hypergraphs. The technique of reducing the FD discovery problem to the problem of finding minimal covers of hypergraphs was applied previously by Lopes et al. in the algorithm Dep-Miner. Dep-Miner employs a levelwise search for minimal covers, whereas FastFDs uses DFHD search. We report several tests on distinct benchmark relation instances involving Dep-Miner, FastFDs, and Tane. Our experimental results indicate that DFHD search is more efficient than Dep-Miner’s levelwise search or Tane’s partitioning approach for many of these benchmark instances.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, Rakesh; Mannila, Heikki; Srikant, Ramakrishnan; Toivonen, Hannu and Verkamo, A.I. “Fast Discovery of Association Rules.” Advances in KDD, AAA, Press, Menlo Park, CA, pg. 307–328, 1996.

    Google Scholar 

  2. Demetrovics, J; Katona, G; Miklos, D; Seleznjev, O. and Thalheim, B. “The Average Length of Keys and Functional Dependencies in (Random) Databases.” Lecture Notes in Computer Science, vol. 893, 1995.

    Google Scholar 

  3. Eiter, Thomas and Gottlob, Goerg. “Identifying the Minimal Traversals of a Hypergraph and Related Problems.” SIAM Journal of Computing, vol. 24,no. 6, pg. 1278–1304, 1995.

    Article  MATH  MathSciNet  Google Scholar 

  4. Flach, Peter and Savnik, Iztok. “Database Dependency Discovery: a Machine Learning Approach.” AI Comm. vol. 12,no. 3, pg 139–160.

    Google Scholar 

  5. Gunopulos, Dimitrios; Khardon, Roni; Mannila, Heikki; and Toivonen, Hannu. “Data Mining, Hypergraph Traversals, and Machine Learning (extended abstract)”, PODS, 1997, pg 209–216.

    Google Scholar 

  6. Huhtala, Ykä; Kärkkäinen, Juha; Porkka, Pasi and Toivonen, Hannu. “TANE: An Efficient Algorithm for Discovering Functional and Approximate Dependencies.” The Computer Journal, vol. 42,no. 2, 1999.

    Google Scholar 

  7. Kantola, Martti; Mannila, Heikki; Räihä, Kari-Jouko and Siirtola, Harri. “Discovering Functional and Inclusion Dependencies in Relational Databases.” Journal of Intelligent Systems, vol. 7, pg. 591–607, 1992.

    Article  MATH  Google Scholar 

  8. Lopes, Stephane; Petit, Jean-Marc and Lakhal, Lotfi. “Efficient Discovery of Functional Dependencies and Armstrong Relations.” Proceedings of ECDT 2000. Lecture Notes in Computer Science, vol 1777.

    Google Scholar 

  9. Mannila, Heikki and Räihä, Kari-Jouko. “Dependency Inference (Extended Abstract)”, Proceedings of the Very Large Databases Conference (VLDB), Brighton, pg. 155–158, 1987.

    Google Scholar 

  10. Mannila, Heikki and Räihä, Kari-Jouko. “Algorithms for Inferring Functional Dependencies from Relations”, Data & Knowledge Engineering, 12, pg. 83–99, 1994.

    Article  MATH  Google Scholar 

  11. Merz, C.J. and Murphy, P.M. UCI Machine Learning databases (1996). http://www.ics.uci.edu/~mlearn/MLRepository.html. Irvine, CA: University of California, Department of Information and Comp. Sci.

  12. The Tane and Tane/mem source code is available on the web at http://http://www.cs.helsinki.fi/research/fdk/datamining/tane

  13. Wyss, C; Giannella, C; and Robertson E. “FastFDs: A Heuristic-Driven, Depth-First Algorithm for Mining Functional Dependencies from Relation Instances”, Technical Report, Dept. of Comp. Sci, Indiana University, May 2001.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wyss, C., Giannella, C., Robertson, E. (2001). FastFDs: A Heuristic-Driven, Depth-First Algorithm for Mining Functional Dependencies from Relation Instances Extended Abstract. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2001. Lecture Notes in Computer Science, vol 2114. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44801-2_11

Download citation

  • DOI: https://doi.org/10.1007/3-540-44801-2_11

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-42553-3

  • Online ISBN: 978-3-540-44801-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics