HySpirit — A probabilistic inference engine for hypermedia retrieval in large databases

  • Norbert Fuhr
  • Thomas Rölleke
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1377)

Abstract

HySpirit is a retrieval engine for hypermedia retrieval integrating concepts from information retrieval (IR) and deductive databases. The logical view on IR models retrieval as uncertain inference, for which we use probabilistic reasoning. Since the expressiveness of classical IR models is not sufficient for hypermedia retrieval, HySpirit is based on a probabilistic version of Datalog. In hypermedia retrieval, different nodes may contain contradictory information; thus, we introduce probabilistic four-valued Datalog. In order to support fact queries as well as content-based retrieval, HySpirit is based on an open world assumption, but allows for predicate-specific closed world assumptions. For performing efficient retrieval on large databases, our system provides access to external data. We demonstrate the application of HySpirit by giving examples for retrieval on images, structured documents and large databases.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Beeri C. and Kornatzky Y. A logical query language for hypermedia systems. Information Sciences, 77(1/2):1–37, 1994.CrossRefGoogle Scholar
  2. 2.
    Belnap N. A useful four-valued logic. In Dunn J.M. and Epstein G.., editors, Modern Uses of Multiple-Valued Logic. Reidel, Dordrecht, 1977.Google Scholar
  3. 3.
    Callan J.P., Croft W.B., and Harding S.M. The INQUERY retrieval system. In Proc. 3rd DEXA, pages 78–83, 1992.Google Scholar
  4. 4.
    Chaudhuri S. and Gravano L. Optimizing queries over multimedia repositories. In Proc. SIGMOD., pages 91–102, 1996.Google Scholar
  5. 5.
    Christophides V., Abiteboul S., and Cluet S. From structured documents to novel query facilities. In Proc. SIGMOD, pages 313–324, 1994.Google Scholar
  6. 6.
    Cooper G.F. The computational complexity of probabilistic inference using bayesian belief networks. Artificial Intelligence, 42:393–405, 1990.CrossRefMathSciNetGoogle Scholar
  7. 7.
    Dey D. and Sarkar S. A probabilistic relational model and algebra. ACM TOIS, 21(3):339–369, 1996.CrossRefGoogle Scholar
  8. 8.
    Flickner M., et al. Query by image and video content: The QBIC system. Computer, 28(9):23–32, 1995.CrossRefGoogle Scholar
  9. 9.
    Fuhr N. Probabilistic datalog — a logic for powerful retrieval methods. In Proc. SIGIR, pages 282–290, 1995.Google Scholar
  10. 10.
    Fuhr N. and Rölleke T. A probabilistic relational algebra for the integration of information retrieval and database systems. ACM TODS, 14(1):32–66, 1997.Google Scholar
  11. 11.
    Gelfond M. and Lifschitz V. Logic programs with classical negation. Logic Programming, Proc. 7th Conf., 1990.Google Scholar
  12. 12.
    Hermes Th., Klauck Ch., Krey\J., and Zhang J. Image retrieval for information systems. In Proc. IS&T/SPIE's Symposium on Electronic Imaging: Science & Technologie., 1995.Google Scholar
  13. 13.
    Meghini C. and Straccia U. A relevance terminological logic for information retrieval. In Proc. SIGIR, pages 197–205,1996.Google Scholar
  14. 14.
    Moffat Alistair and Zobel Justin. Self-indexing inverted files for fast text retrieval. ACM TOIS, 14(4):349–379, October 1996.CrossRefGoogle Scholar
  15. 15.
    Ng R. and Subrahmanian V. S. A semantical framework for supporting subjective and conditional probabilities in deductive databases. J. Autom. Reas., 10:191–235, 1993.CrossRefMathSciNetGoogle Scholar
  16. 16.
    Poole D. Probabilistic horn abduction and bayesian networks. Artificial Intelligence, 64:81–129, 1993.CrossRefGoogle Scholar
  17. 17.
    Rakow T.C., Neuhold E.J., and Löhr M. Multimedia database systems — the notion and the issues. In Proc. BTW, pages 1–29, 1995. Springer.Google Scholar
  18. 18.
    Rölleke T. and Fuhr N. Retrieval of complex objects using a four-valued logic. In Frei H.-P, Harmann D., SchÄuble P., and Wilkinson R., editors, Proc. SIGIR, pages 206–214, 1996.Google Scholar
  19. 19.
    Rölleke T. and Fuhr N. Probabilistic reasoning for large scale databases. In Proc. BTW, pages 118–132, 1997. Springer.Google Scholar
  20. 20.
    Ross K. Modular stratification and magic sets for datalog programs with negation. J. ACM, 41(6):1216–1266, November 1994.CrossRefGoogle Scholar
  21. 21.
    Takahashi Y Fuzzy database query languages and their relational completeness theorem. IEEE TKDE, 5(1):122, 1993.Google Scholar
  22. 22.
    van Rijsbergen C. J. A non-classical logic for information retrieval. Computer Journal, 29(6):481–485, 1986.CrossRefGoogle Scholar
  23. 23.
    van Rijsbergen C. J. Towards an information logic. In Proc. SIGIR, pages 77–86, 1989.Google Scholar
  24. 24.
    Wagner G. A database needs two kinds of negation. In Proc. MFDB, pages 357–371, 1991.Google Scholar
  25. 25.
    Wagner G. Ex contradictione nihil sequitur. In Proc. IJCAI, pages 538–543, 1991.Google Scholar
  26. 26.
    Wong S.K.M. and Yao Y.Y. On modeling information retrieval with probabilistic inference. ACM TOIS, 13(1):38–68, 1995.CrossRefGoogle Scholar
  27. 27.
    Zhang H.J., Smoliar S.W., and Tan Y.H. Towards automating content-based video indexing and retrieval. In Proc. Multi-Media Modeling, 1993.Google Scholar

Copyright information

© Springer-Verlag 1998

Authors and Affiliations

  • Norbert Fuhr
    • 1
  • Thomas Rölleke
    • 1
  1. 1.Informatik 6University of DortmundDortmundGermany

Personalised recommendations