Skip to main content

A Document Database Query Language

  • Conference paper
  • First Online:
Advances in Databases (BNCOD 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2405))

Included in the following conference series:

  • 246 Accesses

Abstract

This work presents a natural language based technique to build user interfaces to query document databases through the web. We call such technique Bounded Natural Language (BNL). Interfaces based on BNL are useful to query document databases containing only structured data, containing only text or containing both of them. That is, the underlying formalism of BNL can integrate restrictions over structured and non-structured data (as text).

Interfaces using BNL can be programmed ad hoc for any document database but in this paper we present a system with an ontology based architecture in which the user interface is automatically generated by a software module (User Interface Generator) capable of reading and following the ontology. This ontology is a conceptualization of the database model, which uses a label in natural language for any concept in the ontology. Each label represents the usual name for a concept in the real world.

The ontology includes general concepts useful when the user is interested in documents in any corpus in the database, and specific concepts useful when the user is interested in a specific corpus. That is, databases can store one or more corpus of documents and queries can be issued either over the whole database or over a specific corpus.

The ontology guides the execution of the User Interface Generator and other software modules in such a way that any change in the database does not imply making changes in the program code, because the whole system runs following the ontology. That is, if a modification in the database schema occurs, only the ontology must be changed and the User Interface Generator will produce a new and different user interface adapted to the new database.

This work was partially supported by CICYT (TEL99-0335-C04-02)

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Askjeeves http://www.askjeeves.com. 2001

  2. Alonso, O. and Baeza-Yates, R. “A model and software architecture for search results visualization on the WWW”, Proceedings of the International Symposium on String Processing and Information Retrieval SPIRE 2000, IEEE Computer Society Press, A Coruña Spain, September 27–29, 2000, pp:8–15.

    Chapter  Google Scholar 

  3. Baeza-Yates, R.; Navarro, G. Integrating contents and structure in text retrieval. ACM SIG-MOD Record, 25(1):67–79, Marzo 1996.

    Google Scholar 

  4. Baeza-Yates, R.; Navarro, G.; Vegas, J.; Fuente, P. A model and a visual query language for structured text. En Berthier Ribeiro-Neto (Eds.) Proc. of the 5th Symposium on String Processing and Information Retrieval, pp:7–13, Santa Cruz, Bolivia, Sept 1998. IEEE CS Press.

    Google Scholar 

  5. Baeza-Yates, R.; Ribeiro-Neto, B. Modern Information Retrieval, Addison-Wesley, 1999.

    Google Scholar 

  6. Berry, M. W.; Dumais, S. T.; O’Brien, W. Using Linear Algebra for Intelligent Information Retrieval. SIAM Review 37:573–595, 1995.

    Article  MATH  MathSciNet  Google Scholar 

  7. Cousins, Steve B., Paepcke, Andreas, Winograd, Terry, Eric A. Bier and Ken Pier; The digital library integrated task environment (DLITE); Proceedings of the 2nd ACM international conference on Digital libraries, 1997, Pages 142–151

    Google Scholar 

  8. Gruber, T. Toward Principles for the Design of Ontologies Used for Knowledge Sharing. IJHCS, 43(5/6): 907–928. 1994.

    Google Scholar 

  9. Gruber, T. http://www-ksl.stanford.edu/kst/what-is-an-ontology.html

  10. Guarino, N. (ed.), Formal Ontology in Information Systems. Proceedings of FOIS’98. Amsterdam, IOS Press, pp. 3–15., Trento, Italy, 6–8 June 1998.

    Google Scholar 

  11. Hearst, Marti A. and Chandu Karadi; Cat-a-Cone: An interactive interface for specifying searches and viewing retrieval results using a large category hierarchy; Proceedings of the 20th annual international ACM SIGIR. Conference on Research and development in information retrieval, 1997, Pages 246–255.

    Google Scholar 

  12. Hearst, M. “User interfaces and visualization” in Modern Information Retrieval, Addison-Wesley, London, 1999

    Google Scholar 

  13. Koenemann, Juergen and Belkin, Nicholas (1996). A case for interaction: A study of interactive information retrieval behavior and effectiveness. Proc. CHI’96 Human Factors in Computing Systems, ACM Press, New York, NY, pp. 205–212.

    Google Scholar 

  14. Landauer, T., Egan, D., Remde, J., Lesk, M., Lochbaum, C., and Ketchum, D. Enhancing the usability of text through computer delivery and formative evaluation: The SuperBook project. Hypertext-A Psychological Perspective. Ellis Horwood, 1993.

    Google Scholar 

  15. Mena, E., Illarramendi, A., Kashyap, V., Sheth, A. OBSERVER: An Approach for Query Processing in Global Information Systems based on Interoperation across Pre-existing Ontologies. Journal Distributed And Parallel Databases (DAPD). 1998.

    Google Scholar 

  16. Ogawa, Y.; Morita, T.; Kobayashi, K. A fuzzy document retrieval system using the keyword connection matrix and a learning method. Fuzzy Sets and Systems, 39:163–179, 1991.

    Article  MathSciNet  Google Scholar 

  17. Penabad, M., Durán, M.J., Lalín, C, López, J.R., Paramá, J, Places, A. S. y Brisaboa, N.R. Using Bounded Natural Language to Query Databases on the Web. Proceeding of the Information Systems, Analysis and Synthesis ISAS’99. Orlando (Florida), Julio-Agosto 1999.

    Google Scholar 

  18. Rao, Ramana, Card, Stuart K., Jellinek, Herbert D., Mackinlay, Jock D. and Robertson, George G. The information grid: A framework for information retrieval and retrieval-centered applications. Proceedings of the fifth annual ACM symposium on User interface software and technology, 1992, Pages 23–32

    Google Scholar 

  19. Rijsbergen, C.J. van. Information Retrieval. Butterworths, 1979.

    Google Scholar 

  20. Robertson, G. C.; Sparck Jones, K. Relevance weighting of search terms. Journal of the American Society for Information Sciences, 27(3):129–146, 1976.

    Article  Google Scholar 

  21. Salton, G. Automatic information Organization and Retrieval. McGraw-hill, 1968.

    Google Scholar 

  22. Salton, G. The SMART Retrieval System-Experiments in Automatic Document Processing. Prentice Hall Inc., Englewood Cliffs, NJ, 1971.

    Google Scholar 

  23. Salton, G.; Fox, E. A.; Wu, H. Extended Boolean information retrieval. Communications of the ACM, 26(11):1022–1036, November 1983.

    Google Scholar 

  24. Salton, G., and Buckley, C. 1990. Improving retrieval performance by relevance feedback. Journal of the American Society for Information Science 41:288–297.

    Article  Google Scholar 

  25. Shneiderman. B. “Designing the User Interface: Strategies for Effective Human-Computer Interaction”, 3ed, Massachusetts Addison-Wesley, 1998.

    Google Scholar 

  26. Strzalkowski, Tomek, editor. Natural Language Information Retrieval. Kluwer Academic Publishers, Dordrecht, April 1999

    MATH  Google Scholar 

  27. Vegas, J. “Un sistema de recuperación de información sobre estructura y contenido”, Tesis doctoral. Universidad de Valladolid, Valladolid, Spain, 1999.

    Google Scholar 

  28. Wong, S. K. M.; Ziarko, W.; Wong, P. C. N. Generalized vector space model in information retrieval. Proc. 8th ACM SIGIR Conference on Research and Development in information Retrieval, pp:18–25, New Yok, USA, 1985.

    Google Scholar 

  29. World Wide Web Consortium. Standard X

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Brisaboa, N.R., Penabad, M.R., Places, Á.S., Rodríguez, F.J. (2002). A Document Database Query Language. In: Eaglestone, B., North, S., Poulovassilis, A. (eds) Advances in Databases. BNCOD 2002. Lecture Notes in Computer Science, vol 2405. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45495-0_20

Download citation

  • DOI: https://doi.org/10.1007/3-540-45495-0_20

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43905-9

  • Online ISBN: 978-3-540-45495-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics