Skip to main content

Interactive Query Formulation in Semistructured Databases

  • Conference paper
  • First Online:
Flexible Query Answering Systems (FQAS 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2522))

Included in the following conference series:

Abstract

The use of large amounts of distributed and heterogeneous information has become extremely cumbersome; this difficulty is mainly related to exploring the data, rather than actually storing or exchanging it. The user who is interested in small bits of information is getting more and more confused when having to dig under a large volume of diverse and more importantly semi-structured data. In this paper,we propose an interactive and adaptive framework that guides the user in the search for data, by disclosing only a part of the underlying information at a time. It first provides the user with a high-level view of the raw data and gradually adapts to his/her needs in order to offer a refined answer. The proposed model offers the possibility to query a semistructured database based on general schema-related constraints imposed by the user or identified by the system, but without specific knowledge of the underlying metadata. This is achieved by receiving initially an amorphic query, which may consist of one or more basic paths, and helping the user to refine it gradually to a specific semistructured query, expressed in a language like XQuery or Lorel.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cattell et al, R.: The Object Data Standard: ODMG 3.0. Morgan Kaufmann (2000)

    Google Scholar 

  2. Chaudhuri, S., Shim, K.: Optimizing queries with aggregate views. In: Extending Database Technology. (1996) 167–182

    Google Scholar 

  3. Chakravarthy, U., Grant, J., Minker, J.: Foundations of semantic query optimization for deductive databases. In: Foundations of Deductive Databases and Logic Programming. (1988) 243–273

    Google Scholar 

  4. Abiteboul, S.: Querying semi-structured data. In: Intl Conf on Database Theory. (1997) 1–18

    Google Scholar 

  5. Buneman, P.: Semistructured data: a tutorial. In: Symposium on Principles of Database Systems. (1997)

    Google Scholar 

  6. Evangelista-Filha, I., Laender, A., Silva, A.: Querying semistructured data by example: The QSByE interface. In: IntlWorkshop on Information Integration on theWeb (WIIW). (2001) 156–163

    Google Scholar 

  7. Buneman, P., Davidson, S., Fernandez, M., Suciu, D.: Adding structure to unstructured data. In: Intl Conf on Database Theory. (1997) 336–350

    Google Scholar 

  8. Nestorov, S., Abiteboul, S., Motwani, R.: Inferring structure in semistructured data. In: Workshop on Management of Semistructured Data. (1997)

    Google Scholar 

  9. Nestorov, S., Abiteboul, S., Motwani, R.: Extracting schema from semistructured data. In: ACM Intl Conf on Management of Data. (1998) 295–306

    Google Scholar 

  10. Kobayashi, M., Takeda, K.: Information retrieval on the web. ACM Computing Surveys 32 (2000) 144–173

    Article  Google Scholar 

  11. Buneman, P., Fan, W., Simeon, J., Weinstein, S.: Constraints for semistructured data and xml. SIGMOD Record 30 (2001)

    Google Scholar 

  12. Buneman, P., Fan, W., Weinstein, S.: Query optimization for semistructured data using path constraints in a deterministic data model. In:Workshop on Database Programming Languages. (1999) 208–223

    Google Scholar 

  13. Wang, J., Chirn, G.W., Marr, T., Shapiro, B., Shasha, D., Zhang, K.: Combinatorial pattern discovery for scientific data: Some preliminary results. In: ACM SIGMOD Intl Conf on Management of Data. (1994) 115–125

    Google Scholar 

  14. Chandrasekaran, B., Josephson, J., Benjamins, V.: What are ontologies, and why do we need them? IEEE Intelligent Systems 14 (1999) 20–26

    Article  Google Scholar 

  15. Maedche, A., Staab, S.: Ontology learning for the semantic web. IEEE Intelligent Systems 16 (2001) 72–79

    Article  Google Scholar 

  16. Zhong, N.: Ontologies in web intelligence. In: Practical Applications of Intelligent Agents, Springer. (2001)

    Google Scholar 

  17. Vianu, V.: A web odyssey: From codd to XML. In: ACM PODS Symposium on Principles of Database Systems. (2001) 1–15

    Google Scholar 

  18. Garofalakis, M., Rastogi, R., Seshadri, S., Shim, K.: Data mining and the web: Past, present and future. In: ACM CIKM’99 2nd Workshop on Web Information and Data Management (WIDM’99). (1999) 43–47

    Google Scholar 

  19. Chinenyanga, T., Kushmerick, N.: Expressive retrieval from XML documents. In: ACM SIGIR Conf on Research and Development in Information Retrieval. (2001) 163–171

    Google Scholar 

  20. Fan, W., Libkin, L.: On XML integrity constraints in the presence of DTDs. In:ACM PODS Symposium on Principles of Database Systems. (2001) 114–125

    Google Scholar 

  21. Fan, W., Simeon, J.: Integrity constraints for XML. In:ACMPODS Symposium on Principles of Database Systems. (2000) 23–34

    Google Scholar 

  22. Agrawal, R., Srikant, R.: Mining sequential patterns. In: Intl Conf on Data Engineering (ICDE). (1995) 3–14

    Google Scholar 

  23. Srikant, R., Agrawal, R.: Mining sequential patterns: Generalizations and performance improvements. In: 5th Intl Conf on Extending Database Technology (EDBT). (1996) 3–17

    Google Scholar 

  24. Pei, J., Han, J., Mortazavi-Asl, B., Pinto, H., Chen, Q., Dayal, U., Hsu, M.C.: PrefixSpan mining sequential patterns efficiently by prefix projected pattern growth. In: Intl Conf on Data Engineering (ICDE). (2001) 215–224

    Google Scholar 

  25. Srikant, R., Agrawal, R.: Mining generalized association rules. In: Intl Conf on Very Large Databases (VLDB). (1995) 407–419

    Google Scholar 

  26. Garofalakis, M., Rastogi, R., Shim, K.: SPIRIT: Sequential pattern mining with regular expression constraints. In: Intl Conf on Very Large Databases (VLDB). (1999) 223–234

    Google Scholar 

  27. Ng, R., Lakshmanan, L., Han, J., Pang, A.: Exploratory mining and pruning optimizations of constrained association rules. In: ACM SIGMOD Intl Conf on Management of Data. (1998) 13–24

    Google Scholar 

  28. Wang, K., Liu, H.: Discovering typical structures of documents: A road map approach. In: ACM SIGIR Intl Conf on Research and Development in Information Retrieval. (1998) 146–154

    Google Scholar 

  29. Wang, K., Liu, H.: Discovering structural association of semistructured data. Knowledge and Data Engineering 12 (2000) 353–371

    Article  Google Scholar 

  30. Goldman, R., Widom, J.: DataGuides: Enabling query formulation and optimization in semistructured databases. In: Twenty-third Intl Conf on Very Large Data Bases. (1997) 436–445

    Google Scholar 

  31. Goldman, R., Widom, J.: Interactive query and search in semistructured databases. In: First IntlWorkshop on theWeb and Databases (WebDB). (1998) 52–62

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Trigoni, A. (2002). Interactive Query Formulation in Semistructured Databases. In: Carbonell, J.G., Siekmann, J., Andreasen, T., Christiansen, H., Motro, A., Legind Larsen, H. (eds) Flexible Query Answering Systems. FQAS 2002. Lecture Notes in Computer Science(), vol 2522. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36109-X_28

Download citation

  • DOI: https://doi.org/10.1007/3-540-36109-X_28

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00074-7

  • Online ISBN: 978-3-540-36109-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics