Abstract
The use of large amounts of distributed and heterogeneous information has become extremely cumbersome; this difficulty is mainly related to exploring the data, rather than actually storing or exchanging it. The user who is interested in small bits of information is getting more and more confused when having to dig under a large volume of diverse and more importantly semi-structured data. In this paper,we propose an interactive and adaptive framework that guides the user in the search for data, by disclosing only a part of the underlying information at a time. It first provides the user with a high-level view of the raw data and gradually adapts to his/her needs in order to offer a refined answer. The proposed model offers the possibility to query a semistructured database based on general schema-related constraints imposed by the user or identified by the system, but without specific knowledge of the underlying metadata. This is achieved by receiving initially an amorphic query, which may consist of one or more basic paths, and helping the user to refine it gradually to a specific semistructured query, expressed in a language like XQuery or Lorel.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cattell et al, R.: The Object Data Standard: ODMG 3.0. Morgan Kaufmann (2000)
Chaudhuri, S., Shim, K.: Optimizing queries with aggregate views. In: Extending Database Technology. (1996) 167–182
Chakravarthy, U., Grant, J., Minker, J.: Foundations of semantic query optimization for deductive databases. In: Foundations of Deductive Databases and Logic Programming. (1988) 243–273
Abiteboul, S.: Querying semi-structured data. In: Intl Conf on Database Theory. (1997) 1–18
Buneman, P.: Semistructured data: a tutorial. In: Symposium on Principles of Database Systems. (1997)
Evangelista-Filha, I., Laender, A., Silva, A.: Querying semistructured data by example: The QSByE interface. In: IntlWorkshop on Information Integration on theWeb (WIIW). (2001) 156–163
Buneman, P., Davidson, S., Fernandez, M., Suciu, D.: Adding structure to unstructured data. In: Intl Conf on Database Theory. (1997) 336–350
Nestorov, S., Abiteboul, S., Motwani, R.: Inferring structure in semistructured data. In: Workshop on Management of Semistructured Data. (1997)
Nestorov, S., Abiteboul, S., Motwani, R.: Extracting schema from semistructured data. In: ACM Intl Conf on Management of Data. (1998) 295–306
Kobayashi, M., Takeda, K.: Information retrieval on the web. ACM Computing Surveys 32 (2000) 144–173
Buneman, P., Fan, W., Simeon, J., Weinstein, S.: Constraints for semistructured data and xml. SIGMOD Record 30 (2001)
Buneman, P., Fan, W., Weinstein, S.: Query optimization for semistructured data using path constraints in a deterministic data model. In:Workshop on Database Programming Languages. (1999) 208–223
Wang, J., Chirn, G.W., Marr, T., Shapiro, B., Shasha, D., Zhang, K.: Combinatorial pattern discovery for scientific data: Some preliminary results. In: ACM SIGMOD Intl Conf on Management of Data. (1994) 115–125
Chandrasekaran, B., Josephson, J., Benjamins, V.: What are ontologies, and why do we need them? IEEE Intelligent Systems 14 (1999) 20–26
Maedche, A., Staab, S.: Ontology learning for the semantic web. IEEE Intelligent Systems 16 (2001) 72–79
Zhong, N.: Ontologies in web intelligence. In: Practical Applications of Intelligent Agents, Springer. (2001)
Vianu, V.: A web odyssey: From codd to XML. In: ACM PODS Symposium on Principles of Database Systems. (2001) 1–15
Garofalakis, M., Rastogi, R., Seshadri, S., Shim, K.: Data mining and the web: Past, present and future. In: ACM CIKM’99 2nd Workshop on Web Information and Data Management (WIDM’99). (1999) 43–47
Chinenyanga, T., Kushmerick, N.: Expressive retrieval from XML documents. In: ACM SIGIR Conf on Research and Development in Information Retrieval. (2001) 163–171
Fan, W., Libkin, L.: On XML integrity constraints in the presence of DTDs. In:ACM PODS Symposium on Principles of Database Systems. (2001) 114–125
Fan, W., Simeon, J.: Integrity constraints for XML. In:ACMPODS Symposium on Principles of Database Systems. (2000) 23–34
Agrawal, R., Srikant, R.: Mining sequential patterns. In: Intl Conf on Data Engineering (ICDE). (1995) 3–14
Srikant, R., Agrawal, R.: Mining sequential patterns: Generalizations and performance improvements. In: 5th Intl Conf on Extending Database Technology (EDBT). (1996) 3–17
Pei, J., Han, J., Mortazavi-Asl, B., Pinto, H., Chen, Q., Dayal, U., Hsu, M.C.: PrefixSpan mining sequential patterns efficiently by prefix projected pattern growth. In: Intl Conf on Data Engineering (ICDE). (2001) 215–224
Srikant, R., Agrawal, R.: Mining generalized association rules. In: Intl Conf on Very Large Databases (VLDB). (1995) 407–419
Garofalakis, M., Rastogi, R., Shim, K.: SPIRIT: Sequential pattern mining with regular expression constraints. In: Intl Conf on Very Large Databases (VLDB). (1999) 223–234
Ng, R., Lakshmanan, L., Han, J., Pang, A.: Exploratory mining and pruning optimizations of constrained association rules. In: ACM SIGMOD Intl Conf on Management of Data. (1998) 13–24
Wang, K., Liu, H.: Discovering typical structures of documents: A road map approach. In: ACM SIGIR Intl Conf on Research and Development in Information Retrieval. (1998) 146–154
Wang, K., Liu, H.: Discovering structural association of semistructured data. Knowledge and Data Engineering 12 (2000) 353–371
Goldman, R., Widom, J.: DataGuides: Enabling query formulation and optimization in semistructured databases. In: Twenty-third Intl Conf on Very Large Data Bases. (1997) 436–445
Goldman, R., Widom, J.: Interactive query and search in semistructured databases. In: First IntlWorkshop on theWeb and Databases (WebDB). (1998) 52–62
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Trigoni, A. (2002). Interactive Query Formulation in Semistructured Databases. In: Carbonell, J.G., Siekmann, J., Andreasen, T., Christiansen, H., Motro, A., Legind Larsen, H. (eds) Flexible Query Answering Systems. FQAS 2002. Lecture Notes in Computer Science(), vol 2522. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36109-X_28
Download citation
DOI: https://doi.org/10.1007/3-540-36109-X_28
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00074-7
Online ISBN: 978-3-540-36109-1
eBook Packages: Springer Book Archive