Query-By-Keywords (QBK): Query Formulation Using Semantics and Feedback

  • Aditya Telang
  • Sharma Chakravarthy
  • Chengkai Li
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5829)


The staples of information retrieval have been querying and search, respectively, for structured and unstructured repositories. Processing queries over known, structured repositories (e.g., Databases) has been well-understood, and search has become ubiquitous when it comes to unstructured repositories (e.g., Web). Furthermore, searching structured repositories has been explored to a limited extent. However, there is not much work in querying unstructured sources. We argue that querying unstructured sources is the next step in performing focused retrievals. This paper proposed a new approach to generate queries from search-like inputs for unstructured repositories. Instead of burdening the user with schema details, we believe that pre-discovered semantic information in the form of taxonomies, relationship of keywords based on context, and attribute & operator compatibility can be used to generate query skeletons. Furthermore, progressive feedback from users can be used to improve the accuracy of query skeletons generated.


Structure Query Query Formulation User Intent Linguistic Meaning Query Condition 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Hristidis, V., Papakonstantinou, Y.: DISCOVER: Keyword Search in Relational Databases. In: VLDB, pp. 670–681 (2002)Google Scholar
  2. 2.
    Nie, Z., Kambhampati, S., Hernandez, T.: BibFinder/StatMiner: Effectively Mining and Using Coverage and Overlap Statistics in Data Integration. In: VLDB, pp. 1097–1100 (2003)Google Scholar
  3. 3.
    Cohen, W.W.: A Demonstration of WHIRL. In: SIGIR (1999)Google Scholar
  4. 4.
    Telang, A., Chakravarthy, S., Huang, Y.: Information integration across heterogeneous sources: Where do we stand and how to proceed? In: International Conference on Management of Data (COMAD), pp. 186–197 (2008)Google Scholar
  5. 5.
    Braga, D., Ceri, S., Daniel, F., Martinenghi, D.: Optimization of multi-domain queries on the web. In: PVLDB, vol. 1(1), pp. 562–573 (2008)Google Scholar
  6. 6.
    Zloof, M.M.: Query-by-example: A data base language. IBM Systems Journal 16(4), 324–343 (1977)CrossRefGoogle Scholar
  7. 7.
    Petropoulos, M., Deutsch, A., Papakonstantinou, Y.: Clide: Interactive query formulation for service oriented architectures. In: SIGMOD Conference, pp. 1119–1121 (2007)Google Scholar
  8. 8.
    zu Eissen, S.M., Stein, B.: Analysis of Clustering Algorithms for Web-Based Search. In: PAKM, pp. 168–178 (2002)Google Scholar
  9. 9.
    Katz, B., Lin, J.J., Quan, D.: Natural Language Annotations for the Semantic Web. In: CoopIS/DOA/ODBASE, pp. 1317–1331 (2002)Google Scholar
  10. 10.
    Madhavan, J., Cohen, S., Dong, X.L., Halevy, A.Y., Jeffery, S.R., Ko, D., Yu, C.: Web-Scale Data Integration: You can afford to Pay as You Go. In: CIDR, pp. 342–350 (2007)Google Scholar
  11. 11.
    Ley, M.: Faceted DBLP. (2006)Google Scholar
  12. 12.
    Miller, G.A.: WordNet: A Lexical Database for English. Commun. ACM 38(11), 39–41 (1995)CrossRefGoogle Scholar
  13. 13.
    Allen, J.F.: Maintaining knowledge about temporal intervals. ACM Communications 26(11), 832–843 (1983), zbMATHCrossRefGoogle Scholar
  14. 14.
    Fonseca, F., Egenhofer, M., Agouris, P., Camara, G.: Using ontologies for integrated geographic information systems. Transactions in Geographic Information Systems 3 (2002)Google Scholar
  15. 15.
    Abadi, D.J., Marcus, A., Madden, S.R., Hollenbach, K.: Scalable Semantic Web Data Management Using Vertical Partitioning. In: International Conference on Very Large Databases (VLDB), pp. 411–422 (2007)Google Scholar
  16. 16.
    Waldinger, R., Appelt, D.E., Fry, J., Israel, D.J., Jarvis, P., Martin, D., Riehemann, S., Stickel, M.E., Tyson, M., Hobbs, J., Dungan, J.L.: Deductive question answering from multiple resources. In: New Directions in Question Answering. AAAI, Menlo Park (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Aditya Telang
    • 1
  • Sharma Chakravarthy
    • 1
  • Chengkai Li
    • 1
  1. 1.Department of Computer Science & EngineeringThe University of Texas at ArlingtonArlington

Personalised recommendations