The VLDB Journal

, Volume 22, Issue 1, pp 99–123 | Cite as

Keyword search on form results

  • Aditya Ramesh
  • S. Sudarshan
  • Purva Joshi
  • Manisha Naik Gaonkar
Special Issue Paper

Abstract

In recent years there has been a good deal of research in the area of keyword search on structured and semistructured data. Most of this body of work has a significant limitation in the context of enterprise data, since it ignores the application code that has often been carefully designed to present data in a meaningful fashion to users. In this work, we consider how to perform keyword search on enterprise applications, which provide a number of forms that can take parameters; parameters may be explicit, or implicit such as the identifier of the user. In the context of such applications, the goal of keyword search is, given a set of keywords, to retrieve forms along with corresponding parameter values, such that result of each retrieved form executed on the corresponding retrieved parameter values will contain the specified keywords. Some earlier work in this area was based on creating keyword indices on form results, but there are problems in maintaining such indices in the face of updates. In contrast, we propose techniques based on creating inverted SQL queries from the SQL queries in the forms. Unlike earlier work, our techniques do not require any special purpose indices and instead make use of standard text indices supported by most database systems. We have implemented our techniques and show that keyword search can run at reasonable speeds even on large databases with a significant number of forms.

Keywords

Keyword search Enterprise application forms Query inversion 

Notes

Acknowledgments

We thank Surajit Chaudhuri for discussions leading to the idea of inverting form queries. This work was partially supported by the Indo-German Max Planck Center for Computer Science (IMPECS) project, which is supported by DST India, BMBF Germany, and MPG, Max Planck Society.

References

  1. 1.
    Agrawal, S., Chaudhuri, S., Das, G.: DBXplorer: A system for keyword-based search over relational databases. ICDE, pp. 5–16 (2002)Google Scholar
  2. 2.
    Baid, A., Rae, I., Li, J., Doan, A., Naughton, J.F.: Toward scalable keyword search over relational data. PVLDB 3(1), 140–149 (2010)Google Scholar
  3. 3.
    Bhalotia, G., Hulgeri, A., Nakhe, C., Chakrabarti, S., Sudarshan, S.: Keyword searching and browsing in databases using BANKS. In ICDE, pp. 431–440 (2002)Google Scholar
  4. 4.
    Bowman, I.T., Salem, K.: Semantic prefetching of correlated query sequences. In ICDE, pp. 1284–1288 (2007)Google Scholar
  5. 5.
    Chu, E., Baid, A., Chai, X., Doan, A., Naughton, J.F.: Combining keyword search and forms for ad hoc querying of databases. In SIGMOD Conference, pp. 349–360 (2009)Google Scholar
  6. 6.
    Ding, B., Yu, J.X., Wang, S., Qin, L., Zhang, X., Lin, X.: Finding top-k min-cost connected trees in databases. In ICDE, pp. 836–845 (2007)Google Scholar
  7. 7.
    Duda, C., Graf, D.A., Kossmann, D.: Predicate-based indexing of enterprise Web applications. In CIDR, pp. 102–107 (2007)Google Scholar
  8. 8.
    Elhemali, M., Galindo-Legaria, C.A., Grabs, T., Joshi, M.: Execution strategies for SQL subqueries. In SIGMOD Conference, pp. 993–1004 (2007)Google Scholar
  9. 9.
    Guravannavar, R., Sudarshan, S.: Rewriting procedures for batched bindings. PVLDB 1(1), 1107–1123 (2008)Google Scholar
  10. 10.
    Hristidis, V., Papakonstantinou, Y.: Discover: keyword search in relational databases. In VLDB, pp. 670–681 (2002)Google Scholar
  11. 11.
    Liu, F., Yu, C.T., Meng, W., Chowdhury, A.: Effective keyword search in relational databases. In SIGMOD Conference, pp. 563–574 (2006)Google Scholar
  12. 12.
    Luo, Y., Lin, X., Wang, W., Zhou, X.: Spark: top-k keyword query in relational databases. In SIGMOD Conference, pp. 115–126 (2007)Google Scholar
  13. 13.
    Nandi, A., Jagadish, H.V.: Qunits: queried units in database search. In CIDR (2009)Google Scholar
  14. 14.
    Shao, F., Guo, L., Botev, C., Bhaskar, A., Chettiar, M., Yang, F., Shanmugasundaram, J.: Efficient keyword search over virtual XML views. VLDB J 18(2), 543–570 (2009)CrossRefGoogle Scholar
  15. 15.
    Silberschatz, A., Korth, H.F., Sudarshan, S.: Database System Concepts. McGraw-Hill, New York (2010)Google Scholar

Copyright information

© Springer-Verlag 2012

Authors and Affiliations

  • Aditya Ramesh
    • 1
  • S. Sudarshan
    • 2
  • Purva Joshi
    • 3
  • Manisha Naik Gaonkar
    • 2
  1. 1.Stanford UniversityStanfordUSA
  2. 2.Indian Institute of Technology BombayMumbaiIndia
  3. 3.SybasePuneIndia

Personalised recommendations