A Last-Resort Semantic Cache for Web Queries

Ferrarotti, Flavio; Marin, Mauricio; Mendoza, Marcelo

doi:10.1007/978-3-642-03784-9_31

Flavio Ferrarotti^19,20,
Mauricio Marin¹⁹ &
Marcelo Mendoza¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5721))

Included in the following conference series:

International Symposium on String Processing and Information Retrieval

1064 Accesses
6 Citations

Abstract

We propose a method to evaluate queries using a last-resort semantic cache in a distributed Web search engine. The cache stores a group of frequent queries and for each of these queries it keeps minimal data, that is, the list of machines that produced their answers. The method for evaluating the queries uses the inverse frequency of the terms in the queries stored in the cache (Idf) to determine when the results recovered from the cache are a good approximation to the exact answer set. Experiments show that the method is effective and efficient.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Amiri, K., Park, S., Tewari, R., Padmanabhan, S.: Scalable template-based query containment checking for web semantic caches. In: ICDE (2003)
Google Scholar
Baeza-Yates, R., Gionis, A., Junqueira, F., Murdock, V., Plachouras, V., Silvestri, F.: Design trade-offs for search engine caching. ACM TWEB 2(4) (2008)
Google Scholar
Chidlovskii, B., Roncancio, C., Schneider, M.: Semantic Cache Mechanism for Heterogeneous Web Querying. Computer Networks 31(11-16), 1347–1360 (1999)
Article Google Scholar
Chidlovskii, B., Borghoff, U.: Semantic Caching of Web Queries. VLDB Journal 9(1), 2–17 (2000)
Article Google Scholar
Dhillon, I., Mallela, S., Modha, D.S.: Information-theoretic co-clustering. In: KDD (2003)
Google Scholar
Fagni, T., Perego, R., Silvestri, F., Orlando, S.: Boosting the performance of Web search engines: Caching and prefetching query results by exploiting historical usage data. ACM TOIS 24(1), 51–78 (2006)
Article Google Scholar
Falchi, F., Lucchese, C., Orlando, S., Perego, R., Rabitti, F.: A Metric Cache for Similarity Search. In: LSDS-IR (2008)
Google Scholar
Godfrey, P., Gryz, J.: Answering Queries by Semantic Caches. In: Bench-Capon, T.J.M., Soda, G., Tjoa, A.M. (eds.) DEXA 1999. LNCS, vol. 1677, pp. 485–498. Springer, Heidelberg (1999)
Chapter Google Scholar
Jónsson, B., Arinbjarnar, M., THórsson, B., Franklin, M., Srivastava, D.: Performance and overhead of semantic cache management. In: TOIT, vol. 6(3) (2006)
Google Scholar
Karypis, G.: CLUTO software for clustering high-dimensional datasets, V. 2.1.1, http://glaros.dtc.umn.edu/gkhome/
Lempel, R., Moran, S.: Predictive caching and prefetching of query results in search engines. In: WWW (2003)
Google Scholar
Long, X., Suel, T.: Three-level caching for efficient query processing in large Web search engines. In: WWW (2005)
Google Scholar
Markatos, E.: On caching search engine query results. Computer Communications 24(7), 137–143 (2000)
Google Scholar
Puppin, D., Silvestri, F., Perego, R., Baeza-Yates, R.: Load-balancing and caching for collection selection architectures. In: INFOSCALE (2007)
Google Scholar
Puppin, D., Silvestri, F., Perego, R., Baeza-Yates, R.: Tuning the Capacity of Search Engines: Load-driven Routing and Incremental Caching to Reduce and Balance the Load. To appear in TOIS (2009)
Google Scholar
Gan, Q., Suel, T.: Improved Techniques for Result Caching in Web Search Engines. In: WWW (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Yahoo! Research Latin America, Santiago of Chile, Chile
Flavio Ferrarotti, Mauricio Marin & Marcelo Mendoza
University of Santiago of Chile, Chile
Flavio Ferrarotti

Authors

Flavio Ferrarotti
View author publications
You can also search for this author in PubMed Google Scholar
Mauricio Marin
View author publications
You can also search for this author in PubMed Google Scholar
Marcelo Mendoza
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Swedish Institute of Computer Science, Kista, Sweden
Jussi Karlgren
Department of Computer Science and Engineering, Helsinki University of Technology, P.O. Box 5400, 02015 HUT, Espoo, Finland
Jorma Tarhio
Department of Computer Sciences, University of Tampere, Tampere, Finland
Heikki Hyyrö

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ferrarotti, F., Marin, M., Mendoza, M. (2009). A Last-Resort Semantic Cache for Web Queries. In: Karlgren, J., Tarhio, J., Hyyrö, H. (eds) String Processing and Information Retrieval. SPIRE 2009. Lecture Notes in Computer Science, vol 5721. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03784-9_31

Download citation

DOI: https://doi.org/10.1007/978-3-642-03784-9_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03783-2
Online ISBN: 978-3-642-03784-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics