Running Time Prediction for Web Search Queries

  • Oscar Rojas
  • Veronica Gil-Costa
  • Mauricio Marin
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9574)


Large scale Web search engines have to process thousands of queries per second and each query has to be solved within a fraction of a second. To achieve this goal, search engines rely on sophisticated services capable of processing large amounts of data. One of these services is the search service (or index service) which is in charge of computing the top-k document results for user queries. Predicting in advance the response time of queries has practical applications in efficient administration of hardware resources assigned to query processing. In this paper, we propose and evaluate a query running time prediction algorithm that is based on a discrete Fourier transform which models the index as a collection of signals to obtain patterns. Results show that our approach performs at least as effectively as well-known prediction algorithms in the literature, while significantly improving computational efficiency.


WAND Inverted files Multi-threading 



This research was partially funded by Basal funds FB0001, Conicyt, Chile; PMI USA 1204 and PICT 2014-1146.


  1. 1.
    Broder, A.Z., Carmel, D., Herscovici, M., Soffer, A., Zien, J.Y.: Efficient query evaluation using a two-level retrieval process. In: CIKM, pp. 426–434 (2003)Google Scholar
  2. 2.
    Macdonald, N.T.C., Ounis, I.: Learning to predict response times for online query scheduling. In: SIGIR, pp. 621–630 (2012)Google Scholar
  3. 3.
    Chakrabarti, K., Chaudhuri, S., Ganti, V.: Interval-based pruning for top-k processing over compressed lists. In: ICDE, pp. 709–720 (2011)Google Scholar
  4. 4.
    Cronen-Townsend, S., Zhou, Y., Croft, W.B.: Predicting query performance. In: SIGIR, pp. 299–306 (2002)Google Scholar
  5. 5.
    Ding, S., Suel, T.: Faster top-k document retrieval using block-max indexes. In: SIGIR, pp. 993–1002 (2011)Google Scholar
  6. 6.
    Kim, S., He, Y., Hwang, S., Elnikety, S., Choi, S.: Delayed-dynamic-selective (DDS) prediction for reducing extreme tail latency in web search. In: WSDM, pp. 7–16 (2015)Google Scholar
  7. 7.
    Park, L., Ramamohanarao, K., Palaniswami, M.: Fourier domain scoring: a novel document ranking method. TKDE 16(5), 529–539 (2004)Google Scholar
  8. 8.
    Rojas, O., Gil-Costa, V., Marin, M.: Efficient parallel block-max wand algorithm. In: Wolf, F., Mohr, B., an Mey, D. (eds.) Euro-Par 2013. LNCS, vol. 8097, pp. 394–405. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  9. 9.
    Tonellotto, N., Macdonald, C., Ounis, I.: Efficient and effective retrieval using selective pruning. In: WSDM, pp. 63–72 (2013)Google Scholar
  10. 10.
    Warren, T.: Clustering of time series data-a survey. JPR 38(11), 1857–1874 (2005)zbMATHGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  • Oscar Rojas
    • 1
    • 2
  • Veronica Gil-Costa
    • 1
    • 2
  • Mauricio Marin
    • 1
    • 2
  1. 1.CITIAPS, DIINFUniversity of SantiagoSantiagoChile
  2. 2.Center for Biotechnology and BioengineeringSantiagoChile

Personalised recommendations