Skip to main content

LTRRS: A Learning to Rank Based Algorithm for Resource Selection in Distributed Information Retrieval

  • Conference paper
  • First Online:
Information Retrieval (CCIR 2019)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11772))

Included in the following conference series:

Abstract

Resource selection is a key task in distributed information retrieval. There are many factors that affect the performance of resource selection. Learning to rank methods can effectively combine features and are widely used for document ranking in web search. But few of them are explored for resource selection. In this paper, we propose a resource selection algorithm based on learning to rank called LTRRS. By analyzing the factors affecting the effectiveness of resource selection, we extract multi-scale features including term matching features, topical relevance features and central sample index (CSI) based features. By training LambdaMART learning to rank model, we directly optimize NDCG metric of resource ranking list in LTRRS. Experiments on the Sogou-QCL dataset show that LTRRS algorithm can significantly outperform the baseline methods in NDCG and precision metrics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Callan, J.: Distributed information retrieval. In: Croft, W.B. (ed.) Advances in Information Retrieval: Recent Research from the Center for Intelligent Information Retrieval, pp. 127–150. Springer, Boston (2000). https://doi.org/10.1007/0-306-47019-5_5

    Chapter  Google Scholar 

  2. Callan, J.P., Lu, Z., Croft, W.B.: Searching distributed collections with inference networks. In: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 21–28. ACM (1995)

    Google Scholar 

  3. Xu, J., Croft, W.B.: Cluster-based language models for distributed retrieval. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 254–261. Citeseer (1999)

    Google Scholar 

  4. Si, L., Callan, J.: Relevant document distribution estimation method for resource selection. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 298–305. ACM, New York (2003)

    Google Scholar 

  5. Shokouhi, M.: Central-rank-based collection selection in uncooperative distributed information retrieval. In: Amati, G., Carpineto, C., Romano, G. (eds.) ECIR 2007. LNCS, vol. 4425, pp. 160–172. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-71496-5_17

    Chapter  Google Scholar 

  6. Kang, I.-H., Kim, G.: Query type classification for web document retrieval. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 64–71. ACM, New York (2003)

    Google Scholar 

  7. Arguello, J., Callan, J., Diaz, F.: Classification-based resource selection. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, pp. 1277–1286. ACM, New York (2009)

    Google Scholar 

  8. Xu, J., Li, X.: Learning to rank collections. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR 2007, Amsterdam, The Netherlands p. 765. ACM Press (2007)

    Google Scholar 

  9. Dai, Z., Kim, Y., Callan, J.: Learning to rank resources. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR 2017, Shinjuku, Tokyo, Japan, pp. 837–840. ACM Press (2017)

    Google Scholar 

  10. Kim, Y., Callan, J., Culpepper, J.S., Moffat, A.: Load-balancing in distributed selective search. In: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, pp. 905–908. ACM (2016)

    Google Scholar 

  11. Kulkarni, A., Callan, J.: Selective search: Efficient and effective search of large textual collections (TOIS). ACM Trans. Inf. Syst. 33, 17 (2015)

    Article  Google Scholar 

  12. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

  13. Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Burges, C.J.C., Bottou, L., Welling, M., Ghahramani, Z., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 26, pp. 3111–3119. Curran Associates, Inc. (2013)

    Google Scholar 

  14. Wu, Q., Burges, C.J., Svore, K.M., Gao, J.: Adapting boosting for information retrieval measures. Inf. Retr. 13, 254–270 (2010)

    Article  Google Scholar 

  15. Zheng, Y., Fan, Z., Liu, Y., Luo, C., Zhang, M., Ma, S.: Sogou-QCL: a new dataset with click relevance label. In: The 41st International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1117–1120. ACM (2018)

    Google Scholar 

Download references

Acknowledgement

The research of this paper was supported by Guangdong Natural Science Foundation (2015A030308017).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shoubin Dong .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wu, T., Liu, X., Dong, S. (2019). LTRRS: A Learning to Rank Based Algorithm for Resource Selection in Distributed Information Retrieval. In: Zhang, Q., Liao, X., Ren, Z. (eds) Information Retrieval. CCIR 2019. Lecture Notes in Computer Science(), vol 11772. Springer, Cham. https://doi.org/10.1007/978-3-030-31624-2_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-31624-2_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-31623-5

  • Online ISBN: 978-3-030-31624-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics