Score Transformation in Linear Combination for Multi-criteria Relevance Ranking

Gerani, Shima; Zhai, ChengXiang; Crestani, Fabio

doi:10.1007/978-3-642-28997-2_22

Score Transformation in Linear Combination for Multi-criteria Relevance Ranking

Shima Gerani²²,
ChengXiang Zhai²³ &
Fabio Crestani²²

Conference paper

2883 Accesses
15 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7224))

Abstract

In many Information Retrieval (IR) tasks, documents should be ranked based on a combination of multiple criteria. Therefore, we would need to score a document in each criterion aspect of relevance and then combine the criteria scores to generate a final score for each document. Linear combination of these aspect scores has so far been the dominant approach due to its simplicity and effectiveness. However, such a strategy of combination requires that the scores to be combined are “comparable” to each other, an assumption that generally does not hold due to the different ways of scoring each criterion. Thus it is necessary to transform the raw scores for different criteria appropriately to make them more comparable before combination. In this paper we propose a new principled approach to score transformation in linear combination, in which we would learn a separate non-linear transformation function for each relevance criterion based on the Alternating Conditional Expectation (ACE) algorithm and BoxCox Transformation. Experimental results show that the proposed method is effective and is also robust against non-linear perturbations of the original scores.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ounis, I., Macdonald, C., Soboroff, I.: Overview of the TREC-2008 blog track. In: Proceedings of TREC 2008 (2008)
Google Scholar
Bendersky, M., Croft, W.B., Diao, Y.: Quality-biased ranking of web documents. In: Proceedings of WSDM 2011, pp. 95–104 (2011)
Google Scholar
Xu, J., Li, H.: Adarank: a boosting algorithm for information retrieval. In: Proceedings of SIGIR 2007, pp. 391–398 (2007)
Google Scholar
Yue, Y., Finley, T., Radlinski, F., Joachims, T.: A support vector method for optimizing average precision. In: Proceedings of SIGIR 2007, pp. 271–278 (2007)
Google Scholar
Liu, T.Y., Xu, J., Qin, T., Xiong, W.Y., Li, H.: Letor: Benchmark dataset for research on learning to rank for information retrieval. In: Proceedings of SIGIR 2007 Workshops (2007)
Google Scholar
Lee, J.H.: Analyses of multiple evidence combination. In: Proceedings of SIGIR 1997, pp. 267–276 (1997)
Google Scholar
Montague, M., Aslam, J.A.: Relevance score normalization for metasearch. In: Proceedings of CIKM 2001, pp. 427–433 (2001)
Google Scholar
Manmatha, R., Rath, T., Feng, F.: Modeling score distributions for combining the outputs of search engines. In: Proceedings of SIGIR 2001, pp. 267–275 (2001)
Google Scholar
Manmatha, R., Sever, H.: A formal approach to score normalization for meta-search. In: Proceedings of HLT 2002, pp. 98–103 (2002)
Google Scholar
Arampatzis, A., Kamps, J.: A signal-to-noise approach to score normalization. In: Proceedings of CIKM 2009, pp. 797–806 (2009)
Google Scholar
Fernández, M., Vallet, D., Castells, P.: Using historical data to enhance rank aggregation. In: Proceedings of SIGIR 2006, pp. 643–644 (2006)
Google Scholar
Sever, H., Tolun, M.R.: Comparison of Normalization Techniques for Metasearch. In: Yakhno, T. (ed.) ADVIS 2002. LNCS, vol. 2457, pp. 133–143. Springer, Heidelberg (2002)
Chapter Google Scholar
Arampatzis, A., Robertson, S.: Modeling score distributions in information retrieval. Inf. Retr. 14, 26–46 (2011)
Article Google Scholar
Craswell, N., Robertson, S., Zaragoza, H., Taylor, M.: Relevance weighting for query independent evidence. In: Proceedings of SIGIR 2005, pp. 416–423 (2005)
Google Scholar
Veaux, R.D.D.: Finding Transformations for Regression Using the ACE Algorithm. Sociological Methods and Research 18(327), 327–359 (1989)
Article Google Scholar
Santos, R.L., He, B., Macdonald, C., Ounis, I.: Integrating Proximity to Subjective Sentences for Blog Opinion Retrieval. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478, pp. 325–336. Springer, Heidelberg (2009)
Chapter Google Scholar
Zhang, W., Yu, C., Meng, W.: Opinion retrieval from blogs. In: Proceedings of CIKM 2007, pp. 831–840 (2007)
Google Scholar
Lee, Y., Na, S.H., Kim, J., Nam, S.H., Jung, H.Y., Lee, J.H.: KLE at TREC 2008 blog track: Blog post and feed retrieval. In: Proceedings of TREC 2008 (2008)
Google Scholar
Gerani, S., Carman, M.J., Crestani, F.: Proximity-based opinion retrieval. In: Proceedings of SIGIR 2010, pp. 403–410 (2010)
Google Scholar
Breiman, L., Friedman, J.H.: Estimating Optimal Transformations for Multiple Regression and Correlation. American Statistical Association 80(391) (1985)
Google Scholar
Box, G.E.P., Cox, D.R.: An Analysis of Transformations. Journal of the Royal Statistical Society B(26), 211–252 (1964)
Google Scholar
Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to ad hoc information retrieval. In: Proceedings of SIGIR 2001, pp. 334–342 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Informatics, University of Lugano, Lugano, Switzerland
Shima Gerani & Fabio Crestani
Department of Computer Science, University of Illinois at Urbana-Champaign, USA
ChengXiang Zhai

Authors

Shima Gerani
View author publications
You can also search for this author in PubMed Google Scholar
ChengXiang Zhai
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Crestani
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Yahoo! Research, Diagonal 177, 08018, Barcelona, Spain
Ricardo Baeza-Yates & B. Barla Cambazoglu &
Centrum Wiskunde & Informatica, Science Park 123, Amsterdam, The Netherlands
Arjen P. de Vries
Websays, Nàpols 294 7-4, 08025, Barcelona, Spain
Hugo Zaragoza
Yahoo! Research, Diagnoal 177, 08018, Barcelona, Spain
Vanessa Murdock
Yahoo! Labs, Tower 3, Matam Park, 31905, Haifa, Israel
Ronny Lempel
ISTI-CNR, via G. Moruzzi, 1, 56124, Pisa, Italy
Fabrizio Silvestri

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gerani, S., Zhai, C., Crestani, F. (2012). Score Transformation in Linear Combination for Multi-criteria Relevance Ranking. In: Baeza-Yates, R., et al. Advances in Information Retrieval. ECIR 2012. Lecture Notes in Computer Science, vol 7224. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28997-2_22

Download citation

DOI: https://doi.org/10.1007/978-3-642-28997-2_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28996-5
Online ISBN: 978-3-642-28997-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics