Abstract
Diversification is a method of improving user satisfaction by increasing the variety of information shown to user. Due to the lack of a precise definition of information variety, many diversification techniques have been proposed. These techniques, however, have been rarely compared and analyzed under the same setting, rendering a ‘right’ choice for a particular application very difficult. Addressing this problem, this paper presents a benchmark that offers a comprehensive empirical study on the performance comparison of diversification. Specifically, we integrate several state-of-the-art diversification algorithms in a comparable manner, and measure distinct characteristics of these algorithms with various settings. We then provide in-depth analysis of the benchmark results, obtained by using both real data and synthetic data. We believe that the findings from the benchmark will serve as a practical guideline for potential applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
http://boston.lti.cs.cmu.edu/clueweb12/TRECcrowdsourcing2013/
Agrawal, R., et al.: Diversifying search results. In: WSDM, pp. 5–14 (2009)
Boriah, S., et al.: Similarity measures for categorical data: a comparative evaluation. In: SIAM, pp. 243–254 (2008)
Braschler, M.: CLEF 2001 - overview of results. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, p. 9. Springer, Heidelberg (2002)
Carbonell, J., et al.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: SIGIR, pp. 335–336 (1998)
Chandar, P., et al.: Preference based evaluation measures for novelty and diversity. In: SIGIR, pp. 413–422 (2013)
Clarke, C.L., et al.: Overview of the trec 2009 web track. Technical report, DTIC Document (2009)
Clarke, C.L., et al.: Novelty and diversity in information retrieval evaluation. In: SIGIR, pp. 659–666 (2008)
Deng, T., et al.: On the complexity of query result diversification. Proc. VLDB Endowment 6, 577–588 (2013)
Drosou, M., et al.: Search result diversification. ACM SIGMOD Rec. 39, 41–47 (2010)
Drosou, M., et al.: Disc diversity: result diversification based on dissimilarity and coverage. Proc. VLDB Endowment 6, 13–24 (2012)
Gabrilovich, E., et al.: Newsjunkie: providing personalized newsfeeds via analysis of information novelty. In: WWW, pp. 482–490 (2004)
Gollapudi, S., et al.: An axiomatic approach for result diversification. In: WWW, pp. 381–390 (2009)
Hasan, M., et al.: User effort minimization through adaptive diversification. In: KDD, pp. 203–212 (2014)
Jain, A., Sarda, P., Haritsa, J.R.: Providing diversity in k-nearest neighbor query results. In: Dai, H., Srikant, R., Zhang, C. (eds.) PAKDD 2004. LNCS (LNAI), vol. 3056, pp. 404–413. Springer, Heidelberg (2004)
Kando, N., et al.: Overview of IR tasks at the first NTCIR workshop. In: NTCIR, pp. 11–44 (1999)
Küçüktunç, O., et al.: Diversified recommendation on graphs: pitfalls, measures, and algorithms. In: WWW, pp. 715–726 (2013)
Rafiei, D., et al.: Diversifying web search results, pp. 781–790. In: WWW 2010 (2010)
Skoutas, D., et al.: Tag clouds revisited. In: CIKM, pp. 221–230 (2011)
Tong, H., et al.: Diversified ranking on large graphs: an optimization viewpoint. In: KDD, pp. 1028–1036 (2011)
Vaughan, L., et al.: Search engine coverage bias: evidence and possible causes. Inf. Process. Manag. 40, 693–707 (2004)
Verheij, A., et al.: A comparison study for novelty control mechanisms applied to web news stories. In: WI, pp. 431–436 (2012)
Vieira, M.R., et al.: On query result diversification. In: ICDE, pp. 1163–1174 (2011)
Yu, C., et al.: It takes variety to make a world: diversification in recommender systems. In: EDBT, pp. 368–378 (2009)
Zhai, C.X., et al.: Beyond independent relevance: methods and evaluation metrics for subtopic retrieval. In: SIGIR, pp. 10–17 (2003)
Zhang, B., Li, H., Liu, Y., Ji, L., Xi, W., Fan, W., Chen, Z., Ma, W.Y.: Improving web search results using affinity graph. In: SIGIR, pp. 504–511 (2005)
Zhu, X., et al.: Improving diversity in ranking using absorbing random walks. In: NAACL, pp. 97–104 (2007)
Ziegler, C.N., et al.: Improving recommendation lists through topic diversification. In: WWW, pp. 22–32 (2005)
Acknowledgment
The research has received funding from the ScienceWise project.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Thang, D.C., Tam, N.T., Hung, N.Q.V., Aberer, K. (2015). An Evaluation of Diversification Techniques. In: Chen, Q., Hameurlain, A., Toumani, F., Wagner, R., Decker, H. (eds) Database and Expert Systems Applications. Globe DEXA 2015 2015. Lecture Notes in Computer Science(), vol 9262. Springer, Cham. https://doi.org/10.1007/978-3-319-22852-5_19
Download citation
DOI: https://doi.org/10.1007/978-3-319-22852-5_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-22851-8
Online ISBN: 978-3-319-22852-5
eBook Packages: Computer ScienceComputer Science (R0)