Applying Machine Learning Diversity Metrics to Data Fusion in Information Retrieval

Leonard, David; Lillis, David; Zhang, Lusheng; Toolan, Fergus; Collier, Rem W.; Dunnion, John

doi:10.1007/978-3-642-20161-5_73

Applying Machine Learning Diversity Metrics to Data Fusion in Information Retrieval

David Leonard²¹,
David Lillis²¹,
Lusheng Zhang²¹,
Fergus Toolan²¹,
Rem W. Collier²¹ &
…
John Dunnion²¹

Conference paper

6725 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6611))

Abstract

The Supervised Machine Learning task of classification has parallels with Information Retrieval (IR): in each case, items (documents in the case of IR) are required to be categorised into discrete classes (relevant or non-relevant). Thus a parallel can also be drawn between classifier ensembles, where evidence from multiple classifiers are combined to achieve a superior result, and the IR data fusion task.

This paper presents preliminary experimental results on the applicability of classifier ensemble diversity metrics in data fusion. Initial results indicate a relationship between the quality of the fused result set (as measured by MAP) and the diversity of its inputs.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Lee, J.H.: Analyses of multiple evidence combination. SIGIR Forum 31, 267–276 (1997)
Article Google Scholar
Dietterich, T.: Ensemble methods in machine learning. In: Kittler, J., Roli, F. (eds.) MCS 2000. LNCS, vol. 1857, pp. 1–15. Springer, Heidelberg (2000)
Chapter Google Scholar
Kuncheva, L., Whitaker, C.: Ten measures of diversity in classifier ensembles: limits for two classifiers. In: IEEE Workshop on Intelligent Sensor Processing, Birmingham, UK (2001)
Google Scholar
Shipp, C., Kuncheva, L.: Relationships between combination methods and measures of diversity in combining classifiers. Information Fusion 3(2), 135–148 (2002)
Article Google Scholar
Lillis, D., Toolan, F., Collier, R., Dunnion, J.: Extending Probabilistic Data Fusion Using Sliding Windows. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 358–369. Springer, Heidelberg (2008)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Informatics, University College Dublin, Ireland
David Leonard, David Lillis, Lusheng Zhang, Fergus Toolan, Rem W. Collier & John Dunnion

Authors

David Leonard
View author publications
You can also search for this author in PubMed Google Scholar
David Lillis
View author publications
You can also search for this author in PubMed Google Scholar
Lusheng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Fergus Toolan
View author publications
You can also search for this author in PubMed Google Scholar
Rem W. Collier
View author publications
You can also search for this author in PubMed Google Scholar
John Dunnion
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Information School, University of Sheffield, Regent Court, 211 Portobello Street, S1 4DP, Sheffield, UK
Paul Clough
CLARITY: Centre for Sensor Web Technologies, School of Computing, Dublin City University, Glasnevin, Dublin 9, Ireland
Colum Foley , Cathal Gurrin & Hyowon Lee , &
Centre for Next Generation Localisation, School of Computing, Dublin City University, Glasnevin, Dublin 9, Ireland
Gareth J. F. Jones
TNO Human Factors, Brassersplein 2, 2612 CT, Delft, The Netherlands
Wessel Kraaij
Yahoo! Research, 177 Diagonal, 08018, Barcelona, Spain
Vanessa Mudoch

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Leonard, D., Lillis, D., Zhang, L., Toolan, F., Collier, R.W., Dunnion, J. (2011). Applying Machine Learning Diversity Metrics to Data Fusion in Information Retrieval. In: Clough, P., et al. Advances in Information Retrieval. ECIR 2011. Lecture Notes in Computer Science, vol 6611. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20161-5_73

Download citation

DOI: https://doi.org/10.1007/978-3-642-20161-5_73
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20160-8
Online ISBN: 978-3-642-20161-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics