Automated Early Leaderboard Generation from Comparative Tables

Singh, Mayank; Sarkar, Rajdeep; Vyas, Atharva; Goyal, Pawan; Mukherjee, Animesh; Chakrabarti, Soumen

doi:10.1007/978-3-030-15712-8_16

Mayank Singh²⁰,
Rajdeep Sarkar²¹,
Atharva Vyas²¹,
Pawan Goyal²¹,
Animesh Mukherjee²¹ &
…
Soumen Chakrabarti²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11437))

Included in the following conference series:

European Conference on Information Retrieval

2530 Accesses
5 Citations

Abstract

A leaderboard is a tabular presentation of performance scores of the best competing techniques that address a specific scientific problem. Manually maintained leaderboards take time to emerge, which induces a latency in performance discovery and meaningful comparison. This can delay dissemination of best practices to non-experts and practitioners. Regarding papers as proxies for techniques, we present a new system to automatically discover and maintain leaderboards in the form of partial orders between papers, based on performance reported therein. In principle, a leaderboard depends on the task, data set, other experimental settings, and the choice of performance metrics. Often there are also tradeoffs between different metrics. Thus, leaderboard discovery is not just a matter of accurately extracting performance numbers and comparing them. In fact, the levels of noise and uncertainty around performance comparisons are so large that reliable traditional extraction is infeasible. We mitigate these challenges by using relatively cleaner, structured parts of the papers, e.g., performance tables. We propose a novel performance improvement graph with papers as nodes, where edges encode noisy performance comparison information extracted from tables. Every individual performance edge is extracted from a table with citations to other papers. These extractions resemble (noisy) outcomes of ‘matches’ in an incomplete tournament. We propose several approaches to rank papers from these noisy ‘match’ outcomes. We show that our ranking scheme can reproduce various manually curated leaderboards very well. Using widely-used lists of state-of-the-art papers in 27 areas of Computer Science, we demonstrate that our system produces very reliable rankings. We also show that commercial scholarly search systems cannot be used for leaderboard discovery, because of their emphasis on citations, which favors classic papers over recent performance breakthroughs. Our code and data sets will be placed in the public domain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Wikiometrics: a Wikipedia based ranking system

Article 19 January 2017

CS-KG: A Large-Scale Knowledge Graph of Research Entities and Claims in Computer Science

Is It Possible to Assess Progress in Science?

Notes

1.
https://scholar.google.com/.
2.
https://semanticscholar.org/.
3.
‘Time’ is ambiguous by itself: a long time on battery but short training time are preferred. Our system is meant to take such errors it might make in stride.
4.
https://networkx.github.io.
5.
https://rajpurkar.github.io/SQuAD-explorer/.
6.
https://www.cityscapes-dataset.com/benchmarks/.
7.
https://goo.gl/6xTWxB.
8.
http://saliency.mit.edu/results_mit300.html.

References

Al-Zaidy, R.A., Giles, C.L.: Automatic extraction of data from bar charts. In: Proceedings of the 8th International Conference on Knowledge Capture, p. 30. ACM (2015)
Google Scholar
Cafarella, M.J., Halevy, A., Wang, D.Z., Wu, E., Zhang, Y.: WebTables: exploring the power of tables on the web. PVLDB 1(1), 538–549 (2008). ISSN 2150–8097. http://doi.acm.org/10.1145/1453856.1453916, http://www.eecs.umich.edu/~michjc/papers/webtables_vldb08.pdf
Article Google Scholar
David, H.A.: Ranking from unbalanced paired-comparison data. Biometrika 74(2), 432–436 (1987). https://academic.oup.com/biomet/article-pdf/74/2/432/659083/74-2-432.pdf
Article MathSciNet Google Scholar
Hashimoto, H., Shinoda, K., Yokono, H., Aizawa, A.: Automatic generation of review matrices as multi-document summarization of scientific papers. In: Workshop on Bibliometric-Enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL), vol. 7, pp. 850–865 (2017)
Google Scholar
Jech, T.: The ranking of incomplete tournaments: a mathematician’s guide to popular sports. Am. Math. Monthly 90(4), 246–266 (1983). http://www.jstor.org/stable/2975756
Article MathSciNet Google Scholar
Jung, D., et al.: Chartsense: interactive data extraction from chart images. In: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems CHI 2017, pp. 6706–6717 (2017). ISBN 978-1-4503-4655-9
Google Scholar
Mitra, P., Giles, C.L., Wang, J.Z., Lu, X.: Automatic categorization of figures in scientific documents. In: Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries JCDL2006, pp. 129–138. IEEE (2006)
Google Scholar
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the Web. Manuscript, Stanford University (1998)
Google Scholar
Redmond, C.: A natural generalization of the win-loss rating system. Math. Mag. 76(2), 119–126 (2003). http://www.jstor.org/stable/3219304
Article MathSciNet Google Scholar
Sarawagi, S., Chakrabarti, S.: Open-domain quantity queries on web tables: annotation, response, and consensus models. In: SIGKDD Conference (2014)
Google Scholar
Singh, M., Sarkar, R., Vyas, A., Goyal, P., Mukherjee, A., Chakrabarti, s.: Automated early leaderboard generation from comparative tables. In: ECIR 2019 arXiv:1802.04538 (2018). https://arxiv.org/abs/1802.04538
Xing, W., Ghorbani, A.: Weighted pagerank algorithm. In: Proceedings of Second Annual Conference on Communication Networks and Services Research, pp. 305–314. IEEE (2004)
Google Scholar

Download references

Acknowledgment

Partly supported by grants from IBM and Amazon.

Author information

Authors and Affiliations

IIT Gandhinagar, Gandhinagar, India
Mayank Singh
IIT Kharagpur, Kharagpur, India
Rajdeep Sarkar, Atharva Vyas, Pawan Goyal & Animesh Mukherjee
IIT Bombay, Mumbai, India
Soumen Chakrabarti

Authors

Mayank Singh
View author publications
You can also search for this author in PubMed Google Scholar
Rajdeep Sarkar
View author publications
You can also search for this author in PubMed Google Scholar
Atharva Vyas
View author publications
You can also search for this author in PubMed Google Scholar
Pawan Goyal
View author publications
You can also search for this author in PubMed Google Scholar
Animesh Mukherjee
View author publications
You can also search for this author in PubMed Google Scholar
Soumen Chakrabarti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mayank Singh .

Editor information

Editors and Affiliations

University of Strathclyde, Glasgow, UK
Leif Azzopardi
Bauhaus Universität Weimar, Weimar, Germany
Benno Stein
Universität Duisburg-Essen, Duisburg, Germany
Norbert Fuhr
GESIS - Leibniz Institute for the Social Sciences, Cologne, Germany
Philipp Mayr
Delft University of Technology, Delft, The Netherlands
Claudia Hauff
University of Twente, Enschede, The Netherlands
Djoerd Hiemstra

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Singh, M., Sarkar, R., Vyas, A., Goyal, P., Mukherjee, A., Chakrabarti, S. (2019). Automated Early Leaderboard Generation from Comparative Tables. In: Azzopardi, L., Stein, B., Fuhr, N., Mayr, P., Hauff, C., Hiemstra, D. (eds) Advances in Information Retrieval. ECIR 2019. Lecture Notes in Computer Science(), vol 11437. Springer, Cham. https://doi.org/10.1007/978-3-030-15712-8_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-15712-8_16
Published: 07 April 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-15711-1
Online ISBN: 978-3-030-15712-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Automated Early Leaderboard Generation from Comparative Tables

Abstract

Access this chapter

Similar content being viewed by others

Wikiometrics: a Wikipedia based ranking system

CS-KG: A Large-Scale Knowledge Graph of Research Entities and Claims in Computer Science

Is It Possible to Assess Progress in Science?

Notes

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Automated Early Leaderboard Generation from Comparative Tables

Abstract

Access this chapter

Similar content being viewed by others

Wikiometrics: a Wikipedia based ranking system

CS-KG: A Large-Scale Knowledge Graph of Research Entities and Claims in Computer Science

Is It Possible to Assess Progress in Science?

Notes

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation