Name: Pretrained Transformers for Text Ranking
ISBN: 978-3-031-02181-7

Overview

Authors:

Jimmy Lin ⁰,
Rodrigo Nogueira ¹,
Andrew Yates ²

Jimmy Lin
1. University of Waterloo, Canada
View author publications

You can also search for this author in PubMed Google Scholar
Rodrigo Nogueira
1. University of Waterloo, Canada
View author publications

You can also search for this author in PubMed Google Scholar
Andrew Yates
1. University of Amsterdam and Max Planck Institute for Informatics, USA
View author publications

You can also search for this author in PubMed Google Scholar

Part of the book series: Synthesis Lectures on Human Language Technologies (SLHLT)

1026 Accesses
43 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 69.99

Price excludes VAT (USA)

Softcover Book USD 89.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (6 chapters)

Front Matter

Pages i-xvii

Download chapter PDF
Introduction
- Jimmy Lin, Rodrigo Nogueira, Andrew Yates
Pages 1-24
Setting the Stage
- Jimmy Lin, Rodrigo Nogueira, Andrew Yates
Pages 25-61
Multi-Stage Architectures for Reranking
- Jimmy Lin, Rodrigo Nogueira, Andrew Yates
Pages 63-161
Refining Query and Document Representations
- Jimmy Lin, Rodrigo Nogueira, Andrew Yates
Pages 163-193
Learned Dense Representations for Ranking
- Jimmy Lin, Rodrigo Nogueira, Andrew Yates
Pages 195-238
Future Directions and Conclusions
- Jimmy Lin, Rodrigo Nogueira, Andrew Yates
Pages 239-253
Back Matter

Pages 255-307

Download chapter PDF

About this book

The goal of text ranking is to generate an ordered list of texts retrieved from a corpus in response to a query. Although the most common formulation of text ranking is search, instances of the task can also be found in many natural language processing (NLP) applications.This book provides an overview of text ranking with neural network architectures known as transformers, of which BERT (Bidirectional Encoder Representations from Transformers) is the best-known example. The combination of transformers and self-supervised pretraining has been responsible for a paradigm shift in NLP, information retrieval (IR), and beyond. This book provides a synthesis of existing work as a single point of entry for practitioners who wish to gain a better understanding of how to apply transformers to text ranking problems and researchers who wish to pursue work in this area. It covers a wide range of modern techniques, grouped into two high-level categories: transformer models that perform reranking inmulti-stage architectures and dense retrieval techniques that perform ranking directly. Two themes pervade the book: techniques for handling long documents, beyond typical sentence-by-sentence processing in NLP, and techniques for addressing the tradeoff between effectiveness (i.e., result quality) and efficiency (e.g., query latency, model and index size). Although transformer architectures and pretraining techniques are recent innovations, many aspects of how they are applied to text ranking are relatively well understood and represent mature techniques. However, there remain many open research questions, and thus in addition to laying out the foundations of pretrained transformers for text ranking, this book also attempts to prognosticate where the field is heading.

Authors and Affiliations

University of Waterloo, Canada

Jimmy Lin, Rodrigo Nogueira
University of Amsterdam and Max Planck Institute for Informatics, USA

Andrew Yates

About the authors

Jimmy Lin holds the David R. Cheriton Chair in the David R. Cheriton School of Computer Science at the University of Waterloo. Prior to 2015, he was a faculty at the University of Maryland, College Park. Lin received his Ph.D. in Electrical Engineering and Computer Science from the Massachusetts Institute of Technology in 2004.Rodrigo Nogueira is a post-doctoral researcher at the University of Waterloo, an adjunct professor at the University of Campinas (UNICAMP), and a senior research scientist at NeuralMind, a startup focused on applying deep learning to document and image analysis. Nogueira received his Ph.D. in Computer Science from the New York University in 2019.Andrew Yates is an assistant professor in the Informatics Institute at the University of Amsterdam. Prior to 2021, he was a post-doctoral researcher and then senior researcher at the Max Planck Institute for Informatics. Yates received his Ph.D. in Computer Science from Georgetown University in 2016.

Bibliographic Information

Book Title: Pretrained Transformers for Text Ranking
Book Subtitle: BERT and Beyond
Authors: Jimmy Lin, Rodrigo Nogueira, Andrew Yates
Series Title: Synthesis Lectures on Human Language Technologies
DOI: https://doi.org/10.1007/978-3-031-02181-7
Publisher: Springer Cham
eBook Packages: Synthesis Collection of Technology (R0), eBColl Synthesis Collection 11
Copyright Information: Springer Nature Switzerland AG 2022
Softcover ISBN: 978-3-031-01053-8Published: 29 October 2021
eBook ISBN: 978-3-031-02181-7Published: 01 June 2022
Series ISSN: 1947-4040
Series E-ISSN: 1947-4059
Edition Number: 1
Number of Pages: XVII, 307
Topics: Artificial Intelligence, Natural Language Processing (NLP), Computational Linguistics

Publish with us

Policies and ethics

Pretrained Transformers for Text Ranking

Overview

Access this book

Other ways to access

Table of contents (6 chapters)

Front Matter

Introduction

Setting the Stage

Multi-Stage Architectures for Reranking

Refining Query and Document Representations

Learned Dense Representations for Ranking

Future Directions and Conclusions

Back Matter

About this book

Authors and Affiliations

University of Waterloo, Canada

University of Amsterdam and Max Planck Institute for Informatics, USA

About the authors

Bibliographic Information

Publish with us

Search

Navigation