Abstract
The chapter presents two types of bilingual corpora: parallel corpora and comparable corpora. The measure of point-wise mutual information, often used to find collocations, is adapted to search for term equivalents, which are terms in different languages referring to the same entity. Language modeling is adapted to perform at the character level as a language identification tool. Experiment: Searching for term equivalents in a parallel English-French corpus. *This chapter largely relies on chapters 6 and 7 as prerequisite.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Barrière, C. (2016). Bilingual Corpora. In: Natural Language Understanding in a Semantic Web Context. Springer, Cham. https://doi.org/10.1007/978-3-319-41337-2_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-41337-2_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41335-8
Online ISBN: 978-3-319-41337-2
eBook Packages: Computer ScienceComputer Science (R0)