Corpus Linguistics

Juola, Patrick

doi:10.1007/978-3-319-32001-4_523-1

Patrick Juola³

169 Accesses

Introduction

Corpus linguistics is, broadly speaking, the application of “big data” to the science of linguistics. Unlike traditional linguistic analysis [caricatured by Fillmore (1992) as “armchair linguistics”], which relies on native intuition and introspection, corpus linguists rely on large samples to quantitatively analyze the distribution of linguistic items. It has therefore tended to focus on what can be easily measured by computer and quantified, such as words, phrases, and word-based grammar, instead of more abstract concepts such as discourse or formal syntax. With the advent of high-powered computers and the increased availability of machine-readable texts, it has become a major force in modern linguistic research.

History

The use of corpora for language analysis long predates computers. Theologians were making Biblical concordances in the eighteenth century, and Samuel Johnson started a tradition followed to this day (e.g., most famously by the Oxford English Dictionary)...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Author information

Authors and Affiliations

Department of Mathematics and Computer Science, McAnulty College and Graduate School of Liberal Arts, Duquesne University, Pittsburgh, PA, USA
Patrick Juola

Authors

Patrick Juola
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Patrick Juola .

Editor information

Editors and Affiliations

School of Policy, Government and International Affairs, George Mason University, Fairfax, Virginia, USA
Laurie A. Schintler
School of Policy, Government and International Affairs, George Mason University, Fairfax, Virginia, USA
Connie L. McNeely

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Juola, P. (2018). Corpus Linguistics. In: Schintler, L., McNeely, C. (eds) Encyclopedia of Big Data. Springer, Cham. https://doi.org/10.1007/978-3-319-32001-4_523-1

Download citation

DOI: https://doi.org/10.1007/978-3-319-32001-4_523-1
Received: 31 August 2017
Accepted: 27 December 2017
Published: 19 April 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-32001-4
Online ISBN: 978-3-319-32001-4
eBook Packages: Springer Reference Business and ManagementReference Module Humanities and Social SciencesReference Module Business, Economics and Social Sciences

Publish with us

Policies and ethics

Corpus Linguistics

Introduction

History

Access this chapter

Further Readings

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this entry

Cite this entry

Download citation

Publish with us

Navigation

Corpus Linguistics

Introduction

History

Access this chapter

Further Readings

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this entry

Cite this entry

Download citation

Publish with us

Search

Navigation