© 2018

Taming the Corpus

From Inflection and Lexis to Interpretation

  • Masako Fidler
  • Václav Cvrček

Table of contents

  1. Front Matter
    Pages i-vi
  2. Václav Cvrček, Masako Fidler
    Pages 1-8
  3. Words, Rhymes, and Grammatical Forms

    1. Front Matter
      Pages 9-9
    2. Jiří Milička, Hana Kalábová
      Pages 37-61
    3. Radek Čech, Miroslav Kubát
      Pages 63-77
  4. Not Only “Lost” in Translation

  5. Understanding Discourse

    1. Front Matter
      Pages 145-145
    2. Kieran Williams
      Pages 147-166

About this book


This book bridges the current quantitative and qualitative text analyses, using grammar as a crucial source of investigation. Taking data from Czech, an inflected language, in which the most optimal conditions to respond to this research question are met, the book expands the understanding of language and text in ways that have not been executed before. For predominantly English-based quantitative research, this volume fills a crucial gap by examining the relationship between inflection and other phenomena (including discourse, translation and literature). For the current qualitative research, the volume provides large empirical data to confirm some of its claims, but more importantly, it demonstrates the important role of detailed grammatical concepts that have not been considered before. Besides addressing fundamental questions about text analysis methods, the volume presents a diverse array of Czech data that are unique in their own right and worthy of dissemination to the general audience. 

Taming the Corpus: From Inflection and Lexis to Interpretation is divided into three sections. Section 1 deals with phonotactics, poetic structure, morphological complexity used to differentiate literary style, and native speakers’ sense of grammaticality – issues pertinent to linguistic typology, cognition and language, and literary studies. Section 2 focuses on inter-language relations, especially the theory of translation. Section 3 demonstrates how quantitative analysis of texts can contribute to our understanding of society and connects the volume to legal language, construction of gender and discourse position and implicit ideology.


Corpus Linguistics Quantitative Text Analysis Qualitative Text Analysis Czech National Corpus (CNC) Slavic Language Lexis Corpus Discourse Inflection Grammatical Markers Digital Humanities Quantitative Literary Studies

Editors and affiliations

  • Masako Fidler
    • 1
  • Václav Cvrček
    • 2
  1. 1.Department of Slavic StudiesBrown UniversityProvidenceUSA
  2. 2.Institute of the Czech National CorpusCharles UniversityPrague 1Czech Republic

About the editors

Masako Fidler received her PhD in Slavic Linguistics at UCLA. She is currently a Professor of Slavic Slavic Studies at Brown University. Fidler's research areas are comparative linguistics, discourse analysis, and cognitive linguistics. Her most current research involves corpus-assisted text analysis of Czech using statistical applications developed at the Institute of the Czech National Corpus at Charles University in Prague. This is a collaborative project with Václav Cvrček at the Institute of the Czech National Corpus at Charles University in Prague. 

Václav Cvrček received his PhD in Mathematical and Corpus Linguistics at Charles University. He is currently the Deputy Chair of the Institute of the Czech National Corpus and Associate Professor at Charles University. His research interests include corpus linguistics, quantitative linguistics, and corpus-assisted discourse studies.

Bibliographic information