Skip to main content

Humanities Data in R

Exploring Networks, Geospatial Data, Images, and Text

  • Textbook
  • © 2024
  • Latest edition

Overview

  • Incorporates the set of R packages known as the “tidyverse”, including the popular ggplot2 and dplyr
  • Grounds data visualization and manipulation on theoretical frameworks within data science and computer science
  • Expands text and image applications based on recent advances in machine learning and AI

This is a preview of subscription content, log in via an institution to check access.

Access this book

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

eBook USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book USD 99.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

About this book

This book teaches readers to integrate data analysis techniques into humanities research practices using the R programming language. Methods for general-purpose visualization and analysis are introduced first, followed by domain-specific techniques for working with networks, text, geospatial data, temporal data, and images. The book is designed to be a bridge between quantitative and qualitative methods, individual and collaborative work, and the humanities and social sciences. The second edition of the text is a significant revision, with almost every aspect of the text rewritten in some way. The most notable difference is the incorporation of new R packages such as ggplot2 and dplyr that center broad data-science concepts.

This 2nd edition of Humanities Data with R does not presuppose background programming experience. Early chapters take readers from R set-up to exploratory data analysis, with one chapter dedicated to each stage of the data-science pipeline (data collection, visualization, manipulation, and relational joins). Following this, text analysis, networks, temporal data, geospatial data, and image analysis each have a dedicated chapter. These are grounded in examples to move readers beyond the intimidation of adding new tools to their research. The final section of the book extends the core material with additional computer science techniques for processing large datasets.

Everything is hands-on: image analysis is explained using digitized photographs from the 1930s, and networks are applied to page links on Wikipedia. After working through these examples with the provided data, code and book website, readers are prepared to apply new methods to their own work. The open source R programming language, with its myriad packages and popularity within the sciences and social sciences, is particularly well-suited to working with humanities data. R packages are also highlighted in an appendix.

The methodology will have wide application in classrooms and self-study for the humanities, but also for use in linguistics, anthropology, and political science. Outside the classroom, this intersection of humanities and computing is particularly relevant for research and new modes of dissemination across archives, museums and libraries.

Keywords

Table of contents (12 chapters)

  1. Core

  2. Data Types

  3. Additional Methods

Authors and Affiliations

  • University of Richmond, Richmond, USA

    Taylor Arnold, Lauren Tilton

About the authors

Taylor Arnold is Professor of Data Science & Statistics at the University of Richmond and affiliated faculty in the interdisciplinary programs in linguistics and cognitive science. His research applies and develops corpus-based techniques and software to study how messages are communicated through visual and multimodal forms. Arnold is the co-author of four books: Humanities Data in R: Exploring Networks, Geospatial Data, Images and Texts (Springer, 2015), A Computational Approach to Statistical Learning (CRC Press, 2019), Layered Lives (Stanford University Press, 2022), and Distant Viewing: Analyzing Visual Culture at Scale (MIT Press, 2023).

Lauren Tilton is the E. Claiborne Robins Professor of Liberal Arts and Digital Humanities in the Department of Rhetoric and Communication Studies at the University of Richmond. Her research focuses on 20th and 21st century U.S. visual culture. She is director of Photogrammar, a digital public humanities project mapping New Deal and World War II documentary expression funded by the ACLS and NEH, and author of Humanities Data in R: Exploring Networks, Geospatial Data, Images and Texts (Springer, 2015), Layered Lives (Stanford University Press, 2022), and Distant Viewing: Analyzing Visual Culture at Scale (MIT Press, 2023). Her scholarship has appeared in journals such as American QuarterlyArchive JournalDigital Humanities Quarterly, and Digital Scholarship in the Humanities. She is the co-editor of Computational Humanities (Debates in the Digital Humanities), currently in production with the University of Minnesota Press.

Bibliographic Information

Publish with us