Identifying Challenges for Information Organization in Language Archives: Preliminary Findings
- 203 Downloads
Language archives are repositories of linguistic data about a selected set of languages, typically including recordings, transcripts, translations, and linguistic annotations. Digital accessibility of primary language data, particularly that of endangered languages, has long been recognized as necessary for research reproducibility, production of pedagogical materials, and typological discovery, though their potential currently lies dormant because these resources are rarely accessed by linguists or language communities. Reasons for the under-utilization of language archives include the lack of data standardization and decreased focus on metadata quality. The present work seeks to elucidate the issues facing language archive managers and users through two steps: content analysis of information organization in language archives, and semi-structured interviews with language archive managers and users. Primary challenges identified include lacking institutional support and a range of factors which impact authority control in language archives.
KeywordsLanguage archives Information organization Metadata Digital repositories
- Henke, R., Berez-Kroeker, A.: A brief history of archiving in language documentation, with an annotated bibliography. Lang. Doc. Conserv. 10, 411–457 (2016)Google Scholar
- Al Smadi, D., et al.: Exploratory user research for CoRSAL: report prepared for S. Chelliah, Director of the Computational Resource for South Asian Languages. University of North Texas. Department of Anthropology (2016)Google Scholar
- Wasson, C., Holton, G., Roth, H.: Bringing user-centered design to the field of language archives. Lang. Doc. Conserv. 10, 641–671 (2016)Google Scholar
- National Science Foundation Documenting Endangered Languages (DEL) program solicitation. https://www.nsf.gov/pubs/2018/nsf18580/nsf18580.htm. Accessed 13 Sept 2019