Abstract
The CIDOC-CRM standard indicates that common events, actors, places and timeframes are important in linking together cultural material, and provides a framework for describing them. However, merely describing entities in this way in two datasets does not yet interlink them. To do that, the identities of instances still need to be either reconciled, or be based on a shared vocabulary. The WW1LOD dataset presented in this paper was created to facilitate both of these approaches for collections dealing with the First World War. For this purpose, the dataset includes events, places, agents, times, keywords, and themes related to the war, based on over ten different authoritative data sources from providers such as the Imperial War Museum. The content is harmonized into RDF, and published as a Linked Open Data service. While generally based on CIDOC-CRM, some modeling choices used also deviate from it where our experience dictated such. In the article, these deviations are discussed in the hope that they may serve as examples where CIDOC-CRM itself may warrant further examination. As a demonstration of use, the dataset and online service have been used to create a contextual reader application that is able to link together and pull in information related to WW1 from, e.g., 1914–1918 Online, Wikipedia, WW1 Discovery, Europeana and the Digital Public Library of America.
Similar content being viewed by others
Notes
The collection is available at http://cudl.colorado.edu/luna/servlet/UCBOULDERCB1~58~8 .
This timeline was principally derived from the official British series on the history of the war, the History of the Great War Based on Official Documents, particularly the volume Principal Events, 1914–1918 [8].
Derived in part from Patrick Lefevre’s standard bibliography on this topic published by Belgium’s Musée Royal d’Armée [12].
Available at http://data.aim25.ac.uk/about_t3.php.
Available at http://demo.seco.tkk.fi/ww1/.
References
Belgium Ministère de l’Intérieur et de l’Hygiène.: Annuaire statistique de la Belgique et du Congo Belge, vol 46. Brussels (1922)
Binding, C., May, K., Tudhope, D.: Semantic interoperability in archaeological datasets: data mapping and extraction via the CIDOC CRM. In: Christensen-Dalsgaard, B., Castelli, D., Ammitzbøll Jurik, B., Lippincott, J. (eds.) Research and Advanced Technology for Digital Libraries, Lecture Notes in Computer Science, vol. 5173. Springer, Berlin, pp. 280–290 (2008). doi:10.1007/978-3-540-87599-4_30
Bountouri, L., Gergatsoulis, M.: The semantic mapping of archival metadata to the CIDOC CRM ontology. J. Arch. Org. 9(3–4), 174–207 (2011). doi:10.1080/15332748.2011.650124
Crm, S.I.G.: How to implement crm time in rdf. Tech. rep, Amsterdam (2011)
Cron, H.: Geschichte des Deutschen Heeres im Weltkriege 1914–1918. Biblio-Verlag, Berlin (1937)
Doerr, M.: The CIDOC CRM—an ontological approach to semantic interoperability of metadata. AI Mag. 24(3), 75–92 (2003)
Giles, J.: Internet encyclopaedias go head to head. Nature 438(7070), 900–901 (2005). doi:10.1038/438900a
Principal events, 1914–1918. History of the Great War based on official documents, London (1922)
Horne, J., Kramer, A.: German atrocities 1914: a history of denial. Yale University Press, New Haven (2001)
Hyvönen, E., Lindquist, T., Törnroos, J., Mäkelä, E.: History on the semantic web as linked data—an event gazetteer and timeline for World War I. In: Proceedings of CIDOC 2012—enriching cultural heritage. CIDOC, Helsinki (2012). http://www.cidoc2012.fi/en/cidoc2012/programme
Hyvönen, E., Tuominen, J., Alonen, M., Mäkelä, E.: Linked Data Finland: A 7-star model and platform for publishing and re-using linked datasets. In: [22] , pp. 226–230 (2014). doi:10.1007/978-3-319-11955-7_24
Lefevre, P.: (ed) Belgique et la Première Guerre mondiale, bibliographie. Musée royal de l’Armée, Brussels (1987–2001)
Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P.N., Hellmann, S., Morsey, M., van Kleef, P., Auer, S., Bizer, C.: DBpedia—a large-scale, multilingual knowledge base extracted from Wikipedia. Semant. Web J. 6(2), 167–195 (2015)
Lin, C.H., Hong, J.S., Doerr, M.: Issues in an inference platform for generating deductive knowledge: a case study in cultural heritage digital libraries using the CIDOC CRM. Int. J. Digital Libraries 8(2), 115–132 (2008). doi:10.1007/s00799-008-0034-0
Lindquist, T., Long, H.: How can educational technology facilitate student engagement with online primary sources? User needs assessment. Library Hi Tech 29(2), 224–241 (2011)
Lindquist, T., Dulock, M., Törnroos, J., Hyvönen, E., Mäkelä, E.: Using linked open data to enhance subject access in online primary sources. Catalog. Classif. Q. 51(8), 913–928 (2013). doi:10.1080/01639374.2013.823583
Luyt, B., Tan, D.: Improving Wikipedia’s credibility: References and citations in a sample of history articles. J. Am. Soc. Inf. Sci. Technol. 61(4), 715–722 (2010). doi:10.1002/asi.21304
Mäkelä, E.: Combining a REST lexical analysis web service with SPARQL for mashup semantic annotation from text. In: [22], pp. 424–428 (2014). doi:10.1007/978-3-319-11955-7_60
Mäkelä, E., Hyvönen, E.: SPARQL SAHA, a configurable linked data editor and browser as a service. In: [22], pp. 434–438 (2014). doi:10.1007/978-3-319-11955-7_62
Mäkelä, E., Hyvönen, E., Ruotsalo, T.: How to deal with massively heterogeneous cultural heritage data—lessons learned in CultureSampo. Semantic Web 3(1), 85–109 (2012)
Mäkelä, E., Lindquist, T., Hyvönen, E.: CORE—a contextual reader based on linked data. In: Proceedings of Digital Humanities 2016, long papers (2016)
Presutti, V., Blomqvist, E., Troncy, R., Sack, H., Papadakis, I., Tordai, A. (eds.): The Semantic Web: ESWC 2014 Satellite Events—ESWC 2014 Satellite Events, Anissaras, Crete, Greece, May 25–29, 2014, Revised Selected Papers, Lecture Notes in Computer Science, vol. 8798. Springer, Berlin (2014). doi:10.1007/978-3-319-11955-7
Rector, L.H.: Comparison of Wikipedia and other encyclopedias for accuracy, breadth, and depth in historical articles. Ref. Services Rev. 36(1), 7–22 (2008)
Tessin, G.: Deutsche Verbände und Truppen. Biblio-Verlag, Osnabrück (1974)
Volz, J., Bizer, C., Gaedke, M., Kobilarov, G.: Discovering and maintaining links on the web of data. In: Bernstein, A., Karger, DR., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds) International Semantic Web Conference, Lecture Notes in Computer Science, vol. 5823. Springer, Berlin, pp. 650–665 (2009)
Waters, N.L.: Why you can’t cite Wikipedia in my class. Commun. ACM 50(9), 15–17 (2007). doi:10.1145/1284621.1284635
Acknowledgments
We would like to thank Michael Ortiz (CU) and Tuomas Palonen (Aalto University) for annotating the resources, Michael Dulock (CU) and Holley Long (CU) for their aid with the digital collection and metadata, and Martha Hanna (CU), Patrick Tally (CU), Alan Kramer (Trinity College Dublin), Sophie de Schaepdrijver (Pennsylvania State University) and Tammy Proctor (Wittenberg University) for their expert opinion on the content.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Mäkelä, E., Törnroos, J., Lindquist, T. et al. WW1LOD: an application of CIDOC-CRM to World War 1 linked data. Int J Digit Libr 18, 333–343 (2017). https://doi.org/10.1007/s00799-016-0186-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00799-016-0186-2