Abstract
This article tackles the issue of integrating heterogeneous archival sources in one single data repository, namely the EHRI portal, whose aim is to support Holocaust research by providing online access to information about dispersed sources relating to the Holocaust (http://portal.ehri-project.eu). In this case, the problem at hand is to combine data coming from a network of archives in order to create an interoperable data space which can be used to search for, retrieve and disseminate content in the context of archival-based research. The central aspect of the work described in this paper is the assessment of the role of the Encoded Archival Description (EAD) standard as the basis for achieving the tasks described above. We have worked out how we could develop a real strategy of defining specific customization of EAD that could be used at various stages of the process of integrating heterogeneous sources. We have developed a methodology based on a specification and customization method inspired from the extensive experience of the Text Encoding Initiative (TEI) community. In the TEI framework, one has the possibility to model specific subsets or extensions of the TEI guidelines while maintaining both the technical (XML schemas) and editorial (documentation) content within a single framework. This work has led us quite far in anticipating that the method we have developed may be of a wider interest within similar environments, but also, as we believe, for the future maintenance of the EAD standard.
Similar content being viewed by others
Notes
https://www.w3.org/TR/its20/its20.odd. Accessed 15 March 2018.
The EAD guidelines and schema encoded with ODD can be found here: https://github.com/ParthenosWP4/standardsLibrary/blob/master/archivalDescription/EAD/odd/EADSpec.xml. Accessed 28 March 2017.
References
Aas K, Sugimoto G, Jagodzinski S, Tamm U, Jeller D and Lux Z (2013) Archives Portal Europe network of excellence. D6.1 First analysis report: applying web 2.0 solutions in archival applications. http://apex-project.eu/images/docs/D61_Web20_In_Archival_Applications.pdf. Accessed 9 Jan 2018
Aas K, Jagodzinski S, Lux Z, Djupdahl M, Sugimoto G, Papp S and Kaljuvee A (2014) Archives Portal Europe network of excellence. D6.6 Second analysis report: applying Web 2.0 solutions in archival applications, 2014. http://apex-project.eu/images/docs/D66_Web20_In_Archival_Applications_final.pdf. Accessed 9 Jan 2018
AFNOR. EAD and EAC-CPF Working Groups (n.d.) Proposals for evolution of EAD. https://www2.archivists.org/sites/all/files/France_Proposals%20for%20evolution%20of%20EAD_0.rtf. Accessed 9 Jan 2018
Archives Portal Europe. https://www.archivesportaleurope.net/. Accessed 14 April 2018
Archives Portal Europe Network of Excellence (APEx) (2015) Encoded archival description (EAD). http://apex-project.eu/index.php/en/outcomes/standards/apeead. Accessed 04 April 2018
Bunn J (2013) Developing descriptive standards: a renewed call to action. Arch Rec 34(2):235–247. https://doi.org/10.1080/23257962.2013.830066
CENDARI (n.d.) Collaborative European Digital Archival Research Infrastructure. http://www.cendari.eu. Accessed 4 April 2018
CLARIN—European Research Infrastructure for Language Resources and Technology. https://www.clarin.eu. Accessed 14 April 2018
DARIAH-EU. https://www.dariah.eu. Accessed 14 April 2018
EHRI (2014) An initial Schematron schema for any EAD to validate for EHRI-preprocess. Version: 0.1. https://cdn.rawgit.com/EHRI/data-validations/master/schematron/rules.html. Accessed 14 April 2018
European Holocaust Research Infrastructure. http://portal.ehri-project.eu. Accessed 14 April 2018
Gartner R (2015) An XML schema for enhancing the semantic interoperability of archival description. Arch Sci 15(3):295–313. https://doi.org/10.1007/s10502-014-9225-1
International Council on Archives (2000) ISAD (G): general international standard archival description: adopted by the Committee on Descriptive Standards, Stockholm, Sweden, 19-22 September 1999, 2nd edn. https://www.ica.org/sites/default/files/CBPS_2000_Guidelines_ISAD%28G%29_Second-edition_EN.pdf. Accessed 04 April 2018
International Council on Archives (2004) ISAAR (CPF): international standard archival authority record for corporate bodies, persons and families. Adopted Canberra, Australia, 27-30 October 2003, 2nd edn. https://www.ica.org/sites/default/files/CBPS_Guidelines_ISAAR_Second-edition_EN.pdf. Accessed 04 April 2018
International Council on Archives (2007) ISDF: international standard for describing functions. Developed by the Committee on Best Practices and Standards, Dresden, Germany, 2–4 May, 2007. https://www.ica.org/sites/default/files/CBPS_2007_Guidelines_ISDF_First-edition_EN.pdf. Accessed 4 April 2018
International Council on Archives (2008) ISDIAH: international standard for describing institutions with archival holdings. Developed by the Committee on Best Practices and Standards, London, UK, 10–11 March, 2018. https://www.ica.org/sites/default/files/CBPS_2008_Guidelines_ISDIAH_First-edition_EN.pdf. Accessed 4 April 2018
International Council on Archives. Experts group on archival description (2016) Records in contexts, a conceptual model for archival description. Consultation draft v0.1. Conseil international des Archives, September 2016. http://www.ica.org/sites/default/files/RiC-CM-0.1.pdf. Accessed 9 Jan 2018
ISO 8601:2004 (2004) Data elements and interchange formats—Information interchange—Representation of dates and times. International Organization for Standardization
ISO/IEC 19757-3: 2016 (2016) Information technology—Document Schema Definition Languages (DSDL)—part 3: rule-based validation—Schematron. International Organization for Standardization
Library of Congress (2002) Encoded Archival Description tag library. EAD technical document no. 2. http://www.loc.gov/ead/tglib/index.html Accessed 12 April 2018
Library of Congress (2008) EAD 2002 RELAX NG Schema (version 200804 release) Society of American Archivists and Library of Congress. http://www.loc.gov/ead/ead.rng. Accessed 12 April 2018
Library of Congress (2013) Development of the Encoded Archival Description DTD. http://www.loc.gov/ead/eaddev.html Accessed 9 Jan 2018
Library of Congress (2017) EAD: official site. http://www.loc.gov/ead/index.html. Accessed 14 April 2018
Lieske C, Rahtz S, Sasaki F (2006) Internationalization and localization of XML: introducing “ITS”, XTech 2006, Amsterdam, The Netherlands, May 2006. https://www.w3.org/People/fsasaki/docs/xtech06-sasakietal.pdf. Accessed 15 April 2018
Medves M, Romary L (2014) EAG(CENDARI): customising EAG for research purposes. Building infrastructures for archives in a digital world, Jun 2013, Dublin, Ireland. https://hal.inria.fr/hal-00959841v2. Accessed 9 Jan 2018
METS: metadata encoding and transmission standard. http://www.loc.gov/standards/mets/. Accessed 4 April 2018
PARTHENOS: pooling activities, resources and tools, for heritage E-research, networking, optimization and synergies. http://www.parthenos-project.eu. Accessed 14 April 2018
Rahtz S, Burnard L (2014) Advanced topics in ODD, In: ODD: one document does it all. Workshop at the Text Encoding Initiative Conference and Members Meeting, 22–24 Oct Evanston, IL. https://hal.inria.fr/hal-01767683. Accessed 15 April 2018
Riondet C, Romary L, Van Nispen A, Rodriguez KJ, Bryant M (2017) Report on standards. [Contract] D.11.4, Inria Paris. https://hal.archives-ouvertes.fr/hal-01503235/. Accessed 2 April 2018
Romary L, Riondet C (2017) Ongoing maintenance and customization of archival standards using ODD (EAC-CPF revision proposal). EAC-CPF revision proposal. https://hal.inria.fr/hal-01677185. Accessed 2 April 2018
Romary L, Banski P, Bowers J, Degl’innocenti E, Ďurčo M, et al (2017) Report on standardization (draft). [Technical report] Deliverable 4.2 Inria. https://hal.inria.fr/hal-01560563. Accessed 2 April 2018
Schematron (2018). http://schematron.com. Accessed 2 Nov 2016
Schematron QuickFix (2017). http://www.schematron-quickfix.com. Accessed 28 March 2017
Shaw EJ (2001) Rethinking balancing flexibility and interoperability. New Rev Inf Netw 7(1):17–31. https://doi.org/10.1080/13614570109516972
Society of American Archivists. Technical Subcommittee on Encoded Archival Context for Corporate Bodies, Persons, and Families (TS-EAC-CPF) (2010) https://www2.archivists.org/governance/handbook/section7/groups/Standards/TS-EAC-CPF. Accessed 4 April 2018
Society of American Archivists. Technical Subcommittee on Encoded Archival Description (TS-EAD) (2010) https://www2.archivists.org/governance/handbook/section7/groups/Standards/TS-EAD. Accessed 4 April 2018
Text Encoding Initiative (2013) Getting started with P5 ODDs. http://www.tei-c.org/Guidelines/Customization/odds.xml. Accessed 12 April 2018
Text Encoding Initiative (2018a) P5: guidelines for electronic text encoding and interchange. <schemaSpec>. Version 3.3.0. Revision f4d8439. http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-schemaSpec.html. Accessed 14 April 2018
Text Encoding Initiative (2018b) P5: guidelines for electronic text encoding and interchange. <classSpec>. Version 3.3.0. Revision f4d8439. http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-classSpec.html. Accessed 14 April 2018
Walsh N (2002) Literate programming in XML. http://nwalsh.com/docs/articles/xml2002/lp/paper.html. Accessed 9 Jan 2018
Acknowledgements
Special thanks to Annelies van Nispen (NIOD) and Hector Martinez Alonso (ALMAnaCH) for their help, and to Lou Burnard (TEI) for his wise comments.
Funding
Funding was provided by Horizon 2020 Framework Programme (Grant No. 654164).
Author information
Authors and Affiliations
Corresponding author
Appendix: EAD ODD constraints expressed in the EHRI guidelines
Appendix: EAD ODD constraints expressed in the EHRI guidelines
Constraints expressed in the EHRI guidelines
ISAD(G) field concerned by the constraint | Corresponding EAD elements or paths | Expression of the constraint |
---|---|---|
Reference codes ISAD(G) 3.1.1 | ead:eadid ead:unitid | Copy the reference number given by the collection-holding institution |
Other forms of title | ead:titleproper ead:unititle | It is an EHRI requirement to provide English translations of non-English language titles |
Dates ISAD(G) 3.1.3 for dates of the descriptions units | ead:date ead:unitdate | Follow the ISO 8601 standard (Data elements and interchange formats—Information interchange—Representation of dates and times.) The standardized form is YYYY-MM-DD |
Level of description ISAD(G) 3.1.4 | ead:archdesc/@level ead:c{01-06}/@level | ISAD(G) 3.1.4 has a predefined list of units. As EHRI works with archives and collections that have not been arranged according to traditional rules, the terms used for the levels of description might also deviate. It is therefore chosen that this list should be flexible and expandable |
Archival history | ead:custodhist ead:acqinfo | 3.2.4 “Immediate source of acquisition or transfer” has been included in this element |
Access points | ead:controlaccess/ead:subject ead:controlaccess/ead:placename ead:controlacess/ead:persname ead:controlaccess/ead:famname ead:controlaccess/ead:corpname ead:controlaccess/ead:geogname | Wish to support linkage with EHRI authorities lists, thesauri or internationally recognized gazetteers (like Geonames for plan names) |
Languages of materials ISAD(G) 3.4.3 | ead:language/@langcode | Mandatory in EHRI Its value must be in the ISO 639-1 or ISO 639-2 lists (International Standards for Language Codes) |
Scripts of materials ISAD(G) 3.4.3 | ead:langmaterial/ead:language/@scriptcode | Mandatory in EHRI Its value must be in the ISO 15924 list (International Standard for Names of Scripts) |
Existence and locations of originals ISAD(G) 3.5.1 | ead:originalsloc | The link to Repository Authority list and the request for extra information is specific to EHRI |
Existence and locations of copies | ead:altformavail | The link to Repository Authority list is specific to EHRI |
Publication note | ead:bibliography | Combination of guidelines from ISAD(G) and ISBD and Guidelines created by EHRI for describing personalities and corporate bodies |
Institution Identifier To identify the agency(ies) responsible for the description | ead:titlestmt/ead:author | Mandatory in EHRI |
Language of description | ead:langusage/ead:language/@langcode | Mandatory in EHRI Its value must be in the ISO 639-1 or ISO 639-2 lists (International Standards for Language Codes) |
Script of description | ead:langusage/ead:language/@scriptcode | Mandatory in EHRI Its value must be in the ISO 15924 list (International Standard for Names of Scripts) |
Sources To identify providers of metadata descriptions, other than collection-holding institutions | ead:titlestmt/ead:author | Mandatory in EHRI |
EHRI scope To identify the extent of Holocaust-related material within the total collection | ead:odd[type = ”EHRI-scope”] | Desirable in EHRI |
EHRI copyright | ead:publisher | Mandatory in EHRI |
Rules or conventions ISAD(G) 3.7.2 | ead:descrules | Mandatory in EHRI |
Date(s) of description ISAD(G) 3/7.3 | ead:processinfo/ead:p/ead:date | Mandatory in EHRI Use of the ISO 8601 standard |
Rights and permissions
About this article
Cite this article
Romary, L., Riondet, C. EAD ODD: a solution for project-specific EAD schemes. Arch Sci 18, 165–184 (2018). https://doi.org/10.1007/s10502-018-9290-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10502-018-9290-y