Language Resources and Evaluation

, Volume 47, Issue 4, pp 1315–1326 | Cite as

Creating & Testing CLARIN Metadata Components

  • Folkert de Vriend
  • Daan Broeder
  • Griet Depoorter
  • Laura van Eerten
  • Dieter Van Uytvanck
Project Note
  • 149 Downloads

Abstract

The CLARIN Metadata Infrastructure (CMDI) that is being developed in Common Language Resources and Technology Infrastructure (CLARIN) is a computer-supported framework that combines a flexible component approach with the explicit declaration of semantics. The goal of the Dutch CLARIN project “Creating & Testing CLARIN Metadata Components” was to create metadata components and profiles for a wide variety of existing resources housed at two data centres according to the CMDI specifications. In doing so the principles of the framework were tested. The results of the project are of benefit to other CLARIN-projects that are expected to adhere to the CMDI framework and its accompanying tools.

Keywords

Metadata Infrastructure CLARIN 

Notes

Acknowledgments

The authors would like to thank Jan Pieter Kunst (Meertens Institute) and Anna Aalstein (INL) for their valuable input during the project. The project reported on in this paper was funded by CLARIN-NL (www.clarin.nl).

References

  1. Barbiers, S., Cornips, L. & Kunst, J. P. (2007). The Syntactic Atlas of the Dutch Dialects: A corpus of elicited speech and text as an on-line dynamic atlas. In J. C. Beal & K. C. Corrigan & H. Moisl [red.] Creating and digitizing language corpora. Volume 1: Synchronic databases. Palgrave Macmillan, Hampshire, pp. 54–90.Google Scholar
  2. Beeken, J. C. & van der Kamp, P. (2004). The Centre for Dutch Language and Speech Technology (TST Centre). In Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC), pp. 555–558.Google Scholar
  3. Broeder, D., Declerck, T., Hinrichs, E., Piperidis, S., Romary, L., Calzolari, N., & Wittenburg, P. (2008). Foundation of a component-based flexible registry for language resources and technology. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC).Google Scholar
  4. Broeder, D., & Wittenburg, P. (2006). The IMDI metadata framework, its current application and future direction. International Journal of Metadata, Semantics and Ontologies, 1(2), 119–132.CrossRefGoogle Scholar
  5. Cucchiarini, C., Driesen, J., Van Hamme, H., & Sanders, E. (2008). Recording Speech of Children, Non-Natives and Elderly People for HLT Applications: The JASMIN-CGN Corpus. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC).Google Scholar
  6. ISLE Metadata Initiative (IMDI). (2009). Metadata Elements for Catalogue Descriptions. Part 1 B, Version 3.0.13. http://www.mpi.nl/IMDI/documents/Proposals/IMDI_Catalogue_3.0.0.pdf.
  7. Kemps-Snijders, M., Windhouwer, M., Wittenburg, P. & Wright, S.E. (2009). ISOcat: Remodeling Metadata for Language Resources. In the special issue on the Open Forum on Metadata Registries of the International Journal of Metadata, Semantics and Ontologies (IJMSO), 4(4), pp. 261–276.Google Scholar
  8. Meder, T. (2010). From a Dutch Folktale Database towards an International Folktale Database. In: Fabula 51, Heft 1/2. Walter de Gruyter: Berlin: New York.Google Scholar
  9. NISO. (2004). Understanding Metadata. Bethesda, MD: NISO Press. URL: http://www.niso.org/standards/resources/UnderstandingMetadata.pdf.
  10. Simons, G., & Bird, S. “OLAC Metadata”. 2008, cited version http://www.language-archives.org/OLAC/metadata-20080531.html, latest version http://www.language-archives.org/OLAC/metadata.html.
  11. TEI Text Encoding Initiative. (2009). http://www.tei-c.org/.
  12. Váradi, T., Wittenburg, P., Krauwer, S., Wynne, M., & Koskenniemi, K. (2008). CLARIN: Common language resources and technology infrastructure. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC).Google Scholar

Copyright information

© Springer Science+Business Media Dordrecht 2013

Authors and Affiliations

  • Folkert de Vriend
    • 1
  • Daan Broeder
    • 2
  • Griet Depoorter
    • 3
  • Laura van Eerten
    • 3
  • Dieter Van Uytvanck
    • 2
  1. 1.Meertens InstituteAmsterdamThe Netherlands
  2. 2.Max Planck Institute for PsycholinguisticsNijmegenThe Netherlands
  3. 3.Institute for Dutch LexicologyLeidenThe Netherlands

Personalised recommendations