Integrating Scholarly Publications and Research Data – Preparing for Open Science, a Case Study from High-Energy Physics with Special Emphasis on (Meta)data Models

  • Piotr Praczyk
  • Javier Nogueras-Iso
  • Suenje Dallmeier-Tiessen
  • Mike Whalley
Part of the Communications in Computer and Information Science book series (CCIS, volume 343)

Abstract

There is an emerging need in the research communitiy to have access to the research material beyond a publication. In an ideal scenario, scientists should have access to more than the full text: data, code, documentation and any other research output. We present here a case-study of our approach to facilitate seamless access to more than “just the paper” by integrating two complementary, heavily used, systems: Inspire and HEPData. On the one hand, Inspire, a digital library of High-Energy Physics, allows access to metadata about publications and full-text documents. On the other hand, the HEPData project has concentrated on gathering datasets behind figures and tables. We allow both systems to take advantage of a sum of their data and present a new infrastructure in Inspire making datasets equally important as publications. We also present mechanisms allowing long-term preservation of datasets and their unique identification, being an important step towards the open linked data in Inspire.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Borgman, C.L.: Research data: Who will share what, with whom, when, and why? Working Paper Series of the German Council for Social and Economic Data 161, German Council for Social and Economic Data (RatSWD) (2010)Google Scholar
  2. 2.
    Caffaro, J., Kaplun, S.: Invenio: A Modern Digital Library for Grey Literature. oai:cds.cern.ch:1312678. In: 12th. Int. Conf. on Grey Literature, Prague, Czech Republic, Geneva, p. 7 (December 2010) No. CERN-OPEN-2010-027Google Scholar
  3. 3.
    Dietrich, D.: Metadata management in a data staging repository. Journal of Library Metadata 10(2-3), 79–98 (2010)MathSciNetCrossRefGoogle Scholar
  4. 4.
    Green, T.: We need publishing standards for datasets and data tables. OECD Publishing White Paper 22(4), 325–327 (2009)Google Scholar
  5. 5.
    Greenberg, J.: Theoretical Considerations of Lifecycle Modeling: An Analysis of the Dryad Repository Demonstrating Automatic Metadata Propagation, Inheritance, and Value System Adoption. Cataloging & Classification Quarterly 47(3), 380–402 (2009)CrossRefGoogle Scholar
  6. 6.
    PARSE: First insights into digital preservation of research output in europe: interim insight report (2009), http://www.parse-insight.eu/downloads/PARSE-Insight_D3-5_InterimInsightReport_final.pdf
  7. 7.
    Praczyk, P., Nogueras-Iso, J., Kaplun, S., Simko, T.: A storage model for supporting figures and other artefacts in scientific libraries: the case study of invenio. In: Proc. of 4th Workshop on Very Large Digital Libraries (VLDL 2011), Berlin, Germany (2011)Google Scholar
  8. 8.
    Schindler, U., Diepenbroek, M.: Generic XML-based framework for metadata portals. Comput. Geosci. 34(12), 1947–1955 (2008)CrossRefGoogle Scholar
  9. 9.
    de Sompel, H.V., Nelson, M.L., Lagoze, C., Warner, S.: Resource Harvesting within the OAI-PMH Framework. D-Lib Magazine 10(12) (2004)Google Scholar
  10. 10.
    Vesely, M., Baron, T., Le Meur, J.Y., Simko, T.: Creating Open Digital Library Using XML: Implementation of OAi-PMH Protocol at CERN. oai:cds.cern.ch:590906. In: Int. Conf. on Electronic Publishing, Karlovy Vary, Czech Republic, p. 7 (July 2002) No. CERN-ETT-2002-003Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Piotr Praczyk
    • 1
    • 2
  • Javier Nogueras-Iso
    • 2
  • Suenje Dallmeier-Tiessen
    • 1
  • Mike Whalley
    • 3
  1. 1.CERNGenevaSwitzerland
  2. 2.Computer Science and Systems Engineering Dept.Universidad de ZaragozaSpain
  3. 3.IPPPDurham UniversityUK

Personalised recommendations