Metadata and Semantics Research

Volume 343 of the series Communications in Computer and Information Science pp 146-157

Integrating Scholarly Publications and Research Data – Preparing for Open Science, a Case Study from High-Energy Physics with Special Emphasis on (Meta)data Models

  • Piotr PraczykAffiliated withCERNComputer Science and Systems Engineering Dept., Universidad de Zaragoza
  • , Javier Nogueras-IsoAffiliated withComputer Science and Systems Engineering Dept., Universidad de Zaragoza
  • , Suenje Dallmeier-TiessenAffiliated withCERN
  • , Mike WhalleyAffiliated withIPPP, Durham University

* Final gross prices may vary according to local VAT.

Get Access


There is an emerging need in the research communitiy to have access to the research material beyond a publication. In an ideal scenario, scientists should have access to more than the full text: data, code, documentation and any other research output. We present here a case-study of our approach to facilitate seamless access to more than “just the paper” by integrating two complementary, heavily used, systems: Inspire and HEPData. On the one hand, Inspire, a digital library of High-Energy Physics, allows access to metadata about publications and full-text documents. On the other hand, the HEPData project has concentrated on gathering datasets behind figures and tables. We allow both systems to take advantage of a sum of their data and present a new infrastructure in Inspire making datasets equally important as publications. We also present mechanisms allowing long-term preservation of datasets and their unique identification, being an important step towards the open linked data in Inspire.