Advertisement

Linkset Quality Assessment for the Thesaurus Framework LusTRE

  • Riccardo Albertoni
  • Monica De Martino
  • Paola PodestàEmail author
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 672)

Abstract

Recently a great number of controlled vocabularies (e.g., thesauri) covering several domains and shared by different communities, have been published and interlinked using the Linked Data paradigm. Remarkable efforts have been spent from data producers to make their thesauri compliant with Linked Data requirements both for the content encoding and for the connections (aka, linkset) with others thesauri. Also in our experience in the creation of the framework of multilingual linked thesauri for the environment (LusTRE), within the EU funded project eENVplus, the development of the interlinking among thesauri, have required significant efforts, thus, the evaluation of their quality in term of usefulness and enrichment of information became a critical issue. In this paper, to support our claim, we discuss the results of the quality evaluation of several linksets created in LusTRE. To this purpose, we consider two quality measures, the average linkset reachability and the average linkset importing, able to quantify the linkset-accessible information.

Keywords

Linkset quality SKOS Linked data Environmental thesauri Metadata 

Notes

Acknowledgements

The paper activity has been carried out within the EU funded project eENVplus (CIP-ICT-PSP grant No. 325232). The authors would like to thank all partners and, in particular, Paolo Plini (IIA-CNR) for the important collaboration, and the team of the European Commission’s Joint Research Centre (Italy) for the valuable contribution.

References

  1. 1.
    Abecker, A., Wössner, R., Schnitter, K., Albertoni, R., De Martino, M., Podestà, P.: Latest developments of the linked thesaurus framework for the environment (LusTRE). In: 29th EnviroInfo and 3rd ICT4S Conference 2015, 7–9 September 2015, Copenhagen, Denmark (2015)Google Scholar
  2. 2.
    Acosta, M., Zaveri, A., Simperl, E., Kontokostas, D., Auer, S., Lehmann, J.: Crowdsourcing linked data quality assessment. In: Alani, H., et al. (eds.) ISWC 2013. LNCS, vol. 8219, pp. 260–276. Springer, Heidelberg (2013). doi: 10.1007/978-3-642-41338-4_17 CrossRefGoogle Scholar
  3. 3.
    Albertoni, R., De Martino, M., Di Franco, S., De Santis, V., Plini, P.: EARTh: an environmental application reference thesaurus in the linked open data cloud. Semant. Web 5(2), 165–171 (2014)Google Scholar
  4. 4.
    Albertoni, R., De Martino, M., Podestà, P.: Environmental thesauri under the lens of reusability. In: Kő, A., Francesconi, E. (eds.) EGOVIS 2014. LNCS, vol. 8650, pp. 222–236. Springer, Heidelberg (2014). doi: 10.1007/978-3-319-10178-1_18 Google Scholar
  5. 5.
    Albertoni, R., De Martino, M., Podestà, P.: A linkset quality metric measuring multilingual gain in SKOS thesauri. In: Rula, A., Zaveri, A., Knuth, M., Kontokostas, D. (eds.) Proceedings of the 2nd Workshop on Linked Data Quality Co-located with 12th Extended Semantic Web Conference (ESWC 2015), 1 June 2015, Portorož, Slovenia, vol. 1376. CEUR Workshop Proceedings. CEUR-WS.org (2015)Google Scholar
  6. 6.
    Albertoni, R., Gómez-Pérez, A.: Assessing linkset quality for complementing third-party datasets. In: Guerrini, G. (ed.) EDBT/ICDT Workshops, pp. 52–59. ACM (2013)Google Scholar
  7. 7.
    Albertoni, R., Isaac, A., Debattista, J., Dekkers, M., Guret, C., Lee, D., Mihindukulasooriya, N., Zaveri, A.: Data on the web best practices: data quality vocabulary (2016), W3C Working Draft. http://www.w3.org/TR/vocab-dqv/. Accessed 28 July 2016
  8. 8.
    Caracciolo, C., Stellato, A., Morshed, A., Johannsen, G., Rajbhandari, S., Jaques, Y., Keizer, J.: The AGROVOC linked dataset. Semant. Web 4(3), 341–348 (2013)Google Scholar
  9. 9.
    Guéret, C., Groth, P., Stadler, C., Lehmann, J.: Assessing linked data mappings using network measures. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 87–102. Springer, Heidelberg (2012). doi: 10.1007/978-3-642-30284-8_13 CrossRefGoogle Scholar
  10. 10.
    Heath, T., Bizer, C.: linked data: evolving the web into a global data space. Synthesis Lectures on the Semantic Web. Morgan & Claypool Publishers (2011)Google Scholar
  11. 11.
    Mader, C., Haslhofer, B., Isaac, A.: Finding quality issues in SKOS vocabularies. In: Zaphiris, P., Buchanan, G., Rasmussen, E., Loizides, F. (eds.) TPDL 2012. LNCS, vol. 7489, pp. 222–233. Springer, Heidelberg (2012). doi: 10.1007/978-3-642-33290-6_25 CrossRefGoogle Scholar
  12. 12.
    Papaleo, L., Pernelle, N., Saïs, F., Dumont, C.: Logical detection of invalid SameAs statements in RDF data. In: Janowicz, K., Schlobach, S., Lambrix, P., Hyvönen, E. (eds.) EKAW 2014. LNCS (LNAI), vol. 8876, pp. 373–384. Springer, Heidelberg (2014). doi: 10.1007/978-3-319-13704-9_29 Google Scholar
  13. 13.
    Suominen, O., Hyvönen, E.: Improving the quality of SKOS vocabularies with skosify. In: Teije, A., Völker, J., Handschuh, S., Stuckenschmidt, H., d’Acquin, M., Nikolov, A., Aussenac-Gilles, N., Hernandez, N. (eds.) EKAW 2012. LNCS (LNAI), vol. 7603, pp. 383–397. Springer, Heidelberg (2012). doi: 10.1007/978-3-642-33876-2_34 CrossRefGoogle Scholar
  14. 14.
    Suominen, O., Mader, C.: Assessing and improving the quality of skos vocabularies. J. Data Semant. 3(1), 47–73 (2014)CrossRefGoogle Scholar
  15. 15.
    Zaveri, A., Kontokostas, D., Sherif, M.A., Bühmann, L., Morsey, M., Auer, S., Lehmann, J.: User-driven quality evaluation of DBpedia. In: Sabou, M., Blomqvist, E., Noia, T.D., Sack, H., Pellegrini, T. (eds.) I-SEMANTICS 2013, 4–6 September 2013, Graz, Austria, pp. 97–104. ACM (2013)Google Scholar
  16. 16.
    Zaveri, A., Rula, A., Maurino, A., Pietrobon, R., Lehmann, J., Auer, S.: Quality assessment for linked open data: a survey. Semant. Web 7(1), 63–93 (2016)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  • Riccardo Albertoni
    • 1
  • Monica De Martino
    • 1
  • Paola Podestà
    • 1
    Email author
  1. 1.Istituto di Matematica Applicata e Tecnologie InformaticheConsiglio Nazionale delle RicercheGenovaItaly

Personalised recommendations