Skip to main content

Building Knowledge Graphs from Survey Data: A Use Case in the Social Sciences (Extended Version)

  • Conference paper
  • First Online:
The Semantic Web: ESWC 2019 Satellite Events (ESWC 2019)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11762))

Included in the following conference series:

Abstract

Many research endeavors in the social sciences rely on high-quality empirical data. Survey data is often used as a foundation to investigate social behavior. The GESIS Panel is a probability-based mixed-mode panel survey in Germany providing high-quality survey and statistical data about e.g. political opinions, well-being, and other contemporary societal topics. In general, the integration and analysis of relevant data is a time-consuming process for researchers. This is due to the fact that search, discovery, and retrieval of the survey data requires accessing various data sources providing different information in different file formats. In this paper, we present our architecture for building a Knowledge Graph of the GESIS Panel data. We present the relevant heterogeneous data sources and demonstrate how we semantically lift and interlink the data in a shared RDF model. At the core of our architecture is a Knowledge Graph representing all aspects of the surveys. It is generated in a modular fashion and, therefore, our solution can be transferred to the existing infrastructure of other survey data publishers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.gesis.org/en/gesis-panel/gesis-panel-home/.

  2. 2.

    https://dbk.gesis.org/dbksearch/.

  3. 3.

    https://km.aifb.kit.edu/sites/gesispanel-demo/.

  4. 4.

    https://www.ddialliance.org.

  5. 5.

    http://rdf-vocabulary.ddialliance.org/discovery.html.

  6. 6.

    http://dublincore.org.

  7. 7.

    https://www.w3.org/TR/vocab-data-cube/.

  8. 8.

    https://www.w3.org/TR/vocab-data-cube/.

  9. 9.

    https://w3id.org/sora/resource/vocabulary.

  10. 10.

    http://rml.io/index.html.

  11. 11.

    https://jupyter.org/.

  12. 12.

    https://pandas.pydata.org.

  13. 13.

    https://github.com/RDFLib.

  14. 14.

    https://usc-isi-i2.github.io/karma/.

  15. 15.

    https://www.w3.org/2004/02/skos/.

  16. 16.

    http://silkframework.org/.

  17. 17.

    www.sora-projekt.de (note: German only).

  18. 18.

    https://www.diw.de/en/soep.

References

  1. Bosch, T., Cyganiak, R., Gregory, A., Wackerow, J.: DDI-RDF discovery vocabulary: a metadata vocabulary for documenting research and survey data. In: LDOW (2013)

    Google Scholar 

  2. Bosch, T., Wackerow, J., Cyganiak, R., Zapilko, B.: Leveraging the DDI model for linked statistical data in the social, behavioural, and economic sciences, p. 10 (2012)

    Google Scholar 

  3. Bosnjak, M., et al.: Establishing an open probability-based mixed-mode panel of the general population in Germany: the GESIS Ppanel. Social Science Computer Review 36(1), 103–115 (2018)

    Article  Google Scholar 

  4. Chaves-Fraga, D., Priyatna, F., Santana-Pérez, I., Corcho, Ó.: Virtual statistics knowledge graph generation from CSV files. In: Emerging Topics in Semantic Technologies - ISWC 2018 Satellite Events (best papers from 13 of the Workshops Co-located with the ISWC 2018 Conference), pp. 235–244 (2018). https://doi.org/10.3233/978-1-61499-894-5-235

  5. Gherghina, S., Geissel, B.: Citizens’ conceptions of democracy and political participation in Germany. In: Workshops of European Consortium for Political Research, p. 25 (2015)

    Google Scholar 

  6. Gottron, T., Hachenberg, C., Harth, A., Zapilko, B.: Towards a semantic data library for the social sciences, p. 13 (2011)

    Google Scholar 

  7. Mayer, S.J., Schultze, M.: The effects of political involvement and cross-pressures on multiple party identifications in multi-party systems - evidence from Germany. J. Elections Public Opin. Parties 29, 1–17 (2018)

    Google Scholar 

  8. Paulheim, H.: Knowledge graph refinement: a survey of approaches and evaluation methods. Semant. Web 8(3), 489–508 (2017)

    Article  Google Scholar 

  9. Schaible, J., Zapilko, B., Bosch, T., Zenk-Möltgen, W.: Linking study descriptions to the linked open data cloud. IASSIST Q. 38(4), 38 (2015)

    Article  Google Scholar 

  10. Vardigan, M., Heus, P., Thomas, W.: Data documentation initiative: toward a standard for the social sciences. Int. J. Digit. Curation 3(1), 107–113 (2008)

    Article  Google Scholar 

  11. Zapilko, B., Schaible, J., Mayr, P., Mathiak, B.: TheSoz: a SKOS representation of the thesaurus for the social sciences, p. 7 (2012)

    Google Scholar 

  12. Zapilko, B., Schaible, J., Wandhöfer, T., Mutschke, P.: Applying linked data technologies in the social sciences. KI -Künstliche Intelligenz 30(2), 159–162 (2016)

    Article  Google Scholar 

Download references

Acknowledgments

This work was carried out with the support of the German Research Foundation (DFG) within the project “SoRa - Sozial-Raumwissenschaftliche Forschungsdateninfrastruktur” (see footnote 17).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lars Heling .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Heling, L., Bensmann, F., Zapilko, B., Acosta, M., Sure-Vetter, Y. (2019). Building Knowledge Graphs from Survey Data: A Use Case in the Social Sciences (Extended Version). In: Hitzler, P., et al. The Semantic Web: ESWC 2019 Satellite Events. ESWC 2019. Lecture Notes in Computer Science(), vol 11762. Springer, Cham. https://doi.org/10.1007/978-3-030-32327-1_48

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-32327-1_48

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-32326-4

  • Online ISBN: 978-3-030-32327-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics