Skip to main content

Advertisement

SpringerLink
Book cover

Smittestopp − A Case Study on Digital Contact Tracing pp 121–141Cite as

Chapter 7 Data aggregation and anonymization for mathematical modeling and epidemiological studies

Chapter 7 Data aggregation and anonymization for mathematical modeling and epidemiological studies

  • Are Magnus Bruaset13,
  • Glenn Terje Lines14 &
  • Joakim Sundnes14 
  • Chapter
  • Open Access
  • First Online: 18 June 2022
  • 348 Accesses

  • 1 Citations

Part of the Simula SpringerBriefs on Computing book series (SBRIEFSC,volume 11)

Abstract

An important secondary purpose of the Smittestopp development was to provide aggregated data sets describing mobility and social interactions in Norway’s population. The data were to be used to monitor the effect of government regulations and recommendations, provide input to advanced computational models to predict the pandemic’s spread, and provide input to fundamental epidemiology research. In this chapter we describe the challenges and technical solutions of Smittestopp’s data aggregation, as well as preliminary results from the time period when the app was active.We first give a detailed overview of the requirements, specifying the types of data to be collected and the level of spatial and temporal aggregation. We then proceed to describe the concepts for anonymization via :-anonymity and Y-differential privacy (Y-DP ), and the technical solutions for collecting and aggregating data from the database. In particular, we present details of how GPS- and Bluetooth events were mapped to geographical regions and points of interest, and the solutions employed for efficient data retrieval and processing. The preliminary results demonstrate how the recorded GPS- and Bluetooth events match with expected temporal and spatial variations in mobility and social interactions, and indicate the usefulness of the aggregated data as a tool for pandemic monitoring and research. One of the main criticisms of Smittestopp concerns the centralized storage of individuals’ movements, even if such data were used and presented only at an aggregated and anonymized level. In this chapter, we also outline a completely different approach, where the GPS data do not leave the user’s phone but are, instead, pre-processed to a much higher level of privacy before being dispatched to a server-side data aggregation algorithm. This approach, which would make the app significantly less intrusive, is made possible by recent advances in determining close contacts from Bluetooth data, either by a revised Smittestopp algorithm or by means of the Google/Apple Exposure Notification framework.

Download chapter PDF

Author information

Authors and Affiliations

  1. Department of High performance Computing, Simula Research Laboratory, Oslo, Norway

    Are Magnus Bruaset

  2. Department of Computational Physiology, Simula Research Laboratory, Oslo, Norway

    Glenn Terje Lines & Joakim Sundnes

Authors
  1. Are Magnus Bruaset
    View author publications

    You can also search for this author in PubMed Google Scholar

  2. Glenn Terje Lines
    View author publications

    You can also search for this author in PubMed Google Scholar

  3. Joakim Sundnes
    View author publications

    You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Are Magnus Bruaset .

Editor information

Editors and Affiliations

  1. SimulaMet, OsloMet – Oslo Metropolitan University, Oslo, Norway

    Prof. Ahmed Elmokashfi

  2. SimulaMet, OsloMet – Oslo Metropolitan University, Oslo, Norway

    Prof. Olav Lysne

  3. Simula Consulting, Oslo, Norway

    Dr. Valeriya Naumova

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and Permissions

Copyright information

© 2022 The Author(s)

About this chapter

Verify currency and authenticity via CrossMark

Cite this chapter

Bruaset, A.M., Lines, G.T., Sundnes, J. (2022). Chapter 7 Data aggregation and anonymization for mathematical modeling and epidemiological studies. In: Elmokashfi, A., Lysne, O., Naumova, V. (eds) Smittestopp − A Case Study on Digital Contact Tracing. Simula SpringerBriefs on Computing, vol 11. Springer, Cham. https://doi.org/10.1007/978-3-031-05466-2_7

Download citation

  • .RIS
  • .ENW
  • .BIB
  • DOI: https://doi.org/10.1007/978-3-031-05466-2_7

  • Published: 18 June 2022

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-05465-5

  • Online ISBN: 978-3-031-05466-2

  • eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Share this chapter

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Over 10 million scientific documents at your fingertips

Switch Edition
  • Academic Edition
  • Corporate Edition
  • Home
  • Impressum
  • Legal information
  • Privacy statement
  • California Privacy Statement
  • How we use cookies
  • Manage cookies/Do not sell my data
  • Accessibility
  • FAQ
  • Contact us
  • Affiliate program

Not affiliated

Springer Nature

© 2023 Springer Nature Switzerland AG. Part of Springer Nature.