Chapter 7 Data aggregation and anonymization for mathematical modeling and epidemiological studies

Bruaset, Are Magnus; Lines, Glenn Terje; Sundnes, Joakim

doi:10.1007/978-3-031-05466-2_7

Are Magnus Bruaset¹³,
Glenn Terje Lines¹⁴ &
Joakim Sundnes¹⁴

Part of the book series: Simula SpringerBriefs on Computing ((SBRIEFSC,volume 11))

837 Accesses
1 Citations

Abstract

An important secondary purpose of the Smittestopp development was to provide aggregated data sets describing mobility and social interactions in Norway’s population. The data were to be used to monitor the effect of government regulations and recommendations, provide input to advanced computational models to predict the pandemic’s spread, and provide input to fundamental epidemiology research. In this chapter we describe the challenges and technical solutions of Smittestopp’s data aggregation, as well as preliminary results from the time period when the app was active.We first give a detailed overview of the requirements, specifying the types of data to be collected and the level of spatial and temporal aggregation. We then proceed to describe the concepts for anonymization via :-anonymity and Y-differential privacy (Y-DP ), and the technical solutions for collecting and aggregating data from the database. In particular, we present details of how GPS- and Bluetooth events were mapped to geographical regions and points of interest, and the solutions employed for efficient data retrieval and processing. The preliminary results demonstrate how the recorded GPS- and Bluetooth events match with expected temporal and spatial variations in mobility and social interactions, and indicate the usefulness of the aggregated data as a tool for pandemic monitoring and research. One of the main criticisms of Smittestopp concerns the centralized storage of individuals’ movements, even if such data were used and presented only at an aggregated and anonymized level. In this chapter, we also outline a completely different approach, where the GPS data do not leave the user’s phone but are, instead, pre-processed to a much higher level of privacy before being dispatched to a server-side data aggregation algorithm. This approach, which would make the app significantly less intrusive, is made possible by recent advances in determining close contacts from Bluetooth data, either by a revised Smittestopp algorithm or by means of the Google/Apple Exposure Notification framework.

Download to read the full chapter text

Chapter PDF

The exciting potential and daunting challenge of using GPS human-mobility data for epidemic modeling

Article 19 June 2024

An Exploratory Analysis of the Effects of Spatial and Temporal Scale and Transportation Mode on Anonymity in Human Mobility Trajectories

Unveiling Spatial Epidemiology of HIV with Mobile Phone Data

Article Open access 13 January 2016

Author information

Authors and Affiliations

Department of High performance Computing, Simula Research Laboratory, Oslo, Norway
Are Magnus Bruaset
Department of Computational Physiology, Simula Research Laboratory, Oslo, Norway
Glenn Terje Lines & Joakim Sundnes

Authors

Are Magnus Bruaset
View author publications
You can also search for this author in PubMed Google Scholar
Glenn Terje Lines
View author publications
You can also search for this author in PubMed Google Scholar
Joakim Sundnes
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Are Magnus Bruaset .

Editor information

Editors and Affiliations

SimulaMet, OsloMet – Oslo Metropolitan University, Oslo, Norway
Ahmed Elmokashfi
SimulaMet, OsloMet – Oslo Metropolitan University, Oslo, Norway
Olav Lysne
Simula Consulting, Oslo, Norway
Valeriya Naumova

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Bruaset, A.M., Lines, G.T., Sundnes, J. (2022). Chapter 7 Data aggregation and anonymization for mathematical modeling and epidemiological studies. In: Elmokashfi, A., Lysne, O., Naumova, V. (eds) Smittestopp − A Case Study on Digital Contact Tracing. Simula SpringerBriefs on Computing, vol 11. Springer, Cham. https://doi.org/10.1007/978-3-031-05466-2_7

Download citation

DOI: https://doi.org/10.1007/978-3-031-05466-2_7
Published: 18 June 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-05465-5
Online ISBN: 978-3-031-05466-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Chapter 7 Data aggregation and anonymization for mathematical modeling and epidemiological studies

Abstract

Chapter PDF

Similar content being viewed by others

The exciting potential and daunting challenge of using GPS human-mobility data for epidemic modeling

An Exploratory Analysis of the Effects of Spatial and Temporal Scale and Transportation Mode on Anonymity in Human Mobility Trajectories

Unveiling Spatial Epidemiology of HIV with Mobile Phone Data

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Chapter 7 Data aggregation and anonymization for mathematical modeling and epidemiological studies

Abstract

Chapter PDF

Similar content being viewed by others

The exciting potential and daunting challenge of using GPS human-mobility data for epidemic modeling

An Exploratory Analysis of the Effects of Spatial and Temporal Scale and Transportation Mode on Anonymity in Human Mobility Trajectories

Unveiling Spatial Epidemiology of HIV with Mobile Phone Data

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation