Abstract
An important secondary purpose of the Smittestopp development was to provide aggregated data sets describing mobility and social interactions in Norway’s population. The data were to be used to monitor the effect of government regulations and recommendations, provide input to advanced computational models to predict the pandemic’s spread, and provide input to fundamental epidemiology research. In this chapter we describe the challenges and technical solutions of Smittestopp’s data aggregation, as well as preliminary results from the time period when the app was active.We first give a detailed overview of the requirements, specifying the types of data to be collected and the level of spatial and temporal aggregation. We then proceed to describe the concepts for anonymization via :-anonymity and Y-differential privacy (Y-DP ), and the technical solutions for collecting and aggregating data from the database. In particular, we present details of how GPS- and Bluetooth events were mapped to geographical regions and points of interest, and the solutions employed for efficient data retrieval and processing. The preliminary results demonstrate how the recorded GPS- and Bluetooth events match with expected temporal and spatial variations in mobility and social interactions, and indicate the usefulness of the aggregated data as a tool for pandemic monitoring and research. One of the main criticisms of Smittestopp concerns the centralized storage of individuals’ movements, even if such data were used and presented only at an aggregated and anonymized level. In this chapter, we also outline a completely different approach, where the GPS data do not leave the user’s phone but are, instead, pre-processed to a much higher level of privacy before being dispatched to a server-side data aggregation algorithm. This approach, which would make the app significantly less intrusive, is made possible by recent advances in determining close contacts from Bluetooth data, either by a revised Smittestopp algorithm or by means of the Google/Apple Exposure Notification framework.
Chapter PDF
Similar content being viewed by others
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
Copyright information
© 2022 The Author(s)
About this chapter
Cite this chapter
Bruaset, A.M., Lines, G.T., Sundnes, J. (2022). Chapter 7 Data aggregation and anonymization for mathematical modeling and epidemiological studies. In: Elmokashfi, A., Lysne, O., Naumova, V. (eds) Smittestopp − A Case Study on Digital Contact Tracing. Simula SpringerBriefs on Computing, vol 11. Springer, Cham. https://doi.org/10.1007/978-3-031-05466-2_7
Download citation
DOI: https://doi.org/10.1007/978-3-031-05466-2_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-05465-5
Online ISBN: 978-3-031-05466-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)