1 Introduction

Epidemics are defined as the occurrence of a group of illnesses of similar nature, in a community or region, of more than normal expectancy, and derived from a common or a propagated source. Pandemics refer to epidemics that affect the world globally [1]. In addition to severe illness and death, pandemic diseases cause fear and economic instability [2]. In December 2019 a novel Coronavirus 2019 (COVID-19) outbreak began in Wuhan in the Hubei region of China and very quickly spread worldwide, ultimately becoming recognized by the World Health Organization (WHO) as a pandemic [3]. COVID-19 is highly contagious with a median incubation period of 4 days, with the most common symptoms being fever and cough. Other symptoms include fatigue, headache, diarrhea, hemoptysis, dyspnea, and pneumonia [4, 5].

One of the traditional and widely used methods for the assessment of the extent of an epidemic outbreak is the epidemiologic investigation. This is usually conducted by health authorities by interviewing the suspected or confirmed exposed individuals. Depending on available time and resources, this may include an examination of evidence such as receipts and photographs. It is then followed by decisions and actions based on the investigation findings [6]. It may lead to either local or general containment measures in cases of a communicable disease outbreak and allows the notification of potentially exposed persons and locations [7, 8]. Inquiries by interview, being the main investigatory technique, are based on self-recall, thus making them prone to error [9]. Accuracy of recall depends among other things upon one’s cognitive abilities, questioning environment, and levels of stress which may be significant during an investigation [10, 11].

A transmission dynamics analysis may improve understanding of the epidemiologic factors of the disease and allow monitoring of the effectiveness of measures taken to control the outbreak [12,13,14]. Hellewell and colleagues used the stochastic transmission model to show that in most scenarios, highly effective contact tracing and case isolation is enough to control a new outbreak of COVID-19 within three months. The probability of control decreases with the prolonged delay between symptom onset to isolation, fewer cases ascertained by contact tracing, and increasing transmission before symptoms [15]. Geographic Information Systems (GIS) are not a new concept in both epidemiologic research and inquiries during an epidemic may contribute to understanding transmission dynamics [16]. GIS has been used to allow visual surveillance on trends, pathogens, diseases, and potential outbreaks [17, 18]. When it is used for epidemiologic inquiries it is usually uploaded manually by the investigator [19, 20]. Firestone et al. showed that combining spatial analysis with social network analysis enhances the quality of epidemic investigations [21].

Since the previous pandemic of H1N1 in 2009 which infected over 250,000 persons [22,23,24] smartphone positioning technologies have become more accurate due to several factors including improved precision of integrated GPS antennas, improved cellular support location services in the newer generations of cellular networks [25], and better integration with phone sensor data such as an accelerometer, social networks, and Wi-Fi connectivity. Google Maps is the most popular among the maps applications and the fourth most used app in the United States [26]. Google Maps allows continuous tracking of places that one has traveled including walking, biking, public transportation, car, and airline travel on a timeline – Google Maps Timeline (GMTL) [27]. The GMTL is not activated by default, rather the user is prompted to activate it when the android phone is initiated, or when the Google Maps app is installed on another phone (e.g. iPhone by Apple). GMTL data may be exported as a Keyhole Markup Language (KML) file containing GPS coordinates and GMTL radius of accuracy around the given location (in meters). Keyhole Markup Language (KML) was adopted as an Open Geospatial Consortium implementation standard [28].

The objective of this paper is to compare the accuracy of GMTL and user recall during routine events as a surrogate for examining the role of Google Maps in epidemiologic inquiries during pandemic outbreaks.

2 Methods

This study compared locations, routes, and means of transportation including walking, car, biking, public transportation, and airline travel between GMTL and user recall. The 17 Google Maps users whose timeline was activated were recruited through posts on “Facebook” published during the second week of March 2020. All participants were questioned in the same week by the same interviewer. They were informed that no personal information will be stored or published and that sharing their GMTL is regarded as consent to participate in the study. The study was conducted in Haifa, Israel which has a diverse population in terms of age, religion, and socio-economic status.

Self-reporting is prone to error but to a lesser degree when done face to face with an interviewer [29, 30]. For this reason, a previously validated tool was used- the Timeline Follow-Back (TLFB) method. This was first established for research on substance use where reliable and valid measures were sought for self-reports on use and abuse of alcohol and other substances [31,32,33,34,35], smoking behavior [36], eating disorders [37], sexual behavior [38,39,40,41], and overnight locations of homeless people [42]. The TLFB method is based on Tourangeau’s methods for improving autobiographical memory recall: (1) taking more time to remember, (2) breaking down a class of events into subclasses, (3) recalling in reverse chronology, and (4) using landmarks [43].

Five types of data were defined to be sought through both GMTL and TLFB methods: locations, routes, means of transportation, times, and durations. The interview began with an outline of the routine daily schedule for each participant to be used as memory landmarks. A timeline was then constructed in a reverse fashion containing visited locations, routes, means of transportation, times, and durations in the day that occurred two days before conducting the interview. This was then validated in the chronological order of the day (i.e. from earlier to later events). Each participant’s GMTL was then examined and both TLFB and GMTL data were inserted into an Excel Spreadsheet (Redmond, WA, USA: Microsoft). Each participant was shown the discrepancies between GMTL and TLFB data. Conclusions were divided by the interviewer into two categories based on user recall: (1) participant recalled or (2) no recall until shown. Categories were subdivided by GMTL accuracy: [a] GMTL accurate, [b] GMTL inaccurate, [c] GMTL data missing.

Locations and means of transportation were collectively summed to allow better statistical comparison as the endpoint of the study is location accuracy and not number of accurate timelines.

3 Results

Demographics: All 17 participants are citizens of Israel and their Google Maps interface is in Hebrew. A total of 9 (41.17%) were males, with a mean age of 29 (SD = 5). Public transportation as a sole means of transportation was used by 7 (41.17%) of the participants compared to 6 (35.29%) who used only a car. Both means of transport were used by 3 of the participants (17.64%).

A total of 362 locations were compared. Participants recalled 322 (88.95% SD = 8.55) locations, compared with 40 (11.05%, SD = 2.05) locations that they did not recall. There were 303 (83.70%, SD = 9.5) locations found accurate on GMTL compared with 30 (8.29%, SD = 1.16) inaccurate locations. GMTL had 29 (8.01%, SD = 1.84) missing locations. The total discrepancy between GMTL and TLFB recall was 95 cases (26.24%, SD = 3.26). There were 26 (7.18%, SD = 1.24) locations that the participants recalled but were found inaccurate on the GMTL, 29 locations that participants recalled but were missing from their GMTL, 36 (9.94%, SD = 2.52) locations that the participants did not recall until shown their timeline which were confirmed as accurate, and 4 (1.10%, SD = 0.54) locations that the participants did not recall until shown their timeline and were found inaccurate (Figs. 1 and 2).

Fig. 1
figure 1

Flow chart depicting accuracy of location according to GMTL

Fig. 2
figure 2

Flow chart depicting the discrepancies between GMTL and TLFB

4 Discussion

Rodriguez and colleges examined the potential use of GMTL as evidence for judicial purposes, by quantifying its accuracy compared to a GPS device with superior accuracy, but did not find Google Maps accurate enough for use in forensics. However, that study did not examine the use of the fourth generation network which is widely used in mobile networks in Israel [44]. In addition, this study compared the traditional epidemiologic inquiry which is based on memory recall to the same users GMTL while the study by Rodriguez compared coordinates to a GPS device of better accuracy. Other studies related to epidemics have used static locations which were either uploaded automatically by the use of third-party applications or manually by a researcher. In addition, these were pre-decided locations such as home, work, etc. Unlike this study, they did not use the dynamic daily timeline to track the exact visited places [17, 45, 46].

The GMTL accuracy of 83.98%, means that 84 of 100 visited places, buses, cars, and streets are documented correctly and can be used in epidemiologic inquiries without further processing. The users did not recall 11.05% of the locations, but when reminded by GMTL they found that 90% (n = 36) of those locations were accurate and 10% (n = 4) were inaccurate locations but were found to be useful as an aid to recall the correct location. GMTL also provided accurate times and thus durations of stay and transport in comparison with times and durations recalled by users which were crude estimations or based on their regular schedule not accounting for daily changes.

In the first month of the COVID-19 outbreak in Israel, the exposure potentials of the first 100 patients varied between 100 for a passenger returning home on a small flight from an endemic country and 4400 for a boy infected in a store who then went to watch a sports game in a stadium [47]. During the evolution of a pandemic, the case capacity increases and resources become less and less available to conduct individual epidemiologic investigations. This point in time differs between countries and the availability of resources. In these cases, an exposed or confirmed positive user may be asked to export a KML file from GMTL and upload it to a dedicated platform for automation of the epidemiologic inquiries. It is important to note that the file may technically be stripped of personal identifying data by either the user or the authorities. This may have a positive impact concerning privacy as there will be no identifying details linked to the locations but negative value if the authorities need to contact the user in cases such as when they need to be in quarantine.

In the earliest stages of epidemic evolution and as preparation, an official recommendation of health organizations should be to turn on the GMTL feature of one’s smartphone. Currently, in most countries, adherence to quarantine is not routinely monitored unless there is a specific complaint filed with law enforcement. Privacy is a basic human right and is therefore a subject of public discussion, legislation, and enforcement, although the level of awareness and extent that people trust in peers, companies and authorities varies greatly between countries [48]. In some countries alongside meticulous legislation, health and privacy laws contain exceptions concerning public health in and communicable diseases to protect the majority [49]. Violation of this right should be considered carefully and repeatedly by a panel of experts such as an ethics committee [50]. Public health concerns during an epidemic might tip the balance between the right for privacy and enforcing adherence to quarantine or informing of other exposed persons. A possible obstacle to this might be a reluctance to cooperate. In these cases, possible options are confiscation of the information under the law upon review by an ethics committee or direct acquisition of data from Google depending on the laws and regulations of the country.

The COVID-19 pandemic has motivated the development of dedicated applications and tools for use by health authorities as well as private users. Currently these solutions are mostly based on standalone platforms. In the future, GMTL may be used as complementary to these solutions or even possibly linked to these applications through the user’s smartphone.

4.1 Limitations

There are several limitations to this study. This was a study of participants recruited from a Facebook advertisement who were healthy and not actually exposed or potentially exposed to any infectious material. The number of participants in this study was low due to difficulty in recruiting, and true objective location history cannot be obtained. In addition, this would only apply to countries where the majority of citizens have GTML on their telephones. It may not apply to areas where people do not possess newer generation smartphones or where newer cellular technologies are not implemented. In Israel, thousands of ultra-orthodox citizens do not have this feature on their telephones for religious reasons.

5 Conclusion

Google maps with GMTL technology may be useful in identifying potentially exposed individuals in a pandemic. Further research is required with a larger number of users during a real infectious outbreak to corroborate findings and establish further recommendations.