Abstract
Human mobility plays a key role in the dissemination of infectious diseases around the world. However, the complexity introduced by commuting patterns in the daily life of cities makes such a role unclear, especially at the intracity scale. Here, we propose a multiplex network fed with 9 months of mobility data with more than 107 million public bus validations in order to understand the relation between urban mobility and the spreading of COVID-19 within a large city, namely, Fortaleza in the northeast of Brazil. Our results suggest that the shortest bus rides in Fortaleza, measured in the number of daily rides among all neighborhoods, decreased \(\approx 25\)% more than the longest ones after an epidemic wave. Such a result is the opposite of what has been observed at the intercity scale. We also find that mobility changes among the neighborhoods are synchronous and geographically homogeneous. Furthermore, we find that the most central neighborhoods in mobility are the first targets for infectious disease outbreaks, which is quantified here in terms of the positive linear relation between the disease arrival time and the average of the closeness centrality ranking. These central neighborhoods are also the top neighborhoods in the number of reported cases at the end of an epidemic wave as indicated by the exponential decay behavior of the disease arrival time in relation to the number of accumulated reported cases with decay constant \(\lambda \approx 33\) days. We believe that these results can help in the development of new strategies to impose restriction measures in the cities guiding decision-makers with smart actions in public health policies, as well as supporting future research on urban mobility and epidemiology.
Similar content being viewed by others
Introduction
COVID-19 has caused massive loss of lives, economic destabilization, and changes in social relations worldwide1,2,3. In order to mitigate the spread of SARS-CoV-2, several countries have implemented restriction measures, such as curfews and lockdowns, which induce a rupture in human mobility4,5,6. Such rupture yields an immediate control of the number of reported cases, at least for some time, to the detriment of the local economy and the mental health of the population7,8,9. In most cases, restriction measures are asynchronous across national territories due to the phase difference among epidemic waves in each city10. Nevertheless, they provide interesting case studies to researchers that aim to understand the mechanisms behind the dynamics of COVID-19 dissemination in order to improve mathematical model predictions for future outbreaks11,12.
There is a consensus that human mobility plays a fundamental role in the dissemination of infectious diseases at the intercity scale13,14,15. Brockmann and Helbing16 introduced a network model to study the dissemination of 2009 H1N1 flu and 2003 SARS among different cities worldwide. They found that the propagation of infectious diseases among cities is better described by an effective distance (defined in relation to human mobility) than by the geographic distance since the disease arrival time in the cities only correlates with the effective distance to the Initial Outbreak Location (IOL). Therefore, the effective distance is interpreted as the most probable path through which an infectious disease spreads, i.e., the path along which the probability of population mixing is maximized. On the other hand, this role remains unclear at the intracity scale despite the use of sophisticated techniques, such as contact tracing and meta-population models17,18,19,20,21, in order to find a reliable correlation between urban mobility and the spreading of infectious diseases. One of the main reasons for this lack of clarity is the rise of complexity introduced by commuting patterns in the daily life of cities.
Here, we perform a longitudinal analysis based on concepts of complex networks fed with 9 months of epidemiological and mobility data in order to understand the relation between urban mobility and the spreading of COVID-19 within Fortaleza, a large Brazilian city. Brazil is the biggest country in Latin America, having a population of 211 million people (estimated for 2020)22. Brazil’s first case of COVID-19 was reported on February 26th, 202023. At the end of that year, the accumulated cases and deaths of COVID-19 were 7,680,082 and 195,00824, respectively. Regarding Fortaleza, capital of the state of Ceará, a city with a population of 2.68 million people (estimated for 2020), the first case of COVID-19 was officially reported in Meireles neighborhood on March 16th, 202025. Over that year, 87,636 cases and 4980 deaths of COVID-19 in the city were reported26. Taking that into account, we introduce a model that uses the flow of individuals among neighborhoods in order to estimate the most probable path in which an infectious disease spreads and, consequently, define the most important neighborhoods from the point of view of commuting patterns. Further, we show how a lockdown influences urban mobility and, consequently, the arrival time of an outbreak at the intracity scale. The manuscript is concluded with a discussion regarding the maintenance of the order in the ranking of the most important neighborhoods throughout the investigated epidemic period. Our main contribution is to find evidence that these most important neighborhoods are not only the primary targets for infectious disease outbreaks, but also the top neighborhoods in the number of reported cases at the end of an epidemic wave. This allows us to shed light on a potentially more effective definition of local restriction measures.
Results and discussion
Socio-economical and epidemiological indicators
We show the geographic and social-economical structure of Fortaleza, as well as, some COVID-19 indicators for 2020 in Fig. 1 (see Datasets). Figure 1a shows the spatial distribution of population with the names of neighborhoods that will be mentioned throughout our study. Figure 1b,c show the spatial distribution of accumulated reported cases and deaths of COVID-19, respectively. We also show the spatial distribution of the Human Development Index (HDI) in Fig. 1d. There was a sub-notification of reported cases of COVID-19 in Fortaleza. We can realize such fact by making a comparison among the Fig. 1b,c,d. The neighborhoods with the highest HDIs hold the largest numbers of reported cases since their inhabitants had more access to COVID-19 tests in private practices. However, we emphasize that such neighborhoods are not the top ones in deaths. That being said, we believe that this sub-notification was geographically homogeneous in the city, except for the highest HDI neighborhoods (5 out of 119), since we are able to recover the expected positive linear relation between cases and deaths (see Fig. SI–1). This result suggests the existing bias is not strong enough to change our study.
Figure 2 shows an overview of the COVID-19 spreading across Fortaleza during the studied period. We calculate the mobility changes through the quantity \((Z^{(k)}-Z^{(1)})/Z^{(1)}\), where \(Z^{(k)}=\sum _{ij(i\ne j)} F_{ij}^{(k)}\) is the number of rides among all neighborhoods in a given week k (see Fig. 2a). Similar to other cities, the restriction measures implemented in Fortaleza were divided into two closing periods and one opening period: (i) a social isolation period (from week 4–9, State Decree 33,519), in which some services (e.g., schools and universities) were closed, and the people were not obligated to stay at home; (ii) a lockdown period (from week 10–13, State Decree 33,574), in which the people were obligated to stay at home, and only essential services were functioning; and (iii) an economic reopening period (from week 14 onwards, State Decree 33,608), in which all services started to reopen, and the people were allowed to leave their houses. In this context, we observe that urban mobility abruptly reduced and gradually increased in Fortaleza when the first restriction measures came into force and when the economic reopening progressed, respectively. Figure 2b shows these changes stratified by geographic distance d through a similar ratio \((Z_d^{(k)}-Z_d^{(1)})/Z_d^{(1)}\), where \(Z_d^{(k)}=\sum _{ij(d_{ij} \in d)} F_{ij}^{(k)}\). The quantity \(Z_d^{(k)}\) is measured by taking into account only the neighborhoods in which the geographic distances among their centroids lie in the range of the stratum d. We find that the shortest rides decreased \(\approx 25\)% more than the longest ones during the studied period, which is the opposite of what has been observed at the intercity scale10. Furthermore, we also find evidence that mobility changes and HDI were correlated in Fortaleza (see Fig. SI–2), similar to previous studies27,28. The weekly number of reported cases, \(n_c\), is shown in Fig. 2c. We observe that the peak of reported cases took place right before the lockdown period. This little anticipation is due to the fact that the dates of reference adopted for the x-axis correspond to the days of the onset of symptoms. As shown in Fig. 2d, the number of accumulated reported cases (until the end of the studied period) \(n_{ac}\) decays exponentially with the arrival time \(T_a\) of the disease in the neighborhoods, \(n_{ac} \sim \exp (-T_{a}/\lambda )\), with the decay constant \(\lambda \approx 33\) days. Precisely, \(T_a\) is measured as the number of days from January 1st, 2020 until the day that a threshold of \(n_{ac}^{*}=6\) reported cases is reached (see Supplementary Information). We also perform the calculation of the Nadaraya-Watson estimator to validate our exponential decay hypothesis29,30.
Network metrics and centrality measures
Three layers of the proposed network (see Methods) are shown in Fig. 3a, corresponding to week 3 (before the restriction measures), week 10 (during the lockdown), and week 17 (after the beginning of the economic reopening) during the investigated epidemic wave of COVID-19. The maps show synchronization of mobility changes among the neighborhoods, which is uncommon to find on a larger geographic scale, e.g., among regions or cities, due to different stages of local outbreaks6,10. The size of the vertices is based on their weighted in-degrees. The neighborhoods Centro (downtown), Aldeota, and Meireles are the largest three vertices in the maps. Further, the magnitude of the directed edges, which is defined by \(F_{ij}^{(k)}\), is compatible with the macroscopic evolution of the mobility shown in Fig. 2a. The edges corresponding to the three highest values of \(F_{ij}^{(k)}\) are (Barra do Ceará, Centro), (Vila Velha, Centro), and (Barra do Ceará, Meireles). Figure 3b shows the average shortest path length \(\langle D^{(k)}\rangle = [n(n-1)]^{-1}\sum _{ij(i\ne j)} D_{ij}^{(k)}\) for each week k of the studied period. As compared with the values observed before the restriction measures, we observe an increase of \(\approx 30\)% in \(\langle D^{(k)}\rangle\) during the social isolation and lockdown periods, due to the substantial decrease in the number of rides in Fortaleza. Such a result indicates that, on average, a hypothetical outbreak would take longer to reach all neighborhoods if the regular urban mobility was similar to the one found during the restriction measures. In the same time interval, the density \(\eta ^{(k)} = m/[n(n-1)]\), where m is the number of edges in the layer k, decreased as shown in the inset of Fig. 3b. Since n is constant, such behavior implies missing edges during this period, which is consistent with the rise of the \(\langle D^{(k)}\rangle\). In Fig. 3c, the average of the in-degrees \(\langle K^{(k)}\rangle\) exhibits a behavior similar to \(\eta ^{(k)}\), indicating a decrease in the probability of mixing the populations among neighborhoods during the restriction measures. All metrics tend to eventually return to their original values, i.e., before the restriction period.
Shortest path distributions
The effective distance distributions during weeks 3, 10, and 17 from two particular IOLs, Centro and Castelão, to all other neighborhoods are shown in Fig. 4. In Fig. 4a,b, the distributions are represented in a polar coordinate system, where the radius stands for the shortest path, \(D_{ij}^{(k)}\), and the angle stands for a random number between 0 and \(2\pi\), for better visualization. We observe that \(D_{ij}^{(k)}\) and \(d_{ij}^{(k)}\) do not exhibit linear correlation, i.e., a comparatively lower shortest path \(D_{ij}^{(k)}\) between two neighborhoods does not necessarily imply that they are closer in the geographic space. As shown in Fig. 4c,d, we can also describe these distributions through the Kernel Density Estimation (KDE), defining an estimator \(g(D_{ij}^{(k)})\) with a Gaussian kernel (see Methods). In both cases, we observe that \(g(D_{ij}^{(k)})\) shifts to the right from week 3–10, indicating that there is, on average, an increase in \(D_{ij}^{(k)}\), and shifts back to the left from week 10–17, partially recovering the urban mobility pattern observed in week 3. We find similar behaviors considering all other neighborhoods as IOLs, even in the cases where \(g(D_{ij}^{(k)})\) has a bimodal-like shape, as shown in Fig. 4d. This shows that the restriction measures effectively resulted in the increase of the shortest path between the neighborhoods of Fortaleza.
Spearman’s correlation of the closeness centrality
As the network is a weakly connected graph for each temporal layer k, we calculate the time evolution of the closeness centrality, \(C_i^{(k)}\), for all neighborhoods i (see Methods). In Fig. 5a., we observe that the values of \(C_i^{(k)}\) were synchronously affected by the restriction measures, which resembles the \(Z^{(k)}\) profile in Fig. 2a. For such metric, the most central neighborhood is Centro (downtown), while the least central is Sabiaguaba. Figure 5b shows the time evolution of the Spearman’s rank correlation coefficient31, \(\rho ^{(k)}\), between the sets \(C_i^{(k)}\) and \(C_i^{(1)}\) (see Methods). We find that the \(C_i^{(k)}\) lines rarely cross each other since \(\rho ^{(k)} > 0.92\) in all layers. The inset shows the linear relation between \(C_i^{(1)}\) and \(C_i^{(10)}\) (first week of lockdown) with a slope coefficient \(a \approx 0.75\) (\(r^2 \approx 0.90\)). Here, we use Nadaraya–Watson estimator to validate the linear relation between \(C^{(k)}\) and \(C^{(1)}\). We believe that these results are supporting evidence that the restriction measures were not only synchronous but also geographically homogeneous in Fortaleza. We conjecture that this can be explained in terms of the stability in the commuting patterns, i.e., the restriction measures can increase the shortest path among the neighborhoods, but their proportional commuting patterns remain the same since there are strong socioeconomic ties between people and places within a city32. Furthermore, we also find that neighborhoods with high values of \(C_i\) tend to exhibit unimodal-like shapes in \(g(D_{ij}^{(k)})\) profiles, while the ones with low values tend to exhibit bimodal-like shapes.
Arrival time and closeness centrality
As shown in Fig. 6, the disease arrival time \(T_{a}\) increases linearly with the average rank position of the closeness centrality, \(\langle R_{i}\rangle\), calculated for all layers. This linear relation is characterized by the slope coefficient \(a \approx 0.24\) (\(r^2 \approx 0.41\)). Again, we use Nadaraya–Watson estimator to validate the linear relation between \(T_{a}\) and \(\langle R_{i}\rangle\). Such result reveals that the more neighborhoods are effectively central in the mobility network, the faster the disease arrives at them. Consequently, as already shown in Fig. 2d, these neighborhoods also produce the highest number of reported cases throughout the investigated epidemic period.
Conclusions
We have proposed a data-driven model to understand the relation between urban mobility and the spreading of COVID-19 in the city of Fortaleza, Ceará, Brazil. Our results show that the shortest bus rides within the city decreased \(\approx 25\)% more than the longest ones during the first epidemic wave of reported cases, i.e., from March to December 2020. Such a finding is the opposite of what has been observed at the intercity scale10. Applying the proposed model, we found that mobility changes among the neighborhoods are synchronous since their closeness centrality curves have a similar shape during the studied period. They can also be considered geographically homogeneous because there are almost no crossings among these curves. Both behaviors are also uncommon to see on a larger geographic scale - the former due to different stages of local outbreaks and the latter due to the invariance of the commuting patterns within a city, which can be explained by the fact that there are strong socioeconomic ties between people and places. Finally, we found that the most central neighborhoods in the mobility network were not only the primary focuses of the COVID-19 outbreak but also the top neighborhoods in the number of reported cases at the end of epidemic wave. We emphasize that our mobility dataset is based on public bus transport only. However, as explained in SI, it is a good proxy for the total urban mobility of Fortaleza. Another limitation that can be pointed out in our study is that there was indeed a sub-notification of reported cases of COVID-19 in Fortaleza. However, our results suggest the existing bias is not strong enough to change our conclusions. As a perspective, the development of dynamic models could improve the understanding of the studied phenomenon at the intra-city scale. Despite the leading role of massive vaccination campaigns in controlling outbreaks of respiratory communicable diseases and the large number of studies published in the last three years regarding the use of Non-Pharmaceutical Interventions (NPIs) to mitigate the spread of COVID-19, more research is still needed to improve public policies based on NPIs, mainly for situations that, for some reason, there are no vaccines available, which usually happen in the least developed and developing countries. Historically, most local outbreaks in Fortaleza were controlled through massive vaccination campaigns, e.g., in the cases of H1N1 and measles33,34. In such cases, where both were vaccine-preventable diseases, the main interventions were the characterization of transmission chains to interrupt the spread (especially in the case of measles), and large-scale vaccination, which included an active search for unvaccinated individuals. Furthermore, focused quarantines were only recommended when outbreaks were in institutions such as schools, prisons, and churches. We believe that these results may unveil new strategies to impose restriction measures in the cities guiding decision-makers with smart actions in public health policies, as well as supporting future research on urban mobility and epidemiology.
Methods
Datasets
Population
We use the 2010 census population provided by the Brazilian Institute of Geography and Statistics (IBGE)35 to define the number of inhabitants from the 119 neighborhoods of the city of Fortaleza, Ceará, Brazil.
Reported cases and deaths of COVID-19
We collect the number of reported cases and deaths of COVID-19 from Fortaleza in IntegraSUS26, the official repository of the Ceará State Government, and make them available (see Data Availability). For both, the file structure is similar: each row represents the number of reported cases or deaths of COVID-19 on a specific date for all neighborhoods. We emphasize that such a dataset is retrospectively updated and corrected.
Human development index
The Fortaleza City Hall made available the Human Development Index (HDI) by neighborhood. (see Data Availability)
Urban mobility
The Fortaleza City Hall made available a set of 39 weekly Origin-Destination (OD) matrices \(\{M\}\), based on bus validations during weekdays, among all neighborhoods of Fortaleza (see Data Availability). For each week k, \(M_{ij}^{(k)}\) represents the average of the number of individuals that commute from the neighborhood i to the neighborhood j, i.e., \(M_{ij}^{(k)}\) only considers one-way rides. The period of the data ranges from March to December 2020. We emphasize that matrices M are identical to matrices F, except for the main diagonals, since \(M_{ii}^{(k)}\) stands for the number of intra-neighborhood rides, while \(F_{ii}^{(k)} = N_i - \sum _{j(i\ne j)} F_{ij}^{(k)}\) stands for the number of inhabitants that remain in the neighborhood i. Finally, we also note that the neighborhood Vila Ellery has no entries in M during the studied period. For this reason, we remove it from the entire analysis.
The model
We investigate the effects of the restriction measures on the urban mobility of Fortaleza by representing the data in terms of a multiplex network36 with 39 layers, each corresponding to a week during the period from March to December 2020. Precisely, for each week k, the vertices i stand for the \(n=118\) neighborhoods of Fortaleza, while the weighted directed edges \((i,j,w_{ij}^{(k)})\) stand for the commuting patterns from neighborhoods i to others j. The weight \(w_{ij}^{(k)}\) is given by16:
where \(F_{ij}^{(k)}\) is the average daily number of individuals that ride from i to j at week k, and \(N_i\) is the population of i. We emphasize that \(F_{ij}^{(k)}\) is estimated from the public urban transportation using a dataset with more than 107 million bus validations (see Datasets). Furthermore, the main diagonal elements of the matrices \(F^{(k)}\) are set as \(F_{ii}^{(k)} = N_i - \sum _{j(i\ne j)} F_{ij}^{(k)}\). Figure 7a,b show a schematic representation of the geographic distance \(d_{ij}\) between the centroids of the neighborhoods i and j, as well as the shortest path (or the effective distance) \(D_{ij}^{(k)}\) from i to j at the week k, respectively. The distance \(d_{ij}\) is calculated through the Haversine formula37, while \(D_{ij}^{(k)}\) is calculated through the Dijkstra’s algorithm38. As highlighted in light red in Fig. 7, a neighborhood j which is geographically far away from neighborhood i may, however, be effectively close to it (low \(D_{ij}^{(k)}\)), i.e., may have high commuting levels with i.
Closeness centrality
As all layers of the proposed multiplex network are directed subgraphs, the closeness centrality \(C_i^{(k)}\) of the neighborhood i in the layer k is given by36,
where \(n-1\) is the number of neighborhoods j that reach i, and \(D_{ji}^{(k)}\) is the shortest path from j to i, i.e., \(D_{ji}^{(k)}\) is the inward distance of i.
Spearman’s rank correlation
The Spearman’s rank correlation coefficient31, \(\rho _i^{(k)}\), between \(C_i^{(k)}\) and \(C_i^{(0)}\) is calculated as follows:
where \(R_i^{(k)}\) is the ranking of \(C_i^{(k)}\), \(\text {cov}\) is the covariance and \(\sigma\) is the standard deviation.
Nadaraya–Watson estimator
The Nadaraya–Watson (NW) estimator is defined by the following kernel smoother function29,30:
where x is the evaluation point, N is the number of points of the data distribution \(\{X_i,Y_i\}\), \(K_h(x-X_i) = \exp [(x-X_i)^2/(2h^2)]\) is a Gaussian kernel, and h is the bandwidth. We estimate the bandwidth h by the least squares cross-validation method39,40. We compute the 95% (\(\alpha = 0.05\)) Confidence Intervals (CIs) over 500 random bootstrapping samples with replacement through the so-called \(\alpha /2\) quantile function.
Kernel density estimator
The Kernel Density Estimation (KDE) is a non-parametric method to estimate the probability density function based on kernel smoothing41,42. Given a data distribution \(\{X_i\}\) with N points, the estimator is calculated as follows:
where x and \(K_h(x-X_i)\) are defined in a way similar to the NW estimator. On the other hand, the bandwidth h is defined by Scott’s rule (\(h=N ^{-1/5}\)).
Data availability
The data that support the findings of this study are available in Zenodo with the identifier https://doi.org/10.5281/zenodo.7655082.
References
Ribeiro, H. V., Sunahara, A. S., Sutton, J., Perc, M. & Hanley, Q. S. City size and the spreading of COVID-19 in Brazil. PLoS ONE 15(9), e0239699. https://doi.org/10.1371/journal.pone.0239699 (2020).
Fosco, C. & Zurita, F. Assessing the short-run effects of lockdown policies on economic activity, with an application to the Santiago Metropolitan Region, Chile. PLoS ONE 16(6), e0252938. https://doi.org/10.1371/journal.pone.0252938 (2021).
Guidotti, E. A worldwide epidemiological database for COVID-19 at fine-grained spatial resolution. Sci Datahttps://doi.org/10.1038/s41597-022-01245-1 (2022).
Pepe, E. et al. COVID-19 outbreak response, a dataset to assess mobility changes in Italy following national lockdown. Sci Datahttps://doi.org/10.1038/s41597-020-00575-2 (2020).
Engle, S., Stromme, J. & Zhou, A. Staying at home: Mobility effects of COVID-19. SSRNhttps://doi.org/10.2139/ssrn.3565703 (2020).
Melo, H. P. M. et al. Heterogeneous impact of a lockdown on inter-municipality mobility. Phys. Rev. Res. 3, 013032. https://doi.org/10.1103/PhysRevResearch.3.013032 (2021).
Kraemer, M. U. G. et al. The effect of human mobility and control measures on the COVID-19 epidemic in China. Science 368(6490), 493–497. https://doi.org/10.1126/science.abb4218 (2020).
Coccia, M. The relation between length of lockdown, numbers of infected people and deaths of COVID-19, and economic growth of countries: Lessons learned to cope with future pandemics similar to COVID-19 and to constrain the deterioration of economic system. Sci. Total Environ. 775, 145801. https://doi.org/10.1016/j.scitotenv.2021.145801 (2021).
Eyawo, O., Viens, A. M. & Ugoji, U. C. Lockdowns and low- and middle-income countries: Building a feasible, effective, and ethical COVID-19 response strategy. Glob. Healthhttps://doi.org/10.1186/s12992-021-00662-y (2021).
Schlosser, F. et al. COVID-19 lockdown induces disease-mitigating structural changes in mobility networks. Proc. Natl. Acad. Sci. USA 117(52), 32883–32890. https://doi.org/10.1073/pnas.2012326117 (2020).
Wu, J. T., Leung, K. & Leung, G. M. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: A modelling study. Lancet 395(10225), 689–697. https://doi.org/10.1016/S0140-6736(20)30260-9 (2020).
Block, P. et al. Social network-based distancing strategies to flatten the COVID-19 curve in a post-lockdown world. Nat. Hum. Behav. 4(6), 588–596. https://doi.org/10.1038/s41562-020-0898-6 (2020).
Fang, H., Wang, L. & Yang, Y. Human mobility restrictions and the spread of the novel coronavirus (2019-nCoV) in China. J. Public Econ. 191, 104272. https://doi.org/10.1016/j.jpubeco.2020.104272 (2020).
Gatalo, O., Tseng, K., Hamilton, A., Lin, G. & Klein, E. Associations between phone mobility data and COVID-19 cases. Lancet Infect. Dis. 21(5), e111. https://doi.org/10.1016/S1473-3099(20)30725-8 (2021).
Badr, H. S. et al. Association between mobility patterns and COVID-19 transmission in the USA: A mathematical modelling study. Lancet Infect. Dis. 20(11), 1247–1254. https://doi.org/10.1016/S1473-3099(20)30553-3 (2020).
Brockmann, D. & Helbing, D. The hidden geometry of complex, network-driven contagion phenomena. Science 342(6164), 1337–1342. https://doi.org/10.1126/science.1245200 (2013).
Ponte, C. et al. Tracing contacts to evaluate the transmission of COVID-19 from highly exposed individuals in public transportation. Sci. Rep. 11(1), 1–11. https://doi.org/10.1038/s41598-021-03998-y (2021).
Bomfim, R. et al. Predicting dengue outbreaks at neighbourhood level using human mobility in urban areas. J. R. Soc. Interface 17(171), 20200691. https://doi.org/10.1098/rsif.2020.0691 (2020).
Costa, G. S., Cota, W. & Ferreira, S. C. Outbreak diversity in epidemic waves propagating through distinct geographical scales. Phys. Rev. Res. 2(4), 043306. https://doi.org/10.1103/PhysRevResearch.2.043306 (2020).
Arenas, A. et al. Modeling the spatiotemporal epidemic spreading of COVID-19 and the impact of mobility and social distancing interventions. Phys. Rev X. 10(4), 041055. https://doi.org/10.1103/PhysRevX.10.041055 (2020).
Serafino, M. et al. Digital contact tracing and network theory to stop the spread of COVID-19 using big-data on human mobility geolocalization. PLoS Comput. Biol. 18(4), e1009865. https://doi.org/10.1371/journal.pcbi.1009865 (2022).
Projeção da população do Brasil e das Unidades da Federação. Instituto Brasileiro de Geografia e Estatistica (IBGE). (2022). https://www.ibge.gov.br/apps/populacao/projecao/.
Coronavírus: Brasil confirma primeiro caso da doença. (2020). https://www.unasus.gov.br/noticia/coronavirus-brasil-confirma-primeiro-caso-da-doenca/.
Cota, W. Monitoring the number of COVID-19 cases and deaths in Brazil at municipal and federative units level. SciELO Preprints.https://doi.org/10.1590/SciELOPreprints.362 (2020).
Boletins Epidemiológicos. Prefeitura de Fortaleza. (2023). https://coronavirus.fortaleza.ce.gov.br/boletim-epidemiologico.html.
Secretaria de Saúde do Estado do Ceará. IntegraSUS. (2022). https://integrasus.saude.ce.gov.br.
Gozzi, N. et al. Estimating the effect of social inequalities on the mitigation of COVID-19 across communities in Santiago de Chile. Nat. Commun. 12, 2429. https://doi.org/10.1038/s41467-021-22601-6 (2021).
Laetitia, G. et al. Socio-economic determinants of mobility responses during the first wave of COVID-19 in Italy: From provinces to neighbourhoods. J. R. Soc. Interface. 18, 20210092. https://doi.org/10.1098/rsif.2021.0092 (2021).
Nadaraya, E. On estimating regression. Theor. Probab. Appl. 9(1), 141–142. https://doi.org/10.1137/1109020 (1964).
Watson, G. S. Smooth regression analysis. Sankhya Ser. A. 26(4), 359–372 (1964).
Kutner, M. H., Nachtsheim, C. J., Neter, J. & Li W. Inferences in regression and correlation analysis. In Applied Linear Statistical Models. 5th ed. McGraw-Hill/Irvin (2005).
Barbosa, H. et al. Uncovering the socioeconomic facets of human mobility. Sci. Rep. 11(1), 1–13. https://doi.org/10.1038/s41598-021-87407-4 (2021).
Pires, N. R. et al. Pandemic influenza A (H1N1) 2009: Epidemiological analysis of cases in a tropical/semi-arid region of Brazil. Rev. Soc. Bras. Med. Trop. 46(2), 141–6. https://doi.org/10.1590/0037-8682-0016-2012 (2013).
Leite, R. D., Barreto, J. L. & Sousa, A. Q. Measles Reemergence in Ceará, Northeast Brazil, 15 years after elimination. Emerg. Infect. Dis. 21(9), 1681–3. https://doi.org/10.3201/eid2109.150391 (2015).
Censo 2010. Instituto Brasileiro de Geografia e Estatistica (IBGE). [cited 2022 Oct 1]. https://censo2010.ibge.gov.br.
Newman, M. E. J. Networks: An Introduction 2nd edn. (Oxford University Press, New York, 2018).
Snyder, J. P. Map Projections—A Working Manual 3rd edn. (United States Government Printing Office, 1994).
Dijkstra, E. W. A note on two problems in connexion with graphs. Numer. Math. 1(1), 269–271. https://doi.org/10.1007/BF01386390 (1959).
Racine, J. & Li, Q. Nonparametric estimation of regression functions with both categorical and continuous data. J. Econom. 119(1), 99–130. https://doi.org/10.1016/S0304-4076(03)00157-X (2004).
Li, Q. & Racine, J. Cross-validated local linear nonparametric regression. Stat. Sin. 14(2), 485–512 (2004).
Scott, D. W. Multivariate Density Estimation: Theory, Practice, and Visualization 2nd edn. (Wiley, 2015).
Gramacki, A. Nonparametric Kernel Density Estimation and its Computational Aspects 1st edn, Vol. 37 (Springer, 2018).
Acknowledgements
We gratefully acknowledge CNPq, CAPES, FUNCAP, the National Institute of Science and Technology for Complex Systems in Brazil and the Edson Queiroz Foundation for financial support.
Author information
Authors and Affiliations
Contributions
J.L.B.A., E.A.O., A.S.L., J.S.A., and V.F. designed research; J.L.B.A., E.A.O., A.S.L., J.S.A., and V.F. performed research; J.L.B.A., E.A.O., A.S.L., J.S.A., and V.F. analyzed data; and J.L.B.A., E.A.O., A.S.L., J.S.A., and V.F. wrote the paper. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Araújo, J.L.B., Oliveira, E.A., Lima Neto, A.S. et al. Unveiling the paths of COVID-19 in a large city based on public transportation data. Sci Rep 13, 5761 (2023). https://doi.org/10.1038/s41598-023-32786-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-32786-z
- Springer Nature Limited