1 Introduction

The COVID-19 pandemic has brought unforeseen and unpredictable effects in terms of both health and economics. With unprecedented research efforts that generated several articles tackling clinical presentation, preventive and treatment measures, and possible correlated and permanent effects of the disease, as well as various other articles exploring its economic effects (from micro to macroeconomic elements), researchers worldwide seek to contribute with actionable insight to tackle the challenges presented by the pandemic.

The State of São Paulo, a highly impacted, densely populated area that represents the economic core of Brazil, sought to follow in the footsteps of Asian, European, and other countries in the use of evidence to design policy intervention (OECD 2020; European Commission 2020; World Health Organization 2020; Oxford Martin School—Global Health Platform 2020; Haddad and Bugarin 2020). To better understand the evolution of the disease and its territorial economic impacts due to social distancing measures, the State government established a set of parameters to manage the crisis. This endeavor was tackled alongside a network of researchers and qualified public sector personnel and ultimately led to the institutionalization of two consulting bodies—one composed of health professionals (Contingency Centre) and another of economists (Economic Council)—to advise on necessary policy to mitigate the sanitary and economic effects of the disease.

Once the data and indicators used to monitor the disease were defined and officially announced, a new challenge arose: guaranteeing effective monitoring of economic indicators. The central argument is the need to pave the way for sound cost–benefit analysis between more flexible economic operations and the spread of the disease. The effectiveness of sanitary safeguard measures is bounded by the economic stress on households due to reduced income as a side product of guaranteeing health service attendance. With the São Paulo Plan,Footnote 1 economic activities started slowly taking up operations after a period of closure and suspension. However, there may be an imminent threat of disease spread with increased economic activity, which still needed to be quantified.

There is vast literature relating the demand for freight and passenger transport to economic activity (Boyce and Williams 2015). A usual question in project appraisal is whether one can predict the evolution of domestic demand for freight [and passenger] transport to a reasonably high level of explanation with a few readily available variables (Bennathan et al. 1992). As transport is a derived demand, its activity level, in general, relates to the output levels in an economy. Therefore, one of the critical elements in forecasting traffic demand models is the information on economic trends. Studies often find a close link between GDP and traffic levels, which becomes more evident when the elements making up the GDP forecasts are identified (Cole 2005; Ortúzar and Willumsen 2011).

More recently, a related strand of research has benefitted from the availability of detailed traffic-flow data based on sensor readings to reverse the question and look at ways of using traffic intensity metricsFootnote 2 to construct coincident and leading indicators of regional economic activity (van Ruth 2014; Hooijmaaijers 2017; Nolan 2019; Li et al. 2020). However, weights based on traffic flows measured taking into account only direct links involving the study region neglect the intricate structure of interregional input–output flows that may affect regional activity. Combining traffic intensity data in the whole network with measures of regional economic activity embedded in trade relations everywhere may enhance the accuracy of the indicator by accounting for the systemic nature of a region’s output (Sonis et al. 1996).

There are alternatives to the use of high-frequency data on sensor readings to address this issue. The Transportation Services Index (TSI), for instance, created by the US Department of Transportation (DOT), Bureau of Transportation Statistics (BTS), measures the monthly movement of freight and passengers, combining available data on freight traffic, as well as passenger travel (Lahiri et al. 2002). The monthly measurement of transportation services output at the national level has proved important to follow business cycles in the USA (Lahiri and Yao 2011). Nonetheless, its relatively low frequency, the time lag in its release, and aggregate geographical dimension impose limitations to its use as an indicator for timely monitoring of regional economic activity.

We add to this debate departing from a similar approach to van Ruth (2014), who has shown that it is possible to construct indicators of regional economic activity based on local traffic intensity data. In her monthly indicator of traffic intensity in the Eindhoven region, in the Netherlands, she convincingly showed the link between traffic intensity and economic activity at the regional level. However, the choice of the study region was based on its high specialization in manufacturing activities, justifying the procedure of computing daily averages over all sensors in the sample. Accordingly, such a procedure defined for a broader range of areas presenting different degrees of economic specialization could lead to a less clear correlation.

Therefore, we propose a novel indicator of regional economic activity that tackles some of the issues mentioned above. A good index should be simple, policy-relevant, reliable, and timely (National Research Council 2002). We rely on close-to-real-time big data sources (i.e., radar sensors) and structural economic statistics (i.e., input–output system), which represent useful economic concepts to create a set of indicators that allow early identification of significant economic changes. In doing that, we provide insight into economic activity, at a level of timeliness and granularity not possible for official economic statistics, achieving the desired goals pursued in modern economic activity monitoring projects (Nolan 2019). We then show the usefulness of the proposed indicator for policy evaluation in the context of a regionally heterogeneous plan designed to mitigate the economic effects of the COVID-19 pandemic in São Paulo, Brazil.Footnote 3

This article presents a novel approach to complementary crisis monitoring strategies. A link between high-frequency transportation flows (cargo and passengers) and sanitary results, through economic indicators, may shed light on possible future disease outbreaks and pave the way to a cost–benefit analysis. Section 2 lays out the mathematical model behind the empirical strategy for specifying and implementing our novel regional sectoral economic activity index and presents its results. The index combines input–output analysis and traffic data. As an example of potential applications of the proposed index, Sect. 3 then addresses the link between economic activity and the spread of COVID-19 in the context of the São Paulo Plan. The empirical results highlight the link between transportation flows within and between regions, economic activity disaggregated by tradable and non-tradable sectors, and the spread of the disease, providing the first systematic assessment of the Plan. The analysis reveals that the phased strategy pursued in São Paulo has been effective in reactivating economic activity while maintaining the adequate responsiveness of the healthcare system. Finally, we discuss applying the results in guiding policy design and possible implications for COVID-19 exit strategies pursued elsewhere.

2 Monitoring regional sectoral economic activity

2.1 Methodology

The purpose of this section is to present the general structure of the proposed index of regional economic activity that considers each region’s insertion into an integrated multi-sectoral interregional system. We consider there are n domestic regions, r = 1, …, n, and the rest of the world, row, which exhaust the space of the economy. Economic interactions take place inside and outside each region (intraregional, interregional, and international trade). In our multi-sectoral economy, there are j sectors, s = 1, …, j, provided by n + 1 different sources.

We then assume we can measure, for each sector s in region r, the value-added contents embedded in trade flows associated with each regional origin–destination pair, such that we can complete the information in Table 1.

Table 1 Regional value-added in trade flows, by sector

According to Table 1, a region’s sectoral output is potentially associated with transactions involving economic agents located in the region and elsewhere. Define the sets of origins, O, and destinations, D, both comprising all domestic regions, r, and add the rest of the world, row. Thus, we can compute the total value-added of sector s in region r,\(va^{rs}\), as

$$va^{rs} = \mathop \sum \limits_{o \in O,d \in D} va_{o,d}^{r,s}$$
(1)

We also calculate the region’s r total value-added, \(va^{r}\), as

$$va^{r} = \mathop \sum \limits_{s} va^{rs}$$
(2)

In order to monitor sectoral, regional economic activity, we would need to follow \(va^{rs}\) over time. This information, when available, is usually published with a delay and at a low frequency (annual). To circumvent such informational constraints, we could track changes in trade flows for each regional origin–destination pair and combine them with the information in Table 1 to calculate a trade-weighted index of regional economic activity.

Thus, if we can observe, in each period t, changes in values of flows from each origin o, to each destination d, \({\Delta }F_{t,o,d}\), a regional index of economic activity could be calculated as

$$EAI_{t}^{r,s} = \mathop \sum \limits_{o \in O,d \in D} w_{o,d}^{r,s} {\Delta }F_{t,o,d}$$
(3)

where \(EAI_{t}^{r,s}\) is the economic activity index for sector s, in region r, in time t, and the weights \(w_{o,d}^{r,s}\) are calculated as

$$w_{o,d}^{r,s} = \frac{{va_{o,d}^{r,s} }}{{\mathop \sum \nolimits_{o \in O,d \in D} va_{o,d}^{r,s} }}$$
(4)

such that

$$\mathop \sum \limits_{o \in O,d \in D} w_{o,d}^{r,s} = 1$$
(5)

The implementation of the index depends on two pieces of information: first, we need to define an empirical strategy to estimate values in Table 1 so that we can define the weights, \(w_{o,d}^{r,s}\); second, we have to collect timely information to estimate \({\Delta }F_{t,o,d}\). In what follows, we describe our empirical approach to this problem.

2.2 Measurement of domestic value-added in trade flows

This sub-section reports on the results of an application with an interregional input–output matrix for Brazil,Footnote 4 which allows calculating the total value-added that is embodied in specific trade flows. This set of information generates estimates for Table 1. Thus, the concept of embedded value-added in this paper is defined within the input–output framework, which determines the value-added content via coefficients of sectoral value-added per monetary unit of a given sector.Footnote 5 The input–output system was developed as part of a technical cooperation initiative involving researchers from the Regional and Urban Economics Lab at the University of São Paulo (NEREUS), the Institute of Economic Research Foundation (FIPE), and the State of São Paulo Secretariat of Economic Development (FIPE-NEREUS 2020). A fully specified interregional input–output database was estimated for 2015, considering 67 sectors in 17 regions (Regional Health Departments) in the State of São Paulo, and a residual endogenous domestic region, the rest of Brazil. We estimate, for each flow originated in one of the Brazilian regions, measures of value-added in trade that are further used to calculate our index. The parsimonious approach proposed in Los et al. (2016), based on “hypothetical extraction,” serves as the methodological anchor.

The input–output model can be expressed by

$${\mathbf{x}} = {\mathbf{Ax}} + {\mathbf{f}}$$
(6)
$${\mathbf{x}} = \left( {{\mathbf{I}} - {\mathbf{A}}} \right)^{ - 1} {\mathbf{f}} = {\mathbf{Lf}}$$
(7)

where \({\mathbf{x}}\) and \({\mathbf{f}}\) are the vectors of gross output and final demand; \({\mathbf{A}}\) is a matrix with the input coefficients (\(a_{ij}\)); \({\mathbf{I}}\) is the identity matrix; and \({\mathbf{L}}\) is the Leontief inverse.

Considering a national interregional input–output model with n different regions and the ROW as a column vector in the final demand, (6) and (7) can be represented asFootnote 6

$$\left[ {\begin{array}{*{20}c} {{\mathbf{x}}^{1} } \\ \vdots \\ {{\mathbf{x}}^{{\text{n}}} } \\ \end{array} } \right] = \left[ {\begin{array}{*{20}c} {{\mathbf{A}}^{11} } & \cdots & {{\mathbf{A}}^{{1{\text{n}}}} } \\ \vdots & \ddots & \vdots \\ {{\mathbf{A}}^{{{\text{n}}1}} } & \cdots & {{\mathbf{A}}^{{{\text{nn}}}} } \\ \end{array} } \right]\left[ {\begin{array}{*{20}c} {{\mathbf{x}}^{1} } \\ \vdots \\ {{\mathbf{x}}^{{\text{n}}} } \\ \end{array} } \right] + \left[ {\begin{array}{*{20}c} {{\mathbf{f}}^{11} } & \cdots & {\begin{array}{*{20}c} {{\mathbf{f}}^{{1{\text{n}}}} } & {{\mathbf{f}}^{{1{\mathbf{row}}}} } \\ \end{array} } \\ \vdots & \ddots & {\begin{array}{*{20}c} { \vdots { }} & \vdots \\ \end{array} } \\ {{\mathbf{f}}^{{{\text{n}}1}} } & \cdots & {\begin{array}{*{20}c} {{\mathbf{f}}^{{{\text{nn}}}} } & {{\mathbf{f}}^{{{\mathbf{nrow}}}} } \\ \end{array} } \\ \end{array} } \right]{\mathbf{i}}$$
(8)
$$\left[ {\begin{array}{*{20}c} {{\mathbf{x}}^{1} } \\ \vdots \\ {{\mathbf{x}}^{{\text{n}}} } \\ \end{array} } \right] = \left\{ {\left[ {\begin{array}{*{20}c} {\mathbf{I}} & \cdots & 0 \\ \vdots & \ddots & \vdots \\ 0 & \cdots & {\mathbf{I}} \\ \end{array} } \right] - \left[ {\begin{array}{*{20}c} {{\mathbf{A}}^{11} } & \cdots & {{\mathbf{A}}^{{1{\text{n}}}} } \\ \vdots & \ddots & \vdots \\ {{\mathbf{A}}^{{{\text{n}}1}} } & \cdots & {{\mathbf{A}}^{{{\text{nn}}}} } \\ \end{array} } \right]} \right\}^{ - 1} \left[ {\begin{array}{*{20}c} {{\mathbf{f}}^{11} } & \cdots & {\begin{array}{*{20}c} {{\mathbf{f}}^{{1{\text{n}}}} } & {{\mathbf{f}}^{{1{\mathbf{row}}}} } \\ \end{array} } \\ \vdots & \ddots & {\begin{array}{*{20}c} { \vdots { }} & \vdots \\ \end{array} } \\ {{\mathbf{f}}^{{{\text{n}}1}} } & \cdots & {\begin{array}{*{20}c} {{\mathbf{f}}^{{{\text{nn}}}} } & {{\mathbf{f}}^{{{\mathbf{nrow}}}} } \\ \end{array} } \\ \end{array} } \right]{\mathbf{i}} = \left[ {\begin{array}{*{20}c} {{\mathbf{L}}^{11} } & \cdots & {{\mathbf{L}}^{{1{\text{n}}}} } \\ \vdots & \ddots & \vdots \\ {{\mathbf{L}}^{{{\text{n}}1}} } & \cdots & {{\mathbf{L}}^{{{\text{nn}}}} } \\ \end{array} } \right] \left[ {\begin{array}{*{20}{c}} {{{\mathbf{f}}^{11}}}& \cdots &{{\mathbf{f}}^{1{\text{n}}} }&{{{\mathbf{f}}^{1{\mathbf{row}}}}}\\ \vdots & \ddots & \vdots &{{\text{~}} \vdots }\\ {{{\mathbf{f}}^{{\text{n}}1}}}& \cdots &{{{\mathbf{f}}^{{\text{nn}}}}}&{{{\mathbf{f}}^{{\mathbf{nrow}}}}} \end{array}} \right]{\mathbf{i}}$$
(9)

where \({\mathbf{i}}\) is a column vector with all elements equal unity, which sums all elements in each of the n + 1 rows of the matrix \({\mathbf{f}}\).

Following Los et al. (2016), the value-added in region r (\(va^{r}\)) can be expressed as

$$va^{1} = {\mathbf{v}}_{1} \left( {{\mathbf{I}} - {\mathbf{A}}} \right)^{ - 1} {\mathbf{fi}}$$
(10)

where \({\mathbf{v}}_{1}\) is a row vector with ratios of value-added to gross output in the j sectors in region n as elements (\({\tilde{\mathbf{v}}}_{1}\)) and zeros elsewhere (\({\mathbf{v}}_{1} = \left[ {\begin{array}{*{20}c} {{\tilde{\mathbf{v}}}_{1} } & 0 \\ \end{array} } \right]\)); and \({\mathbf{i}}\) is a column vector in which all elements are unity.

In order to attribute the amount of domestic value-added in sales from region 1 to domestic region n, as proposed by Los et al. (2016), we consider a hypothetical world where region 1 does not sell anything to region n. In this case, the new value-added or hypothetical value-added can be represented by

$$va_{1,n}^{*} = {\mathbf{v}}_{1} \left( {{\mathbf{I}} - {\mathbf{A}}_{1,n}^{\user2{*}} } \right)^{ - 1} {\mathbf{f}}_{1,n}^{\user2{*}} {\mathbf{i}}$$
(11)

where \({\mathbf{A}}_{1,n}^{\user2{*}}\) and \({\mathbf{f}}_{1,n}^{\user2{*}}\) are the hypothetical matrix of input coefficients and final demand, respectively, expressed as

$${\mathbf{A}}_{r,t}^{\user2{*}} = \left[ {\begin{array}{*{20}c} {{\mathbf{A}}^{11} } & \cdots & 0 \\ \vdots & \ddots & \vdots \\ {{\mathbf{A}}^{{{\text{n}}1}} } & \cdots & {{\mathbf{A}}^{{{\text{nn}}}} } \\ \end{array} } \right]$$
(12)
$${\mathbf{f}}_{1,n}^* = \left[ {\begin{array}{*{20}{c}} {{{\mathbf{f}}^{11}}}& \cdots &0&{{{\mathbf{f}}^{1{\mathbf{row}}}}}\\ \vdots & \ddots & \vdots &{{\rm{~}} \vdots }\\ {{{\mathbf{f}}^{{\rm{n}}1}}}& \cdots &{{{\mathbf{f}}^{{\rm{nn}}}}}&{{{\mathbf{f}}^{{\mathbf{nrow}}}}} \end{array}} \right]$$
(13)

In addition, to attribute the amount of domestic value-added in exports from region 1 to the row, we consider a hypothetical world where region 1 does not export to the row. In this case, the hypothetical value-added can be represented as

$$va_{1,row}^{*} = {\mathbf{v}}_{1} \left( {{\mathbf{I}} - {\mathbf{A}}} \right)^{ - 1} {\mathbf{f}}_{1,row}^{\user2{*}} {\mathbf{i}}$$
(14)

where \({\mathbf{A}}\) is the original matrix with the input coefficients as in (10); and \({\mathbf{f}}_{1,row}^{\user2{*}}\) is the hypothetical matrix of final demand, expressed as

$${\mathbf{f}}_{1,row}^* = \left[ {\begin{array}{*{20}{c}} {{{\mathbf{f}}^{11}}}& \cdots &{{{\mathbf{f}}^{1{\rm{n}}}}}&0\\ \vdots & \ddots & \vdots & \vdots \\ {{{\mathbf{f}}^{{\rm{n}}1}}}& \cdots &{{{\mathbf{f}}^{{\rm{nn}}}}}&{{{\mathbf{f}}^{{\mathbf{nrow}}}}} \end{array}} \right]$$
(15)

From (10) and (11), we can define the regional value-added (across all sectors) in sales from region 1 to region n as follows:Footnote 7

$$va_{1,n}^{1, \bullet } = va^{1} - va_{1,n}^{*}$$
(16)

and, from (10) and (14), we can define regional value-added (across all sectors) from region 1 to the ROW as

$$va_{1,row}^{1, \bullet } = va^{1} - va_{1,row}^{*}$$
(17)

Similarly, we can attribute the regional value-added in transactions from region 1 to all regions (1, 2, …, n), and from each region to the n-regions (1, 2, …, n), including itself. We can also attribute the regional value-added from each region to the ROW. In this sense, in an interregional system with n regions and the ROW exogenous, we can compute regional value-added embedded in trade flows for the n domestic regions, as illustrated in Table 2.

Table 2 Domestic value-added in trade flows in an interregional system with exogenous ROW, by sector

2.3 Use of traffic intensity indicators as a measure of change in trade volumes

This sub-section describes how we calculate changes in traffic volumes during the pandemic using high-frequency traffic data from different sources in the State of São Paulo, including automated vehicle identification on toll stations and smart-camera records. These datasets allow us to track specific vehicles and infer their origin and destination in each trip that they make. We then aggregate these trips in order to calculate the total traffic flows of passenger and freight (light commercial vehicles and trucks) vehicles between all pairs of regions (Regional Health Departments) in each day of our analyses. The overall procedure consists of seven steps, which we detail next.

  1. 1.

    First, we extract data from automated vehicle identification from 341 toll stations located on inter-municipal highways in the State of São Paulo. In the year 2018, 600 million registers were collected in these toll stations. Each register includes a vehicle identifier, a toll station identifier, and a timestamp. Therefore, we can identify the exact moment when each vehicle crossed each toll station and calculate the trip duration between any two records. We can define the trip route by connecting consecutive records from the same vehicle, considering four parameters to join the legs: time between toll passages, inversion direction angle, accumulated distance, and rest time for truck drivers.

  2. 2.

    This dataset from toll stations is then combined with similar information from 4870 smart cameras located in urban areas of the state. These cameras recorded 3 billion vehicle registers during the year 2018. By combining both datasets, we can expand the temporospatial coverage of our tracking and be more precise about trip segments made within urban areas.

  3. 3.

    We aggregate and anonymize individual trips to guarantee travelers’ privacy based on the combined dataset of records from toll stations and smart cameras. Next, a gravitational distribution model, considering the fleet of cars and trucks as an attraction factor per zone, is used to estimate origins and destinations of aggregated trips by day and by pair of origin and destination. Given the spatial coverage of tolls and smart cameras, we can aggregate flows using 1,054 traffic zones in the state, including one zone for every 645 municipalities, 24 frontier zones, and 385 additional subdivisions to split larger cities and transportation hubs.

  4. 4.

    The origin and destination flows calculated in the previous step are then allocated to a highway simulation network and calibrated with expansion weights to match the average daily volume of vehicles from several other sources by the Transport Department of São Paulo.

  5. 5.

    The calibrated highway simulation network is a powerful tool to provide any transport engineering analysis, but it is static when considering the average flows of a year. Therefore, traffic variations from 451 monitoring stations (toll plazas, Service Level—LOS, fixed radar speed system) are used to update the highway simulation network and its matrices of passenger and freight vehicles.

  6. 6.

    The updated information is made available to decision-makers through an online Business Intelligence Dashboard System. The information extracted from these dashboards is the aggregated traffic flows by day and by pair of regions both by passenger and freight vehicles. This level of aggregation is the same one used in our integrated framework of interregional trade flows as described in the previous section.

  7. 7.

    For tracking the traffic flow development during the pandemic, each dataset is updated within the monitoring system according to its technological limitations. For example, data from toll stations are updated every six hours, and data from monitoring equipment of LOS were updated every three days.

2.4 Descriptive results

Based on Eq. (3), the EAI was calculated from March 29 to August 1, 2020.Footnote 8 For each day, the index was also calculated for each of the 17 Regional Health Departments (DRS in Portuguese) and two groupings of sectors (tradable and non-tradable) and the State total.Footnote 9 Changes in values of flows from each origin o to each destination d, \({\Delta }F_{t,o,d}\), reflect rolling-seven-day averages to take into account weekly seasonality.

The choice of DRS to define the regional setting is particularly relevant for the COVID-19 crisis monitoring in the State of São Paulo. Each DRS is responsible, among other things, for investment planning, monitoring and advertising of health analyses and indicators, epidemiological and risk analysis, and control of the application of state and federal resources allocated to the health system. The use of a common regionalization also provides a cognitive alignment between health scientists and economists on the analysis of regional processes. Table 3 presents basic socioeconomic indicators for the 17 DRS in the state. It is noteworthy the relevance of the São Paulo Metropolitan Region, associated with DRS I, that hosts 47.4% of the state population and is responsible for 53.8% of its GDP.

Table 3 Number of Municipalities, GDP, and Population: State of São Paulo, by DRS-region (2017.)

The average value for the total EAI was 50.27% on March 29, indicating that, on average, economic activity in São Paulo was 49.73 percentage points below the pre-crisis level.Footnote 10 At the end of the period (August 1), the State economy had already recovered 33.21 percentage points, presenting an EAI equal to 83.48%. Figure 1 plots the average EAI for the State by different sectors.Footnote 11 Economic activities faced different degrees of exposition to confinement measures after the initial shutting down of non-essential businesses in March. Non-tradable sectors (i.e., services) were the most affected, presenting an EAI equal to 42.33% (57.67 p.p. below the pre-crisis level), while tradable sectors seemed economically less vulnerable the pandemic as the EAI reached 80.44% on March 29.

Fig. 1
figure 1

Economic Activity Index: State of São Paulo (March 29—August 1, 2020)

Sectors and regions were affected in different ways, with the main urbanized areas facing more substantial restrictions. Figure 2 shows the average EAI within each of the DRS, presenting different levels of economic vulnerability at the beginning of the pandemic. Initial economic losses were more heavily concentrated in the regions that most contribute to the state’s GRP, which coincide with the most densely populated areas. As we will see in the next section, more densely populated regions are also the main vector for promoting contamination amid the initial outbreak of coronavirus in São Paulo. The recovery rate was neither homogenous across regions. In Fig. 2, DRS are ranked from top to bottom according to the recovery speed, measured as the ratio of change in the EAI in the period and the initial distance from the pre-crisis level. Part of the difference is associated with the regionally differentiated flexibility measures contemplated in the São Paulo Plan, as we will make it more evident in the next section.

Fig. 2
figure 2

Economic Activity Index, by DRS (March 29 and August 1, 2020)

While the analysis of Fig. 2 reveals different levels of regional resilience based on a conditional convergence framework, we can also look at the evolution of the EAI in each region, comparing its initial level to its growth rate in the period. Figure 3 presents an apparent process of absolute convergence in the regional EAI, i.e., regional economies that have been hardest hit by the COVID-19 pandemic outbreak are those that presented the fastest recovery.

Fig. 3
figure 3

Regional Convergence of the Economic Activity Index, by DRS (March 29—August 1, 2020)

3 Economic activity and the spread of COVID-19

The relation between the uneven geography of the pandemic within countries and local economic activity has gained interest since the beginning of the outbreak. Pioneering studies relied on large-scale structural modeling to run ex-ante assessments based on simulations to look at short-term regional economic impacts of control strategies for mitigating the effects of coronavirus (Bonet et al. 2020; Capello and Caragliu 2021; Haddad et al. 2021). As the pandemic progressed and more information became available, other authors started looking at ex-post relationships between the sub-national geography of the COVID-19 pandemic spread and (i) urban density (Hamidi et al. 2020), (ii) the economic base of local economies (Ascani et al. 2021), (iii) local institutions (Rodríguez‐Pose and Burkina 2021), or (iv) regional connectivity (Bourdin et al. 2021).

Some attempts to elucidate the effects of different levels of economic activity on the spread of coronavirus in the context of exit strategies relied more commonly on applying different specifications of the susceptible/exposed/infective/recovered (SEIR) model. Using the SEIR model as their methodological anchor, authors estimated the COVID-19 epidemic parameters based on the data of reported infected cases and deaths in different parts of the world to predict the future epidemics of COVID-19 under different reopening policies (Chang et al. 2020; Liu et al. 2020; Yu et al. 2021). This section takes an alternative route to assess the COVID-19 exit strategy designed to gradually lift interventions introduced to control the outbreak in the State of São Paulo. We use our timely trade-based regional economic activity indicator (EAI) to circumvent the need for high-frequency regional economic information.

São Paulo is the State that has been hit the hardest in Brazil, with over 552,318 confirmed cases and 23,236 deaths through August 1, 2020 (Table 4).Footnote 12 On March 29, the State government issued an executive order closing all non-essential businesses to reduce the transmission of COVID-19. Sequentially, on May 29, a plan for gradually lifting restrictive measures on different economic sectors was crafted with representatives from the business community and public officials (health professionals, state-level secretaries, and mayors). To ensure the reopening would proceed smoothly and safely, decisions were to be driven by public health data. The São Paulo Plan defined a set of critical public health metrics,Footnote 13 which would determine if, when, and where it would be appropriate to proceed through reopening phases. Public health data trends indicating significant increases in viral transmission could even return to prior phases or closing sectors of the economy. A dashboard monitored by the Contingency Centre allows permanent updates of public health indicators. Before and during reopening, these indicators must continue to show progress. As the regional economies reopen, the state administration provides guidance that each sector, industry, and business must follow. Finally, the Plan also includes a testing and tracing strategy that took off only recently.

Table 4 Coronavirus Cases: State of São Paulo, by DRS (accumulated until August 1, 2020).

The São Paulo Plan divided the economic opening of each DRS according to a phase classification ranging from 1 (more restrictive) to 5 (less restrictive). Figure 4 shows the timeline of phase change in each DRS of the State. Initially, all DRS were classified on phase 1. On June 1, the first phase changes were enacted, with 11 DRS being moved to phase 2 and 4 DRS to phase 3. Next, the phase of each DRS was changed up or down according to their health indicators. On August 1, 3 DRS were classified on phase 1, 10 DRS on phase 2, and 4 DRS on phase 3.

Fig. 4
figure 4

Changes of Phases in Each DRS in the São Paulo Plan (May 29—August 1, 2020)

Therefore, the daily monitoring of health indicators in each DRS is critical for the sound implementation of the São Paulo Plan. However, the lack of timely, daily regional economic indicators precluded the broader monitoring of the Plan, as well as its proper evaluation. The use of the EAI provides a unique opportunity to close this gap. We can thus provide the first assessment of the Plan by combining the analysis of traffic, economic, and health indicators.

3.1 Empirical strategy

The objective of our empirical analyses is to provide an evaluation of the São Paulo Plan—in the first two months since its implementation—that considers both the effects of lifting interventions on the economic activity and the possible impact of easing restrictions on the spread of COVID-19. Specifically, we want to answer whether the changes of phases from the São Paulo Plan affected the COVID-19 spread through increased economic activity, measured by the proposed EAI.

Our empirical analysis will be separated into three steps:

  1. 1.

    Identify the effect of phase changes on the EAI.

  2. 2.

    Estimate the relationship between changes in the EAI and the spread of COVID-19 in the State.

  3. 3.

    Compute a counterfactual exercise comparing the EAI and COVID-19 spread observed in the period of our analysis against what would happen if all DRS remained with the most restrictive classification for the economic activity.

The main challenge for estimating the effect of the São Paulo Plan on the epidemic is the endogeneity of variables. Changes in the phase classifications of DRS are not exogenous. Instead, they are decided based on the evolution of the disease in each region and the regions’ health capacity to deal with it. Therefore, a simple estimation associating changes in classification and COVID-19 outcomes would likely lead to biased estimates of effects because regions with better health prospects are precisely the ones where economic restrictions are more likely to be removed. Given that, our empirical strategy tries to overcome this challenge by defining a structural path of causality. We first isolate the effect of phase changes on economic activity, assuming that parallel trends hold for this relation. Next, we assume that there is no reverse causality between COVID-19 outcomes and the EAI of regions.

3.1.1 The effect of phase changes on the EAI

For the first question, that is, the effect of phase changes on the economic activity, we estimate a fixed-effect model described by the following equation:

$$EAI_{t,i} = \alpha_{i} + \tau_{d} + \beta_{2} F2_{t,i} + \beta_{3} F3_{t,i} + \varepsilon_{t,i}$$
(18)

where \(EAI_{t,i}\) is the economic activity index of region \(i\) in date \(t\). The fixed effects of the model are \(\alpha_{i}\) that captures the region-specific means of the index, \(\tau_{d}\) that controls for the evolution of the index regardless of phase changes. Including this latter set of controls is critical: it accounts for the fact that the economic activity was increasing in the different regions even without the official ease of economic restrictions. Finally, \(F2_{t,i}\) and \(F3_{t,i}\) are dummy variables that indicate if region \(i\) is classified on either phase 2 or 3 on date \(t\). Therefore, coefficients \(\beta_{2}\) and \(\beta_{3}\) represent the average marginal effect of regions classified on phases 2 and 3 of the São Paulo Plan.

Table 5 presents the results of this estimation. The dependent variable in columns 1–3 is the different EAI versions (tradables, non-tradables, and total). In columns 4 and 5, the equation is estimated using internal regional mobility as the dependent variable instead of the EAI.Footnote 14 The coefficients can be interpreted as the level effect of each phase compared to the level that would be expected if regions remained on phase 1. For example, the coefficient of 1.110 for phase 2 on total EAI indicates that moving from phase 1 to phase 2 leads to an average increase of 1.110 percentage points on total EAI. Results from all specifications are consistent, showing positive and significant effects of phase 2 and phase 3 in all indicators. While the effects for both phases are positive, they are larger for phase 3, which is an expected result since the economic flexibility is higher in phases with higher classification. Moreover, the effects of moving to less restricted phases are stronger to non-tradable activities.

Table 5 Regression results: effect of phase change on EAI and mobility

3.1.2 The effect of EAI on COVID-19

In the second step of our empirical analysis, we estimate the average effect of changes in the EAI on COVID-19 outcomes. Specifically, we estimate the following two specifications:

$$y_{t,i} = \left( {\mathop \sum \limits_{q} \gamma_{q} S_{q,t - 7,i} } \right) + \phi EAI_{{t - t^{^{\prime}} ,i}} + \epsilon_{t,i}$$
(19)
$$y_{t,i} = \left( {\mathop \sum \limits_{q} \gamma_{q} S_{q,t - 7,i} } \right) + \left( {\mathop \sum \limits_{q} \phi_{q} S_{{q,t - t^{^{\prime}} ,i}} EAI_{{t - t^{^{\prime}} ,i}} } \right) + \epsilon_{t,i}$$
(20)

where \(y_{t,i}\) is a health variable measured in log per capita terms. Our preferred specification focuses our analysis on COVID-19-associated hospitalizations as this variable is less subject to measurement error and less likely to be under-notified if compared to COVID-19 cases or even deaths.Footnote 15 Researchers are looking at the daily hospitalization rates attributed to COVID-19 to help guide the planning and prioritization of healthcare system resources (Garg et al. 2020). The first summation term on the right-hand side of both equations controls for the stage of the disease spread in each region over time. The term \(S_{q,t,i}\) is a vector of dummies representing bins of the accumulated number of deaths in each DRS the week before each observation.Footnote 16 This vector flexibly controls for the nonlinear pattern of the epidemiological evolution of the dependent variable. The main coefficient of interest is \(\phi\) which estimates the marginal effect of lagged changes in the EAI on the dependent variable. The size of the lag term \(t^{^{\prime}}\) depends on the epidemiological characteristics of the dependent variable. Because we are focusing on COVID-19 hospitalizations, we use \(t^{^{\prime}}\) equal to 12, representing the virus average estimated infection-to-death period of 24 days (Flaxman et al. 2020) minus the average period between mechanical ventilation and death that is observed in the State of São Paulo individual-level data of COVID-associated deaths. The difference between the two specifications is associated with coefficient \(\phi\). In Eq. 19, only a single average effect of lagged EAI is estimated. Meanwhile, Eq. 20 allows the EAI effect to varying according to the epidemic stage during the lagged period.

Table 6 presents the result of these estimations. Column (1) indicates that a one percentage point increase in EAI is, on average, associated with a 0.5% increase in the number of hospitalizations per capita after 12 days. Meanwhile, results from column (2) suggest that EAI effects are higher in the earlier stages of the pandemic. When deaths per million are below 100, a one percentage point is associated with a 0.6% increase in hospitalizations 12 days later. That effect decreases to 0.4% when COVID-19-associated deaths per million are between 300 and 400, and it becomes statistically non-significant above that level.

Table 6 Regression results: effect of EAI changes on COVID-19 hospitalization rates

3.1.3 Calculating the effect of the São Paulo Plan on COVID-19 hospitalizations

The final step of our empirical framework combines the results of the previous two estimations to calculate the effect of the São Paulo Plan on COVID-19-associated hospitalizations in the period of our analysis. The calculation is based on a counterfactual analysis that uses the results of the previous two subsections to estimate total hospitalizations under two different scenarios: (A) the observed scenario and phase changes that the São Paulo Plan adopted; (B) an alternative scenario where all DRS remained under the most extreme economic restrictions (phase 1) throughout the whole period. By comparing the estimated number of hospitalizations in both scenarios, we estimate the impacts of the plan.

We start this procedure by calculating a counterfactual EAI for each region in scenario B (\(\widehat{IAE}_{t,i}\)). This calculation can be described by Eq. 21:

$$\widehat{EAI}_{t,i} = EAI_{t,i} - \hat{\beta }_{2} F2_{t,i} - \hat{\beta }_{3} F3_{t,i}$$
(21)

The terms \(\hat{\beta }_{2}\) and \(\hat{\beta }_{3}\) are the results from Model (3) presented in Table 5 and correspond to the estimated effect phase changes on total EAI. Notice that if a region is already observed in phase 1 in scenario A, its EAI will be the same in both scenarios. Figure 5 shows the aggregated EAI for the State of São Paulo for both scenarios. The EAI is the same in both scenarios until the beginning of June, as all DRS were under the most restrictive classifications. With the beginning of changes in classifications in June, the difference between scenarios starts to show up, with slightly lower total EAI for the scenario where all regions are assumed to remain in phase 1. The difference between the scenarios increases over time as more regions are classified into more flexible phases. However, even on the last date of our comparison, the difference between the scenarios is only 1.93 percentage points.

Fig. 5
figure 5

Observed and Counterfactual EAI for the State of São Paulo

Next, we use these simulated and observed EAI to project the number of hospitalizations in each DRS on each date using the results from Table 6. Specifically, we calculate:

$$\begin{gathered} \hat{y}_{i,t} = \left( {\mathop \sum \limits_{q} \hat{\gamma }_{q,t - 7,i} S_{q,t,i} } \right) + \hat{\phi }EAI_{{t - t^{^{\prime}} ,i}} \hfill \\ \widehat{{\hat{y}}}_{i,t} = \left( {\mathop \sum \limits_{q} \hat{\gamma }_{q,t - 7,i} S_{q,t,i} } \right) + \hat{\phi }\widehat{EAI}_{{t - t^{^{\prime}} ,i}} \hfill \\ \end{gathered}$$
(22)

That is, \(\hat{y}_{i,t}\) corresponds to the fitted log of hospitalizations per capita in scenario A, and \(\widehat{{\hat{y}}}_{i,t}\) is the corresponding value from scenario B. We can use \(\widehat{{\hat{y}}}_{i,t}\) to \(\hat{y}_{i,t}\) to project the total number of hospitalizations in each scenario. We run this calculation twofold: first assuming a single average effect of lagged EAI on new hospitalizations as reported on Table 6, Column (1); next, we allow the effect of lagged EAI to vary according to the stage of the pandemic using the coefficients from Column (2). In the first case, the number of hospitalizations in the counterfactual scenario would be 0.52% lower than the projected number using observed EAI. This difference could range between 0.15 and 0.95%, given a 95% confidence interval and the standard errors of all estimates involved in the simulations. The observed number of COVID-19 hospitalizations in the State of São Paulo between May 29 and August 1 was 119,461. Therefore, if we apply our main central effect to this total, we would have approximately 621 fewer new hospitalizations if no flexibility were adopted throughout the period. In the second case, where we allow more flexibility to the effect of EAI on hospitalizations, results are consistent. The simulated total number of hospitalizations would be 0.21% lower if economic restrictions were not released. This effect ranges from 0.04% to 0.59%, given a 95% confidence interval of all estimates. The central result corresponds to 250 fewer hospitalizations between May 29 and August 1.

3.1.4 Calculating the effect of the São Paulo Plan on EAI volatility

One additional objective of the São Paulo plan was to anchor expectations.Footnote 17 It is a communication tool that allows economic agents to temporally allocate their efforts in production and consumption, reducing informational uncertainty due to the COVID-19 pandemic. To assess the effectiveness of the Sao Paulo Plan as a mechanism of anchoring expectations, we have estimated the daily coefficient of variation of the rolling-7-day EAI (Fig. 6 below). This descriptive analysis indicates that the São Paulo Plan may have smoothened the series and worked as an anchor for expectations guiding economic activity.

Fig. 6
figure 6

Coefficient of Variation of Total EAI – (April, 21 – August 1, 2020)

4 Final discussion and possible extensions

COVID-19 crisis monitoring involves preparing to implement and roll back non-pharmaceutical interventions, including setting up expert committees to examine initial control measures and defining gradual relaxing of social restrictions (Petersen et al. 2020). Nonetheless, up against enormous uncertainties, combining timely tracking of epidemiological and socioeconomic indicators is fundamental for informing officials during the implementation of exit strategies.

We proposed a timely trade-based regional economic activity indicator (EAI) combining high-frequency truck and passenger vehicle traffic data with economic input–output flows. The EAI monitors daily sectoral economic activity in the Brazilian State of São Paulo. The use of this novel set of information, combined with COVID-19-associated hospitalizations, provided a first assessment of the São Paulo Plan, the COVID-19 exit strategy designed to gradually lifting interventions introduced to control the outbreak in the State. One of the contributions of this paper is to provide reliable estimates for the level of economic activity to evaluate the breakdown due to the COVID-19 in an environment of limited information from the economic and health side. Furthermore, since we implement short-run analysis (e.g., 60 first days), it is not necessary to consider if workers and people, in general, expected an improvement or worsening of the health conditions prevalent in the future in the regions and, thus, decreased their social exposure (affecting the economic activities).

In its first 60 days, we show that the phased strategy pursued in São Paulo has been effective in gradually reactivating economic activity pari passu to a relatively small number of additional hospitalizations associated with the flexibilization in phases. The timing of the reopening of non-essential activities has shown to be relevant to the impact on hospitalization rates increases, with more substantial effects during the ascending part of the pandemic curve. Thus, trends in the evolution of the pandemic should be part of the critical public health metrics monitored to determine smooth and safe reopening. Moreover, regional heterogeneity has proved to be an essential feature of the COVID-19 exit strategy. State coordination of preparedness efforts moving forward was an essential mechanism that considered the interconnectedness of the regional economies, which may be important at different spatial scales (Ruktanonchai et al. 2020). In Brazil, for instance, lack of national coordination in a strongly integrated interregional system may have contributed to complicating epidemiological assessments and public health efforts to curb the pandemic (Candido et al. 2020; Souza et al. 2020).

This study has significant limitations. The State of São Paulo is strongly connected with its neighboring states. According to the input–output data (FIPE-NEREUS 2020), interstate trade reaches almost fourfold international trade values involving the state economy. As the spread of diseases does not respect administrative borders (Ruktanonchai et al. 2020), the lack of coordinated nationwide measures to control the COVID-19 epidemic may have imposed limitations to the effectiveness of the São Paulo Plan.

Possible extensions of the EAI analysis include, but are not restricted to: (i) validation of the series’ trend in comparison with invoice records to guarantee robustness in portraying economic activity; (ii) control of effect on disease spread by the expansion of testing; (iii) EAI sensitivity to local characteristics (for instance, the composition of regional economic structures, population socioeconomic characteristics); and (iv) EAI and labor market effects.

This study is the first of a series sought to facilitate crisis monitoring under an economic lens. Using literature on regional economic analysis and high-frequency traffic data, the model derives a connection between economic activity and health impacts. Given a straightforward means of anchoring expectations, we shed light on the possibility of inducing a responsible suspension of restrictive measures while keeping virus spread at bay.