1 Introduction

Regions located along the border of economically integrating countries are highly exposed to this macro-institutional change and we address the question if this exposure translates into specific integration effects for the regions’ socio-economic development path. As borders are a natural barrier to economic interaction (Capello et al., 2018a), the dismantling of border impediments through economic integration may, on the one hand, improve their market access by shifting border regions from the country’s periphery to the heart of the newly formed economic block (e.g., Percoco, 2015). While this argument speaks in terms of positive EU integration effects in border vis-à-vis non-border regions, there are also arguments for a relative weaker performance of border regions: The ‘path dependency’ or ‘lock in’ argument, for instance, states that when borders have pertained for a long time, border regions may have suffered from a gradual process of marginalization that deprives them of the absorptive capacities and scale effects needed to benefit from economic integration more than their better endowed, agglomerated non-border counterparts (Floerkemeier et al., 2021; Petrakos & Topaloglou, 2008).

A significant growth premium in border regions would also be absent if trade costs were sufficiently low so that closer geographical proximity to new markets itself is not a decisive factor for reaping the benefits of open borders and economic integration as it is, for instance, predicted in Krugman (1991)-type core-periphery models. And finally, socio-economic development levels in countries on both sides of the integration border may differ in such a way that gains from economic integration in established (old) and new member states, i.e., effects stemming from widening and deepening EU integration, are unevenly distributed across border regions so that overall effects in border regions are difficult to measure. Thus, what seemed straightforward on a first glance, namely, to identify the treatment effects of economic integration for border regions along the integration border may, in fact, be quite complex and subject to structural, temporal, and spatial heterogeneities. In this paper, we take this ‘complexity’ perspective as starting point for an in-depth study of the integration effects associated with four consecutive enlargement waves of the European Union between 1986 and 2007. Essential research questions are:

  1. 1.

    Do we find evidence for common integration effects of EU enlargement for internal border regions in terms of key socio-economic outcomes such as GDP, employment, and population growth?

  2. 2.

    What types of structural, temporal, and spatial heterogeneity determine the direction, magnitude, and duration of integration effects across enlargement waves and sub-groups of border regions considered?

Providing answers to both research questions shall help policy makers to better assess the benefits and costs associated with EU economic integration in the context of regional commonalities but also heterogeneities.

When it comes to the selection of EU internal border regions as treatment group in our empirical investigation, we argue that this focus is well deserved out of significance and relevance considerations. First, as stated by the EU Commission (2017), border regions account for approximately one third of EU population and a similar aggregate production share in the EU.Footnote 1 At the same time, border regions, on average, have a weaker economic performance, lower levels of labor market integration and public service provision compared to non-border regions in the EU Commission (2017), which is why EU regional policy supports the socio-economic development in border regions through different funding programmes (most notably through Interreg project funding and the b-solutions initiative, see EU Commission (2021).

Second, beyond their status as being a specific (disadvantaged) regional group within the wider internal economic geography of the EU, the EU Commission sees border regions as important “living labs of European integration” (EU Commission, 2021). The idea of living labs is that they allow to study integration effects in border regions under the magnifying glass and that findings obtained here provide important general insights on the overall progress of EU integration and cooperation at large. Extending this logic, we argue that EU internal border regions are particularly well-suited to investigate the economic returns to EU economic integration at the regional level as they have been particularly exposed to associated shifts in the EU’s internal economic geography, while the enlargement process itself can be seen as an exogenous source of variation to their development path.

The exogeneity of the enlargement ‘shock’ at the small-scale regional level can be motivated by the fact that political decisions for EU enlargement were made at the national and supra-national level with goals not specifically tailored to the needs and economic conditions of border regions. The same logic applies to the allocation of the bulk of EU regional funding volumes, which focus on the regions’ development status irrespective of their geographical location within a country (Breidenbach et al., 2019), so that EU regional funding alone cannot explain a potential growth premium associated with economic integration in border regions. We accordingly argue that the focus on border regions enables us to study the effects of EU integration in a quasi-experimental manner.

Prior empirical evidence on a potential growth and development premium associated with EU accession and the economic freedoms associated with the European single market has remained inconclusive. While, e.g., Campos et al. (2019) report significant positive income growth effects of EU membership at the country level (with few exceptions), Andersen et al. (2019) generally do not find evidence for an EU membership growth premium. With respect to the focus of this study, there is also a knowledge gap on how the potential gains from economic integration are distributed across the different regions within integrating countries (Niebuhr & Stiller, 2004, Braakmann and Vogel, 2010, and Heider, 2019). It is generally supposed that regional ability to reap welfare gains from EU integration chiefly depend on a region’s relative competitiveness driven by industry composition and settlement structure, its institutional setup, trade intensity as well as size and geographical proximity to the enlargement border (e.g., Brakman et al., 2012; Brülhart et al., 2012, 2018; McCallum, 1995).

While earlier studies have mainly focused on GDP growth as sole outcome variable, a novelty of our analysis is that we conduct a comprehensive empirical analysis of the complex border regional effects associated with the EU enlargement process during the 1980s, 1990s and 2000s. Specifically, we employ a broad set of outcome variables covering region-specific time patterns of per capita GDP, (sectoral) labor productivity, research and development (R&D) and innovation activity, employment, population development, night light emissions. As there is no information on the stock of public (and also private) infrastructure at the level of NUTS3 regions (the observational unit used in this paper), night light data fill an important gap. An increase in night light emissions reflects changes of the public infrastructure (such as streets or public buildings) but also changes of private activities such as housing stocks or firm density/activity.Footnote 2 It can thus be regarded as a general measure for agglomeration trends and has previously been used to map the development of population and firm density across regions (e.g., Mellander et al., 2015). Also, prior empirical analyses have used night light data to measure processes of economic integration, growth, and convergence, especially when other economic data are missing (see, e.g., Henderson et al., 2012; Galimberti, 2020). But even for geographical areas with fairly good data provision, Mellander et al. (2015) as well as Lessmann and Seidel (2017) have shown that night light data still delivers important insights on economic development trends and differences.

Four EU enlargement periods are covered in our analysis: First, the EU accession of Spain and Portugal in 1986 (third enlargement wave); second, the EU membership of Austria, Sweden, and Finland in 1995 (fourth enlargement wave); third, the so-far largest EU enlargement of mostly central and eastern European countries in 2004 (fifth enlargement wave); and fourth, the accession of Romania and Bulgaria to the EU in 2007 (sixth enlargement wave). To identify treatment effects of EU integration in border regions vis-à-vis non-border regions during these enlargement periods, we conduct an analysis for the 1289 NUTS3 regions of the EU-27 (including the UK but not Croatia) over the time period 1981–2014. We apply static and dynamic difference-in-difference (DiD) estimation to identify pooled and group-specific effects. The DiD approach has shown a high degree of flexibility and robustness when previously been applied to spatio-temporal analyses of border regional growth effects such as for the division and reunification of Germany (Redding & Sturm, 2008) and economic transformations after the fall of the iron curtain (Brülhart et al., 2012, 2018) among other applications.

In the estimations we particularly account for the fact that the distribution of integration effects may be fuzzy with regard to spatial and temporal aspects. As such, we explicitly control for the circumstance that EU enlargement cannot be treated as precisely timed event. For example, in 2004, EU accession followed a process covering early agreements between old and new EU member states initiated in the aftermath of the collapse of the Soviet system (Dangerfield, 2006). This potentially results in ‘early anticipation’ effects that weaken the power of static DiD estimations, which rely on a precise classification of a single pre- and post-treatment period.Footnote 3 To account for these methodical challenges, we apply a flexible DiD approach that estimates time-heterogeneous coefficients for the different stages around the timing of EU enlargement. In addition, we account other confounding factors, which may either affect all EU countries equally, such as the deepening of economic integration through the EU single market in 1992, or is confined to individual countries and country groups, such as the introduction of the Euro currency in 1999, by adding a multidimensional ‘fixed effects’ structure to our DiD specifications. We also run several robustness tests to see if the obtained results hold to variations in the data and regression specification.

The remainder of this paper is organized as follows: Sect. 2 outlines the underlying theory related to border regional growth effects of economic integration. This section also summarizes prior empirical findings for the economic effects of EU enlargement and identifies research gaps in the literature. Section 3 describes our empirical study design, which is followed by a description of the data and variables used in Sect. 4. Section 5 reports our empirical results for pooled and heterogeneous treatment effects of EU economic integration together with a series of robustness tests. Finally, Sect. 6 discusses policy implications and concludes the paper.

2 Border regional effects of economic integration: theory, evidence and gaps

2.1 A complexity perspective of economic integration

Models of regional growth, international trade, and economic geography stress the role of trade related to market size, market access and transport cost for regional development (e.g., Krugman & Venables, 1990; Percoco, 2015). It can be conjectured that border regions gain from EU enlargement due to their unique geographic location and the associated improvement of market access. These effects may, however, be partly or fully offset by sustaining border impediments, lacking absorptive capacities and this insufficient scale economies in border regions, which may lock regions in a peripheral position (Capello et al., 2018a; Petrakos & Topaloglou, 2008).

Our conceptual approach, which takes these opposing factors into account starts with a fairly general specification of a regional production function defined as \({\text{Y}}=A({K}^{\alpha }{L}^{\beta }{\mathbf{N}}^{\mathrm{\varphi }})\), where \(Y\) is a measure of regional output (typically GDP or GVA), A is technology, K is capital, L denotes labor input and \(\mathbf{N}\) is a vector of further inputs; α, \(\beta\) and \(\mathrm{\varphi }\) are the respective output elasticities. If we write this regional production function as growth specification in intensive form, we get

$$\Delta y_{it} \, = \,\Delta A_{it} \, + \,\alpha \Delta k_{it}\, + \user2{ }\mathop \sum \limits_{r = 1}^{R} \varphi_{r} \Delta n_{l,it}$$
(1)

where \({\Delta y}_{it}\) is as measure for per worker (or per capita) output growth for region i at time t,\(y=Y/L\), \(k=K/L\) and similar for the remaining inputs (\({n}_{r}={N}_{r}/L\)). In an earlier analysis with national data for the EU-15, Badinger (2005) has focused on two potential channels how economic integration affects \(\Delta y_{it}\) as: i) a technology channel \(\left( {\Delta A_{it} \, = \,\gamma_{A0} \, + \, \gamma_{A1} \Delta INT_{it} } \right)\) and ii) a physical investment channel \(\left( {\Delta k_{it} \, = \,\gamma_{k0} \, + \, \gamma_{k1} \Delta INT_{it} } \right)\) with \(\Delta\) INT being an indicator for changes in the level of integration at time t; \({\gamma }_{A0}\) and \({\gamma }_{k0}\) are exogenous components of technological progress and capital formation, respectively. This logic can be straightforwardly extended to the integration effects of other inputs such as for input r as \(\left( {\Delta n_{r,it} \, = \,{\upgamma }_{r,n0} \, + \, {\upgamma }_{r,n1} \Delta INT_{it} } \right)\) and we can measure the relative performance of border regions for these inputs separately.

Alternatively, the input channels can be aggregated to an overall effect of economic integration on per capita income growth as

$$\Delta y_{it} = \delta_{0} + \delta_{1} \Delta INT_{it}^{{}}$$
(2)

with \({\delta }_{0}=\left({\gamma }_{A0}+\alpha {\gamma }_{k0}+{\sum }_{r=1}^{r}{\varphi \gamma }_{r,n0}\right)\) and \({\delta }_{1}=\left({\gamma }_{A1}+\alpha {\gamma }_{k1}+{\sum }_{r=1}^{R}{\varphi \gamma }_{r,n1}\right)\).

Given our focus on border regions, Eq. (2) can be extended by incorporating a spatial component into the analysis of growth effects from economic integration as

$$\Delta y_{it} \, = \,\delta_{0} \, + \,\left( {\rho_{1} \, + \,\rho_{2} \left( {\frac{1}{{DIST_{i}^{\theta } }}} \right)} \right)\Delta INT_{it}$$
(3)

where \(\left(\frac{1}{{DIST}_{i}^{\theta }}\right)\) measures proximity for each region i to the newly integrated unit (with \({DIST}_{i}\) being some distance measure to the integration border or a specific point of interest across the border). Equation (3) thus splits the growth effects of integration \({\delta }_{1}\) into a general non-spatial component \({\rho }_{1}\) and a growth premium for regions with closer proximity to the border (\({\rho }_{2}\)) with \({\delta }_{1}={\rho }_{1}+{\rho }_{2}\). While distance/proximity can be measured in different dimensions (Boschma, 2005), we refine to geographical distance as a catch-all term for other forms such as cultural, social, and historical proximity. This extension reflects that benefits from economic integration do not affect each region equally but predicts that regions closer to the integrated market receive larger benefits as typically found in gravity-type models of inter-regional trade such as in McCallum (1995).Footnote 4 The parameter θ shown in Eq. (3) expresses the power of distance decay. For instance, for sufficiently high values of θ, we expect to only observe a spatial growth premium for regions directly adjacent to the enlargement border. Ways to empirically proxy the spatial proximity to the enlargement border will be presented below.

Fig. 1
figure 1

Estimation setup for identification of integration effects from EU enlargement. Authors’ figure

The role of distance decay as a factor determining trade cost and eventually output effects from economic integration is also stressed in models of the New Economic Geography (NEG). Krugman and Venables (1990), for instance, show for an NEG model application to the EU single market in 1992 that with reduced transport costs more firms may find it attractive to relocate to the periphery as a way take advantage of factor price differentials between countries. Other NEG models similarly predict that regions with a lower distance and thus transport cost to international markets reap the largest benefits from economic integration (Brülhart et al., 2004; Crozet & Koenig, 2004).

Behrens et al. (2007) and Monfort and Nicolini (2000) show in NEG model settings that a country’s internal economic geography constitutes a significant conditioning factor for the regional economic effects of international economic integration. For instance, Rauch (1991) presents a model in which costal border regions are the main trade hub of a country. In this case, border regions can particularly benefit from trade integration. Overman and Winters (2006), study the impact of UK accession to the larger European market and find evidence for this setup indicating that coastal (border) regions hosting a port with better market access for exports and intermediate inputs experience higher employment compared to other similar regions.

If border regions suffer from locational disadvantages, model predictions may differ, though. Without scale effects emanating from locational advantages, consumers typically have to pay higher prices and firms can only supply goods to the market at higher cost when being located in a border region (Niebuhr & Stiller, 2004). Increased proximity to foreign markets of integrating countries then only allows border regions to grow faster than non-border regions if they possess specific territorial assets (Capello et al., 2018a). If such assets are missing, there is the risk of a ‘tunnel effect’, i.e., a bypassing of border regions after integration, which could further marginalize border regions if trade patterns after EU enlargement are dominated by central core regions (Petrakos & Topaloglou, 2008). In this case, \({\rho }_{2}\) can be expected to be zero or even negative.

2.2 Prior empirical evidence and remaining research gaps

Several empirical contributions have been concerned with the identification of growth effects of economic integration – predominately at the national level (e.g., Andersen et al., 2019; Badinger, 2005; Campos et al., 2019; Henrekson et al., 1997). Bridging the gap between the available national and scarce regional-level evidence, Monastiriotis et al. (2017) analyze the spatial effects of EU integration for Central and Eastern European (CEE) regions. Using an event-study approach, the authors find that the process of EU accession has particularly strengthened agglomeration forces in CEE countries favoring regions with a high market potential, industry concentration and regional specialization in increasing returns sectors.Footnote 5

Brülhart et al. (2012) and Brülhart et al. (2018) analyze the wage and employment effects of trade liberalization caused by the fall of the iron curtain for Austrian border towns. Their empirical results indicate that improved access to Eastern markets has a positive impact on employment and nominal wages in these regions vis-à-vis the rest of the country. The results in Brülhart et al. (2018) additionally suggest that larger cities benefit more strongly from the border shock in terms of wages, whereas smaller cities experience larger employment effects with a peak for towns with a population of around 150,000. Taken together, their evidence suggests that residents of medium-sized towns gain the most from a given opening of cross-border trade.

Brakman et al. (2012) focus on the population effects of EU integration in EU border regions. Analyzing data for 1457 regions and 2410 cities since 1973, the authors find evidence for positive population growth effects in border regions vis-à-vis non-border regions. This effect is significant at the regional and urban level within a 70 km radius from national borders. It holds for both sides of the integration border. Relatedly, Heider (2019) focusses on the population growth effects of German and Polish border town in the course of the EU enlargement in 2004. The author finds evidence for positive population growth effects for German but not for Polish border towns.

While the majority of studies thus reports positive population and economic effects of trade liberalization and economic integration in border regions of the EU (particularly in the EU15), there is also empirical evidence for insignificant or negative effects as, for instance, reported in Braakmann and Vogel (2011) or Marin (2011). Using data for firms located in East Germany close to Germany’s eastern border, Braakmann and Vogel (2011) find no short-run employment effects of the EU enlargement in 2004 except for firms active in wholesale and retail trade, hotels, and restaurants. Negative wage effects are found for skilled workers in consulting, research, and related activities. This points to sector-specific effects in border regions subject to EU enlargement. Studying employment growth from the perspective of firms in Central and Eastern European Countries (CEECs), Serwicka et al. (2022) find a significant increase in foreign investment and employment growth after the 2004 EU enlargement.

While the prior literature has started to shed light on regional effects of EU integration for selected outcomes, mainly income levels and individual enlargement waves, a comprehensive analysis of the complex spatial effects of EU integration over the last decades is still missing. This is what we are aiming for in the following.

3 Estimation setup

We use flexible DiD estimation and apply different specifications to robustly identify potential common (pooled) effects of EU enlargement together with structural heterogeneities related to i) differences across enlargement waves, ii) the timing of expected effects from EU enlargement (pooled and for individual waves) and iii) the spatial extent of effects. An overview of our estimation setup is outlined in Fig. 1. Our main treatment group are direct border regions defined as regions adjected to territorial border lines for one of the four different enlargement waves considered (1986, 1995, 2004 and 2007). To estimate the role of spatial spreading effects of EU enlargement, we then extend the treatment group to include indirect border regions defined as higher-order neighbors of direct border regions adjacent to the enlargement borders.

As shown in Fig. 1, we start estimating a panel specification that measures common average treatment effects across all enlargement waves and direct border regions in a pooled manner. In the light of institutional differences across regions and enlargement waves considered, we additionally run several regressions that test for effect differences between the individual enlargement waves to account for institutional differences associated with the enlargement and to distinguish between border regions in established and new member states and associated effects from widening and deepening integration, respectively. This type of heterogeneity is classified as structural heterogeneity in Fig. 1.

Similarly, the path towards integration and, according, the timing of integration effects before and after EU enlargement, may differ across enlargement waves. We classify this as temporal heterogeneity shown in Fig. 1. We capture temporal heterogeneity in treatment effects by switching from static DiD estimation, which identifies treatment effects on the basis of a strict definition of a pre- and a post-treatment period around the formal accession of new EU member countries, to dynamic DiD regressions. This allows us to account for leads and lags in the transmission process from EU enlargement to regional economic effects together with a staggered treatment start (in our case, the four different enlargement waves covered). The dynamic DiD approach is also referred to as ‘panel event study’ (see, e.g., Schmidheiny & Siegloch, 2019; Callaway & Sant’Anna, 2021; Goodman-Bacon, 2021; Sun & Abraham, 2021). As outlined in Borusyak & Javarel (2020) dynamic DiD estimates may particularly be helpful to overcome a potential estimation bias in the static baseline approach that arises if treatment effects have a significant temporal pattern. As shown in Fig. 1, we further account for spatial heterogeneity in effects and combinations of heterogeneity dimensions.

A common challenge to the different specifications shown in Fig. 1 relates to the potential problem of treatment endogeneity as the event of EU accession cannot be seen as a source of exogenous variation to the national economic performance, especially for new member states. The two-way link between national development and EU accession mainly stems from the fact that a good economic performance partly reflects a successful transition policy and the adoption of certain institutions linked to democratic governance and a functioning market economy, which in turn are a prerequisite for signing accession agreements. Here, we follow the argumentation in Brakman et al. (2012) referring to the fact that EU enlargement did not primarily target the economic development in border regions and, hence, that this macroeconomic enlargement ‘shock’ can still be seen as an exogenous source of variation for border regions.

To comprehensively capture the integration effects of EU enlargement, we use different outcomes such as the growth rate of GDP per capita, labor productivity, R&D and innovation (proxied through the number of patent applications per local population), employment and population growth, as well as night light emissions. These different measures shall capture the potential transmissions channels described in Sect. 2. In terms of model specification, we adapt a standard approach of regional growth models extended by the integration effect shown in Eq. (3). Formally, our baseline static DiD approach, which pools treatment effects across EU enlargement waves and region types, takes the following form

$$\begin{gathered} { }outcome_{it} \, = \,\beta^{\prime}{\mathbf{x}}_{it} \, + \,\delta \underbrace {{\left( {d_{i}^{border} \, \times \,d_{t}^{EU} } \right)}}_{{\left[ {\left( {\frac{1}{{DIST_{i}^{\theta } }}} \right){ }\, \times \,{\Delta }INT_{it} } \right]}}\, + \,\mu_{i} \, + \,\left( {\xi_{c\left( i \right)} \, \times \, \lambda_{t} } \right)\, + \,\varepsilon_{it} , \hfill \\ \hfill \\ \end{gathered}$$
(4)

where \({outcome}_{it}\) denotes the (log-transformed) outcome level in region i at time t, which is specified as a function of (log-transformed) regional covariates \({(\mathbf{x}}_{it})\), region fixed effects \(\left({\mu }_{i}\right)\) and country-year fixed effects \(\left({\xi }_{c(i)} \times {\lambda }_{t}\right)\), where \({\xi }_{c(i)}\) denote country fixed effects, with regions grouped to countries, and \({\lambda }_{t}\) are time fixed effects. This multidimensional ‘fixed effects’ structure shall account for common and country-specific time trends such the creation of the Single market in 1992, the introduction of the Euro currency in 1999, national business cycle movements or national policy interventions (Ahrend et al., 2017). In addition, we lean on the empirical identification approach used in Monastiriotis et al. (2017) and control for observable regional time-varying characteristics \({\mathbf{x}}_{it}\) that are assumed to influence regional economic growth besides the pure enlargement effect. This likely increases the homogeneity of border and non-border regions in the light of structural differences across countries and thus works in favor of the parallel trend assumption of DiD models (Lechner, 2011).Footnote 6

Our main regression parameter of interest is \(\delta\). It measures the relative border regional outcome effect of EU enlargement for the included DiD term \(\left({d}_{i}^{border}\times {d}_{t}^{EU}\right)\), which is constructed as interaction term of a treatment group dummy for border regions \({(d}_{i}^{border})\) and a time dummy that measures the associated timing of EU enlargement (\({d}_{t}^{EU})\). Specifically, the interaction term takes a value of one from 1986 onwards for internal border regions affected by the second EU enlargement wave (and similarly for 1995, 2004 and 2007, respectively) and is zero before that year. If \(\delta\) is found to be statistically significant and positive, this indicates that border regions along the internal territorial border between established and new member states have grown faster than other EU regions in the post-enlargement period. As shown in Eq. (4), the focus on border regions along the integration border as default treatment group (measured as those NUTS3 regions with territorial overlap to the integration border) implicitly uses a very high distance decay parameter θ as baseline specification.

While \(\delta\) in Eq. (4) measures the integration effects of EU enlargement in border regions in a pooled fashion, results may be biased if underlying structural heterogeneity prevails. To capture heterogeneity across enlargement waves and region types, we can decompose the treatment dummies from Eq. (4) as

$${ }outcome_{it} \, = \,\beta^{\prime}{\mathbf{x}}_{it} \, + \,\mathop \sum \limits_{w = 1}^{4} \mathop \sum \limits_{r = 1}^{2} \delta_{w,r} \left( {d_{i}^{~\mathit{ {{\text{border}}\_{\text{type}}}}~ \left(r \right)} \, \times \,d_{t}^{~\mathit{ {{\text{EU}}\_{\text{wave}}}}~ \left( w \right)} } \right)\, + \, \mu_{i} \, + \,\left( {\xi_{c\left( i \right)} \, \times \, \lambda_{t} } \right)\, + \,\varepsilon_{it}$$
(5)

which estimates individual treatment effects \({\delta }_{w,r}\) for each of the w = 1,…,4 EU enlargement waves and r = 1,2 border region types as being located in established or new EU member states.

As outlined in Fig. 1, we also consider two other sources of parameter heterogeneity: The first one relates to the dynamic nature of EU integration effects, and we move from static DiD estimations to the estimation of time-specific treatment effects in line with the literature on dynamic DiD or panel event studies. The specification of a flexible dynamic DiD estimator has the advantage that it accounts for potential lead and lag structures in the distribution of the economic integration effect on regional growth processes over time. The underlying assumption is that economic integration effects captured by the coefficient of the DiD term (\(\delta )\) in Eq. (4) are not uniformly distributed over time. We capture temporal lead and lag effects as

$${ }outcome_{it} \, = \,\beta^{\prime}{\mathbf{x}}_{it} \, + \,\mathop \sum \limits_{s = - N}^{M} \delta_{s} \left( {d_{i}^{border} \, \times \,d_{t - s}^{EU} } \right)\, + \, \mu_{i} \, + \,\left( {\xi_{c\left( i \right)} \, \times \, \lambda_{t} } \right)\, + \,\varepsilon_{it}$$
(6)

where \({d}_{t-s}^{EU}\) is an indicator for event time s (time-since-event), meaning that the event date \({E}_{i}^{w}\), i.e., wave w of EU enlargement, took place s periods before period t measured in absolute calendar years, that is, s = t − \({E}_{i}^{w}.\) For example, s = 2 would mean that outcome in t is measured two years after EU enlargement. A separate term is included for each event time with \(s\in \{-N,\dots ,0,\dots ,M\}\) defining the maximum number of leads (\(-\)N) prior to treatment and lags (M) after treatment considered. The difference in data organization between the static and dynamic DiD estimation is summarized in Fig. 2.

Fig. 2
figure 2

Identification of treatment effects in static and dynamic DiD estimation. Authors’ figure

Importantly, the dynamic DiD specification measure effects relative to treatment start \({E}_{i}^{w}\) for each enlargement wave. The advantage of considering temporal heterogeneity is that all treated border regions receive an equal sample weight for the estimation of treatment effects even if they some regions are treated later than others (in absolute time). In addition, quantifying annual treatment effects around the timing of EU enlargement allows to test for the presence of Ashenfelter’s (1978) dip, i.e., early anticipation effects. Borusyak & Javarel (2020) provide a discussion of consistency problems associated with static approaches when time dynamics in treatment effects is present. Beyond pooled dynamic estimation, we also combine estimates accounting for structural and temporal heterogeneity as shown in Fig. 1.

We also account for the fact that economic integration may not only have an impact on direct border regions but also the broader geographical neighborhood of the enlargement border. Clarke (2017) has recently pointed out that the stable unit treatment value assumption (SUTVA) underlying DiD estimation may be too strong when dealing with regional data to estimate treatment effects. The reason is that territorial borders are porous and may give rise to spatial spillovers. In order to estimate unbiased treatment effects in the presence of spatial spreading effects, Clarke (2017) proposes the use of a weaker condition than SUTVA, which relies on the assumption that there exists at least some subset of units which are not affected by the treatment status of others. As it can be assumed that those economic actors living in regions close to treated (border) regions are able to either partially or fully access treatment, the subset of regions unaffected by the treatment can be determined by their (geographic, economic etc.) distance to treated units.

We can capture this spatial heterogeneity mechanism by including an extended set of treatment group dummies \(\left({d}_{i}^{k}\right)\), where the index k = g,…,G indicates the total number of treatment group dummies included in the empirical specification. Each of the K treatment groups thereby represents a slice of space, for instance, defined by a specific maximum geographical distance g relative to the enlargement border. This process can be seen as testing for incremental changes in economic integration effects over space, where the hypothesis from standard inter-country and inter-regional trade models is that a potential growth effect of economic integration decreases with further distance to the border (e.g., Eaton & Kortum, 2002). While the spatial disaggregation of the treatment group can be used for static estimations as in Eq. (5), we also combine spatial and temporal heterogeneity to identify incremental growth effects of EU integration over space and time as

$$outcome_{it} \, = \,\beta^{\prime}{\mathbf{x}}_{it} \, + \,\mathop \sum \limits_{s = - N}^{M} \mathop \sum \limits_{k = g}^{G} \delta_{s,k} \left( {d_{i}^{k} \, \times \,d_{t - s}^{{}} } \right)\, + \, \mu_{i} \, + \,\left( {\xi_{c\left( i \right)} \, \times \, \lambda_{t} } \right)\, + \,\varepsilon_{it}$$
(7)

Equation (7) shows a general (space-time) flexible DiD estimator that allows us to conduct a grid search for significant coefficients over slices of time and space to provide a comprehensive assessment of the border regional growth effects of EU enlargement.

4 Data and variables

We use 1289 NUTS3 regions (based on the NUTS2010 classification) for the EU-27 (including the UK but without Croatia) as units of observation and set the estimation period to 1981–2014. These data settings allow us to work on a finely granulated spatial level with a sufficiently long observation period to include a common set of leads and lags for all covered EU enlargement waves (except for night light emissions, which are only available from 1992 onwards). The data set is unbalanced since observations for East German regions and Central and Eastern European regions are only recorded from 1991 onwards. However, this does not affect the maximum number of lag- and lead-terms used for the identification of treatment effects in the dynamic DiD specifications since those regions are only subject to later, i.e., the 2004 and 2007, EU enlargement waves.

We apply a comprehensive testing approach for different outcome variables to capture the potentially complex effects of EU integration on internal border regions. The set of outcome variables includes:Footnote 7

$$\begin{aligned} outcome_{{it}} & = \left\{ {{\text{GDP per capita}},{\text{ }}\left( {{\text{sectoral}}} \right){\text{ labor productivity}},{\text{ patents per capita}},} \right. \\ & \left( {{\text{sectoral}}} \right){\text{ employment rates}},{\text{ regional population levels}},{\text{ night}} \\ & \left. {{\text{light emissions}} } \right\}. \\ \end{aligned}$$

In all cases, we test for differences in (log-transformed) levels in these variables between the respective treatment and comparison groups. To give an example, in the case of per capita GDP we test for static and dynamic treatment effects by comparing per capita GDP levels across groups before and after EU enlargement. If we find significant differences in GDP levels, these can be interpreted as temporary short- to mid-run growth effects in the light of our fixed effects specifications and the theoretical arguments outlined in Sect. 2. We also provide sector-specific evidence for the development of productivity and the employment rate. The ERD data allow us to analyze effects separately for Agriculture (NACE Code: A), construction (C), industry (excl. construction (B-E), wholesale, retail, transport, accommodation & food services, information, and communication (G-J), financial & business services (K-N) and non-market services (O-U). Variable definitions and summary statistics are given in Table 1.

Table 1 Definitions and summary statistics for variables used in the empirical analysis for 1289 NUTS3 periods during 1981–2014

Geographical information on the EU’s internal territorial borders is extracted from a shapefile on administrative units in the EU obtained from Eurostat. Direct border regions for the enlargement waves 1986, 1995, 2004 and 2007 are defined as those regions whose administrative boundaries intersect with a corresponding NUTS3 region from a new member state (and vice versa).Footnote 8,Footnote 9 By this definition, the enlargement in 2004 marks the biggest enlargement wave with 4.6% of all observed regions defined as direct border regions (Table 1), which corresponds to 2.9% of the population in the EU-27 in 2004. The 1995 enlargement wave covers 2.6% of the regions and 1.6% of the EU-27 population with the treatment groups of direct border regions being significantly smaller in 1986 (0.8%) and 2007 (0.7%).

Given the temporal distribution of EU enlargement events throughout our sample period 1981–2014, we can estimate dynamic treatment effects for a maximum of five years prior to and seven years after the institutional changes for all four EU accession waves (except for night light emissions, which is only available from 1992 onwards).Footnote 10 While it would be preferable to extend the data to periods beyond 2014 and also include Croatia’s EU accession, there are also reasons to restrict the sample to 2014. Particularly, the EU migration crisis of 2015 and 2016 with substantial migration flows to border regions of several EU countries may bias at least all outcomes on “per capita” levels.

To measure the degree of spatial heterogeneity and neighborhood effects, we define indirect border regions based on their geographical distance from the border. To do so, we calculate for all regions not classified as direct border regions the geographical distance from the region’s centroid to the closest location at the border. Using 50 km threshold distances g with k = {100 km, 150 km, …, 300 km}, we then build additional treatment group dummies for regions within these 50 km distance belts from the border and test for spatially distributed integration effects (with k = 0 km being direct border regions along the integration border).Footnote 11 A graphical overview of direct and indirect border regions for our sample of 1289 NUTS3 regions for all four EU enlargement waves is given in Fig. 3.

Fig. 3
figure 3

Direct and indirect border regions for EU enlargement 1986, 1995, 2004 and 2007. Information on the territorial borders of EU-27 (including UK, without Croatia) NUTS3 regions has been obtained from the GISCO statistical unit dataset available at: https://ec.europa.eu/eurostat/web/gisco/geodata/reference-data/administrative-units-statistical-units/nuts. Maps for border regions by EU enlargement wave and region type (established vs new) are provided in Figure A2 in the appendix

5 Empirical results

5.1 Baseline estimates

Table 2 reports the estimation output for our pooled static DiD specification according to Eq. (4). Accounting for the multidimensional ‘fixed effects’ structure including country-specific time trends as most general specification to account for latent time-varying confounding factors, three significant findings emerge: First, border regions relatively increase their labor productivity relative to non-border regions after treatment (Panel A) and have higher levels of night light emissions (Panel E) as a general measure for agglomeration effects (Mellander et al., 2015). While the specifications shown in columns (I) and (II) thereby use the average development of non-border regions in the sample as benchmark, the inclusion of country-specific time trends tightens the benchmark to non-border regions in the respective country of border regions considered. Effect size points to a roughly 4–5% relative increase in the levels of labor productivity and night light emissions. In terms of labor productivity, this corresponds to an approx. increase of 1500–2000 Euro per worker increase evaluated at the sample average for labor productivity of 40,800 Euro per worker in non-treated regions. Since night light emissions levels are measured on a relative scale between 0 and 63, the above reported percentage increase is difficult to interpret. However, we can illustrate its magnitude with the help of an example. Evaluated against the sample average of night light emissions in non-treated regions of about 20, a 4–5% increase means an additional night light emission level of 0.8–1. The latter corresponds to an accumulation of night light intensities for growing metropolitan regions such as Madrid and Hamburg of about 5–7 years. Panel C also reports an increase in regional patent applications per capita following EU enlargement, which points to the working of the technology channel \(\Delta {A}_{it}\) of economic integration as outlined in Eq. (1). However, the development of the employment rate falls behind the overall EU-trend during the sample period by around 3%-points (for an average employment rate of approx. 43% in our data sample). No significant effects are observed for regional population development.

Table 2 Baseline treatment effects of EU enlargement for direct border regions

5.2 Structural heterogeneity

Differences in treatment effects may be driven by structural heterogeneity across the group of border regions and underlying compositional effects associated with region-sector combinations, which may not be fully captured by our set of regional controls (sectoral employment shares). To gain a deeper understanding of the underlying mechanisms at play, we disaggregate effects by enlargement waves (Panel A of Table 3) and country groups, that is, we distinguish between effects for border regions in old (established) and new member states for each EU enlargement wave (Panel B of Table 3) as outlined in Fig. 1. Especially the enlargement waves in 2004 and 2007 saw larger structural differences between established EU member countries and CEECs in their transition from planned to market economies after the fall of the iron curtain. This meant that per capita income levels, labor productivity and labor market parameters were significantly different in established EU member countries and newly joining CEECs in 2004 and 2007.

Table 3 Treatment effects for different EU enlargement waves and country groups

As Panel A shows, the estimated positive treatment effects for labor productivity and night light emissions are mainly driven by the 1995 and 2004 enlargement waves. Negative effects on the employment rate are similarly found for the 1995 and 2004 enlargement waves in particular. The latter also show a decline in population development and regional innovativeness in border regions. A positive development in terms of regional innovativeness measured through patents per capita is found for the 1986 enlargement wave. Panel B of Table 3 further indicates that estimated treatment effects not only differ across enlargement waves but are also determined by the regions’ development level and, hence, the region’s absorptive capacity at the timing of integration. Positive productivity effects of the 1995 and 2004 enlargement wave are captured by border regions in established EU countries. For these regions the 1995 enlargement also induced general agglomeration effects measured in terms of a positive population development and increases in night light emissions relative to non-border regions.

This effect is less significant for border regions in established EU member countries during the fifth enlargement wave in 2004. Relative population levels are observed to decline in the process of EU integration. On the other hand, border regions in new EU member states grow in terms of innovativeness (1986) and general agglomeration effects (night light emissions) in 2004 together with a strong increase in the employment rate by approximately 17% in 2007. The latter effect is likely driven by persistent wage differences between established EU countries (Greece) and the new member states Bulgaria and Romania who joint in 2007. Only during the 1986 EU enlargement do border regions in new member countries see a relative productivity increase and a improvements in innovativeness.

5.3 Temporal heterogeneity

Static treatment regressions may be biased if estimated effects show significant patterns of early anticipation or gradual phasing-in. Figure 4 therefore plots the results of a flexible DiD approach, which estimates yearly treatment effects relative to the timing of EU enlargement. The pre-enlargement year (t − 1) is used a reference year against which pre- and post-enlargement effects are evaluated. The results largely confirm the static treatment effect estimates in terms of positive and significant effects for labor productivity and night light emissions. In addition, Panel A of Fig. 4 also reports a positive and significant relative GDP per capita development in EU internal border vis-à-vis non-border regions. Maximum effect size for the 7-year lag period considered is an GDP per capita increase of about 2% (compared to 2.6% in the baseline static estimation approach). Annual treatment effects for labor productivity levels are found to range between 2 and 4% during the first seven years after EU enlargement. The temporal distribution of GDP and productivity effects point at a levelling out of additional growth effects after approximately 5–7 years, which supports the view of a medium-term growth bonus associated with EU integration (Baldwin & Wyplosz, 2015; in’t Veld, 2019).

Fig. 4
figure 4

Dynamic treatment effects of EU enlargement on border regions. Diamonds show point estimates for annual treatment effects of EU enlargement in border regions together with 95% confidence intervals (vertical lines; based on robust standard errors clustered at the regional level). The vertical dashed line indicates the pre-enlargement year (t − 1) used a reference year against which pre- and post-enlargement effects are evaluated. Underlying flexible DiD estimates include region FE, year FE, country-year FE and regional controls. For further details see main text. Sample period 1981–2014; 1289 NUTS3 regions

Annual treatment effects prior to EU enlargement are statistically insignificant and do not point to early anticipation effects associated with potential confounding factors around treatment start. Effects turn significant with a time lag of 4–5 years after the enlargement event. This indicates that positive economic effects from economic integration need to unfold until they are fully visible in the regional economy. Likely reasons for this gradual phasing-in process are that associated private and public investment effects typically only show up over time (Breidenbach et al., 2019; Eberle et al., 2019). The difference between the statistically significant static estimation results for the development of patents per capita and the insignificant annual dynamic effects in the first seven years after enlargement underline the role of gradual phasing in effects and the time needed to transform regional innovation systems in treated regions (Isaksen & Trippl, 2016).

But not only did technology transformation and the adaption of production systems takes time, also labor market opening after EU enlargement followed a gradual pattern, particularly for the EU eastern enlargement waves in 2004 and 2007, determined, for instance, by the 2 + 3 + 2 regulation that restricted employment access in (some) incumbent EU member states by citizens of new EU member after an up to seven years transition period.Footnote 12 Also, the Schengen entry of new member countries followed EU accession with a temporal lag of about three years. Compared to the static baseline case, the flexible DiD estimates show negative, albeit marginally statistically insignificant effects of EU enlargement on the employment rate in internal border regions (Panel C, evaluated at 95% confidence intervals). Finally, the dynamic estimates confirm positive and statistically significant increases in night light activity as general agglomeration effect in border regions.

5.4 Spatial heterogeneity

Table 4 reports the results from baseline DiD regressions, which add indirect border regions based on their geographical distance (in 50 km slices) to the enlargement border to the default treatment group as visualized in Fig. 1. Given that very few indirect border regions have a distance of 50 km to the enlargement border (while to not belong to the group of direct border regions), those regions have been merged with the 100 km slice of indirect border regions.

Table 4 Treatment effects of EU enlargement by spatial distance to enlargement border

As the results show in Table 4 show, the inclusion of additional treatment dummies for indirect border regions does not alter the effect size found direct border regions (as reported in Table 2 and Fig. 4). In addition, we observe that indirect border regions experience treatment effects of the same direction but with diminishing size as the distance to the border increases. These spatial spreading effects are especially observed for GDP per capita, labor productivity, patents per capita and night light emissions. From 300 km onwards effects are absent except of the development in patents per capita. This observed decay in effects is in line with previous findings such as Brakman et al. (2012) and Brülhart et al. (2018). The results thus point to the theoretical argument that increased market access in the process of EU integration is a major development factor for EU internal border regions. The negative effect on the employment rate is limited to direct border regions.

5.5 Extensions

As robustness tests, we run regressions combining heterogeneity dimensions. Table 5 summarizes dynamic DiD estimations by enlargement wave and region types. Underlying individual regressions can be found in the appendix. Border regions in established EU member countries benefit mostly in terms of GDP per capita and labor productivity increases but experience a relative decline in employment rates. A common result from Table 5 is that border regions both across enlargement waves and country groups gain through increase in agglomeration economies (night light emissions) and, with the exception of the 2004 enlargement in terms of population. The latter finding is in line with a border population effect of EU integration identified in Brakman et al. (2012).

Table 5 Summary of dynamic treatment effects by country group and enlargement wave

Table 6 summarizes the main effects from dynamic DiD estimates for the spatial extent of integration effects. Effects enter as being positive or negative in the table if at least one yearly post-treatment effect is estimated to be statistically significant (evaluated at 95% confidence intervals). Brackets in Table 6 indicate that significant post-treatment effects are found but also that pre-treatment trends were present. The latter limit the validity of estimated treatment effects. Detailed visualizations of dynamic treatment effects by treatment group and outcome variable are given in the appendix.

Table 6 Summary of dynamic treatment effects by distance to internal enlargement border

The results summarized in Table 6 underline the presence of spatial spreading effects up to a maximum distance of 250 km to the integration border, which are mostly positive. One exception is the relative development of night light emissions point to negative spatial correlation between direct border regions and their immediate hinterlands [100 km]. The latter pattern may point to some relocation effects taking place with economic activity moving out of the hinterlands closer to the enlargement border. Further, different from the development in direct border regions, the dynamic DiD estimates report some evidence for positive employment rate effects in the hinterlands of border regions.

Finally, we check for sector-specific patterns associated with positive (or negative) integration effects in border regions. These effects are summarized in Table 7 for labor productivity (Panel A) and employment rates (Panel B); underlying estimation results can be found in the appendix. Regarding labor productivity, we can see a clear differences in effects between border regions in established and new EU members.Footnote 13 For the enlargement waves 1995 and 2004, for which we already observed an overall increase in labor productivity, we see that the aggregate effect is mainly driven by productivity growth in 1) industrial production (sector B-E; effect up to + 12%) and 2) financial and business services (sector K-N; effect between + 6 to + 9%). Given the large share of these two sectors in the gross value added of most European countries, these two particular sector-specific effects largely determine the effect of the overall economy. At the same time, for these sectors we observe overall negative treatment effects on productivity in border regions of new member state (particularly for the 2007 enlargement wave).

Table 7 Disaggregated treatment effects for labor productivity and employment rate by sectors

While negative productivity developments in regions of new member states may point to business relocations among the most productive firms, a closer look at sector-specific employment rate shows that the effect is also driven by employment increases in border regions of new member states after the enlargement. Again, this effect is particularly significant for the 2007 enlargement wave and for the industrial sector together with wholesale & retail activity, transport, accommodation & food services, information and communication (sector G-J; effect up to + 20%).

Border regions in new EU member states associated with EU enlargement 1995 see a shift in employment towards industrial production and construction and away from service sector employment. Similarly, service sector employment declines in the aftermath of the 2004 EU enlargement in new member states. In line with standard trade theories, this specialization pattern indicates that sectors sensitive to spatial wage differences and changes in transport costs such as industrial production and the construction sector increase employment in border regions of new EU member states. Competition on local labor market may thereby reduce service sector employment. Apart from these significant effects, the results in Table 7 show a less clear sectoral picture for the development of the employment rate in border regions after EU enlargement.

6 Discussion and conclusion

This paper has studied the local economic effects of four EU enlargement waves in 1986, 1995, 2004 and 2007 using a comprehensive empirical evaluation design. Our particular emphasis on border regions as treated units is, on the one hand, motivated by theoretical considerations indicating that border regions are particularly exposed to EU enlargement and can be expected to significantly respond to this exogenous macro-institutional change. On the other hand, border regions are closely monitored by EU policy makers associated with the specific challenges of border regions. Border regions typically experience lower development levels than non-border regions in the light of their remoteness related to limited market access, public service provision etc. (European Commission). Estimating the integration effects of EU enlargement in border regions is thus challenging due to the complexity of forces at work. Here, we have taken up this ‘complexity’ challenge in our empirical identification approach by using multiple outcome variables and by applying flexible difference-in-difference estimation for EU-27 NUTS3 regions over the period 1981–2014. This allowed us to comprehensively and to robustly identify the treatment effects of EU enlargement.

Several distinct effects emerge: Overall, we find evidence for positive productivity and agglomeration effects in border regions subject to EU enlargement. However, effects vary by enlargement wave and country groups considered. While increases in overall socio-economic activity measured in terms of light night emissions are estimated as an overarching positive development trend for EU internal border regions, productivity gains are mostly experienced by border regions in established member states. This is contrasted by increases in employment rates in border regions of new member states particularly after the 2004 and 2007 enlargements. Our results hold across different methodological setups such as static and the dynamic DiD estimation.

In new member states, positive employment effects cover different sectors, most notably agriculture and industrial production. EU enlargement is found to exhibit positive spatial spreading effects to the hinterlands for direct border regions and is estimated to gradually phase in over time. The temporal distribution of treatment effects is likely due to the gradual change of institutions after EU accession (particularly in 2004 and 2007), which temporarily protected labor markets in established EU member countries from wage competition through the labor force of EU accession countries. Similarly, border impediments such as passport-free border crossing associated with the Schengen area were fully implemented some years after EU accession of CEECs.

The complexity of estimated regional effects poses some challenges to the working of regional policy support schemes to boost growth and cohesion in EU border regions as a means to reduce the prevailing structural differences in border regions compared to non-border regions. Beside the specific support of firms in border regions to access larger markets and transnational networks (Schäffler et al., 2017) or the adoption of proper institutions (Pinkovskiy, 2017), our results suggest that ongoing integration and a consequent facilitation of cross-border trade and mobility should be supported to accelerate economic development in border regions. Existing literature (see, e.g., Bosker et al., 2010; Kashiha et al., 2017; Capello et al., 2018a) shows that national borders still have strong impacts on trade and economic prosperity within the European Union and that border regions may be particularly affected by exogenous shocks such as the recent COVID-19 pandemic limiting international economic exchange (Capello et al., 2022). This leaves space for future integration efforts targeting the support to economic cooperation and development in border regions.