1 Introduction

The world runs a current account surplus with itself (Gros, 2017): a logical impossibility that must result from measurement error or from—possibly fraudulent—misreporting. This global current account surplus is mainly driven by the trade balance, and not by primary income statistics, which are known to suffer from measurement error. This fact—that we refer to as the world’s self-surplus—has prompted Krugman (2010) to develop a humorous theory of “interstellar trade” that could be at the root of the global surplus.

Less well known, the European Union has been running massive trade surpluses with itself over years, i.e., exports from EU Member States to other EU Member States exceed the value of imports as recorded by the importer. This self-surplus amounts to 307 bn Euro in 2018, or 1.9% of the Union’s GDP. The EU’s self-surplus is almost as big (86%) as the global-surplus of 357 bn Euro (422 bn USD) in 2018.Footnote 1 It is larger than the frequently criticized current account surplus of GermanyFootnote 2, and surpasses the combined GDP of the eight smallest EU Member States. It is therefore too big to be lightheartedly discarded as an irrelevant if amusing statistical artefact. As we argue, this self-surplus follows non-random patterns and may therefore unveil systematic biases. We suspect that the discrepancy may result from massive fraud in value added tax (VAT) declarations, which would amount to up to 64 billion Euro in a worst case scenario. In principle, such fraud would involve an over-declaration of export figures to save VAT for domestic sales. This is possible due to the destination principle in the European VAT system. Hence, firms face a sizeable and direct incentive to mis-report domestic transactions as exports. On the aggregate level, this may yield a credit-bias in intra-European Balance of Payments (BoP) data and can consistently explain the EU’s trade self-surplus.

Recently, probably due to increasing international economic tensions, current account (CA) statistics have attracted unusual attention both from policy makers as well as researchers. For instance, there is substantial uncertainty about whether the EU has run a bilateral CA surplus or a deficit with the United States over the past decade. In times of trade conflicts, the absence of a definite answer is troublesome; see Braml and Felbermayr (2019) for an illustration and tentative interpretation of transatlantic facts. The underlying problem, however, is broader: International transaction data are of poor quality, due to negligence, strategic government manipulation, and fraud. This obviously sets limits for meaningful evidence-based research and policy making and, probably even worse, allows fraudsters hiding their illegal activities.

To the best of our knowledge, ours is the first paper that offers a systematic analysis and interpretation of the EU’s self-surplus.Footnote 3

We provide novel evidence by (i) describing the magnitudes and dynamics of the discrepancies in the intra-EU current account, by (ii) decomposing the self-surplus according to the sub-accounts in the current account, by (iii) investigating the contributions of single EU Member States, and by (iv) the interpretation of our findings based on forensic accounting methods. For example, we show that goods and services trade contribute almost equally to the observed discrepancy. This allows us to rule out such explanations that are specific to only either of the two sorts of trade, e.g., CIF-FOB differentials in the case of goods trade. Given the persistence of the discrepancy, random statistical noise can unlikely account for the observed patterns. Hence, we raise the cui bono—who benefits?—question and argue, later in this paper, that VAT fraud is a plausible explanation for the significant credit-bias that is prevalent in European current account statistics.

For very good reasons, economists usually deem bilateral current account balances irrelevant from a macroeconomic perspective (Feenstra et al., 1999; Mankiw, 2018). However, they are of great importance for bilateral economic relations, in particular in the context of trade conflicts, currency unions, and external imbalances. For a deeper understanding of these self-surpluses as well as to explore potential mechanisms that drive it, this paper not only examines aggregated figures but focuses in greater detail on country and country pair specific discrepancies.

Discrepancies in so-called mirror data are statistical artefacts prevalent in many international data with dyadic dimension.Footnote 4 In principle, any market transaction involves the documentation of corresponding values by buyer and seller. After correct summation, balances must necessarily mirror each other: All countries’ exports are equal to all countries’ imports; this is a mechanical accounting process and holds true by definition. However, in reality, perfectly corresponding mirror data are the exception rather than the rule. Frankel (1978) discusses potential reasons for the worldwide current account deficit in the 1960s and 1970s.

In the early 2000s, the global debit-bias has turned into a credit-bias, meaning that surpluses exceed deficits. Helbling and Terrones (2009) suggest that time lags in international transportation might lead to lagged recording of imports relative to exports; in a world of rapidly growing trade, global surpluses would be a necessary consequence. According to this explanation, lengthy shipment processes and high growth rates distort trade figures. This bias should even revert, when global trade shrinks.

Academic literature has emphasized the link between missing trade and export restrictions: Fisman and Wei (2009) measure the smuggling of art as bilateral discrepancy in trade figures and Vézina (2015) shows that missing exports frequently result from resource export restrictions. Both studies illustrate that discrepancies at the country level correlate with corruption indexes.Footnote 5 Since export restrictions lead to an under-declaration of the true export values, the opposite mechanism could explain the EU’s self-surplus: import restrictions would materialize in excess trade surpluses. However, no relevant import restrictions vindicating missing trade of more than 300 bn Euro annually are known in the very case of intra-EU trade, one of the world’s most liberalized cross-border economic zone.

Considering the geographical proximity of EU Member States and the resulting possibility for land transportation, the relative weak economic growth, as well as the fact that services trade is equally affected, leads us to assume that other reasons must lie at the heart of the observed discrepancies. Ferrantino et al. (2012) investigate discrepancies in goods trade between the USA and China and link them to VAT fraud and tariff evasion. As tariff evasion can be ruled out for transactions within the EU customs union, our interpretation focuses on VAT fraud. In a nutshell, we argue that firms over-report the true value of their exports to receive a VAT rebate for goods that are sold domestically. The resulting export statistics are overblown and fail to match corresponding import figures.

The remainder of this paper is structured as follows: first, Sect. 2 discusses relevant conceptual issues and, second, reports the main finding: an empirical stock-taking of the EU and Euro Area self-surpluses. Third, it shows which Member State pairs exhibit the most severe discrepancies and identifies the countries that are most likely “responsible” for the discrepancy of a given pair. Section 3 provides a broad discussion of our descriptive findings, thereby turning to VAT fraud as an explanation for the observed discrepancies. Moreover, it estimates the potential revenue shortfalls and outlines a concept for an electronic clearing procedure that would tackle VAT fraud by the over-declaration of exports. Section 4 summarizes our main findings and concludes.

2 Documenting the EU self-surplus puzzle

2.1 Current account data

Providing a full picture of the entire current account distortions would require information on all its items: goods trade, services trade, primary income, and secondary income.Footnote 6 In a world without reporting issues—where mirror data match perfectly—the following two accounting identities must hold:

  1. 1

    All sub-accounts of the intra-EU current account sum to zero.

  2. 2

    The aggregate intra-EU current account sums to zero.

Identity 1 is self-evident. If it holds, Identity 2 holds as well. Even if Identity 1 is violated, Identity 2 can still hold. This could be the consequence of so-called demarcation problems.Footnote 7 Such demarcation problems emerge, for instance, with “servitization”: this means that an increasing share of services content embodied in manufacturing exports caused by, amongst others, related software, design, financing, or maintenance tasks makes a clear-cut differentiation between goods and service trade difficult.Footnote 8 Also, some transactions may misleadingly appear in primary income accounts rather than in services trade accounts. This can be the case when countries do not provide services associated with intangible assets directly but through tax havens. Typically, such demarcation issues cancel out after aggregation.Footnote 9

For the purpose of the present analysis, we rely on balance of payments (BoP) data provided by Eurostat only.Footnote 10This rules out methodological differences in data compilation or differences in the interpretation of the Balance of Payments Manual 6 (International, 2019) by reporting countries.Footnote 11 Please note that for Malta data are either not recorded or not published. All values are denoted in Euro. We apply the term “discrepancy” to refer to corresponding import-export statistics which fail to match each other. Aggregate discrepancies are also referred to as self-surpluses or self-deficits, respectively. Unless not specified otherwise, imports and exports include the sum of goods and services trade. In contrast to primary income accounts, which are not accessible, we also observe sizeable discrepancies in secondary income accounts (which are typically considered to be the least important current account item). In 2018, they amount to 98 bn Euro. Secondary income consists of government transactions and private donations (aid, remittances) and is therefore quite different from cross-border trade (that is conducted predominantly firm-to-firm). Hence, it does not come as a surprise that its statistical recording suffers from severe shortcomings. For our objective—the identification and quantification of VAT fraud—discrepancies in the secondary income account are if at all of only little relevance. Hence we do not further investigate them.

2.2 The evolution of the EU’s trade self-surplus

First of all, we present the overall trade balance for the EU and the Euro Area with themselves. For this purpose, we sum up trade balances of all EU (Euro Area) Member States vis-à-vis all EU (Euro Area) Member States.

Figure 1 shows that both the EU and the Euro Area run substantial trade self-surpluses; evidently, Identity 1 has been violated for the past 12 years. In 2018, the total self-surpluses (goods and services) amount to 307 and 126 bn Euro, respectively. These self-surpluses equal 1.9 and 1.1% of the respective nominal GDP. The mere size of the intra-EU trade surplus is stunning: In absolute numbers, it is more than the combined GDP of the EU’s 8 smallest economies.Footnote 12 Dynamics of the shown discrepancies reveal that these self-surpluses are persistent over time: over 12 years of observation, the EU and the Euro Area ran surpluses that cumulatively amount to 3 and 1.6 trillion Euro. While for the Euro Area, the total discrepancy fluctuates between 0.5 and 1% of GDP for almost one decade, the EU-wide self-surplus has increased and now reaches levels close to 2%. From 2013 onward, both the EU and the Euro Area exhibit a strong growth in the services trade self-surplus. When it comes to the EU, 46% of the total discrepancy is due to services trade. In the case of the Euro Area, this share amounts to 31% in 2018. Any major difference between the EU and the Euro Area can plausibly be attributed to the UK to a large extent.Footnote 13 This issue has statistically been resolved by Brexit but may still be prevalent economically. Overall, these discrepancies can hardly result from random measurement errors; otherwise, one would expect the time series to be stationary with mean zero.

Fig. 1
figure 1

EU and Euro area trade self-surpluses, bn EUR and % of GDP. Source: Data from Eurostat 2019; own calculations. Note: Left scale in bn Euro refers to the bars, right scale in % of nominal GDP refers to the dashed line. Figures show balances of the goods and services trade BoP accounts.

The reported discrepancies have also changed relative to the gross trade volume: Figure A-1 shows the dynamic of the self-surpluses relative to the aggregated BoP credit positions. For the EU, the self-surplus in goods has decreased from 9.5 to 5.5% in 2011 and moves sideways since. For the Euro Area, the level has decreased from 7.5% in 2015 to 4.9% in 2018. For both the EU and the Euro Area, self-surpluses in service trade show an upwards trend over time with an increase from 4.4 to 11.3% and from 2.6 to 6.1%, respectively.

As initially proposed, discrepancies in opposite directions because of demarcation issues would cancel. If that was the case, one balance would have to show a self-surplus and the other a self-deficit. This is not the case: for both the EU and the Euro Area, the total discrepancy of the net exports is almost perfectly equal to the total of discrepancies of the goods balance and the services balance.Footnote 14

As mentioned before, not just the EU but the entire world runs a substantial trade self-surplus. This discrepancy amounts to 422 bn USD, which equals 0.5% of global output (or, equivalently, about 1.7% of world exports) in 2018, and is only slightly higher than the EU’s total trade self-surplus of 363 bn USD (307 bn EUR). Figure 2 tracks the evolution of the global trade self-surplus, the EU’s trade self-surplus as well as the global current account discrepancy, which consists by and large of global trade surpluses. The diagram shows that the global trade self-surplus was negative before 2004 and has increased since then, mostly in lockstep with the EU’s own self-surplus. Thus, the global surplus does not appear to result from interstellar trade (Krugman 2010) but rather, misreporting within the EU seems to play an important role.Footnote 15

Strikingly, the global trade deficit started growing in 1993 and turned into a surplus in the late 1990s. During the preceding decades, it always has been a global deficit. The year 1993 is also the starting date of the EU Single Market, which has facilitated intra-European trade substantially (Felbermayr et al., 2018). In 2004, the EU Eastern enlargement has created the world’s second largest internal market. Unfortunately, comprehensive data series that would allow calculating the EU self-surplus back to 1993 are not available. Between 1999 and 2003, the EU self-surplus has remained very stable. Later, coinciding with the EU Eastern enlargement, it has quadrupled. Arguably, establishing the Single Market has simplified VAT procedures in intra-EU trade. Other ancillary achievements such as the Schengen Agreement or the creation of the Euro Area have fostered intra-EU trade on top of that; unintentionally, it may have facilitated the chance for VAT fraud, too.Footnote 16

Fig. 2
figure 2

The EU self-surplus in the global context, bn USD. Source: IMF 2019. World Bank 2019. Eurostat 2019; own calculations. Note: Comprehensive EU data before 1999 are not available. The global current account and trade self-surpluses refer to the sum over all current account and trade balances, respectively. Global figures might include missing values.

2.3 Dissecting the discrepancies

The aim of the following sections is to shift the focus of the analysis to the country and country pair level. More specifically, we are interested in identifying country pairs that cause particularly large discrepancies. One can plausibly argue that the quality of institutions, the nature of national tax systems, and even geography should play a role in explaining discrepancies caused by fraudulent behavior, in particular by VAT evasion. Therefore, this section serves as the empirical foundation of the interpretation outlined in Sect. 3.

We draw on Eurostat data on bilateral trade flows (goods and services) for the year 2018.Footnote 17 Considering all 28 EU Member States, we have a maximum of 756 (28\(\times \)27) observations per BoP item.Footnote 18 The Eurostat BoP data yield intra-EU goods trade exports (credit) totaling 2874 bn Euro. This accounts for 82% of intra-EU exports recorded by Eurostat trade statistics (Comext).Footnote 19 It becomes apparent immediately that the coverage of EU BoP data is in need of improvement. The sensitivity analysis in Sect. 2.6 provides a more detailed comparison of BoP and foreign trade statistics data.

Our preferred measure for bilateral discrepancies takes credit and debit positions into account. The vectors \(X_{ij}\) and \(M_{ij}\) are reported by country i, the vectors \(X_{ji}\) and \(M_{ji}\) by country j.Footnote 20 In the case of perfectly matching mirror data, \(X_{ij}\) should equal \(M_{ji}\) and \(M_{ij}\) should equal \(X_{ji}.\) Let \(\mathcal {E}\) denote the set of all EU Member States. Ideally, we should observe

$$\begin{aligned} \sum \nolimits _{j\in \mathcal {E}} \sum \nolimits _{i\in \mathcal {E}} X_{ij}= \sum \nolimits _{j\in \mathcal {E}}\sum \nolimits _{i\in \mathcal {E}}M_{ji}. \end{aligned}$$
(1)

However, bilateral flows do not match perfectly in the data so that \(X_{ij}\ne M_{ji}\) and \(M_{ij}\ne X_{ji}\). Consequently, the equality above fails to hold. We may define the discrepancy \(\Delta _{\mathcal {E}}\) such that

$$\begin{aligned} \Delta _{\mathcal {E}}\equiv \sum \nolimits _{j\in \mathcal {E}}\sum \nolimits _{i\in \mathcal {E}}\left( X_{ij}-M_{ji}\right) =\sum \nolimits _{j\in \mathcal {E}}\sum \nolimits _{i\in \mathcal {E}}\Delta _{ij}, \end{aligned}$$
(2)

where we define the absolute bilateral reporting discrepancy as \(\Delta _{ij}\equiv X_{ij}-M_{ji}\). Empirically, we observe \(\Delta _{\mathcal {E}}>0\). This can be due to systematic over-reporting of \(X_{ij}\) or under-reporting of \(M_{ji}\).Footnote 21 As the “true” size of the trade flows between i and j is unknown, we cannot distinguish between the two sources of misreporting. Both are possible. If we knew the true value, which assumptions could we make about the structure of the measurement error?

Assuming errors are random and multiplicative, the number of elements in \(\mathcal {E}\) growing very large, would lead to \(E\left( \Delta _{\mathcal {E}}\right) \rightarrow 0\). As we will show empirically, this is not true. So, errors in fact are non-random. In the following, the discrepancy in a pair is expressed in relative terms as

$$\begin{aligned} \delta _{ij}=\frac{2\Delta _{ij}}{X_{ij}+M_{ji}}. \end{aligned}$$
(3)

For the sake of convenience, this equation is pre-multiplied with \(100 \%\). Table 1 shows summary statistics for the observed discrepancies in bilateral BoP data for the year 2018. Figure A-2 illustrates the underlying distribution graphically. Positive means and medians confirm what we know from aggregate data: the presence of a credit/export bias. For goods trade, 50% of the flows are outside a discrepancy range between −10 and 18%. For services trade, the same is true for a range between − 18 and 46%.

Table 1 Summary statistics: bilateral discrepancies, 2018, in %.

2.4 Country analysis

This section scrutinizes which countries appear to have the largest reporting biases. The identification of Member States who play a major role in causing the observed discrepancies is of great interest. Due to the dyadic dimension of the problem, a definite identification is difficult to achieve. As an approximation to this problem, we calculate average country level discrepancies:

$$\begin{aligned} \delta _{i} = \frac{1}{n} \sum \nolimits _{j\in \mathcal {E}} \delta _{ij} \quad \text {and}\quad \delta _{j} = \frac{1}{n} \sum \nolimits _{j\in \mathcal {E}} \delta _{ij}, \end{aligned}$$
(4)

where \(\delta _{i}\) denotes the mean discrepancy for country i being the exporter, \(\delta _{j}\) denotes the mean discrepancy for country j being the importer, and n measures the number of trade partners (i.e., the cardinality of the set \(\mathcal {E}\)). Table 2 shows mean discrepancies per country based on Eq. 4 for goods and services separately.

Table 2 Mean discrepancies per country, 2018, in %.

Member States with the largest export biases (in percent) are Luxembourg (41), Bulgaria (29), and Greece (28). Germany, Europe’s largest exporter, reports on average 10% higher goods exports than its respective importers.Footnote 22 The Netherlands (0.3), Czechia (− 3), and France (− 3.3) are those Member States with the lowest discrepancies in their goods exports. Sweden (− 65), Cyprus (− 62), and Ireland (− 20) display a severe tendency toward a substantial under-reporting of their exports.

Discrepancies in services accounts are on average larger than in goods accounts: the average absolute discrepancy is 26% (18% in goods). Over-reporting goods exports goes along with over-reporting services exports. The two figures are significantly positively correlated (\(corr = 0.62\)). The most accurate reporting is performed, again, by France (2.3) and the Netherlands (− 0.8), the largest inaccuracies are observed for Cyprus (− 80) and Luxembourg (63).

Table 2 indicates that economically smaller EU Member States are more likely to exhibit larger discrepancies. Indeed, absolute values of discrepancies decrease in mean trade volume: a doubling of trade volume lowers absolute discrepancies by 4.9 (goods) and 3.7 (services) percentage points. This does not come as a surprise: scale economies might also apply to statistical recording, and statistical offices might allocate resources to prioritize tackling larger trade flows. Scatter plots in Figs. A-3 and A-4 show the relationship between trade volume and discrepancy.

2.5 Country pair analysis

This subsection explores which country pairs are particularly prone to bilateral discrepancies. Additionally, we investigate whether neighboring countries display notably higher discrepancies—another finding that would support our hypothesis that VAT fraud plays a role in solving the self-surplus puzzle. We begin by defining the mean country pair discrepancy as

$$\begin{aligned} \zeta _{ij} \equiv \frac{1}{2} \bigg ( | \delta _{ij} | + | \delta _{ji} | \bigg ), \end{aligned}$$
(5)

which is a symmetric measure in the sense that \(\zeta _{ij}=\zeta _{ji}\). Hence, we base our analysis on 378 unique country pairs.Footnote 23 We average absolute values of flow discrepancies since positive and negative discrepancies could otherwise net out. Country pair discrepancies of goods and services trade are significantly correlated, the correlation coefficient is 0.28. Table 3 shows summary statistics of the distribution of country pair discrepancies.

Table 3 Summary statistics: country pair discrepancies, 2018

Table A-1 and Table A-2 list the country pairs with the highest average discrepancy for goods trade and services trade. Among the 25 country pairs showing the highest pair discrepancies in goods trade, Sweden and Cyprus are listed 12 and 9 times, respectively. In the services account, Cyprus is part of 11 country pairs. The country pair UK–Luxembourg is particularly striking: according to British data, the service trade volume amounts to 8.2 bn Euro; the same figure, as reported by Luxembourg, stands more than three times as large at 27.3 bn Euro. The corresponding discrepancy totals 108% of the mean trade volume. Also, the UK’s service trade with Denmark worth 11.5 bn Euro (on average) shows a discrepancy of 91%. The trade volume of these two country pair relationships is higher than the combined trade of the other 23 country pairs with exceptionally low reporting quality.

2.6 Comparison with official trade statistics

Next, we perform a sensitivity analysis based on an alternative data base. To this end, we use 2018 trade data from the Comext database, “Eurostat’s reference database for detailed statistics on international trade in goods.”Footnote 24 A comparison with services trade figures from another data source would be ideal. However, Eurostat only provides services trade data based on its Balance of Payments data, which obviously makes a comparison obsolete.

According to Comext, the EU (Euro Area) runs a self-surplus amounting to 64 (19) bn Euro in 2018. These numbers are significantly lower than those resulting from the reported Balance of Payments data. In contrast to Balance of Payments data, the foreign trade statistics (FTS) applies a different valuation method for imports and exports: Imports reflect transaction values at the border of the importing economy including cost, insurance, freight (cif); exports are recorded according to transaction values at the border of the exporting economy free on board (fob). Thus, a bias toward higher import than export values is systemically inherent to the FTS data. In contrast to the FTS, BoP statistics only comprise fob recorded data. Logically, one would expect zero bilateral discrepancies in BoP data and systematic import surpluses in FTS data due to cif-fob differentials. Both are evidently not the case, and a sizable export-bias is prevalent even in European FTS. This provides additional evidence for a systematic pattern of over-reported exports within the EU.Footnote 25

Aside from these differences in absolute numbers, Comext data strongly support our previous findings in qualitative terms. These data display a very similar distribution of discrepancies, and they allow replicating the pattern of average country discrepancies that we find in Sect. 2.4 as well as the same negative relationship between trade volume and discrepancies.Footnote 26 Again, Cyprus, Ireland and the UK show strong under-reporting biases. Malta, whose BoP data are not available, seems to have the most inaccurate data. For Luxembourg—at odds with the previously detected strong export-bias—a severe import-bias becomes visible. Germany shows the lowest discrepancies according to the FTS data. The country pair analysis yields similar results as shown above:Footnote 27 among the 25 country pairs with the most inaccurate data, Cyprus appears six times; Malta even 11 times.

All our findings remain absolutely robust when choosing 2017 as a reference year.Footnote 28 This is true both for BoP as well as Comext data. Hence, we are confident that the discrepancies shown above result from a systematic pattern that we discuss next. We have provided evidence that identifies country level and country pair level patterns in intra-EU Balance of Payments discrepancies for goods and services trade. In many cases, these discrepancies are disproportionately large and seem to follow recurring patterns. Based on the evidence gathered so far, the next section attempts to unravel the EU’s self-surplus puzzle.

3 Making sense of the self-surplus puzzle

The findings of the descriptive part highlight that current account data, even within the EU and the Euro Area suffer from very substantial inconsistencies. Recently, the British magazine The Economist postulated that “Rich countries’ trade statistics tend to be more reliable than those of emerging economies, where data collection is less developed”.Footnote 29 In light of our analysis, one can question this assertion.Footnote 30 Inconsistent data make solid evidence-based economic policy advice very difficult. What is more, it could reflect a much deeper problem: fraud. Since data limitations do not allow a casual identification, our interpretation is tentative. We apply forensic accounting methods and therefore, nota bene, we can neither claim completeness nor ultimate truths. We try to collect evidence for and against our claim.

In the following, we first discuss alternative interpretations of discrepancies in goods trade. Second, we move on to services accounts. Supporting our hypothesis—that VAT fraud causes the discrepancies—we follow a forensic accounting approach: We exclude other potential explanations for the observed phenomenon and argue why firms have a strong incentive to over-report true exports. Finally, we estimate the expected fiscal loss due to VAT fraud and outline a brief concept to improve data recording in cross-border trade.

3.1 Explaining the self-surplus in goods trade

We have shown that the EU self-surplus in goods trade exhibits a systematic, non-random over-reporting bias. Sections 2.4 and 2.5 present evidence for a high degree of variation in the quality of statistical recording across EU Member States and country pairs.

A well-known distortion for EU trade statistics is the so-called Rotterdam Effect. Overseas imports entering the EU in Rotterdam and transiting to other Member States are often recorded as Dutch exports and likewise as an intra-EU import by the counterparty. This leads to inflated trade statistics in bilateral positions with the Netherlands. Discrepancies, however, arise only when these overseas imports are recorded differently by the Netherlands and the country of final destination.Footnote 31 Interestingly, we do not find evidence that the Rotterdam Effect causes discrepancies in intra-EU trade statistics: Dutch accounts on goods trade, both credit and debit, are the most accurate among all EU Member States (0.3 and −3.1%, respectively), which means that they closely match those of their trading partners.

An argument made by Frankel (1978) is that current account discrepancies arise when firms try to circumvent capital controls, for instance, for profit shifting purposes. For intra-EU discrepancies, we can largely rule out this channel, as capital controls within the EU are extremely rare events.Footnote 32 One may suspect that transfer pricing has a distorting effect on intra-EU current account statistics. Even if manipulated transfer prices are used to shift corporate profits, under-priced/ over-priced transactions would not materialize in bilateral discrepancies since accurate BoP figures require correct double-entry bookkeeping, independent of artificially inflated or deflated gross values. Thus, a national trade balance would be distorted by such measures, but discrepancies in bilateral trade statistics are not a consequence thereof.

We believe that value-added tax fraud provides a more convincing explanation for BoP discrepancies. Participants of the Single Market exempt exports from value-added tax. Effectively, VAT is borne by domestic sales independent of their origin. When products enter into a foreign EU Member State, VAT is levied in the destination country. That is due to the fact that consumption instead of production is the objective of taxation. Subsequently, the European VAT system is considered an efficient tax, but it is seemingly prone to fraud: if firms declare products as exports which are in fact not exported (or re-imported), they can sell them domestically without remitting VAT to the respective government. The only and most directly concerned beneficiaries of over-stated exports are firms.Footnote 33

Therefore, the European VAT system is evidently prone to tax fraud. To curb such practices, the EU Commission initiated a VAT Action Plan in 2016.Footnote 34Assessing the damage for European tax payers, the EU Commission has estimated that a single fraud type—the so-called “missing trader”—causes annual VAT revenue shortfalls of 45–53 bn Euro (Fearing et al., 2015).Footnote 35 Fedeli and Forte (2009) describe technical details of VAT fraud systems.Footnote 36 This cross-border fraud type does not necessarily lead to discrepancies in bilateral trade statistics but emphasizes an important issue: the presence of large scale cross-border VAT fraud—presumably because authorities are less capable of identifying cross-border frauds compared to domestic VAT fraud.

The most straightforward way to circumvent VAT for firms is the mere over-declaration of exports and would, in principle, not involve any other party, neither domestically nor in a cross-border transaction.Footnote 37 In practice, to hide such fraud before tax authorities, several firms who trade on high frequency back and forth are involved. Impeding traceability of transactions, firms located in other EU Member States are usually involved, too. Whether many firms in different countries are involved or not, all such fraud types have one common feature: at some point, a fictitious export is declared and leads both to an unlawful VAT exemption and to a statistical discrepancy in BoP and FTS data. To support our hypothesis, we employ the following OLS regression model:

$$\begin{aligned} \zeta _{ij} = \beta _{0} + \beta _{1}log(V_{ij}) + \varvec{X_{ij}} \varvec{\mu } + \varvec{\nu _{i}} + \varvec{\chi _{j}} + \epsilon _{ij}, \end{aligned}$$
(6)

where \(V_{ij}\) denotes the mean trade volume for country pair ij. The vector \(\mathbf{X} \) includes several country pair specific control variables borrowed from the gravity trade literature: geographic distance, the presence of a common border, common language, shared history as well as differentials in VAT standard rates.Footnote 38 The vectors \(\varvec{\nu _{i}}\) and \(\varvec{\chi _{j}}\) denote country fixed effects taking the value one if countries i and j are part of a given country pair.Footnote 39

Table 4 Regression analysis: country pair discrepancies; 2018 cross-section

In light of these empirical results, three findings support our hypothesis: first, our country pair regression analysis suggests that discrepancies in goods trade are 11.6–13% points higher between neighboring countries (cf. Table 4).Footnote 40 This is an isolated border-effect irrespective of the trade volume (which typically is higher for countries who share a common border). If firms over-declare exports for VAT fraud purposes, they might well likely indicate to authorities the same destinations as of their “true” exports. For many SMEs—given their limited number of export destination markets—these are the neighboring countries. Second, differentials in VAT standard rates lead to higher discrepancies: a one percentage point increase in VAT rate differentials goes along with a 3% points increase in discrepancies.Footnote 41 Third, we find a significantly positive relationship between country-level BoP discrepancies and domestic VAT compliance gaps estimated by Morrow et al. (2019).Footnote 42 The results are depicted in Fig. A-5.Footnote 43

3.2 Explaining the self-surplus in services trade

The above-described export over-declaration incentive exists for services not differently than for goods trade. However, good reasons point in the direction that the nature of services trade differs in some respect and that statistical recording is differently conducted. On the one hand, our correlation exercise in Fig. A-5 does not reveal a relationship between VAT compliance gap and services discrepancies. On the other hand, we miss a proxy to account for the quality of statistical recording. Obviously, the less developed statistical recording is, the higher is the chance for successfully embedding illegal or fictitious transactions under the disguise of services trade. For historical reasons, statistical regimes in the EU differ by Member State. And for various reasons, data quality for goods trade are better than for services trade. First, due to the formerly lower economic importance of cross-border service trade, efforts have primarily focused on establishing international standardization for the recording of goods trade. Second, because services are traditionally not subject to import tariffs, governments draw revenue only from the imports of goods. Hence, they always had fiscal interests in high quality statistical recording of international goods trade.

Similarly to discrepancies in services trade between the EU and the USA (Braml & Felbermayr, 2019), the UK contributes quite substantially to discrepancies in intra-EU services trade: the total EU self-surplus in services amounts to 141 bn Euro. Within the Euro Area, this figure stands at 39 bn Euro only. The British Office for National Statistics (ONS) generates services trade statistics by conducting survey-based partial censuses and national projections, which evidently lead to high discrepancies (Chesson et al., 2018).

However, high average discrepancies for countries with certain relevance for financial services indicate that this sector is particularly prone to statistical mis-recording. Table 2 (Column 2 and 4) illustrates that this matters for Cyprus, Ireland, Luxembourg, the Netherlands (only Debits), and the UK. It stands out that these countries display discrepancies for credit and debit accounts that point in the same direction (Cyprus, Ireland, the UK, and the Netherlands exhibit under-reported figures, Luxembourg over-reported figures). Service exports and primary income payments are, to a growing extent, substitutable and can distort BoP sub-accounts. Hence, this issue might be connected to corporate profit shifting. A joint evaluation of services and primary income accounts would be necessary for a final assessment. Due to data limitations, such an undertaking is not possible yet but could be the subject of future research when primary accounts perhaps become available.Footnote 44

Figure 1 shows that the EU’s mysterious self-surplus in services has increased five-fold since 2010. This sharp surge can hardly be explained by general recording problems, which would be time-invariant. We therefore suspect another trend to drive this increase: dis-intermediation and e-commerce. Services exports used to be mainly business-to-business transactions in the past. Nowadays, consumers can directly purchase from foreign companies and no intermediator—formerly often a domestic importer—is needed. This is particularly true for cloud, streaming, and software services. When trade mainly consists of high-value business-to-business transactions, no systematic credit-bias occurs; but when trade increasingly entails more business-to-consumer transactions, a credit-bias becomes prevalent: The exporter records the cross-border transaction, while the importer—the final consumer—does not. When low-value transactions are performed on a high scale, statistics systematically under-report true import figures. This is a direct effect of so-called de-minimis thresholds.Footnote 45 Thus, the combination of e-commerce and dis-intermediation is a growing challenge for statisticians, especially in the correct recording of services transactions.

At the same time, e-commerce has also become a platform for criminals who have set up digital VAT carousel schemes. In principle, the mechanism for VAT fraud in e-commerce functions in an analogous manner as in traditional goods trade: exports are VAT-exempt. Declaring services as exports which are in fact not exported therefore materializes in non-remitted value added tax. Borselli et al. (2015) describe a case disclosed in Italy, where two large telecommunication providers became unknowingly part of a cross-border VAT fraud scheme that cost Italian taxpayers 365 mn Euro. This provides evidence that cross-border VAT fraud is not only limited to goods trade. Following up on this, VAT fraud might be even more profitable in services trade due to lower transactions costs in case of the involvement of any physical shipment.

In the EU, statistical recording is hampered by a fundamental lack of harmonization. This is particularly true for the collection of cross-border services trade data. The existence of 28 different regimes in the EU not only causes statistical discrepancies; the poor data situation also makes it easier for fraudulent parties to hide their illegal activities. To avoid unlawful practices with respect to services trade, some EU Member States, e.g., Germany have partially changed their systems of VAT collection. In contrast to the general principle, in which the provider of a service is obliged to remit VAT, a reverse charging has been implemented.Footnote 46 Basically, it levies the duty to remit VAT to the services recipient, e.g., the final consumer. In order to fight tax fraud, Borselli et al. (2015) also recommend reforms toward a system of reverse charging. The authors are unaware of solid empirical evaluations of such VAT reforms which ideally will be covered by future research based on micro data.

With respect to ongoing debates about the implementation of a revenue-based digital services tax (DST), an accurate revenue attribution to destination markets is a prerequisite. Thus, improving data quality on cross-border supply of services should be in the twofold interest of countries, i.e., to fight VAT fraud as well as for an accurate tax base for a potential future DST.

3.3 Potential VAT revenue shortfalls

In the following, we quantify VAT revenue shortfalls for EU Member States. Thereby, we assume that VAT fraud is the only reason for the observed credit-bias in intra-EU BoP accounts. Given this relatively strict assumption, our estimates should be interpreted as an upper bound of the actual fiscal loss. Note, however, that there may be VAT fraud that is not detectable in international trade statistics. Hence, our estimates may also underestimate cross-border VAT fraud.

Let \(X_{ij}\) be the sum of services and goods exports of country i to country j,  and let \(M_{ji}\) be the imports of country j from i, where i and j are both members of the EU. Let the average VAT rate in country j be \(\bar{t}_{j}.\) If the entire data discrepancy were due to VAT fraud, the fiscal loss to the government in country j would amount to

$$\begin{aligned} T_{j}= \bar{t}_{j} \sum \nolimits _{i}\left( X_{ij}-M_{ji}\right) \end{aligned}$$
(7)

for all pairs ij where \(X_{ij}>M_{ji}\). For the EU as a whole, the aggregate loss yields

$$\begin{aligned} T=\sum \nolimits _{j}\sum \nolimits _{i}\bar{t}_{j}\left( X_{ij}-M_{ji}\right) . \end{aligned}$$
(8)

Our data do not allow calculating \(T_j\) from Eq. 7, because \(\sum \nolimits _{i}\left( X_{ij}-M_{ji}\right) > 0\) is only satisfied for 18 EU Member States (as well as the Union as a whole). However, we know aggregate balances vis-à-vis the EU as a whole for all Member States (see Sect. 2.3). Let \(\bar{t}\) be the GDP weighted average EU VAT rateFootnote 47, the EU-wide VAT loss can be approximated by

$$\begin{aligned} T=\bar{t} \, \Big ( \sum \nolimits _{j}\sum \nolimits _{i}\left( X_{ij}-M_{ji}\right) \Big ) . \end{aligned}$$
(9)

Since \(\sum \nolimits _{j}\sum \nolimits _{i}\left( X_{ij}-M_{ji}\right) =307\) bn Euro in 2018, with \(\bar{t}=0.21,\) we arrive at about 64.5 bn Euro of taxes forfeited. As the UK’s contribution to the discrepancies is most likely resulting from measurement error, not the full amount of 307 bn Euro is due to VAT fraud. But even within the Euro Area, the discrepancy of 126 bn Euro implies an amount of fraud equal to 26.5 bn Euro; more than 70 Euro per capita. Put differently, if we assume that VAT fraud affects trade in goods only, the fiscal loss amounts to 34.9 bn Euro. From 2006 to 2018, the cumulative self-surplus for goods of the entire EU amounts to 2,047 bn Euro. Assuming an average VAT rate of only 18% for the entire periodFootnote 48, EU budgets could have fallen short of 370 bn Euro over the past 13 years.

3.4 Implementing an electronic clearing procedure

Trade data appear massively distorted by inaccurate measurement and fraudulent misreporting. We believe in technical solutions to tackle both. Automated data exchange has effectively fought tax fraud in other areas, such as capital income, and could well be applied for a EU-wide VAT cooperation. An outline of such a solution could look as follows: the implementation of an electronic clearing procedure that documents all cross-border transactions for goods and services. Every transaction should require a two-factor authentication: first, the exporter reports export value, quantity, and counterparty to the system. Second, the importer confirms transaction details. The data collected would be automatically transmitted to statistical offices and tax authorities. The reverse VAT charging, that usually applies to intra-community supply, should apply here as well. As long as the importer does not confirm the transaction, VAT liability is with the exporter. At the moment of confirmation, tax liability passes over to the importer. In this system, to avoid having to remit VAT, the exporter would urge the importer to confirm the transaction. In any case, at least one party would have to remit VAT. Thus, potential fraud (mis-declaration or a wrong confirmation by any of the two parties) can no longer lead to non-taxation. Thereby, legal VAT evasion would at least be limited to the scale that VAT rates differ across EU Member States.

This system is also applicable for business-to-consumers services transactions: every EU citizen could have an electronic VAT ID to pay taxes for, e.g., imported streaming services or other e-commerce services. Payments could be processed automatically by an electronic VAT App. An additional advantage of this procedure is that de-minimis thresholds would become obsolete.

4 Concluding remarks

This paper reports and analyzes large inconsistencies in intra-European balance of payments data. The mere size of the mysterious EU trade surplus with itself—307 bn Euro, or 1.9% of the Union’s GDP—is truly remarkable. This EU trade self-surplus is persistent over time. Recently, the correct recording of services trade has become an additional challenge for statisticians due to the expansion of cross-border supply of services, e.g., by e-commerce traders and streaming or cloud services providers.

The EU self-surplus is equivalent to 86% of the global trade self-surplus of 422 bn USD in 2018. After the EU Eastern Enlargement in 2004, reported discrepancies have quadrupled. The trade self-surplus of the Euro Area amounts to only 41% of the EU self-surplus. Somewhat ironically, after the UK’s departure from the EU, average data quality in the Union improves.

This paper introduces a simple measure for discrepancies on the country and country pair level. Our analysis finds large heterogeneity in data accuracy across countries indicating substantial differences in governmental practices of statistical recording. According to our estimations, Cyprus, Ireland, Luxembourg, and Sweden are the EU Member States with the most inaccurate statistical regime. The Netherlands provide the most accurate data for goods trade. Accounting for economic size, British figures seem to distort intra-EU current account data most strongly—an issue that is, at least formally, resolved by Brexit. The self-surpluses for goods and services trade have increased over time and give reason to suspect that statistical regimes in the EU are systematically incapable of tracking true import and export figures.

Applying forensic accounting methods, we find suggestive evidence that VAT fraud drives discrepancies, in particular for neighboring countries and countries with differentials in applied VAT rates. Attributing the observed discrepancies to VAT fraud, we can quantify EU-wide VAT revenue shortfalls from 27 to 35 bn Euro per year in a realistic scenario. At worst, revenue shortfalls would amount to 64 bn Euro. Finally, we link the growing importance of e-commerce and the process of dis-intermediation to rapidly growing discrepancies in services trade. For the sake of fighting tax fraud, but also to enable policy relevant research based on reliable data, we call on the institutions in charge to substantially improve quality and reliability of intra-EU BoP data. Therefore, we have outlined an electronic clearing procedure that has great potential to inhibit tax fraud and misreporting. With respect to services trade, the correct accounting of cross-border services supply would also be a prerequisite for a future digital services tax. Also, the non-disclosure or non-collection of certain BoP items (e.g., primary income) need to be tackled urgently.

This paper is also meant to encourage future research in the field of VAT/tax fraud, profit shifting, missing trade, and the ex-post evaluation of tax reforms, such as the introduction of reverse charging procedures. The use of micro data could complement the macro approach taken in this study.