1 Introduction

Input-output analysis is widely used by authors working in the fields of regional science or regional economics. It is also becoming increasingly popular in environmental and ecological economics (Los 2011). Naturally, some ecological economists are also interested in conducting environmental impact studies for individual regions. Therefore, it is likely that regional input-output models will frequently be used for environmental impact studies in the future.

When studying a particular region, analysts often have to construct a regional input-output table (RIOT), since many statistical offices provide only national input-output tables (NIOTs). Fortunately, there are established methods for regionalising a NIOT and adapting it to regional characteristics (nonsurvey methods). Numerous methods have been suggested; the most recent ones are the FLQ (Flegg and Webber 1997, 2000; Flegg et al. 1995) and CHARM (Kronenberg 2009). A large and growing literature discusses the strengths and weaknesses of these methods (Bonfiglio and Chelli 2008; Morrison and Smith 1974; Richardson 1985; Schaffer and Chu 1969; Tohmo 2004).Footnote 1 However, the focus of the present paper is a different one.

This paper aims at drawing attention to a crucial issue whose importance has not yet been fully realised in the literature on nonsurvey regionalisation methods: there are different variants of the symmetric input-output table (SIOT), and the choice of nonsurvey method should depend on the type of SIOT that is to be regionalised. The most important difference between the SIOT variants lies in the treatment of imported products. The United Nations handbook on input-output analysis identified four different variants, labelled alphabetically from “A” to “D” (United Nations 1973). This convention is also adopted in the present paper, and an additional variant “E” (for “Eurostat”) is introduced to describe the tables based on the European System of Accounts (ESA 95).

A crucial finding of this paper is that location quotient (LQ) methods are suitable for variant B tables, whereas commodity balance (CB) methods are suitable for variant A and E tables. The existing literature has not paid much attention to this issue because regional economists mostly use variant B tables. Ecological economists, by contrast, are more likely to use variant A or E tables.

The paper proceeds as follows. Section 2 introduces definitions of variables and conventions on mathematical notation. Section 3 explains the different variants of the SIOT, largely following the exposition in the UN manual (United Nations 1973). Section 4 describes the interpretation of the coefficients derived from different tables. Section 5 identifies the implications that follow for those who want to construct a RIOT using nonsurvey methods. Finally, Sect. 6 provides some concluding remarks and suggests avenues for future research.

2 Definitions and conventions

Figure 1 shows the basic data which are needed to construct input-output tables of the sort that will be discussed below.

Fig. 1
figure 1

Basic data. Source: author’s illustration

The following conventions will be used: the subscript i stands for products or commodities, whereas the subscript j stands for industries or homogeneous branches. As all the input-output tables provided by Eurostat are based on the concept of a “homogeneous branch”, an artificial construct which by assumption produces only one type of commodity, the present paper also uses this concept. In reality, many firms produce more than one type of good, so it is impossible to assign the inputs and outputs of one firm unambiguously to one homogeneous branch. Therefore, the statistical offices “split” the firm and allocate its inputs and outputs over a number of homogeneous branches.Footnote 2 The details of this procedure are outside the scope of the present paper. In the following, the term “industry” will be used as a synonym for “homogeneous branch”.

In Fig. 1 (and henceforth), a superscript d or m is used to indicate the origin of products (domestically produced or imported). Matrices are denoted by capital letters, vectors by lower case letters. Both are printed in bold type. The individual elements of a matrix are printed in italics. Thus, for example, Z i,j is element i,j of matrix Z. It reports the total amount of product i used by industry j. The amount originating from domestic production is \(Z_{i,j}^{d}\), whereas the amount imported is \(Z_{i,j}^{m}\). Naturally, \(Z_{i,j}^{d} + Z_{i,j}^{m} = Z_{i,j}\).

In addition to Z d and Z m, the basic data table contains the following elements. The vector v reports value added (i.e. primary inputs) by industry, and the vector x reports output (i.e. production) by industry. The vectors y d and y m contain domestic final use of products, respectively. Domestic final use is defined as the sum of private consumption expenditure, public consumption expenditure, and gross capital formation. The vector e d reports exports of domestically produced commodities, whereas e m reports exports of imported commodities (i.e. re-exports). Finally, the vectors u d and u m describe the total use of domestically produced and imported commodities. Total use is defined as the sum of intermediate use, final domestic use, and exports. Mathematically:

$$u_{i} = \sum_{j = 1}^{n} Z_{i,j} + y_{i} + e_{i} $$
(1)

where n is the number of products. Naturally, this relationship holds for both domestically produced and imported products:

(2a)
(2b)

Figure 1 shows which data are needed for simple applications of input-output analysis. The big advantage of input-output tables is that they arrange these data in a straightforward manner that is consistent with standard book-keeping procedures. Table 1 shows how this can be done in the form of a comprehensive input-output table containing all of the relevant information.

Table 1 The comprehensive input-output table

The first column of Table 1 refers to the first industry (henceforth ‘industry 1’). In the first three rows (with row 1 referring to product 1, row 3 to product n, and row 2 to “all products in between 1 and n”), the elements of Z d concerning industry 1 are reported. Row 4 contains a subtotal, denoted by \(z_{1}^{d}\). This is the sum of all domestically produced products that were used as intermediate inputs by industry 1. Below that, the relevant elements of Z m are reported. Rows 5 to 7 show the use of imported products as intermediate inputs by industry 1, and row 8 contains the sum of these, \(z_{1}^{m}\). In row 9, we find the sum of \(z_{1}^{d}\) and \(z_{1}^{m}\). z 1 is the value of all intermediate inputs used by industry 1. Using these intermediate inputs, industry 1 generates a certain amount of value added, reported in row 10 and denoted by v 1. Taxes on products, which drive a wedge between basic prices and purchasers’ prices and divert a share of value added to the government, are ignored here for the sake of simplicity. Depreciation (i.e. consumption of fixed capital) and taxes on production are also ignored. Under these assumptions, value added is simply the sum of compensation of employees (wages plus social security contributions, the reward for labour services) and net operating surplus (the remuneration of capital).

The following relationship holds by definition:

$$x_{j} = \sum_{i}^{n} Z_{i,j}^{d} + \sum_{i}^{n} Z_{i,j}^{m} + v_{j} $$
(3)

This expression states that the value of output produced by industry j is equal to the value of intermediate products used by that industry and the value added by that industry. This definition is in accordance with the classical theory of value, where firms buy inputs (commodities) of a certain value and generate additional value in the course of the production process. The added value is then distributed to the primary inputs labour, capital and land (although capital and land are unfortunately not displayed separately in the input-output tables).

Moving along the first row of Table 1, we can see how and where domestically produced products of type 1 are used. The first three columns show the amounts of product 1 that are used by industries 1 to n as intermediate inputs, and column 4 shows the sum of these. Column 5 shows the domestic final use (final consumption expenditure by households, NPISH,Footnote 3 and government as well as gross capital formation including stock formation) of product 1. In column 6 we observe the amount of product 1 that is exported to other countries. Finally, column 7 reports total use (i.e. the sum of intermediate use and final use). Rows 2 and 3 show the same for all other domestically produced commodities, and row 4 shows the sum of rows 1 to 3. Rows 5 through 8 show the same thing but for imported products. Thus, the comprehensive input-output table allows us to trace the use of domestically produced products (rows 1 through 4) separately from the use of imported products (rows 5 through 8). Row 9 is the sum of intermediate use (columns 1 to 4) and final use (columns 5 and 6). Row 10 reports value added by each industry, and row 11 reports the total output of each industry.

Eurostat does not supply comprehensive input-output tables of the type shown in Table 1. However, it does provide all the data that are required to produce such a table.Footnote 4 For almost every EU member country, there is an input-output table containing only domestically produced products, which contains data for \(\mathbf{Z}_{i,j}^{d}\), \(\mathbf{y}_{i}^{d}\), \(\mathbf{e}_{i}^{d}\), \(\mathbf{u}_{i}^{d}\), v j , and x j . Furthermore, there is an ‘import matrix’, which contains data for \(\mathbf{Z}_{i,j}^{m}\), \(\mathbf{y}_{i}^{m}\), \(\mathbf{e}_{i}^{m}\), and \(\mathbf{u}_{i}^{m}\). Thus, with the available data we can produce a comprehensive input-output table. However, this is usually not done. Most input-output modellers prefer working with a symmetric input-output table (SIOT). What complicates matters is the fact that there are different ways of constructing a SIOT. These are discussed in the following section.

3 Variants of the symmetric input-output table

Table 2 reports what we will call the SIOT Variant A.Footnote 5 At the core of the SIOT Variant A is an interindustry transactions matrix Z, which reports the entire intermediate consumption of products (domestically produced and imported). Mathematically, Z=Z d+Z m. Taking column sums of this matrix yields total intermediate consumption by industry, denoted by z. Taking row sums yields total intermediate consumption by product, denoted by r. By definition, summing z over j must yield the same result as summing r over i, so z=r (but z=r will usually not be true; it is possible but extremely unlikely).

Table 2 Symmetric input-output table, variant ‘A’

In each homogeneous branch, intermediate consumption plus value added is equal to output:

$$ \mathbf{x} = \mathbf{z} + \mathbf{v}$$
(4)

The bottom row of the SIOT variant A reports output by industry x j . Total output is the sum of x j over all \(j: x = \sum_{j}^{n} x_{j}\).

Each row can be understood as a representation of the commodity balance. If a country uses more of product i than it produces, it must be a net importer of that product, and vice versa. In other words, net exports of product i must be equal to domestic output minus domestic use of that product. Mathematically:

$$e_{i} - m_{i} = x_{i} - r_{i} - y_{i} $$
(5)

Rearranging terms yields:

$$r_{i} + y_{i} + e_{i} - m_{i} = x_{i} $$
(6)

Going through row i of the SIOT variant A means going through (6). This is why imports are entered with a negative sign in the table.

The symmetry of SIOT variant A is captured by the fact that each column represents the summation procedure resulting in the value of total output for each industry:

$$x_{j} = \sum_{i}^{n} Z_{i,j} + v_{j} $$
(7)

Thus, the final entry in row i(x i ) is equal to final value in row j(x j ) whenever i=j.

Table 3 shows the symmetric input-output table, variant B.Footnote 6 This variant is based on a different way of recording imports. In variant A, imports are allocated by product and the vector m consists of the m i . In variant B, by contrast, imports are allocated by use, i.e. homogeneous branches and final users. \(z_{j}^{m}\) denotes the column sums of matrix Z m. Thus, \(z_{j}^{m}\) is the value of imported products that were used as intermediate inputs by industry j. Accordingly, z m denotes the total value of imports used as intermediate inputs. y m denotes the value of products that were consumed by final users in the country, and e m denotes the value of imported products used for exports (re-exports). m denotes the total value of imports.

Table 3 Symmetric input-output table, variant ‘B’

It is important to realise that z m is very different from m. The latter is a column vector of length n (the number of different products), and element m i is interpreted as “imported products of type i”. The former is a row vector of length n, and element \(z_{j}^{m}\) is interpreted as “products of all types imported for use by industry j”. Moreover, the sum of all elements is not equal—vector m contains all imported products, whereas vector z m contains only those products imported for intermediate use, as imported products for final use are recorded elsewhere.

Table 4 shows how Eurostat currently compiles its symmetric input-output tables, including domestic products and imports according to the ESA 95 guidelines. This variant is called variant ‘E’ for ‘Eurostat’. It is very similar to the type A table discussed above (Table 2); the only difference lies in the treatment of imports. At the core of Variant E is the interindustry transactions matrix Z, as in variant A.

Table 4 Symmetric input-output table, variant ‘E’

For the type E table, an additional vector labelled “total supply” (denoted by s) is introduced. It is defined as the sum of domestic production and imports of similar commodities:

$$ \mathbf{s} = \mathbf{x} + \mathbf{m}^{E}$$
(8)

The row vector m E refers to imports by commodity. m E stands for ‘vector of imports constructed the Eurostat way’. \(m_{j}^{E}\) is the value of imports of commodity j, not the value of products imported by industry j. This is a crucial difference, as we will see below. Mathematically, m E is the transpose of m.

Total final use is defined as the sum of domestic final use and exports:

$$ \mathbf{f} = \mathbf{d} + \mathbf{e}$$
(9)

Total use is equal to the sum of intermediate use (by product) and final use:

$$ \mathbf{u} = \mathbf{r} + \mathbf{f}$$
(10)

Finally, it is true by definition that

$$ \mathbf{s} = \mathbf{u}$$
(11)

Thus, the IOT is symmetric in the sense that s j =u i when i=j.

To sum up, there is not a big difference between variants A and E. Actually, variant A can easily be converted into variant E by simply transposing the import vector and adjusting the column/row totals accordingly (and vice versa). There is, however, a great difference between variants E and B, because the treatment of imported products is very different. The implications of this will be discussed in the following section.

4 Interpretation of coefficients

When input-output tables are used in empirical research, it is common practise to compute input-output coefficients to describe the structure of the economy, to identify supply chains involving various industries, and to provide the basis for model-based impact studies.Footnote 7 Therefore, it is important to understand correctly the interpretation of the input-output coefficients that can be computed from the different tables.

In variant A, input-output coefficients are defined as:

$$a_{i,j}^{A} = \frac{Z_{i,j}}{x_{j}} $$
(12)

These coefficients describe how many units of input i were used/needed to produce one unit of output j. Therefore, they can be interpreted as technological coefficients.

In variant B, by contrast, input-output coefficients are defined as:

$$a_{i,j}^{B} = \frac{Z_{i,j}^{d}}{x_{j}} $$
(13)

These coefficients do not tell us how many units of input i were used to produce one unit of output j, because they refer only to those inputs that were produced domestically. Imported inputs are ignored. To make this point clearer, let us define a trading coefficient t i,j as:

$$ t_{i,j} = \frac{Z_{i,j}^{d}}{Z_{i,j}}$$
(14)

t i,j is the share of input i used by industry j that originates from domestic production. Conversely, (1−t i,j ) can be interpreted as the import share. Using (12), (13) and (14), we can write the relationship between \(a_{i,j}^{A}\) and \(a_{i,j}^{B}\) as:

$$ a_{i,j}^{B} = t_{i,j}a_{i,j}^{A}$$
(15)

Thus, \(a_{i,j}^{B}\) will generally be smaller than \(a_{i,j}^{A}\). The \(a_{i,j}^{B}\) coefficients differ from the true technological coefficients (\(a_{i,j}^{A}\)) owing to international trade. Therefore, they cannot be interpreted as technological coefficients. They are a mixture of technology and trade.

Variant E is interesting because it allows the calculation of two different kinds of coefficient. If we divide the elements of Z by output, we are performing the same calculation as in (12). Mathematically:

$$ a_{i,j}^{E} = \frac{Z_{i,j}}{x_{j}} = a_{i,j}^{A}$$
(16)

Thus, variant E also makes possible the computation of technological coefficients. However, if we divide Z i,j by s j , we get a different kind of coefficient (an input-supply coefficient rather than an input-output coefficient), which we will denote by b i,j :

$$ b_{i,j}^{E} = \frac{Z_{i,j}}{s_{j}} =\frac{Z_{i,j}}{x_{j}}\frac{x_{j}}{s_{j}} =a_{i,j}^{E}\frac{x_{j}}{s_{j}}$$
(17)

Thus, there is a close relationship between \(b_{i,j}^{E}\) and the technological coefficient \(a_{i,j}^{E}\). The factor of proportionality is (x j /s j ). This is the share of total supply of product j that is provided by domestic output x j . If the country does not import product j, we have x j =s j . In this case, the two coefficients coincide. Whenever imports of product j occur, \(b_{i,j}^{E}\) will be smaller than \(a_{i,j}^{E}\). The difference between the two can be interpreted as an indicator of self-sufficiency or import dependence. It is clear, however, that \(b_{i,j}^{E}\) cannot be interpreted as a technological coefficient. The only technological coefficient is \(a_{i,j}^{E}\), which is equal to \(a_{i,j}^{A}\).

5 Implications for regional input-output modellers

The very technical discussion of the previous sections has important implications for regional input-output modellers. The reason is that regional input-output models are often constructed on the basis of regionalisation methods that adjust the national input-output table to regional conditions by applying mechanistic rules. These methods are laid out in the following.Footnote 8

A variety of methods is based on a popular concept of regional science, the location quotient (LQ), which is generally interpreted as an indicator of an industry’s relative over- or underrepresentation within a region (compared with the national average). The LQ is computed by using data that happen to be available. A direct measure of an industry’s relative importance, such as the share of its output in total regional output, would be preferable. However, if such data are not available (at a satisfactory level of disaggregation), researchers often resort to employment data. The LQ of industry j is then computed according to the following formula:

$$ \mathrm{LQ}_{j} = \frac{L_{j}^{R}/\bar{L}^{R}}{L_{j}^{N}/\bar{L}^{N}}$$
(18)

The LQ, as computed by (18), is thus equal to the share of industry j in regional employment divided by the share of industry j at the national level. If LQ j >1 (i.e. the employment share of industry j at the regional level is larger than at the national level), industry j is said to be overrepresented. Conversely, if LQ j <1, industry j is said to be underrepresented. This procedure of describing the economic structure of a region, in comparison with other regions or the national average, has been standard practise in regional science for a long time.

The LQ has found use in the input-output literature as a tool for constructing regional input-output tables when detailed data are not available for the region to be studied. The idea is to regionalise the national input-output table by applying the LQ as correction factor of the A matrix. In the seminal paper by Schaffer and Chu, this is formulated as follows: “A location quotient of less than one means that the region imports some of its needs of output i. A location quotient greater than one means that the region exports some of output i” (Schaffer and Chu 1969, p. 85). Following this line of reasoning, Schaffer and Chu then explain what to do when the LQ is above or below unity: “If LQ i ≥1, we set a i,j =A i,j , where a i,j is the regional production coefficient (defined as x i,j /x j ) and A i,j is the national production coefficient (X i,j /X j )” (Schaffer and Chu 1969, p. 85).

In other words, this procedure boils down to assuming that, if LQ i ≥1 , the “regional production coefficient”, a i,j , is equal to its national counterpart. In the other case, Schaffer and Chu propose the following procedure: “If LQ i <1, local production is assumed to be inadequate to supply local needs—no exports can be made and imports are necessary. The regional production coefficient in row i may now be computed as a i,j =LQ i A i,j ” (Schaffer and Chu 1969, p. 86). Thus, in this case the “regional production coefficient” will be smaller than its national counterpart.

This approach is called the “simple location quotient” (SLQ) method. Since it has a number of shortcomings, various alternatives have been proposed. For a survey, see Miller and Blair (2009, pp. 349–359). However, in this paper, the focus is not on the particular shortcomings of the SLQ method; it is on the structure of the input-output table and the proper interpretation of the interindustry transactions matrix.

It is important to realise that Schaffer and Chu illustrated their discussion by means of an input-output table corresponding to our variant B. In their table, m 1 must be interpreted as “intermediate products imported for use by industry 1”. It must not be interpreted as “imported products of type 1” (that would be the correct interpretation for Variant E tables). Thus, the reasoning behind the SLQ method (and, as remains to be shown, that of all other LQ methods) is based on an input-output table of the SIOT Variant B layout. The purpose of the present paper is to show that this has important implications for those who work with other variants of the SIOT.

In the context of a SIOT Variant B table, the reasoning behind the LQ methods is perfectly valid. As an empirical example, let us consider the case of electricity generation from coal in the city-state of Hamburg (one of the sixteen federal states that make up the Federal Republic of Germany). If somebody wanted to construct a Type B RIOT for Hamburg by means of the SLQ method, he would compute LQCOAL using employment data for Hamburg and find that LQCOAL=0 since there are no coal mines in the region. The standard SLQ method calls for setting all entries in the corresponding row of the regional A matrix equal to zero. This is perfectly reasonable because A B, after all, refers only to regionally produced commodities.

However, in the context of a SIOT Variant A or E, this is not the case. A “naive” application of the SLQ method would mean that all entries in the row corresponding to coal would contain only zeroes. The problem here is that the matrix A E, in contrast to A B, refers not only to regionally produced commodities; it refers to the intermediate use of products, including imported products. Setting the first row of A E equal to zero would mean that none of the regional industries used any coal whatsoever. That is not a sensible result, because there is in fact a large coal-fired power plant in Hamburg.Footnote 9 This naive approach might lead to the erroneous conclusion that changes in final demand have no impact on coal consumption (and this in turn might lead to an underestimation of the environmental damage caused by the change in final demand).

Furthermore, a distinction must be made between the matrix of technical input-output coefficients (A A or A E) and the matrix of input-supply coefficients (B E). The column labelled “coal” in matrix B E should contain only zeroes, because the coal industry, being absent in Hamburg, does not use any inputs, so all inputs are equal zero, but supply is larger than zero due to imports. The SLQ procedure does not ensure that the “coal” column of the B E matrix contains only zeroes. Thus, we must conclude that the SLQ method is not well suited to regionalising a SIOT of Variant A or E.

What about other LQ methods? Let us consider the CILQ method, which is based on the following formula (again taken from Schaffer and Chu):

$$ \mathrm{CILQ}_{i,j} = \frac{x_{i}/X_{i}}{x_{j}/X_{j}}$$
(19)

where x i (x j ) is regional output of industry i(j) and X i (X j ) is national output of industry i(j). In words: CILQ i,j is equal to the region’s share in total output of industry i divided by the region’s share in total output of industry j. Compared with the SLQ, the CILQ has the advantage that it does not only consider the relative size of industry i but also sets this in relation with industry j. However, this does not make it immune to the problem identified above. Once again, if industry i is coal mining, and coal mining does not exist in the region to be studied, all cells in row i will be equal to zero. This is perfectly reasonable for SIOT Variant B, but it does not make sense for SIOT Variant A or E. Thus, the CILQ is subject to the same limitation as the SLQ—it is not applicable to tables of Variant A or E.

What, then, can be done if a regional SIOT of Variant A or E is required? Here it is argued that, for these SIOT variants, the preferable regionalisation method should be based on the commodity balance (CB) approach (also known as supply-demand pool approach). This approach is based on the following equation, which is true by definition:

$$ x_{i} + m_{i}^{E} = r_{i} + f_{i}$$
(20)

Equation (10) states that the sum of regional production and imports (the supply pool) must be equal to the sum of intermediate use and final use (the demand pool). Final use, in turn, can be decomposed into regional final use (regional consumption and capital formation) and exports, which yields:

$$ \mathbf{x} + \mathbf{m} = \mathbf{r} + \mathbf{d} + \mathbf{e}$$
(21)

Note that the vector m in (21) is the column vector of imports by product as displayed in the SIOT Variant A (Table 2). Its transpose is the row vector m E, as displayed in the SIOT Variant E (Table 4). It is very different from the vector of imported intermediate products z m, as displayed in the SIOT Variant B (Table 3).

It is assumed that regional output x i and regional final use d i can somehow be estimated or measured. The next task is to estimate r i . In order to do this, the “equal technology assumption” is invoked. This means that each industry in the region is assumed to operate with the same technological coefficients as at the national level. The technological coefficients can be calculated from the national input-output table according to (12) or (16).Footnote 10 Then, the same equations can be used to compute the Z matrix for the RIOT. Taking the row sum of Z yields the vector of intermediate use r. Thus, the remaining task is to compute estimates for e i and m i . To do this, the trade balance b i has to be computed. It is defined as:

$$ \mathbf{b} = \mathbf{e} - \mathbf{m}$$
(22)

Solving (22) for e or m and substituting it into (21) yields:

$$ \mathbf{b} = \mathbf{x} - \mathbf{r} - \mathbf{d}$$
(23)

Since data or estimates of x, r, and d are available, b can be computed from (23). However, it is not possible to compute actual exports and imports; (23) allows us only to compute the commodity balance. If we want to compute actual exports and imports, additional assumptions are required. For SIOT Variant A, this step is not very important because the columns labelled “exports” and “imports” may simply be subsumed by a column labelled “net exports”. This is a way of evading the problem rather than solving it. If the goal is to construct a SIOT Variant E, the problem cannot be circumvented.

A very simple solution that has often been applied relies on the assumption that, for each product, the region is either import-dependent or not. That is, whenever the trade balance is positive, imports are assumed to be zero, and if the trade balance is negative, exports are assumed to be zero. This assumption rules out the possibility of cross-hauling (the simultaneous exporting and importing of similar products) and has been heavily criticised (Boomsma and Oosterhaven 1992; Richardson 1985). A more advanced treatment is possible with the Cross-Hauling Adjusted Regionalisation Method (CHARM), which estimates the amount of cross-hauling based on the heterogeneity of products (Kronenberg 2009, 2010). Either way, some estimate of e and m will be produced, and a SIOT Variant E can be constructed.

The CB method solves the problem outlined above, as can be seen with respect to the coal mining example. By multiplying the technological coefficients derived from the national IOT with the regional production vector, the matrix Z will be correctly estimated (subject to the drawbacks of the “equal technology assumption”, of course). That is, column 1 of the matrix Z will contain only zeroes, because industry 1 (coal mining) does not exist and hence does not use any intermediate inputs. The entries in row 1, by contrast, will not be forced to equal zero, so the coal use of other industries is correctly respected. Given that regional production of coal is zero, the application of (22) will yield a negative trade balance for coal, and this will be reflected in the first element of the import vector m. In a Variant A table, the required imports of coal will be recorded in row 1, column “imports”, and in the Variant E table they will be recorded in row “imports”, column 1. All of this is absolutely correct.

On the other hand, neither the CB method nor the CHARM extension should be applied to SIOT Variant B tables. Aside from producing false results, there is also a logical contradiction. Setting up a commodity balance is possible only if the vector m is known. In a SIOT Variant B there is no such vector, therefore it is not possible to apply a CB method to such a table without using additional information.

6 Concluding remarks

The present paper shows that LQ methods can be applied to national input-output tables of the SIOT Variant B type, but they should not be applied to tables of Variant A or E. Conversely, CB methods, including CHARM, can be applied to Variant A or E tables, but not to type B tables. Applying LQ or CB methods to tables for which they are not suitable may result in misleading and implausible results.

The discussion of the previous sections is based purely on definitions and logical considerations. However, the following implications of the differences between the SIOT variants for regional input-output models have not yet been fully acknowledged in the literature on regional input-output modelling. The reason for this might be that much of that literature was written by authors based in the United States, where SIOT Variant B appears to be much more common than the other variants. Being used to working with this particular version of the SIOT, these authors did not devote much attention to the possible complications of working with other SIOT variants. This explains why neither Schaffer and Chu (1969) nor Miller and Blair (2009) discuss this problem.

Furthermore, in regional economics, it often makes sense to use the SIOT Variant B, as researchers are mostly interested in the effects of a certain final demand impulse on output, value added, and employment within the region of study. These questions coincide with the concerns of regional policy makers. When they decide, for example, on a particular investment project, they want to know how many jobs will be generated in the region for which they are responsible. However, input-output models are increasingly being used in the fields of environmental and ecological economics (Los 2011). Researchers from those fields are mostly interested in energy use, material consumption, greenhouse gas emissions and so on. For such research topics, SIOT Variant A or E is more useful. If we are concerned about climate change, we want to know the impact of final demand on, for example, electricity consumption, because electricity production is closely associated with greenhouse gas emissions. It does not matter much whether the electricity is produced in region A or region B (unless the electricity generation mix differs significantly). Therefore, we want to know how much electricity is actually used, and not whether it comes from domestic production or import. This is why ecological economists tend to prefer the allocation of imports according to Variant E. In the past, most of these studies have been undertaken for national economies. More recently, however, there seems to be a growing interest in sustainable development and environmental policy at the regional level. As more and more researchers conduct environmental impact studies for individual regions, they will need to construct regional input-output models. Therefore, they need to be aware of the complications that stem from the different allocation of imports.