1 Introduction

Internal migration and commuting are important from several perspectives. Low internal mobility may hamper economic growth at the national level, as human resources are not optimally allocated. Norway has pursued a policy aimed at reducing out-migration from the districts, which is believed to have contributed to limiting the mobility. Another factor is a possible mismatch in the balance between in- and out-migration. In many parts of the country, there is negative population growth in both the short and the long run due to high out-migration and a low number of births (Leknes and Løkken 2020).Footnote 1 This makes it difficult to maintain public services and raises concerns over the sustainability of certain communities in the long run. Several municipalities run campaigns to motivate young persons to move back after finishing their education, but the effects of such efforts are often considered to be limited (NOU 2020: 15).

Commuting and internal migration are closely intertwined. Opportunities for commuting allow one to settle in less central areas without this being at the expense of opportunities to obtain satisfactory work. Reasons for choosing to settle in less central areas may, for example, be high housing prices in central areas or a preference for living in less densely populated areas. In general, more dynamic locations are expected to have higher in-commuting and in-migration rates. In addition, larger cities tend to yield higher rewards to human capital than rural areas (Ahlin et al. 2014). For out-migration/commuting the relationship is the reverse.

Internal migration and commuting constitute an old and well-established research area that has been approached from many angles. Sjaastad (1962) considers migration from the perspective of resource allocation. Over time, some industries expand while others experience downturns, and this gives rise to increased wage differentials across industries/occupations. Due to differences in business composition in different parts of the country, this change generates incentives for internal migration. By collecting information on the costs and returns of migration, it is in principle possible to assess whether the reallocations are sufficient to yield efficient allocation of labor.

We follow another common line of research which points to the relationship between economic incentives and geographic mobility; see for instance Greenwood (1975, 1985) and Mitze and Reinkowski (2011). However, the results from previous studies of economic driving forces have been somewhat inconsistent. For instance, Cebula (2005) considered gross state-level in-migration in the USA and found that per capita income had an effect, but unemployment rate differentials did not. Mitze and Dall Schmidt (2015), using Danish data, found mixed evidence of traditional incentive variables. Meanwhile, Piras (2012), using a cointegration framework on Italian data, found results with the expected signs for regional per capita GDP and unemployment rates, although the results varied somewhat with the different empirical specifications applied.

From a theoretical point of view, one should take into account that decisions to migrate and commute are taken simultaneously in order to maximize some target function, i.e., a utility function. The groundbreaking contribution by Monte et al. (2018), who consider the USA, provides a guideline to how this can be done within a general equilibrium context. To simplify model specifications and estimations, the empirical analysis in this article does not explicitly consider the close connection between migration and commuting.

Our analysis employs data for 89 Norwegian economic regions. The observation period is 2001–2014 and the individuals are between 20 and 64 years of age. Our main attention is on two incentive variables: the unemployment rate and the real hourly wage. In Norway, a relatively high fraction of workers are members of trade unions and economic policy is aimed at reducing geographic wage differences. Compared to other countries, the wage distribution has traditionally been relatively narrow, but over time the wage gap has widened.

This article makes two contributions. First, we allow gross migration and commuting flows to vary across four different educational groups. Thus, we allow responses to changes in relative real hourly wages and relative unemployment rates to differ across educational groups from the outset.Footnote 2 Many studies on internal migration illuminate the relationship between human capital and internal migration, even though human capital is not always the main issue addressed. However, many of them operate with a less detailed classification than that employed here. Examples include Devillanova (2004), Piras (2012), Clemente et al. (2013), Korpi and Clark (2017) and Epplesheimer and Möller (2019).

Second, we employ seemingly unrelated regression (SUR) models for panel data. The main reason for estimating a set of equations simultaneously is related to estimation efficiency. Faced with estimates that have the expected sign but low significance, valid parametric constraints applied across the equations produce more efficient estimates.

Our model specification differs from those of, for instance, Liu (2018) and Poghosyan (2018), in that we model not bilateral flows, but aggregate flows. That is, we consider the total inflows and outflows from the economic regions. An advantage of doing this is that we can use far more disaggregated data. In a setting with 89 observational units, a substantial number of observation pairs are characterized by zeros, for either one or both the observational units. This constitutes an issue when the models are specified in terms of log-transformed variables. Consistent with our choice, for each economic region we operate with ‘outside’ variables which are weighted averages of the values of the variables outside the economic region at hand. We use population shares for those aged 20–64 years as weights.

The group comprised of those with a low education appears to be most responsive. An increase in an economic region’s relative wage rate leads to higher in-migration and lower out-migration for individuals with low education. Furthermore, an increase in an economic region’s relative unemployment rate leads to lower in-migration whereas out-migration is left unaffected for individuals with this type of education. Besides, an increase in the relative unemployment rate leads to a significant reduction in in-commuting for this group.

It can be argued that our measure of ‘outside’ variables does not reflect the potential outcome of moving from one region to another if hourly wages are positively correlated with (labor market) populations size. To check this assumption, we carry out a robustness analysis in which we only use a part of the data focusing on internal migration and commuting from small to large economic regions.

The rest of the article is organized as follows: In Sect. 2, we specify our econometric models and make various assumptions. Section 3 provides information on the data. In Sect. 4, we present and discuss our empirical results both for the specification of the main models and for those involved in the robustness analysis. We conclude in Sect. 5. Some technical documentation may be found in the appendices.

2 Model specification

A typical equation for internal migration may be written as

$$\begin{gathered} \log (MIG_{it}^{{{\textit{de}}}} ) = \mu_{i}^{{M{\textit{de}}}} + \lambda_{t}^{{M{\textit{de}}}} + \beta^{{M{\textit{de}}}} \log ({\textit{POP}}_{{{\textit{it}}}}^{e} ) + \gamma^{{M{\textit{de}}}} \times [\log ({\textit{UR}}_{i,t - 1}^{e} ) - \log ({\textit{UROTH}}_{{_{i,t - 1} }}^{e} )] + \hfill \\ \theta^{{M{\textit{de}}}} \times [\log ({\textit{RHWAGE}}_{i,t - 1}^{e} ) - \log ({\textit{RHWAGEOTH}}_{{_{i,t - 1} }}^{e} )] + \hfill \\ \eta^{{M{\textit{de}}}} \times [\log ({\textit{EMPSHARE}}_{i,t - 1}^{e} ) - \log ({\textit{EMPSHAREOTH}}_{i,t - 1}^{e} )] + \varepsilon_{{{\textit{it}}}}^{{M{\textit{de}}}} . \hfill \\ \end{gathered}$$
(1)

The left-hand variable, \(\text{log} ({\it{MIG}}_{{{{it}}}}^{{{{de}}}} )\), is the log of an internal migration flow for region i in period t for an educational group. The superscript d = {IN, OUT} indicates inflow or outflow and the superscript e = {LOW, VOC, SUS, HIGH} indicates educational category.Footnote 3 The superscript M, occurring on the right-hand side of Eq. (1), stands for migration (in contrast to commuting). The observed variables on the right-hand side of (1) are the log of the population with education type e at the beginning of year t,\(\log ({\textit{POP}}_{{{\textit{it}}}}^{e} ),\) the relative difference in the unemployment rate between region i and other regions among those belonging to education category e in period t-1, \(\log ({\textit{UR}}_{t - 1}^{e} ) - \log ({\textit{UROTH}}_{{_{i,t - 1} }}^{e} ),\) the relative difference in the average real hourly wageFootnote 4 in region i and other regions among those belonging to education category e in period t−1, \(\log ({\textit{RHWAGE}}_{i,t - 1}^{e} ) - \log ({\textit{RHWAGEOTH}}_{{_{i,t - 1} }}^{e} )\) and the relative difference in the employment share in region i and in other regions among those belonging to education category e in period t−1, \(\log ({\textit{EMPSHARE}}_{i,t - 1}^{e} ) - \log ({\textit{EMPSHAREOTH}}_{i,t - 1}^{e} ).\) Altogether, (1) contains 8 equations, combining internal inflows and outflows of migrants with four different types of education. The eight equations in (1) are estimated as a set of regression equations. The contemporaneous error terms are assumed to be correlated, but free from autocorrelation and heteroskedasticity. The parameters in this model are estimated by iterative SUR estimation until convergence, which under normality assumptions yields maximum likelihood estimates.

When analyzing commuting patterns, we apply the same explanatory variables as for internal migration rates. We now have the following equations:

$$\begin{aligned} \log ({\textit{COM}}_{{{\textit{it}}}}^{{{\textit{de}}}} ) = & \,\mu_{i}^{{C{\textit{de}}}} + \lambda_{t}^{{C{\textit{de}}}} + \beta^{{C{\textit{de}}}} \log ({\textit{POP}}_{{{\textit{it}}}}^{e} ) + \gamma^{{C{\text{de}}}} \times [\log ({\textit{UR}}_{i,t - 1}^{e} ) - \log ({\textit{UROTH}}_{{_{i,t - 1} }}^{e} )] \\ & + \theta^{{C{\textit{de}}}} \times [\log ({\textit{RHWAGE}}_{i,t - 1}^{e} ) - \log ({\textit{RHWAGEOTH}}_{{_{i,t - 1} }}^{e} )] \\ & + \eta^{{C{\textit{de}}}} \times [\log ({\textit{EMPSHARE}}_{i,t - 1}^{e} ) - \log ({\textit{EMPSHAREOTH}}_{i,t - 1}^{e} )] + \varepsilon_{{{\textit{it}}}}^{{C{\it{de}}}} . \\ \end{aligned}$$
(2)

The superscript C stands for commuting. The left-hand variable is the log of a commuting flow related to region i in period t for a specific educational group.

3 Data

See Table 11 in Appendix A for an overview of definition of variables that one encounters in the main part of the article.

We combine data from different sources for the years 2001–2014. The stock and flow population data are taken from the Population Statistics of Statistics Norway. The population stocks relate to where individuals reside at the beginning of the year and cover persons in the age interval 20–64 years. From the employment statistics, we have information on the labor market status of each person and where workers have their main job. A third data source is Statistics Norway’s education statistics (NUDB). These statistics provide data on the highest education achieved by the individuals in terms of duration and type. Appendix Table 13 provides an exact definition of the educational groupings we employ according to the Norwegian Standard Classification of Education; see Barrabés and Østli (2016). Data have been aggregated from municipal to regional level; see Appendix Table 12 for an overview of the regions.Footnote 6 We utilize this information to calculate time series for educational-specific unemployment rates for the economic regions. Utilizing data for those individuals who are employees, we can calculate time series of mean hourly wage rates for the different educational categories in the economic region. The mean hourly wages used are those of individuals who are full-time workers. To calculate real hourly wages, we deflate by the consumer price index. Combining information from the population statistics, employment statistics and educational statistics, we also calculate educational-specific time series for employment shares at the level of the economic regions. This share is defined as the ratio between the number of employed individuals with a specific education to the total number of individuals with this type of education in the age interval 20–64 years.

In our empirical analysis, we consider relative real hourly wages, relative unemployment rates and relative employment shares. By relative real hourly wage, we mean real hourly wage compared to a mean real wage level in economic regions other than the one being considered. We define the mean real hourly wage outside economic region i for a specific educational group as

$${\textit{RHWAGEOTH}}_{{{\textit{it}}}}^{e} = \frac{{\sum\nolimits_{{j \in R_{i} }}^{{}} {{\textit{POP}}_{{{\textit{jt}}}}^{e} \times {\textit{RHWAGE}}_{{{\textit{jt}}}}^{e} } }}{{\sum\nolimits_{{j \in R_{i} }} {{\textit{POP}}_{{{\textit{jt}}}}^{e} } }},\,\,i \in R;\,e = \{ {\textit{LOW}},{\kern 1pt} \,{\textit{VOC}},\,{\textit{SUS}},\;{\textit{HIGH}}\} ,$$
(3)

where R is the set containing all the (89) observational units and \(R_{i}\) is a subset of R consisting of all the regions except region i (88).

This means that the ‘outside’ real hourly wage of a specific educational group is a weighted (time-varying) mean of the real hourly wage outside the economic region, where weights are based on the size of the (target) population with the selected type of education. We use similar formulae for the unemployment rates and the employment shares. They are given by

$${\textit{UROTH}}_{{{\textit{it}}}}^{e} = \frac{{\sum\nolimits_{{j \in R_{i} }}^{{}} {{\textit{POP}}_{{{\textit{jt}}}}^{e} \times {\textit{UR}}_{{{\textit{jt}}}}^{e} } }}{{\sum\nolimits_{{j \in R_{i} }} {{\textit{POP}}_{{{\textit{jt}}}}^{e} } }},i \in R;\,\,e = \{ {\textit{LOW}},{\textit{VOC}},{\textit{SUS}},{\textit{HIGH}}\}$$
(4)

and

$${\textit{EMPSHAREOTH}}_{{{\textit{it}}}}^{e} = \frac{{\sum\nolimits_{{j \in R_{i} }}^{{}} {{\textit{POP}}_{{{\textit{jt}}}}^{e} \times {\textit{EMPSHAREOTH}}_{{{\textit{jt}}}}^{e} } }}{{\sum\nolimits_{{j \in R_{i} }} {{\textit{POP}}_{{{\textit{jt}}}}^{e} } }},i \in R;\,\,e = \{ {\textit{LOW}},{\textit{VOC}},{\textit{SUS}},{\textit{HIGH}}\} .$$
(5)

In Appendix D, we report measures related to the variables real hourly wage and unemployment rate. First, we calculate annual coefficients of variation to see whether the cross-sectional dispersion has changed over time. These are reported in Tables 14 and 15. The main impression is that spread changes moderately over time. With respect to the real hourly wage variable for the group with high education, there was a slight increase in dispersion in the first half of the observation period, but then it flattened out. There is a weak downward trend in the dispersion of the unemployment rates. This is seen for all educational groups, but the pattern is most pronounced in the three groups vocational, study-oriented upper secondary and high education. We sort the cross-sectional data for each of the years and give each economic region a ranking number. We can then calculate the empirical correlation between the rank variables for different years. This yields information on the stability of an economic region’s place in the distribution. Tables 16 and 17 portray estimates for those with low education, Tables 18 and 19 for those with vocational education, Tables 20 and 21 for those with study-oriented upper secondary education, whereas Tables 22 and 23 show estimates for those with high education. When Table 16 is compared with Table 17, Table 18 with Table 19, Table 20 with Table 21 and Table 22 with Table 23, we see that the ranking correlations are higher for the real hourly wage variable than for the unemployment rates for all educational groups. The correlations tend to decrease with the distance in years. Thus, there is some convergence between the economic regions over time.

4 Empirical results

When it comes to the unknown parameters in Eqs. (1) and (2), we expect the population parameters to enter with a positive sign. A larger observational unit population should generate higher inflows as well as higher outflows. We expect the relative real wage to enter positively for in-migration and in-commuting and negatively for out-migration and out-commuting. The relative unemployment rate is expected to enter with a negative sign for in-migration and in-commuting. For reasons of symmetry, we expect the relative unemployment rate to enter with a positive sign in connection with out-migration and out-commuting. Finally, we expect the employment share to enter with a positive sign for in-migration and in-commuting and with a negative sign for out-migration and out-commuting. A high employment share in an economic region may signal that the labor market functions well and the expected signs related to the employment share variable are in accordance with such a view. Recall that all observed variables in the models except the regional and annual dummies are log-transformed.

5 Results for internal migration

For (internal) migration, we consider a set of regressions consisting of the eight response variables

$$\log (MIG_{it}^{IN,\,\,LOW} ),\log ({\textit{MIG}}_{{{\textit{it}}}}^{{{\textit{IN}},\,\,{it{VOC}}}} ),\log ({\textit{MIG}}_{{{\textit{it}}}}^{{{\textit{IN}},\,\,{\textit{SUS}}}} ),\log ({\textit{MIG}}_{{{\textit{it}}}}^{{{\textit{IN}},\,\,{\textit{HIGH}}}} ),\log ({\textit{MIG}}_{{{\textit{it}}}}^{{{\textit{OUT}},\,\,{\textit{LOW}}}} ),$$

\(\log ({\textit{MIG}}_{{{\textit{it}}}}^{{{\textit{OUT}},{\textit{VOC}}}} ),\log ({\textit{MIG}}_{{{\textit{it}}}}^{{{\textit{OUT}},{\textit{SUS}}}} )\) and \(\log ({\textit{MIG}}_{{{\textit{it}}}}^{{{\textit{OUT}},{\textit{HIGH}}}} ).\)

The equation for each of these variables is given by (1). The assumptions with respect to the error terms have been stated above in a qualitative way but are given a formal treatment here. Let

$$\begin{aligned} \varepsilon_{{{\text{it}}}}^{M} & = \{ \varepsilon_{{{\textit{it}}}}^{{M,\,\,{\textit{IN}},\,\,{\textit{LOW}}}} ,\,\varepsilon_{{{\textit{it}}}}^{{M,\,\,{\textit{IN}},\,\,{\textit{VOC}}}} ,\varepsilon_{{{\textit{it}}}}^{{M,\,\,{\textit{IN}},\,\,{\textit{SUS}}}} ,\varepsilon_{{{\textit{it}}}}^{{M,\,\,{\textit{IN}},\,\,{\textit{HIGH}}}} , \hfill \\ &\qquad \qquad \varepsilon_{{{\textit{it}}}}^{{M,\,\,{\textit{OUT}},\,\,{\textit{LOW}}}} ,\,\varepsilon_{{{\textit{it}}}}^{{M,\,\,{\textit{OUT}},\,\,{\textit{VOC}}}} ,\varepsilon_{{{\textit{it}}}}^{{M,\,\,{\textit{OUT}},\,\,{\textit{SUS}}}} ,\varepsilon_{{{\textit{it}}}}^{{M,\,\,{\textit{OUT}},\,\,{\textit{HIGH}}}} \}^{/} . \hfill \\ \end{aligned}$$
(6)

We assume \(\varepsilon_{it}^{M} \sim {\text{NIID}}(\underline{0} ,\Omega^{M} ),\) where \(\underline{0}\) is an 8 × 1 vector with only zeros and \(\Omega^{M}\) is a symmetric and positive definite but otherwise unrestricted covariance matrix containing 36 s-order moments of the errors.

In view of the preliminary estimation results, we constrained some parameters to zero. The estimation results for a restricted case are reported in Table 1. Table 2 reports the standard error of regression (SER) for each of the estimated equations and Table 3 reports the estimation results in qualitative form. Table 1 is ordered such that the results for the four inflow equations are reported first followed by the results for the four outflow equations. We comment on the estimation results for the inflow equations first. According to the results in Table 1, there is a weak relative unemployment effect for the groups with low and vocational education, but the effects are not statistically significant at 5 percent significance level. For the two remaining groups, the unemployment effect has been constrained to zero because of the low significance of the estimates. When it comes to relative real wage, there is a significant and positive effect on those with study-oriented upper secondary education. There is also a weak positive, but insignificant (at the 5 percent level), effect for the group with low education. For the two remaining groups, the effect has been set to zero because of low significance in preliminary estimation rounds. For all the educational groups, we have set the effect of the employment share to zero because of low significance in preliminary estimation rounds.

Table 1 Internal migration. Iterative SUR-estimates
Table 2 Standard errors of regression from a system of eight equations. Internal migration
Table 3 Qualitative summary of results. Internal migrationa

Let us next turn to the outflow equations. Here, a positive effect of the relative unemployment rate is found for those with study-oriented upper secondary education. The estimate is almost significant at the 5 percent test level. For the three other groups, the effect is set to zero. The coefficients of the relative real wage variables are constrained to be the same for all groups except for those with study-oriented upper secondary education, where it is constrained to be zero. The common parameter estimate is negative with a significance probability of about 7.5 percent. The relative employment share enters significantly and negatively only for those with low education. The population size variable enters significantly in all eight equations with a value centering around to unity. As seen from Table 2, the standard errors of regression vary from 0.096 to 0.173.Footnote 7

6 Results for commuting

As for internal migration, we estimated a SUR specification for commuting workers. Again, there are eight equations. The first four equations are for inflow of commuting workers with different educational backgrounds, while the last four are for outflow of commuting workers with different educational backgrounds. The empirical results are reported in Table 4. Table 5 reports the standard error of regression for each equation and Table 6 provides qualitative estimation results. From the results in Table 4, we see that the relative real wage only plays a role for inflow commuting for those with low education. The relative unemployment rate variable enters significantly with the expected negative sign for those with low and vocational education. The estimate is largest (in absolute value) for those with low education with a point estimate of -0.07 and t-value of about 2.6 in absolute value. For those with study-oriented upper secondary education, we also report a negative estimate, but here the t-value is only around 1 in absolute value. For those with high education, the effect is set to zero. The relative employment share variable enters with a positive estimate for those with low and vocational education. They are both significant at the 5 percent level. For those with study-oriented upper secondary and high education, the effects are set to zero.

Table 4 Iterative SUR-estimates in a set of regression models for commuting
Table 5 Standard errors of regression from a system of eight equations. Commuting
Table 6 Summary of results, qualitative. Commutinga

Lastly, we turn to the results for outflow of commuting workers. For those with low and vocational education, we do not find any effect for any of the three incentive variables. For those with study-oriented upper secondary and high education, we obtain negative estimates of the effects related to the relative wage variable. The point estimate is rather equal for the two groups, but the significance is most pronounced for those with high education. For this group, the estimate is significant at the 5 percent test level, whereas the significance probability is slightly above 5 percent for those with study-oriented upper secondary education. The relative unemployment rate only plays a role for those with study-oriented upper secondary education. However, the positive estimate only has a t-value around 1.4. The relative employment share enters with a negative estimate for those with study-oriented upper and high education. The estimated effect is significant at the 5 percent significance level for those with high education whereas the significance probability for those with study-oriented secondary upper education is about 10 percent.

7 A robustness analysis

So far, our measure of ‘outside’ variables has been based on data for all the 88 outside regions. If there is a positive correlation between population size and hourly wages at the regional level, and/or a negative correlation between population size and the unemployment rate, the potential benefit from moving from one region to another depends on the population size in the two regions. In this case, our measure of ‘outside’ variables might not fully capture this benefit. To check if our estimation results are robust to this hypothesis, we now split the 89 economic regions into two main groups for economic regions according to population size, main regional group 1 and 2. The most populous economic regions are in group 1, whereas the less populous economic regions are in group 2. Assuming the division is the same for all years, the number of economic regions in main groups 1 and 2 is 32 and 57, respectively.Footnote 8 We estimate four sets of regressions each containing 4 equations, corresponding to the four education categories. Two of the sets are related to internal migration and two are related to commuting. With respect to internal migration we look at (i) in-migration to group 1 from group 2 and (ii) out-migration from the small economic regions, i.e., those in main group 2, to main group 1. Thus, the former analysis is based on 32 observational units and the latter is based on 57 observational units. We conduct similar analyses for commuting.

The alternative specification calls for new specifications of what we have termed the ‘outside’ variables. For instance, for observational units in main regional group 2, the outside real hourly wage is now given as a weighted mean of real hourly wages in main regional group 1. The (time-varying) weight for an economic region in main regional group 1, is its population divided by the total population aged 20–64 years in main group 1. The same weights are utilized for the two other incentive variables, i.e., the unemployment rate and the employment share. The formulae are provided explicitly in Appendix E. The same is the case for the specification of the four sets of empirical equations to be estimated.

The detailed estimation results are provided in Appendix E. Before we compare the estimation results for the robustness analysis and the results for the main model, we notice that according to the results of the robustness analysis real wages does not seem to matter very much for internal (in- and out-) migration and commuting. One exception might be for persons with low educational achievement, a group with many immigrants.

The reason why wage income might matter less for internal migration in Norway than what is found in many other developed countries is due to more narrow wage differences. This is partly because of a relatively strong position of trade unions in the system of wage formation and in addition the relatively high share of employment in public sectors. That low educated people show relatively high migration response on wage incomes is partly due to high migration probabilities from peripheral to central regions among immigrants with refugee background (see e.g., Kornstad et al. 2017). The relatively low response on migration of wage incomes among high educated people must be seen in context with the fact that many high educated are already registered settled in the main regions when they finish their higher graduation. Many students involved in higher education register their in-migration to central regions before they fulfill their education, thus entering these regions with low or mostly intermediate education level.

To compare the estimation results from the robustness analysis and the estimation results in the main model we include four tables, which extract information from other tables in the article. In Tables 7 and 8, we compare, respectively, estimates of parameters related to internal in- and out-migration, whereas we in Tables 9 and 10 provide a corresponding comparison in conjunction with commuting. One should recall that whereas the estimates in the main specification are based on a system of eight regression equations for either internal migration or commuting, the estimates in the alternative specification are based on four equations since one either focus on in-migration from small to large economic regions or out-migration from small to large economic regions.

Table 7 Effects of incentive variables on in-migration according to the main and the alternative specification
Table 8 Effects of incentive variables on out-migration according to the main and the alternative model specification
Table 9 Effects of incentive variables on in-commuting according to the main and the alternative specification
Table 10 Effects of incentive variables on out-commuting according to the main and the alternative model specification

In Table 7, we compare estimates related to parameters in in-migration equations. Using the alternative specification, we are not able to obtain a positive estimate of the relative wage on internal in-migration. In fact, for the groups with low and high education the estimates are negative, although not significantly so at the 5 percent significance level. In the main specification, the relative wage enters positively for those with low education, and the estimate is almost significant at the 5 percent significance level. For those with study-oriented upper secondary education, there is a significantly positive estimate of the relative wage variable in the main specification, whereas the effect is set to zero a priori in the new specification because of low significance in a preliminary estimation round. In contrast, the estimated effects of the relative unemployment variable are much stronger and more significant in the alternative than in the main specification. Whereas the effect of the relative unemployment variable is set to zero for both those with study-oriented upper secondary and high education in the main specification, there is a negative significant estimate consistent with our a priori assumptions in the alternative specification. When it comes to the relative employment share variable all the effects are set to zero both in the main and the alternative specification, because of low significance of this variable in a preliminary estimation round.

With respect to Table 8, we notice that in the alternative specification we have somewhat stronger effects of the relative wage variable on out-migration for those with low education. For the other three education categories, the effects are small and insignificant even though they are of the correct sign. They resemble to a large extent what was found for the main specification. The estimated effect of the relative unemployment rate for the alternative model is now significant for those with low education, whereas the effect was set to 0 in the main specification because the preliminary estimate had low significance. For the three other education categories the estimated effect is positive, but with a t-value just above 1. Thus, for those with study-oriented upper secondary education the estimated effect of the relative unemployment variable now turns out to be weaker than for the main specification, where it is almost significant at the 5 percent significance level. For the two remaining education categories, the effect of the relative unemployment rate is set to 0 in the main specification. When it comes to the relative employment share, we do not find a significant effect for any of the education categories using the alternative specification, whereas a significant negative estimate for those with low education is found using the main specification.

In this context, it might be important with a brief description of the Norwegian regional economic development during the period of investigation. Before the financial crisis in 2008, domestic movements went mainly toward the regions with the most vacant jobs, but this has changed after the financial crisis. International migration has in many ways compensated for the fact that internal population movements have followed more traditional paths, with net out-migration from several regions where employment has increased due to very high petroleum investments, especially in the western and south-western parts of Norway, but partly also in northern Norway (see e.g., Stambøl, 2017).

Next, we look at the results for in-commuting in Table 9. We find a positive and significant effect of the relative wage variable for those with low education using the alternative specification. The effect is also positive for the main model specification, but the estimate in the alternative model specification is three times as large as in the main model specification. For the three remaining groups all the effects are set to zero in the main model. The same is true for those with vocational education when considering the alternative model specification. Unfortunately, significant negative estimates are obtained for those with study-oriented upper secondary and high education. This is contrary to what was expected.

The relative unemployment rate seems, generally, to enter stronger using the alternative model specification: For those with low and vocational education the estimates are larger in absolute value for the alternative model specification compared to the main model specification. In the main model specification, the effect of the relative unemployment rate for those with high education is set to zero. A negative estimate is obtained for the alternative model specification. Though, it is not significant at the 5 percent significance level. For those with study-oriented upper secondary education, the effect is set to zero for the alternative model specification since the relative unemployment rate entered with little significance in a preliminary estimation round. For the main model specification, we obtain a negative estimate, but the t-value is only close to −1.

The estimates of the effect of the relative employment share variable also differ between the two model specifications. For those with low and vocational education, we find no effects using the alternative specification, whereas positive and significant estimates are obtained using the main specification. In the main model specification, the effect for those with study-oriented upper education is set to zero. Using the alternative model specification, the estimate is positive and almost significant at the 5 percent significance level. For those with high education, the effect is set to zero for both model specifications.

Turning now to out-commuting, we are, as seen from Table 10, unable to find any significant effects for the relative wage variable using the alternative specification. In contrast, the variable entered with a significant negative effect for those with high education and almost a significant effect for those with study-oriented upper secondary education using the main model specification.

The estimated effects of the relative unemployment rate also differ somewhat. Using the alternative model, we have a positive and significant effect for those with low education whereas the effect is zero according to the main model. For those with vocational education, we have wrong sign for the alternative model, but the estimate is not significant at the 5 percent significance level. For the two remaining education groups, we find no effect of the relative unemployment rate using the alternative model specification. For the main model, we do not find any significant effects. For those with study-oriented upper secondary education, we have a positive estimate, but with a t-value below 1.4.

Using the alternative specification, we find significant negative effects of relative employment shares on out-commuting for all the educational groups except for those with low education, where we find a significant positive estimate. In the main specification, we also have negative estimates for those with study-oriented upper secondary and high education, but the estimates are smaller in absolute values and less significant compared to those obtain for the alternative model specification. For those with low and vocational education, the effect of the relative employment share was set to zero a priori in the main model.

A summarize of our findings is then that classic economic internal migration motives as studied in for instance Sjaastad (1962) seem to matter modestly in Norway. In contrast, labor market conditions seem to matter more for migration and commuting than real wages. However, real wages and unemployment are insufficient for fully explaining internal migration and commuting. Other additional factors are needed. Recent research favors amenities, e.g., culture, landscape, public service, public infrastructure, climate, crime and social/family motives as main reasons for internal migration (see e.g., Chen and Rosenthal 2008; Niedomysl 2011; Niedomysl and Clark 2014; Korpi and Clark 2015).

8 Conclusions

Employing data for economic regions in Norway, we used panel data SUR models to analyze the extent to which internal migration and commuting activity are influenced by incentive variables. Our model is disaggregated in the educational dimension in that we consider four different educational groups. Sixteen flow variables played the part of response variables. For both internal migration and commuting, we looked at both inflow and outflow. For a given educational group, the explanatory variables are common to both response variables. In many cases, we do not find significant effects of the explanatory variables. Generally, the group with the lowest education appears to be the most sensitive with respect to the incentive variables. When we consider migration inflow, we find a significant positive effect of the relative wage variable for the group with study-oriented upper secondary education, and a positive almost significant estimate for those with low education. In the case of migration outflow the groups with low, vocational and high education are somewhat influenced by relative wages. For in-commuting, we only find a relative wage variable effect on the group with low education. When out-commuting is considered, significant and nearly significant effects are found for two of the groups, i.e., for those with study-oriented upper secondary and high education. In all cases where we found at least some effect, the signs are in accordance with a priori expectations.

For the groups with low and vocational education, an increase in a region’s relative unemployment rate leads to somewhat lower in-migration. In contrast, we find no effect for out-migration. For the groups with study-oriented secondary education and high education, we find no effect on the relative unemployment rate on in-migration, but with respect to the former group we find a positive and almost significant effect on out-migration.

We find that an increase in the relative unemployment rate leads to significantly lower in-commuting for the groups with low and vocational education. For both groups, we fail to find any effect on out-commuting. For the group with high education we find no effect of the relative unemployment rate on neither in- nor out-commuting. Lastly, for those with study-oriented education we find a positive insignificant effect of the relative unemployment rate on out-commuting, but no effect on in-commuting.

Also, the relative employment share variable seems to enter with the expected sign in the case where the companion estimate is of some significance. With respect to in-migration, we find no effect of this variable for any of the education types, but we find a significant negative effect for those with low education in connection with out-migration. With respect to in-commuting, we find significant positive effects for those with low and vocational education. However, out-commuting is unaffected. For the two other education types one has the opposite situation, that is in-commuting is unaffected, but out-commuting is affected negatively, and significantly so for those with high education.

Our robustness analysis, which focused on internal migration and commuting from ‘small’ to ‘large’ economic regions produced a somewhat variegated picture. Whereas the relative unemployment variables generally entered more forcefully in specifications underlying the robustness analysis than in the main cases, it turned out to be harder to obtain large and significant estimates with the expected sign for the relative wage variables.

By considering the results from both the main model and the robustness analysis, we conclude that the incentive variables do not fully capture all motives for internal migration and commuting. Ideally, our modeling framework would benefit from the inclusion of variables related to amenities. Unfortunately, most of such variables are not included in registry data sets available to researchers.

In addition to the incentive variables and the population size of the region, we included regional fixed effects as well as annual fixed effects. Thus, our econometric framework is simpler than the approach based on pair-wise observations used by other researchers in this area. This more sophisticated approach, in the tradition of gravity models, is characterized by including population variables and fixed effects for both the ‘sending’ and the ‘receiving’ economic region. One challenge associated with this approach, when there are many observational units, is the predominance of zeros. Dropping observation pairs with zeros may have an undesirable effect on the inference. Researchers have recently started to employ a model that uses maximum likelihood estimation based on the Poisson distribution which also accounts for the zero observations (see Santos Silva and Tenreyro 2006). However, in this approach the response variables cannot be log-transformed, as we have done in our analysis.Footnote 9 It is an interesting topic for further analysis, nevertheless.

We did not take differences in age and gender into account when modeling internal migration and commuting flows.Footnote 10 Such variables seem more relevant in a microeconometric setting. Since the time span is somewhat limited, the age and gender distribution will remain relatively stable through the period.

We heave disregarded the impact of variation in housing prices across the observational units.Footnote 11 The regional housing price data do not fit well with our regional classification since they focus on areas where the turnover is of some size. For contributions emphasizing the explanatory power of housing prices for internal mobility, see Cannari et al. (2000) and Hämäläinen and Böckerman (2004). Another issue related to housing is that home ownership is widespread in Norway. The rental market is quite limited and is primarily restricted to the largest cities. Palomares and van Ham (2020) noted that home ownership was an important factor for limiting internal mobility in Spain, and it is possible that high levels of home ownership in Norway may also work to lower migration propensities in Norway.

Another issue that we have not considered in this article is the distinction between international immigrants and individuals born in Norway when it comes to internal migration in Norway. Individuals born abroad (immigrants) are believed to follow a pattern that differs from that of individuals born in Norway. This issue has also been raised by other researchers with respect to other countries, see for instance Schündeln (2014). Another issue that has been addressed is whether there is any response by natives to internal migration of international immigrants, see for instance Kritz and Gurak (2001) and Ali et al. (2012).

In summary, we find overall weak responses to the incentive variables employed in this study. There is, however, some interesting variation across educational levels. The lowest educated appears to be most responsive. Because of increased educational attainment in the population, there will be still fewer individuals in this group, which is dominated by immigrants. According to our model and empirical results this will impair the government’s possibility to influence internal migration by measures impacting the two incentive variables. Said in another way, the stimuli will need to be stronger to maintain the same effect as when there are more individuals in this group.