1 Introduction

For years, the Socio-Economic Panel (SOEP) has been the standard data set for analyzing income and wageFootnote 1 inequality in Germany (e.g., Steiner and Wagner 1998; Biewen 2000; Gernandt and Pfeiffer 2007; Biewen and Juhasz 2012; Sommerfeld 2013). However, research based on administrative data has recently gained importance, especially in labor economics (e.g., Card et al. 2013; Fitzenberger and de Lazzer 2022). Large sample sizes and the (expected) accuracy of the information included are considered advantages of this type of data. However, these data are designed for administrative purposes and may not contain all the information needed for research. At the same time, most large household surveys have begun to increase their samples, in part to include specific subpopulations.Footnote 2 Compared to administrative data, survey data contain much more information and cover more topics, but at the cost of much smaller sample sizes and perhaps less precision in certain quantitative variables.Footnote 3

Despite these developments, there is a paucity of work comparing survey and administrative data and examining whether these data sets produce comparable results.Footnote 4 In this article, we fill this gap for Germany by comparing trends and levels of wage inequality based on the SOEP (e.g., Goebel et al. 2019) with results based on the Sample of Integrated Labour Market Biographies (SIAB; e.g., Frodermann et al. 2021).

Both data sets are widely used in empirical labor research and have been used extensively in past research on wage inequality. Based on the SOEP, Biewen and Juhasz (2012), e.g., analyze determinants of the rise in income inequality in the early 2000s and show that most of the increase is due to rising labor income inequality. Burauel et al. (2020) analyze the impact of the introduction of the minimum wage on wage inequality and find that this reform leads to a reduction in inequality.Footnote 5 In addition, the SOEP forms the basis of many social reporting statistics in Germany. Several German research institutes publish inequality statistics based on the SOEP at regular intervals (e.g., Stockhausen and Calderón 2020; Grabka 2021), which are incorporated into policy and governmental reports (e.g., Bundesregierung 2016, 2021; OECD 2018).

Most studies on wage inequality in Germany based on administrative data have been conducted with data sets from the Research Data Center (FDZ) of the Federal Employment Agency (BA) at the Institute for Employment Research (IAB). Dustmann et al. (2009), e.g., use the IAB Employment Samples (IABS) to examine the West German wage structure. They show that wage inequality in West Germany has increased between 1975 and 2004. Klein et al. (2013) use the IAB’s Linked Employer-Employee Data (LIAB) to analyze the impact of export activity on wage inequality within and across skill groups. Card et al. (2013) use the Integrated Employment Biographies (IEB) and find that increasing heterogeneity at the establishment level and increasing assertiveness in assigning workersFootnote 6 to establishments mainly explain increasing wage inequality. Dustmann et al. (2014) use the SIAB and show that real wage growth is negative for the lower end of the wage distribution. Fitzenberger and de Lazzer (2022) also use SIAB data to examine whether changes in the selection into full-time employment among German men are a cause of the rise in wage inequality since the mid-1990s.Footnote 7

Recent results based on the SOEP show that (hourly) wage inequality—measured as the ratio between the 90th and 10th percentiles—increased substantially in the early 2000s, remained relatively stable between 2006 and 2014, then declined until 2016, and continued to move horizontal until 2019, bringing the German labor market back to the inequality level of the early 2000s (Grabka 2021). Using the SIAB data through 2014, Fitzenberger and Seidlitz (2020) confirm the same upward trend in the early 2000s for full-time worker wage inequality, measured as the ratio between the 80th and 20th percentiles. However, in their sample, inequality continued to increase for men and decrease for women through 2014. The disparate results based on different data sets underscore the need for a thorough comparative analysis.Footnote 8

The SOEP data have the advantage that information on individuals’ wages is embedded in the full set of information available in a large household survey. This allows researchers not only to calculate individual earnings or individual wages, but also to include the household level or to distinguish between measures of income before and after transfers and taxes. Administrative data are usually produced by government agencies in the course of implementing certain rules, regulations, and laws. Statistics are a byproduct of these activities. On the one hand, administrative data can be considered very accurate in terms of the information required for the administrative procedure from which the data originate, e.g., as there are legal sanctions in case of misreporting. On the other hand, some additional information collected as part of the administrative procedure, such as educational status or occupation, may be considered less reliable.

There is no doubt that the set of potential control variables in the SIAB is much smaller compared to the SOEP. However, because survey costs are substantial, the coverage of surveys is usually limited (about 30,000 individuals in the last SOEP wave), while the administrative data cover almost the entire population of individuals participating in the labor market. The IEB covers, inter alia, all workers subject to social insurance contributions in Germany since 1975. The SIAB is a two-percent random sample of the IEB. This restriction is due to data protection purposes. Nevertheless, the SIAB covers labor market related information of around 1.78 million persons.

Administrative data have the advantage that the characteristics essential to the underlying administrative process are usually of high quality and therefore have a low number of missings and a low measurement error. However, administrative data can be affected by, e.g., processing errors (e.g., duplicate reports, data entry error) (Kapteyn and Ypma 2007; Groen 2012; Lindner and Andreasch 2014) and/or coverage errors (e.g., administrative data lack information on the shadow economy). Whereas, survey data can be subject to various measurement errors (e.g., Bound et al. 2001). Basically, a distinction can be made between sampling errors and non-sampling errors. Sampling errors occur when only a non-representative subset of the population is actually surveyed. Non-sampling errors include coverage errors, framing errors, response/non-response errors, measurement errors, and processing errors (e.g., de Leeuw et al. 2008). In summary, both survey data and administrative data are subject to different types of measurement errors and therefore neither data source should be considered the only true one.Footnote 9 For Germany, this has been shown, e.g., by Oberski et al. (2017). They “found for official administrative data obtained from the German Federal Employment Agency that the reliability of both survey and administrative data was far from perfect.” (Oberski et al. 2017, p. 1486). Any comparison of survey and administrative data additionally faces differences in the definitions of the unit of analysis, divergent reference periods, or even censoring.

As noted above, both the SOEP and the SIAB have been used to analyze wage inequality in the past, using different analytical methods that take advantage of the unique characteristics of the two data sets. To our knowledge, however, no attempt has ever been made to derive comparable inequality estimates from these data sources. In this article, we fill this gap by bringing in estimates of wage inequality trends for Germany based on (1) samples that exploit the strengths of each data set, (2) samples that are as comparable in composition as possible, and comparing our findings from these approaches.

2 Data

In the following, we briefly introduce the two data sets we use—SOEP and SIAB. We then define different sub-samples of the two data sets that we use for the analyses. More detailed information on the data sets, data preparation, and sub-sampling can be found in Appendix B.

2.1 German Socio-Economic Panel (SOEP)

The SOEP is a representative household survey. It has been conducted annually since 1984 and in the last wave covered more than 20,000 households with more than 30,000 individuals (excluding children). The SOEP covers a wide range of topics, including detailed information on earnings and wages at both the individual and household level (Goebel et al. 2019). Due to its nature as a household survey, the SOEP also covers civil servants and self-employed persons as well as marginally employed persons in addition to employees. Wages are surveyed for the previous month, including any overtime pay, and for the previous year, broken down into one-time payments and severance payments. Wages from secondary employment are queried separately. It should be noted that the way in which information for secondary employment is collected has changed fundamentally in 2017. Since then, dependent employment can be clearly distinguished from an honorary and self-employment secondary employment. Previously, this distinction was not possible. In case of item non-response, the main imputation method is the so-called row-and-column imputation developed by Little and Su (1989). When longitudinal information for the imputation process is missing, OLS regressions are applied (see Frick and Grabka 2005).Footnote 10

We use the SOEP-Core v37 for our analyses. For more details and a brief description of the data preparation steps performed see Appendix B1 and Schröder et al. (2020).

2.2 Sample of integrated labour market biographies (SIAB)

The SIAB is a two percent random sample drawn from the IEB of the IAB. The IEB is an administrative data set with information from various data sources. It includes, among others, all workers subject to social insurance contributions and all marginally employed individuals in Germany.Footnote 11

The employment information in the IEB comes from the integrated notification procedure for health, pension and unemployment insurance (Bender et al. 1996). As a part of this procedure, employers are required to submit notifications on all their employees subject to social security insurance to the relevant social insurance institutions at least once a year. Civil servants and self-employed individuals are not subject to social security insurance and are therefore not included in the data set.

Workers can be identified by an artificial individual ID and tracked over years. The data is organized by employment spells. The maximum length of a spell is one calendar year. For each employment spell, the beginning and end of employment on a daily basis and the average gross daily wage are known, among other things.

The wage data in the SIAB is very reliable. The information is used, e.g., to calculate retirement pensions and unemployment insurance benefits. However, the wage data is only relevant up to the social security contribution assessment ceiling. For this reason, the wage information in the process data is top-coded, so that we only observe wages up to the contribution assessment ceiling. Therefore, following Stüber et al. (2023), we impute top-coded wages using a 2-step imputation procedure similar to Dustmann et al. (2009) and Card et al. (2013).Footnote 12

We use the SIAB 7519 for our analyses. For more details and a brief description of the data preparation steps performed see Appendix B2 and Frodermann et al. (2021).

2.3 Defining sub-samples for the analyses

To highlight the strengths and weaknesses of the SOEP and SIAB data and to compare the two data sources, we define different sub-samples of the data sets in Sect. 2.3.1, which we then use for the analyses. The rationale behind the sub-samples hereby is the following: the samples described in the next section play to the individual strengths of the two underlying data sets. Researchers studying wage inequality based solely on the SOEP or the SIAB will most likely end up using one of these sub-samples. These thus represent the standard use cases of the two data sets. In contrast, the samples described in Sect. 2.3.2 follow the intention of making the two data sets as comparable as possible.

The following basic restrictions apply to all samples we draw for our analyses in this article:

  • The analyses are performed for the last two decades covered by our data, i.e., 2000 to 2019.

  • We consider only workers between the ages of 18 and 65.

  • Wages from self-employment or wages paid as part of an internship or, e.g., a voluntary social year are not taken into account.Footnote 13

2.3.1 Exploiting the strengths of the SOEP or SIAB

For each data set, we create two sub-samples to take advantage of the respective strengths of the data set. The basic restrictions listed in Sect. 2.3 apply here as well, of course, but are not repeated in the sketches for the individual data sets.

2.3.1.1 SOEP-Pure 1 (monthly wage)

Workers considered: All workers subject to social insurance contributions and civil servants.

Sampling: All respondents who reported either positive wages in the last month or positive wages in a second job when wages from the main job were either zero or missing.

Wage for the calculation of wage inequality: Nominal gross monthly wage of person’s main job or, if not available, second job in the month prior to the interview.

2.3.1.2 SOEP-Pure 2 (annual wage)

Workers considered: All workers subject to social insurance contributions and civil servants.

Sampling: All respondents who reported positive individual wages for the last calendar year. Since the latest survey year available in the SOEP v37 is 2020, the latest available observation for wages in the last calendar year is for 2019.

Wage for the calculation of wage inequality: Nominal gross annual wages (sum of all labor wages excluding income from self-employment) of an individual in year preceding the survey year, including any one-time payments.

2.3.1.3 SIAB-Pure 1 (average daily wage)

Workers considered: All workers subject to social insurance contributions.

Sampling: Consideration of all employment spells.

Wage for the calculation of wage inequality: Average nominal gross daily wage of all jobs held by a person during the year [weighted by the duration of the employment spell; following Dustmann et al. (2009)].

2.3.1.4 SIAB-Pure 2 (daily wage as of June 30)

Workers considered: All workers subject to social insurance contributions.

Sampling: Consideration of all employment spells as of June 30 of each year.

Wage for the calculation of wage inequality: Sum of all nominal gross daily wages of an individual as of June 30 of each year.Footnote 14

2.3.2 Generating comparable sub-samples

The samples in the last section correspond to what researchers might use if they were analyzing only one of the two data sources. In contrast, in this section, we create sub-samples that are as comparable as possible between the two data sources.

First, we add another basic restriction to all comparable sub-samples: to ensure that the individuals considered are comparable, we exclude civil servants from the SOEP—or, more precisely, wages from civil servant employment relationships—in the comparable samples. The other basic restrictions listed in Sect. 2.3 also apply here, but are not repeated.

2.3.3 SOEP-SIAB-comparable 1: Gross monthly wage

The SOEP asks for wage information for the month preceding the survey month. Interviews are conducted in almost all months, but the vast majority of surveys take place from February to May.Footnote 15 Therefore, we replicate this survey structure in the SIAB. In each year, we randomly assign an interview month to each person in the SIAB. We ensure that the random assignment of the month results in the same distribution as for individuals aged 18 to 65 in this year’s SOEP. If a person in the SIAB is not employed in the month preceding the assigned interview month, this individual is not included in the analysis.

Since the SIAB does not contain the monthly wage, we calculate the gross monthly wage to the day as the gross daily wage (henceforth daily wage) multiplied by the number of days in the employment spell in the respective month. The daily wage in the SIAB also includes bonus payments, etc. (e.g., Christmas bonus) received by the employee in the duration of the spell. This information is not included in the monthly wage in the SOEP. However, the information on bonus payments is collected retrospectively for the last survey year. To mimic the wage measure in the SIAB, we use this information as a proxy and add 1/12 of the one-time payments collected retrospectively in the SOEP to the monthly wage.

If workers are employed by more than one employer, only the information for the main employment, i.e., the job with the highest monthly wage, is used. This leads us to the following comparable samples:

2.3.3.1 SOEP-comparable 1

Workers considered: All workers subject to social insurance contributions.

Sampling: All respondents who reported either positive wages in the last month or positive wages in a second job when wages from main job were either zero or missing.

Wage for the calculation of wage inequality: Nominal gross monthly wage of person’s main job or, if not available, second job in the month prior to the interview; adjusted proportionally for bonuses, etc.

2.3.3.2 SIAB-comparable 1

Workers considered: All workers subject to social insurance contributions.

Sampling: Annual random assignment of a survey month; distribution of survey months is predetermined by the actual distribution of survey months in SOEP.

Wage for the calculation of wage inequality: Nominal gross monthly wage (calculated to the day) of person’s main job in the month prior to the assigned interview.

2.3.4 SOEP-SIAB-comparable 2: Gross annual wage

Since the SIAB does not include annual wages, we calculate gross annual wages by adding up all wages (employment spell duration multiplied by the gross daily wage) for each person in each year.Footnote 16

2.3.4.1 SOEP-comparable 2

Workers considered: All workers subject to social insurance contributions.

Sampling: All respondents who reported positive individual wages for the last calendar year.

Wage for the calculation of wage inequality: Nominal gross annual wage of an individual in year preceding the survey year, including any one-time payments.

2.3.4.2 SIAB-comparable 2

Workers considered: All workers subject to social insurance contributions.

Sampling: Consideration of all employments.

Wage for the calculation of wage inequality: Sum of the nominal gross payrolls of all employment relationships of a person during the year.

3 The development of wage inequality in Germany

3.1 Inequality measures

To analyze income inequality and illustrate the difference between survey and administrative data, we focus on wage percentiles and wage percentile ratios. Percentile ratios are widely used, e.g., by the OECD, and are an intuitive way of representing income inequality.Footnote 17

Percentile ratios indicate the ratio of the wages of two individuals who are in different positions in a given distribution. For example, the P90/P10 ratio compares the wage at the 90th percentile with that at the 10th percentile. If the P90/P10 ratio has increased over a period, this indicates that inequality between the top and the bottom tails of the wage distribution has increased. The disadvantage of the P90/P10 ratio is that wage trends above the 90th percentile and below the 10th percentile are not considered. Somewhere above the 90th percentile, however, is the wage threshold above which employers no longer report detailed wage information, so that inaccuracies can occur even when wage information is estimated. Thus, using the P90/P10 ratio circumvents the uncertainty above the wage threshold.Footnote 18 For the purposes of this article, we consider the following three percentile ratios: P90/P10, P90/P50, and P50/P10.

3.2 Using the strengths of survey and administrative data

First, we measure wage inequality in both data sets using the two most common wage measures for the respective data sets: monthly wage (SOEP-Pure 1) and annual wage (SOEP-Pure 2) for the SOEP and average daily wage (SIAB-Pure 1) and daily wage on June 30 (SIAB-Pure 2) for the SIAB. These are the native measures for the respective data set, i.e., monthly wages as well as annual wages are collected directly in the SOEP questionnaire and wage information are provided as average daily wages in the SIAB. In the following section, we then measure wage inequality using comparable sub-samples of the two data sets.

The results based on monthly and annual wages in the SOEP are shown in Fig. 1. Results based on average daily wage and daily wage at June 30 in the SIAB are shown in Fig. 2.

Fig. 1
figure 1

Development of wage percentiles ratios in the SOEP 2000–2019. Source: own calculations, SOEP-Core.v37.EU

Fig. 2
figure 2

Development of wage percentiles ratios in the SIAB 2000–2019. Source: own calculations, SIAB 7519

Looking at the trajectories of the percentile ratios in the two data sets, it is immediately apparent that the P90/P10 value is lower in the SOEP than in the SIAB. The trend over the years also shows clear differences.

In the SOEP, the monthly P90/P10 percentile ratio is relatively stable until 2016 and then declines (see Fig. 1A).Footnote 19 In contrast, the annual P90/P10 percentile ratio shows a weak inverted U-shape: it rises until 2011 and then falls again starting around 2013 (see Fig. 1B).

In the SIAB, the pattern is much more rigid. However, a very weak inverted U-shape can be seen for the P90/P10 percentile ratio of average daily wages (see Fig. 2A). Comparing the P50/P10 percentile ratios, a similar pattern emerges as for the P90/P10 ratios. In contrast, the P90/P50 percentile ratios are quite similar in all four samples: a nearly constant pattern with a ratio of about two.

The fact that the P90/P50 percentile ratios are quite similar in all four samples, but there are major differences in both the level and trend of the P90/P10 and P50/P10 percentile ratios, suggests that there are differences in the data sets at the lower end of the wage distribution. The lower P90/P10 and somewhat lower P50/P10 percentile ratios in the SOEP indicate that the P10 value in the SOEP must be higher than that in the SIAB.

Of course, this difference may be because the population covered by the two data sets is not fully comparable. For example, the SOEP includes wages of civil servants, which are not included in the SIAB data. In addition, the SOEP surveys do not take place evenly throughout the year. Therefore, wages in the SOEP could be influenced by seasonal effects.

To examine whether the differences described are due to these factors, we consider the comparable sub-samples in the following section. However, another strategy for uncovering potential factors to explain the observed differences between SOEP and SIAB may be to use other data sources with wage information. Therefore, in further analyses, we use the Federal Statistical Office’s Structure of Earnings Survey (VSE, Verdienststrukturerhebung) in Appendix C to gain further insights.

3.3 Using comparable sub-samples of survey and administrative data

In the following, we consider the four comparable sub-samples presented in Sect. 2.3.2. Figures 3 and 4 show the comparable samples using monthly and annual wages, respectively.

Fig. 3
figure 3

Monthly wages in 2000–2019; sample: comparable sample; no lower limit. Source: own calculations, SOEP-Core.v37.EU & SIAB 7519. 95% confidence interval indicated by shaded areas (for percentiles only); 500 bootstrap replications

Fig. 4
figure 4

Annual wages in 2000–2019; sample: comparable sample; no lower limit. Source: own calculations, SOEP-Core.v37.EU & SIAB 7519. 95% confidence interval indicated by shaded areas (for percentiles only); 500 bootstrap replications

The percentile trajectories of monthly wages (Fig. 3A) look relatively similar in the two data sets. This is especially true for the 50th percentile. However, the P10 of the SOEP is somewhat higher than that of the SIAB, especially from 2017 onward. On the other hand, the P90 of the SIAB is consistently higher than that of the SOEP.

The percentile trajectories of annual wages (Fig. 4A) in the two data sets also look quite similar. However, here the P10 of the SOEP is significantly higher than the P10 of the SIAB, and one also sees clear differences in the two P50 percentiles.

The differences—in both monthly and annual data—translate into striking differences when percentile ratios are considered (see Figs. 3B and 4B). Here we can see that the clear differences in percentile ratios—with the exception of P50/P10—still exist.

These results suggest that wages at the lower (and upper) ends of the wage distribution are captured differently. It could be that low wages are underrepresented in the SOEP. This could be due to sampling, but it could also be due to respondents not reporting very low wages or simply forgetting about them. It could also be because information on second jobs has been collected more precisely in the SOEP since 2017. Another explanation could be that respondents with several marginal part-time jobs are very likely to add them together and thus report higher wages as captured in the SIAB. This last aspect might be particularly relevant, given that the number of workers with more than one job more than doubled between 1999 and 2019.Footnote 20

In the period under consideration, low-income wages are included in the administrative data (SIAB) because since 1999 jobs with wages below the marginal earnings threshold are subject to a lump-sum social security contribution payable by the employer. For this reason, marginal wages should be recorded relatively reliably in the SIAB. Here, too, however, it cannot be ruled out that, e.g., spells with low wages may occur due to subsequent wage declarations that do not represent actual employment. Although these are administrative data, we observe some employment spells with unrealistically low daily wages (e.g., \(\le\) 1 euro)Footnote 21. This could lead to an overestimation of marginal part-time employment in the SIAB.

To further analyze this, we decided to add another restriction to our comparable sub-samples below by limiting the analyses to wages above the marginal earnings threshold.Footnote 22

3.4 Using the marginal earnings threshold as a lower wage floor

Using the marginal earnings thresholdFootnote 23 as a wage floor, the results are no longer representative for the entire German labor force, but only for workers with wages above the marginal earnings threshold. Nevertheless, the percentile ratios can be used to examine the development of wage inequality in this group and to highlight differences in the two data sets.

The results of this exercise are striking. Looking only at wages above the marginal earnings threshold, the trend is now almost identical for all three percentiles considered (see Figs. 5A and 6A). Only in 2016 does the P90 in the SOEP decrease slightly, but the development in the following years is again in line with that of the SIAB.

The trajectories of the percentile ratios are now also quite similar (see Figs. 5B and 6B). However, we still find a level difference in the P90/P10 ratio for annual wages (see Fig. 6B).Footnote 24

Fig. 5
figure 5

Monthly wages in 2000–2019; sample: comparable sample; only wages above marginal part-time employment. Source: own calculations, SOEP-Core.v37.EU & SIAB 7519. 95% confidence interval indicated by shaded areas (for percentiles only); 500 bootstrap replications

Fig. 6
figure 6

Annual wages in 2000–2019; sample: comparable sample; only wages above marginal part-time employment ceiling. Source: own calculations, SOEP-Core.v37.EU & SIAB 7519. 95% confidence interval indicated by shaded areas (for percentiles only); 500 bootstrap replications

4 Conclusion

Two very different data sets are often used to describe wage inequality in Germany. On the one hand, the administrative data of the SIAB and, on the other hand, the information obtained from a population survey, the SOEP. Both data sources have their specific advantages and disadvantages. Using the full information from each data source yields comparable trends in wage inequality from 2000 to about 2015. Since then, the results diverge at first glance. After harmonizing the analysis population, comparable trends and relatively similar levels emerge again for both data sources. However, the comparison of the two data sources also shows that there are systematic deviations between them in the area of very low wages. Various reasons may be responsible for this: On the survey side, it might be the case that respondents add up wages from several activities, so that the SOEP shows higher wage levels in the lower part of the distribution. In addition, it cannot be ruled out that money flows from the employer to the employee over and above the regular wage, e.g., to pay for short-term overtime in a mini-job, but not to exceed the official mini-job threshold. Accordingly, there are significantly more very small wage reports on the SIAB side. One of the reasons for this might be short-term registrations and cancellations of employment relationships, which are not considered relevant for respondents and therefore tend not to be reported in survey data. In addition, there is presumably a problem on the part of the SIAB, known from minimum wage research: if the minimum wage legislation is not complied with or the mini-job limit is exceeded, electronic payroll systems issue a warning message and wages or working hours might just be adjusted in the system in accordance to the legal situation (e.g., Bachmann et al. 2020, p. 26). This is likely to occur more frequently in the lower wage range and in short-term employment. Overall, however, the findings presented here show that both data sources are well suited to adequately describe wage inequality in Germany above the mini-job threshold. However, the observed differences at the bottom of the wage distribution require further analysis. The upcoming release of SOEP data linked to SIAB data by social security number (SOEP-ADIAB), planned for 2023, will provide a database helping to further investigate into these differences.