1 Introduction

Pay differences between men and women are one of the most intensively investigated phenomena in economics. Although the gender pay gap has been declining over the last decades in developed countries, women still earn considerably less than men (Blau and Kahn 2017). Gender differences do not only exist in realized wages, but they already emerge in expectations of young people about their future salaries (e.g., Filippin and Ichino 2005; Kiessling et al. 2019; Reuben et al. 2017). Hence, one reason why women do not earn as much as men might be lower wage expectations, as individuals tend to accept jobs that match their beliefs (Reuben et al. 2017).

In this paper, we focus on prospective university students’ expectations about their own starting salary as well as that of an average graduate in the same field and with the same degree. We provide a comprehensive analysis of the structure of the gender gap along the entire distribution of expected salaries. In a first step, we estimate conditional and unconditional quantile regressions making use of a rich survey data set collected at Saarland University, Germany, that is roughly representative of the German student population. The conditional quantile regressions allow us to examine the gender gap at different percentiles of the distribution of expected salaries among students with the same field of study, intended degree, attitudes about income and other characteristics. The unconditional quantile regressions, in contrast, allow us to address the question how gender affects ceteris paribus the salary expectations of students at different percentiles of the unconditional distribution. In a second step, we decompose the gender gap at unconditional quantiles based on the method proposed by Firpo et al. (2009, 2018) in order to evaluate the contributions of different observed characteristics. Finally, we analyze to what extent gender differences in biased beliefs with regard to their own performance and the general labor market prospects contribute to the gender gap in the salary expected for oneself. To this end, we consider two different measures of biased beliefs. Firstly, we include indicators for over- and underplacement that relate students’ own salary expectations to their expectations for average others while taking into account their relative performance (grade point average above/below median) in secondary school. Secondly, we analyze how the gender gap is affected by misperceptions of the average starting salary. For this, we compare students’ expected average salary with the actual average starting salary in their field of study.

Our results indicate a raw gender gap at all percentiles of the distribution of expected salaries. At the median, we find raw gender gaps of 17 and 12 percent for the expected own and expected average salary, respectively. Accounting for observed characteristics reduces the gender gaps substantially, however, particularly the gap in the expected own salary remains sizable and highly significant. We find larger negative effects of being female at the bottom than at the top of the distribution, indicating a sticky floor structure of the gender gap in wage expectations. Furthermore, our decomposition results show that differential sorting of men and women into fields of study is strongly related to the gender gap in expected own and expected average salaries. However, after additionally accounting for biased beliefs, the contribution of field of study to the explained gap in the expected own salary shrinks and becomes insignificant at most percentiles. Gender differences in our measures of over- and underplacement together contribute between 15 and 51 percent to the total gender gap. Even larger fractions of the total gender gap can be attributed to misperceptions of the average salary. Consequently, the unexplained part of the gender gap becomes insignificant after taking gender differences in biased beliefs into account.

Our paper contributes to three research areas: gender differences in pay, wage expectations and biased beliefs. First, our study speaks to the literature analyzing the gender gap in actual pay (see, e.g., the meta-analysis by Weichselbaumer and Winter-Ebmer 2005 for an overview). Many studies in this line of research document a U-shaped pattern of the gender pay gap along the conditional and unconditional distributions of earnings, meaning that the gender pay gap is particularly pronounced in the lower (sticky floor effect) and upper tail (glass ceiling effect) of wage distributions (Antonczyk et al. 2010; Christofides et al. 2013). Analyzing the gender wage gap in the entire working population, Collischon (2019) documents a glass ceiling effect in Germany. In contrast, Francesconi and Parey (2018), who exclusively focus on university graduates in Germany, a group closely related to our sample, find larger gender gaps in the lower tail of the wage distribution. Focusing on prospective students, we are able to document that the gender pay gap is already present at an earlier point in the careers of young people: when high school graduates form their expectations about their future salary.

Second, our study contributes to the literature on gender differences in earnings expectations that have been much less under study so far. However, the use of subjective expectations data in economics in general as well as in the context of education and labor economics has increased over the last years (see, e.g., Arcidiacono et al. 2020; Baker et al. 2018; Jacob and Wilder 2011; Reuben et al. 2017; Zafar 2013). Specifically, Kiessling et al. (2019)—who analyze wage expectations of advanced students in Germany—find significant gender gaps in the expected starting salary and expected lifetime earnings. Furthermore, their results suggest that a substantial part of the gap can be attributed to gender differences in sorting into fields of study as well as personal style in pay negotiations. Unlike Kiessling et al. (2019), we focus on the gender gap in expected starting salaries of prospective students, young individuals who are yet to enter university. In contrast to Kiessling et al. (2019), we study the gender gap also along the unconditional wage expectations distribution and compare the gaps in expected own salary and expected average salary. Moreover, our evidence highlights the role of gender differences in biased beliefs for explaining the gender gap in the expected own salary, rendering the contribution of differential sorting into field of study insignificant.

Third, our paper is related to studies analyzing gender differences in beliefs about individual and general economic outcomes. Students’ beliefs about average starting salaries in their chosen field of study are related to the concept of optimism. In the economic literature, optimism is often defined as positive expectations about future events that are beyond the control of an individual (Jacobsen et al. 2014; Bjuggren and Elert 2019). Previous studies have shown that there exist gender differences in optimism in various economic domains. Bjuggren and Elert (2019), for instance, document that men are more optimistic than women about the future state of the economy. In a similar vein, we compare the expected average starting salary between genders and show that male students are more optimistic about future wage levels in their field than their female counterparts.

A further related study is the work of Reuben et al. (2017), who analyze the impact of overconfidence as well as competitiveness and risk taking on earnings expectations of undergraduate students at New York University. Combining experimental and survey data, they show that a substantial part of the gender gap in wage expectations can be attributed to differences in overconfidence and competitiveness. Similarly, we study the role of biased beliefs that are presumably related to overconfidence. Specifically, we evaluate how students perceive their own earnings potential relative to other students and compare this to the relation between their own grade point average in secondary school and the average GPA of students in the same study field. Such a comparison is most closely related to the concept of over- and underplacement. Overplacement is classified as one form of overconfidence in the psychology literature and is defined as an exaggerated belief that one is doing better than others (Moore and Healy 2008). Thus, we denote our measures as over- and underplacement, respectively. Previous research documents that males are more likely than females to overplace themselves in some settings (e.g., Niederle and Vesterlund 2007; Ring et al. 2016).

Moreover, we also analyze gender differences with respect to misperceptions of average salaries. A number of studies have shown that university students are to some extent misinformed about average market salaries (Betts 1996; Jensen 2010; Wiswall and Zafar 2015). Our results suggest that biased beliefs play a major role in explaining gender differences in wage expectations. In fact, we do not find a statistically significant unexplained part of the gender gap after accounting for biased beliefs.

The remainder of the paper is organized as follows. Section 2 describes the data and shows descriptive statistics. Section 3 presents and discusses our estimation results of the conditional and unconditional quantile regressions, as well as of the decomposition of the gender gaps. Section 4 describes our measurements of over- and underplacement as well as of misperceptions of average salaries and discusses their impact on the gender gap in expected own salaries. Section 5 concludes.

2 Data and descriptive evidence

2.1 Data set

We use data from a survey of prospective students at Saarland University, Germany, who applied for enrollment in the academic years 2011 and 2012 (henceforth Student Survey). Prospective students submitting a complete application received an e-mail with the URL of the survey and were asked to fill in the questionnaire truthfully. In 2011, applicants for a program in Business Studies, Humanities, Law Studies, as well as Mathematics and Computer Science received the questionnaire; in 2012, also applicants for a program in Education and Medicine were surveyed. The sample consists of 2,061 students in total, which is a relatively large sample size compared with other studies examining students’ wage expectations (e.g., Reuben et al. 2017; Webbink and Hartog 2004). The data were first used by Klößner and Pfeifer (2019), who also validated the quality of the data.Footnote 1 As Klößner and Pfeifer (2019) document, Saarland University and its students are well representative of an average university and the corresponding average student body in Germany. Figures like student/teacher ratio, gender ratio, student age distribution, distribution of graduates across fields, distribution of gender across fields, number of exams passed, grades, duration of studies, etc., are all close to the German average. Saarland University also appears in the middle of international rankings across items such as teaching/learning, research, knowledge transfer, international orientation and regional engagement.

The prospective students first had to state the field of study and the degree (bachelor, master or state examination) they applied for. They were also asked whether they intended to do a further degree afterward (master, second state examination or doctoral degree) and with which degree they intended to earn their first salary. In the next part of the survey, the prospective students were asked to answer several questions regarding monthly gross salaries. First, they had to provide an estimate of the average starting salary of other persons graduating in the same field of study and with the same degree. Second, they had to state their own expected starting salary, again referring to their chosen field and intended degree. Figure 4 in “Appendix” shows an extract of the questionnaire containing these two core questions.

Table 1 Descriptive statistics

Furthermore, the students had to provide information about personal characteristics and family background. Specifically, we collected information on their gender; age; grade point average in secondary school; the type of secondary school they had graduated from; the federal state in which they had graduated from secondary school; whether their mother and father had a college degree and if so, their major discipline; whether they intended to live at their parents’ home while studying; and whether they expected to receive the public financial aid ‘BAfoeG’Footnote 2 and, if so, how much.

In the last part of the survey, students had to state the branch of business they intended to work in and their work experience in this branch. They also had to answer two questions regarding the value they placed on their future income: the importance of expected income for their choice of college major and the importance of receiving an above average salary.

2.2 Descriptive statistics

Table 1 depicts means and standard deviations of wage expectations and several other relevant variables for male and female students, respectively.Footnote 3 The last column of Table 1 also shows t-values for the test of equality of means between males and females.

Male students expect on average higher starting salaries both for themselves and for an average other person in the same field and with the same degree compared with female students. More specifically, males expect on average to earn 3579€ a month at labor market entry, while the corresponding number for females is 3015€. With regard to average earnings of others in the same field of study and with the same degree, males state on average 3571€, which is significantly higher than the average female statement of 3178€. There is also a gender difference in the size of the expected own salary relative to expected average salary. Whereas the expected own salary is on average higher than the expected average salary for male students, the opposite is the case for female students.

Moreover, men and women differ substantially with regard to the choice of field of study, as is also found by previous studies (e.g., Zafar 2013; Osikominu et al. 2020). Men in the sample are significantly more likely to apply for a program in Business Studies or in Mathematics and Computer Science. Contrarily, significantly larger shares of women apply for a program in Education, Humanities or Medicine.

There are also differences between male and female students in terms of the value they place on their income after graduation. Male students value an above average salary more than female students. Similarly, male students are more likely to state that future income prospects were important or very important for their choice of college major. This suggests that men are more strongly driven by pecuniary incentives than women who seem to place a higher value on non-pecuniary aspects of college majors and employment opportunities. Similar evidence has also been documented, among others, in Wiswall and Zafar (2018), Osikominu et al. (2020).

Fig. 1
figure 1

Source: PersonalMarkt Services GmbH and Student Survey, own calculations

Expected and actual starting salaries. Notes: Circles show mean values of expected own starting salaries. Squares show mean values of expected average starting salaries of students in the same field of study. Triangles show actual average starting salaries

Figure 1 displays mean values of expected own starting salaries and expected average starting salaries based on the Student Survey as well as actual average starting salaries differentiated by field of study.Footnote 4 Wage expectations differ considerably across majors. While Mathematics and Computer Science students expect the highest starting salaries for themselves and for average others, Education and Humanities students expect the least. The comparison of expected starting salaries with actual starting salaries reveals that prospective students make fairly small estimation errors on average.Footnote 5 Figure 6 in “Appendix” shows that across all fields more men than women expect their own salary in excess of the actual average salary.

3 Analyses of the gender gap in wage expectations

3.1 Conditional and unconditional quantile regressions

Since gender gaps in actual salaries vary along the wage distribution (see, e.g., Collischon 2019; Francesconi and Parey 2018), we want to analyze the gender gap in students’ expectations at different quantiles of the wage expectations distribution. Hence, we regress students’ expected own salary as well as their expected salary for average others on a female dummy and observed characteristics using conditional quantile regressions (CQR). Specifically, we model the logarithm of the respective expected salary (own or average) of individual i, \(y_i\), as:

$$\begin{aligned} y_i={\mathbf {x}}_i\pmb {\beta }_\tau +u_{\tau i} \qquad \text {with} \quad Q_\tau (y_i\,|\,{\mathbf {x}}_i)={\mathbf {x}}_i\pmb {\beta }_\tau \, , \end{aligned}$$
(1)

where \({\mathbf {x}}_i\) denotes the vector of explanatory variables, including a constant, and \(\pmb {\beta }_\tau \) the corresponding coefficient vector. As explanatory variables we use a female dummy as well as variables indicating the field of study, intended degree, degree applied for, family background, desired branch of business, work experience in desired branch, importance of income, grade point average in secondary school, type secondary school, region where high school was attended, age and survey year. The function \(Q_\tau (y_i\,|\,{\mathbf {x}}_i)\) denotes the \(\tau \)-th conditional quantile of the expected salary conditional on the explanatory variables, and \(u_{\tau i}\) is an error term with \(Q_\tau (u_{\tau i}\,|\,{\mathbf {x}}_i)=0\) (see Koenker and Hallock 2001; Koenker 2005, for details). CQR allow us to evaluate the gender gap along the distribution of expected salaries among students with the same characteristics.

Table 2 Gender gap in unconditional and conditional quantile regressions

As the conditional quantile function is nonlinear the gender gap at the \(\tau \)-th conditional quantile may differ from the gender gap at the \(\tau \)-th unconditional quantile. Therefore, we also estimate unconditional quantile regressions (UQR), as proposed by Firpo et al. (2009), that allow us to evaluate the gender gap along the unconditional wage expectations distribution. The basic idea of UQR is to model the recentered influence function (RIF) of the \(\tau \)-th unconditional quantile, \(Q_\tau \), of the logarithm of expected salary, \(y_i\), which can be expressed as

$$\begin{aligned} \text {RIF}(y_i;Q_\tau )=Q_\tau +\dfrac{\tau -\mathbb {1}\{y_i\le Q_\tau \}}{f_{y}(Q_\tau )} \, , \end{aligned}$$
(2)

where \(f_y(Q_\tau )\) denotes the marginal density of \(y_i\), and \(\mathbb {1}\{\cdot \}\) is the indicator function (Firpo et al. 2009). Assuming that the conditional expectation function of the RIF of \(Q_\tau \) is linear, we obtain a linear regression model

$$\begin{aligned} \text {RIF}(y_i;Q_\tau ) = {\mathbf {x}}_i\pmb {\gamma }_\tau + v_{\tau i} \, , \end{aligned}$$
(3)

where \(\pmb {\gamma }_\tau \) denotes the coefficient vector and \(v_{\tau i}\) the error term with \({{\,\mathrm{{\mathbb {E}}}\,}}[v_{\tau i}\, |\, {\mathbf {x}}]=0\). Standard ordinary least square (OLS) regressions of Eq. (3) yield the UQR coefficient estimates that can be interpreted as partial effects on unconditional quantiles of \(y_i\).

Our estimated gender gaps in wage expectations using conditional and unconditional quantile regressions are reported in Table 2. While Panel (a) displays the gender gaps in the expected own salary, Panel (b) shows the gender gaps in the expected salary for an average graduate in the same field of study and with the same degree. First and foremost, we find statistically significant raw gender gaps (RGG) in students’ expected salaries for both, themselves and an average graduate, at all inspected quantiles of the distribution, as can be seen in the first row of Panels (a) and (b), respectively. Except for the first decile, the raw gender gap in students’ expected own salary, reported in Panel (a), is always larger than the respective gap in their expected salary for an average graduate, reported in Panel (b). While the raw gender gap at the median in case of the expected own salary is 17 percent, the corresponding gap in case of the expected average salary is with 12 percent more than a quarter smaller. Interestingly, the raw gender gaps we find in students’ expected own salaries are similar in size to the gender gap in actual salaries of recent university graduates in Germany (Francesconi and Parey 2018).

In terms of the structure of the raw gap along the distribution, we also see differences between expected own and expected average salaries. Whereas the raw gender gap in expected average salaries is declining along the distribution, the raw gap in expected own salaries varies rather unsystematically. Thus, our results for the expected own salary do not completely match the findings of Kiessling et al. (2019), who report lower raw gender gaps at the top of the wage expectations distribution. Similarly, the raw gender gap in actual salaries of university graduates is declining with increasing wage levels. The raw gender wage gap in the whole working population in Germany, in contrast, is increasing along the distribution (Collischon 2019).

The second and third rows of both panels show gender gaps in wage expectations that are adjusted for gender differences in important observed characteristics (e.g., field of study, intended degree, GPA in secondary school, income importance). Consequently, the gender gaps estimated from the quantile regressions reflect gender differences in wage expectations that persist after taking these differences in observed characteristics into account. The two rows differ, however, in terms of the respective distribution being analyzed. The gender gaps displayed in the second row of both panels are estimated by unconditional quantile regressions (UQR) and, thus, can be interpreted as the adjusted gender differences in the unconditional distribution of wage expectations. The gender gaps displayed in the third row of both panels, in contrast, are estimated by conditional quantile regressions (CQR). Hence, these estimates reflect the gender gaps in the conditional wage expectations distributions.

Compared with the raw gender gaps, the adjusted gender gaps at the conditional and unconditional quantiles are, except at the first decile of the expected own salary, always smaller, suggesting that a non-negligible part of the raw gender gaps can be attributed to differences in observed characteristics between men and women. Nevertheless, the gender gaps in the expected own salary remain sizable and statistically significant at all studied quantiles after accounting for differences in observed characteristics. Analogously to the raw gender gaps, the adjusted gender gaps in the expected own salary are larger than the corresponding adjusted gaps in the expected average salary. In fact, the adjusted gender gaps at the bottom and at the top of the conditional and unconditional distribution of the expected average salary are at most weakly statistically significant. However, there are significant adjusted gender gaps in the middle of the expected average salary distribution. Thus, while we find that young women and men differ more in how they perceive their personal earnings prospects, we also see that male students are more optimistic about what a typical graduate from the same field of study can expect to earn. This finding is consistent with previous studies showing that women tend to be more pessimistic about future economic events (e.g., Jacobsen et al. 2014; Bjuggren and Elert 2019).

Overall, we do not find large differences between gender gaps in the conditional and unconditional distributions. Among students with the same characteristics, the gender gap is most pronounced among those who see themselves at the bottom of the conditional distribution of expected own salary, and the gap is the least pronounced at the top. Similarly, our unconditional quantile regressions also suggest a larger negative cet. par. effect of being female at the bottom of the unconditional distribution of expected own salary. Thus, the pattern of the adjusted gaps at conditional and unconditional quantiles mimics a sticky floor effect.

Eventually, our adjusted gender gaps are similar in size and structure to the adjusted gaps found by Kiessling et al. (2019), who estimate the gaps by conditional quantile regressions. However, the adjusted gender gaps in students’ own wage expectation do not completely mirror the adjusted gender gaps at conditional quantiles of actual salaries of recent university graduates found by Francesconi and Parey (2018). In contrast to the adjusted gender gaps in students’ wage expectations, the adjusted gap in actual salaries of university graduates is a bit smaller and constant across conditional quantiles. This divergence in adjusted gaps between expected and realized earnings might be due to differences in the respective set of covariates used across the different studies.

3.2 Decomposition of gender gaps at unconditional quantiles

To decompose the gender gap at unconditional quantiles of expected salaries, we apply the decomposition method proposed by Firpo et al. (2009, 2018) that is based on the RIF. One advantage of this approach in comparison with other decomposition methods for quantiles (e.g., Machado and Mata 2005; Melly 2005) is that it allows a detailed decomposition of the composition effect. Hence, we can also quantify the contribution of gender differences in, for example, the choice of field of study to the gender gap along the unconditional distribution of expected salaries.

We model the group-specific RIFs of men (\(g=0\)) and women (\(g=1\)) adopting again a linear specification for the conditional expectation of the RIFs:Footnote 6

$$\begin{aligned} \text {RIF}(y_{gi};Q_{\tau g}) = {\mathbf {x}}_i\pmb {\gamma }_{\tau ,g} + v_{\tau g i} \qquad \text {for} \quad g=0,1 \, . \end{aligned}$$
(4)

The vector of explanatory variables, \({\mathbf {x}}_i\), does not include a female dummy now. After estimating Eq. (4) by OLS for both groups, the overall gender difference at the \(\tau \)-th quantile, \({\hat{\Delta }}_O^\tau \), can be decomposed as,

$$\begin{aligned} {\hat{\Delta }}_O^\tau = \underbrace{{\bar{\mathbf {x}}}_1(\pmb {{\hat{\gamma }}}_{\tau ,1}-\pmb {{\hat{\gamma }}}_{\tau ,0})}_{{\hat{\Delta }}_U^\tau }+\underbrace{({\bar{\mathbf {x}}}_1-{\bar{\mathbf {x}}}_0)\pmb {{\hat{\gamma }}}_{\tau ,0}}_{{\hat{\Delta }}_E^\tau } \, , \end{aligned}$$
(5)

where \(\pmb {{\hat{\gamma }}}_g\) denotes the respective OLS coefficient vector and \({\bar{\mathbf {x}}}_g\) the respective vector of sample means. Hence, the raw gender difference at the \(\tau \)-th quantile is decomposed into the unexplained part, \({\hat{\Delta }}_U^\tau \), and the explained part, \({\hat{\Delta }}_E^\tau \).

If we want to interpret the explained and unexplained part of the decomposition truly as composition and wage structure effect, we need to assume that, at every quantile \(\tau \), the error term \(v_{\tau g}\) is mean independent of gender, G, i.e.,

$$\begin{aligned} {{\,\mathrm{{\mathbb {E}}}\,}}[v_{\tau g}\,|\,{\mathbf {x}}, G]={{\,\mathrm{{\mathbb {E}}}\,}}[v_{\tau g}\,|\,{\mathbf {x}}]=0 \, , \quad \forall \, \tau \in [0,1] \, , \; g=0,1 \, . \end{aligned}$$
(6)

This requires conditioning on all variables that are correlated with both the gender-specific outcome variables and gender. In the following analyses, we include a broad range of explanatory variables that we have selected based on economic reasoning and prior evidence on the determinants of expected starting salaries among prospective university students. Even though we do not have access to detailed controls for all underlying aspects of economic preferences and beliefs associated with both pay expectations and gender, to satisfy Assumption (6) it would suffice to condition on variables that can proxy the underlying facets well enough to achieve conditional independence of group-specific outcomes and group membership. As an example, while we cannot directly control for competitiveness, we know that students who choose a program in Education and indicate that pay is not important to them are similarly competitive and typically less competitive than students majoring in Business who state that pay is important to them.Footnote 7 Moreover, in Sect. 4, we consider a richer set of control variables that includes in addition measures of students’ beliefs about their own earnings potential relative to others and the general earnings prospects in their field.

Table 3 RIF decomposition of the gender gap in expected own salary

Table 3 contains the decomposition results at various unconditional quantiles of the expected own starting salary. While Panel (a) shows the aggregate decomposition results, detailed contributions of the most relevant covariates to the explained and unexplained part are given in Panels (b) and (c), respectively. According to Panel (a) and in line with the results in the previous section, a raw gender gap of 14 to 19 percentage points is present along the entire distribution of the expected own salary.Footnote 8 The part of the total gap that can be explained by gender differences in the covariates considered varies between six and ten percentage points in absolute terms and is always statistically significant. In relative terms, the explained part ranges between 36 percent at the median and 75 percent at the first decile. The unexplained part ranges between three and twelve percentage points and is statistically significant except at the tenth percentile.

Panel (b) of Table 3 shows the detailed contributions of selected variables to the explained part of the gender gap in the expected own starting salary. Among the covariates considered, gender differences in field of study explain the largest part of the gender gap in the expected own salary: Its contribution fluctuates between four and six percentage points across the quantiles considered and is statistically significant at all but the tenth percentile. In relative terms, differential sorting into field of study explains between 24 (25th quantile and ninth decile) and 41 (first decile) percent of the total gender gap. Gender differences in views on income importance explain up to two percentage points (seven percent) of the total gender gap. However, the effect is statistically significant only at the 75th and the 90th percentile. The contributions of other covariates (not reported in Table 3) are quantitatively less important and not statistically significant.

Table 4 RIF decomposition of the gender gap in expected average salary

Table 4 shows the corresponding decomposition results at unconditional quantiles of the expected average starting salary. According to Panel (a), the total gender gap in the expected average salary is with nine to 14 percentage points again smaller than the one in the expected own salary at every quantile considered. Between five and seven percentage points can be explained by gender differences in the included covariates. In relative terms, this corresponds to 36 to 74 percent of the total gap, depending on the quantile considered. The unexplained part lies between two and nine percentage points. While the explained gap is significant at four out of the five quantiles considered, the unexplained gap is significant only at the 25th and 50th percentile. The results of the detailed decomposition of the explained part, shown in Panel (b), reveal that gender differences in field of study contribute the most to explaining the gender gap in expected average salary. In relative terms, they explain between 25 and 54 percent of the total gender gap in the expected average starting salary, which is quantitatively similar to their contribution to the total gap in the expected own salary. Gender differences in views on income importance and other covariates (not reported in Table 4) do not contribute significantly to explaining the total gender gap.

All in all, the decomposition results match the patterns documented in the quantile regressions in Table 2: A substantial part of the raw gender gap in expected salaries can be attributed to differences in observed characteristics between men and women. Moreover, gender differences in field of study play the by far most important role for explaining the gender gap in expected salaries, which parallels findings in the literature on the gender gap in expected and actual pay (e.g., Collischon 2019; Kiessling et al. 2019; Fernandes et al. 2020).

4 The role of biased beliefs

While field of study or occupation have commonly been documented to be important drivers of the gender pay gap, they represent rather black-box controls for underlying aspects of preferences and beliefs that led to a particular choice of field of study or occupation. In the following, we want to open this black box by exploring to what extent gender differences in prospective students’ salary expectations reflect gender differences in biased beliefs about their own relative earnings potential and the general earnings prospects in their field.

4.1 Measuring over- and underplacement

Our data allow us to compare the expectations of students about their own starting salary, conditional on field of study and intended degree, with their expectation about the average starting salary of other students in the same field of study and with the same intended degree. Specifically, in the survey, we first elicit a respondent’s expectation about the average starting salary of graduates from the same field before we elicit the starting salary the respondent expects for herself (see Fig. 4 in “Appendix”). In so doing, we explicitly allow respondents to anchor the assessment of their own starting salary to their perception of the monetary success in the labor market of an average graduate with the same field of study and intended degree.

The exaggerated belief that one is doing better or worse than others is known as over- or underplacement in the psychology literature, and this terminology was adopted in economic research as well (Larrick et al. 2007; Moore and Healy 2008; Astebro et al. 2014; Duttle 2016). Individual-level differences between the two expected salaries, however, might not necessarily reflect mere over- or underplacement. Instead, they might also entail differences in abilities. In fact, somebody who expects to earn more than the average other graduate in the same program may do so because she obtained above average grades in secondary school. If all students were able to realistically asses their own abilities and based their wage expectations on them, individual-level differences between the two expected salaries would just reflect ability differences. To address this concern and to capture exaggerated misplacement, we take a prospective student’s grade point average in secondary school into account when constructing our measurement of over- and underplacement. The GPA in secondary school constitutes a salient summary measure of a student’s abilities because its value and the corresponding approximate rank are well known by the students themselves, and it is also an important criterion for admission to the different university programs in Germany. To be precise, we define a prospective student as overplacing themselves who expects to earn more than the average graduate in the same program but whose GPA in secondary school is below the median in their chosen program. Analogously, we define a student as underplacing themselves who expects to earn less than average, but has a GPA in secondary school above the median in their chosen program.

Table 5 Over- and underplacement by gender

Table 5 displays the means of our over- and underplacement indicators in the full sample, as well as by gender. Female students are with 18 percent as opposed to 12 percent significantly more likely to underplace themselves with respect to their starting salary. Moreover, the share of male students overplacing themselves is with almost 14 percent more than twice as large than the corresponding share of female students with 6 percent. Similar patterns emerge within all fields of study, as can be seen in Fig. 2. Within all fields, males are more likely to overplace themselves, while females are more likely to underplace themselves.

Fig. 2
figure 2

Source: Student Survey, own calculations

Over- and underplacement by field of study and gender. Notes: a Share of students who expect to earn more than average and have a final grade in secondary school that is worse than the median grade in the chosen field of study. b Share of students who expect to earn less than average and have a final grade in secondary school that is better than the median grade in the chosen field of study

A potential concern with our measurement of over- and underplacement is that, besides ability, there could be other reasons why a student expects to earn more or less than other graduates from the same program earn on average. Prospective students may value financial aspects associated with their study choice differently or desire to work in different industries with differing pay systems. However, in our decomposition of the gender gap, we condition on a broad range of observed characteristics (e.g., income importance, desired branch of business). Hence, this potential issue is not problematic for our identification of the effect of over- and underplacement on the gender gap in expected salaries.

Even though we account for many observed characteristics, there might still be concerns that gender differences in unobserved determinants of wage expectations affect our measures of over- and underplacement. Given that females tend to work fewer hours during their lifetime, one might expect that they intend to work fewer hours after graduation and, consequently, expect lower starting salaries. The previous literature, however, suggests otherwise. Neither is there a substantial gender difference in working hours of recent university graduates (Francesconi and Parey 2018), nor are female students less likely to expect to be in full-time employment directly after graduation (Fernandes et al. 2020). Moreover, previous research shows that non-cognitive skills and risk preferences do not contribute to the gender gap in expected starting salaries (Kiessling et al. 2019; Reuben et al. 2017). Therefore, we do not think that such factors have a major impact on the gender differences in our indicators for over- and underplacement with respect to starting salaries.

4.2 Measuring misperception of labor market prospects

Besides biased beliefs about the own salary in relation to the average salary, prospective students might also have misperceptions about the average starting salary in their chosen field of study. In Sect. 2.2, we have shown that students’ wage expectations on average are fairly close to the actual salaries in most study fields. However, at the individual level, there might still be substantial differences between the expected average salary and the actual average salary. Hence, we compare students’ expectations of the average starting salary with the actual starting salary in their field of study and analyze whether the incidence of biased beliefs about the general labor market prospects in the chosen field of study differs between male and female students. We construct measures of misperception of actual salaries that indicate whether a student over- or underestimates the average actual starting salary by more than ten percent, respectively.

Table 6 Over- and underestimation of average salary by gender

Table 6 displays the means of the indicators for over- and underestimation of the average salary in the full sample, as well as by gender. Firstly, we see that a large number of prospective students either over- or underestimates the average salary by more than ten percent. Male students are with 41 percent as opposed to 30 percent of women significantly more likely to overestimate the average starting salary in their field of study. The share of female students underestimating the average starting salary in their field is with 53 percent twelve percentage points larger than the corresponding share of male students with 41 percent. Moreover, Fig. 3 displays the share of students who over- and underestimate the average salary in each field of study. In all fields apart from Humanities male students are more likely to overestimate the average salary than female students. Females, in contrast, are more likely to underestimate the average salary within all fields.

Fig. 3
figure 3

Source: Student Survey, own calculations

Over- and underestimation by field of study and gender. Notes: a share of students who overestimate the average salary by more than ten percent. b share of students who underestimate the average salary by more than ten percent

4.3 Effects on the gender gap in expected own salary

We have shown in Sect. 3 that the gender gap in the expected own salary is larger than the gap in the expected salary for average others. Further, the decomposition reveals that the unexplained part is particularly pronounced for the expected own salary. Thus, we will now study the role of biased beliefs in explaining the larger gap in the expected own salary. To be precise, we decompose the gender gap in the expected own salary along the unconditional distribution, additionally including the indicators for over- and underplacement as well as for over- and underestimation of the average salary as explanatory variables. Table 7 reports corresponding results. Since our measures of over- and underplacement are based on GPA in secondary school, we can only perform the analysis for prospective students who reported their GPA. Therefore, we lose 479 observations.Footnote 9

Table 7 RIF decomposition of the gender gap in expected own salary

In line with the complete sample, the gender gap in the expected own starting salary is present at all studied quantiles and varies rather unsystematically: 12 percent at the first decile, 18 percent at the median and 15 percent at the ninth decile. As can be seen in Panel (b) of Table 7, the contribution of overplacement is positive and significantly different from zero at every observed point along the wage distribution, with higher values in the tails of the distribution. Whereas gender differences in overplacement are responsible for 31 percent of the gap at the first decile and for 16 percent of the gap at the ninth decile, they are only responsible for 10 percent of the gap at the median. In contrast to overplacement, the impact of underplacement is significant only at the center of the distribution. At the median and the 75th quantile, the contribution of underplacement is a bit more pronounced than the contribution of overplacement. Thus, while gender differences in overplacement are more important at the bottom and top of the distribution, differences in underplacement play a more decisive role at the center. Overall, our findings are in line with Reuben et al. (2017) who document that gender differences in overconfidence are partly responsible for higher expected salaries of male students.

Furthermore, our results show that misperceptions of the average salary significantly affect the gender gap in the expected own salary. The contribution of gender differences in the propensity to overestimate the average salary is sizable and highly significant at the median and the upper tail of the distribution. Up to 49 percent of the gender gap can be attributed to overestimation of the average salary. Analogously, the contribution of underestimation of the average salary is sizable and highly significant at the lower tail of the distribution up to the median. At the first decile, the contribution of underestimation of the average salary even exceeds the total gender gap. This suggests that, while the stronger tendency of males to overestimate the average salary explains a substantial part of the gender gap in the upper half of the distribution, the stronger tendency of females to underestimate the average salary is responsible for a substantial fraction of the gap in the lower half. After including our indicators of biased beliefs, the contribution of field of study to the explained part is only statistically significant at the 75th quantile, while the contribution of income importance to the explained part turns insignificant along the entire distribution. This indicates that these variables are correlated with biased beliefs and have partly absorbed their influence on the total gender gap in our previous decomposition analysis in Sect. 3.2. In Figs. 2 and 3 , it can be seen that the incidence of over- and underplacement as well as over- and underestimation of the average salary varies substantially between fields. In line with this argument, other studies that do not account for biased beliefs find significant contributions of differential sorting of male and female students into field of study to the gender gap in wage expectations (Kiessling et al. 2019; Fernandes et al. 2020). Moreover, the unexplained part of the gender gap we find is no longer statistically different from zero at all quantiles considered. This result is in contrast to previous studies examining students’ expected salaries, reporting larger and significant unexplained parts of the gender gap (Reuben et al. 2017; Kiessling et al. 2019; Fernandes et al. 2020). Thus, gender differences in biased beliefs seem to be important drivers of the gender gap in wage expectations and should be taken into account.

5 Conclusion

Based on large and informative survey data on prospective students at Saarland University, Germany, we analyze the gender gap along the conditional and unconditional distribution of students’ expected salaries both for themselves and for an average graduate in the same field and with the same degree. In addition, we construct indicators of biased beliefs and study their impact on the gender gap.

Our results indicate that there are substantial differences in the distribution of expected salaries between women and men. At the median, the raw gender gap amounts to 17 percent in case of the expected own salary and 12 percent in case of the expect average salary. The size of the raw gender gap we find for the expected own salary is comparable to the raw gender gap in the actual salary of university graduates (Francesconi and Parey 2018). We also find a raw gender gap in the expected average salary, suggesting that female students are less optimistic about general earnings prospects in their field of study than male students.

Furthermore, there remain sizable and significant gender gaps along the conditional and unconditional distribution of the expected own and average salaries after accounting for observed characteristics. Comparing the gender gaps at conditional quantiles with the corresponding gaps at unconditional quantiles, we do not find major differences. In both cases, our results suggest a larger negative effect of being female at the bottom than at the top of the distribution. Hence, the structure of the adjusted gender gap in wage expectations mimics a sticky floor effect. Moreover, our decomposition results show that an economically and statistically significant part of the gender gap in the expected own salary can be attributed to biased beliefs. Gender differences in over- and underplacement and misperceptions about average salaries are both important drivers of the total gap. On the one hand, gender differences in biased beliefs account for a substantial part of the explained gender gap. On the other hand, the unexplained part of the gender gap turns statistically insignificant after conditioning also on biased beliefs.

Our findings raise the question how inter-individual differences in beliefs about the own earnings potential relative to others as well as in beliefs about average salaries emerge, and whether and how they can be influenced by parents and educators. If such beliefs are malleable, policy interventions aimed at informing students about the labor market prospects associated with different fields of study and at making girls and young women in particular more confident about their future labor market prospects might help to reduce gender differences in pay.