1 Introduction

One of the most celebrated results in personnel and organizational economics is on the optimal incentive intensity of employment contracts (Holmstrom and Milgrom 1987). A key tenet of this study is that in markets, where employees’ efforts are unobservable and outputs are subject to high volatility due to idiosyncratic market factors, the use of performance measures that are tied to outputs should be less common than in markets exhibiting less volatility and more stable performance. This is because, as agents are risk averse, they will be less willing to accept contracts that tie their earnings to stochastic factors beyond their control. Consequently, the greater the risk associated with the environment, the lower the incentive intensity or performance pay of the employment contract. In the literature, this is often termed as the Incentive Intensity Principle (Milgrom and Roberts 1992).

Despite the clear intuition of the model, the empirical evidence on the relationship between riskiness in the environment and the incentive intensity of payment schemes has been rather mixed, casting doubts on the validity of the model (Prendergast 1999, 2000, 2002) reports the findings from 26 empirical studies, out of which only four find evidence for a negative relationship, as predicted in the theory. This dearth of empirical support has led to the proliferation of new theoretical models that attempt to explain why a negative relationship may not be observed in the field. For instance, Prendergast (2002) argues that the delegation of principal’s authority to an agent can explain the evidence for a positive relationship between incentive intensity and performance measure. Similarly, Budde and Kräkel (2011) show that combining risk aversion to limited liability could account for such a positive relationship. Likewise, Wright (2004) demonstrates that when one accounts for heterogeneous managers differing in their degrees of risk aversion, both negative and positive relationships are plausible.

In this paper, we present a controlled laboratory experiment that tests the relationship between incentive intensity and risk while isolating any alternative explanations. Testing this relationship in the lab has two significant advantages: first, it provides enhanced control, which allows implementing precise values of the parameters of the model (Charness and Kuhn 2010; Camerer and Weber 2013); second, it allows ruling out alternative explanations (such as the ones discussed earlier) of why the relationship observed using field data is weak or non-existent (Charness and Kuhn 2010; Corgnet and Hernán-González 2018).

To the best of our knowledge, this is only the second experimental study that tests the relationship between incentive intensity and risk. Very recently, Corgnet and Hernán-González (2018), henceforth C-HG, report an experiment that independently tests the trade-off between risk and incentives. Our experimental design shares various similarities and some crucial divergences with that of C-HG. Similar to them, we conduct between-subject treatments, in which output is either deterministic (baseline treatment) or stochastic (risk treatments). In one of our risk treatments, as in C-HG, we insure the principal towards the stochastic element to ensure his risk neutrality. However, as a robustness test, we additionally conduct another risk treatment, where the principal is not insured towards risk, allowing for both the principal and the agent to be risk averse. Furthermore, like C-HG, we have collected data on risk aversion for both principals and agents using the Holt and Laury (2002) task. However, whereas C-HG elicits the beliefs of principals on agents’ risk aversion, we rely on social projection (Robbins and Krueger 2005). The key difference between our designs and that of C-HGs, however, is the way the agents exert effort in a principal–agent setting. In both the studies, the principal offers a linear contract to the agent that consists of a fixed wage and a share on agent’s production. However, whereas C-HG employs a real-effort summation task to simulate the agent’s effort choice, we follow the tradition in the gift exchange literature (Fehr et al. 1998; Anderhub et al. 2002; Fehr and Gächter 2002; Fehr and Schmidt 2004; Fehr et al. 2007) in which the agent states/chooses an effort level subject to a (convex) cost of effort function.

As with every design choice, there are certain advantages and disadvantages of using a stated-effort vs. a real-effort task.Footnote 1 A real-effort task is a more intuitive approach closer to the field. However, an underpinning assumption of the Incentive Intensity Principle (henceforth IIP) is that principals (and agents) are aware of the precise cost of the effort function that the agent is subjected to. Using a real-effort task introduces heterogeneity in the cost of effort and requires the principal to rely on his beliefs regarding the agent’s ability to complete the task. In essence, introducing a real-effort task transforms the moral hazard problem into a combined problem of moral hazard and adverse selection. As shown in C-HG, this may not affect the direction of the predictions of the IIP; but it complicates the analysis of the contract of the principal. In addition, in the presence of asymmetric information regarding the agent’s ability, the principal’s optimal contract does not implement the efficient level of effort even in the absence of noise. In addition, also noted by C-HG, it could be argued that the baseline treatment is not strictly deterministic, since the agents may find the solution to the task by luck, and/or the agents may not know their ability of the task with certainty. Using a stated-effort rather than a real-effort task eliminates these informational issues and brings the experiment closer to theory, though it does not come without limitations. As shown in Charness et al. (2004), stated effort in gift exchange games is sensitive to the way that payoffs are presented to the subjects. In particular, they show that providing a payoff table reduces average wages by 19% and discretionary effort by 69%. Even though we do not provide agents with an explicit payoff table, we provided calculators that allowed estimating the potential profits for given effort levels and stochastic random factor outcomes. Nevertheless, since we are interested in treatment effects, we expect such issues not to affect our results.

According to the linear agency model, the optimal effort choice of the agent depends on the marginal cost of effort and is unrelated to the noise in the performance measure. Sloof and van Praag (2008) test this experimentally and compare their results with expectancy theory, a theory developed by psychologists that predicts a negative relationship between effort and noise in the performance measure. In contrast to the current study that focuses on the optimal choice of incentive intensity, Sloof and van Praag (2008) focus on the optimal effort choice in a real-effort (number adding) task with noise. Due to their divergent aim and to reduce complexity, they abstract away from the role of a principal and the subjects had to allocate effort between two different tasks. Their findings are in line with the linear agency model, as their results suggest that effort levels are invariant to the distribution of noise terms.

Overall, our results are broadly in support of the theoretical predictions of the IIP. We find a negative relationship between incentive intensity and risk, in line with theoretical predictions and the findings of C-HG. The principals, on average, offer lower piece rates and higher fixed wages in the risk treatments than in the no risk treatment. However, in contrast to both C-HG and Sloof and van Praag (2008, 2010) who observe an increase in effort with higher risk; we find, again in line with the predictions of Hart and Holmström (1986), that agents respond with lower effort levels when the performance measure is noisier. The most plausible explanation for this difference in effort responses could be the fact that we used a stated-effort approach rather than a real-effort task. Finally, in line with C-HG and Sloof and van Praag (2008, 2010), we find that the effort choices of the employees are not affected by the volatility in the performance measure in contrast to the expectancy theory of motivation (Vroom 1964).

The remainder of this paper is organized as follows: Sect. 2 presents the theoretical framework under the parametric restrictions in the experiment, and Sect. 3 describes the specifics of the experimental design. Section 4 presents the results and Sect. 5 concludes. The instructions as well as proofs of the theoretical predictions are provided in the “Appendix”.

2 The principal–agent model

In this section, we construct a principal–agent model following Holmstrom and Milgrom (1987) and describe the theoretical solutions. First, we consider the case, where there is no stochastic variance on output, and therefore, effort is observable (but not contractible). Second, we consider the case, where output is the sum of effort and a stochastic random factor. Here, the principal cannot distinguish which part of the output is due to the agent’s effort and which part is due to the stochastic random factor.

With no risk, the revenue (\( R \)) depends on the agent’s effort level \( e \), such that \( R\left( e \right) = 50e \). The agent starts with an endowment of effort; s/he bears an effort cost of \( C\left( e \right) = e^{2} \) with \( e \in \left\{ {5,6, \ldots ,10} \right\} \). The principal also holds an endowment and s/he jointly decides on a fixed wage \( F \in \left\{ {50, 51, \ldots ,200} \right\} \) and a piece rate \( \beta \in \left\{ {5, 6, \ldots ,40} \right\} \) that specifies how much Experimental Currency Units (ECUs) that the agent will receive for each unit of effort.Footnote 2 Hence, the agent’s payoff is given by

$$ P^{\text{A}} = F + \beta e - C(e) + {\text{endowment}}. $$

Conversely, the principal’s payoff is

$$ P^{\text{P}} = R\left( e \right) - F - \beta e + {\text{endowment}}. $$

Assuming that both principal and the agent are payoff maximizers, the game theoretic solution, derived in “Appendix A” for the specific parameters used in the experiment, predicts the principal to offer a piece rate of 20 and a fixed wage of 50 and the agent to exert an effort level of 10. The consequent payoff, after taking into account their endowments, for the principal is \( P^{\text{P}} = 450 \) and for the agent is \( P^{\text{A}} = 350 \).

With risk in the environment, the piece rate in output is subject to a stochastic variance (\( V \)), which is normally distributed with a mean of zero. When we make the standard assumption that the principal is risk neutral and the agent is risk averse, then the principal faces a trade-off between incentivizing the agent, and providing him with insurance for the variance in payoffs that is created due to the stochastic random factor. Holmstrom and Milgrom (1987) have shown that the optimal incentive intensity (i.e., piece rate) is given by

$$ \beta^{*} = R^{\prime}\left( e \right)/\left[ {1 + rVC^{\prime\prime}\left( e \right)} \right], $$

where \( r \) is the coefficient of absolute risk aversion (ARA) of the agent. Given that \( V \) in our experiment was set at 2.5 and the rest of the factors remained unchanged, the optimal \( \beta^{*} \) is

$$ \beta^{*} = 50/\left( {1 + 5r} \right). $$

We elicited the coefficient of ARA for all subjects using the Holt and Laury (2002) questionnaire. Note that the model assumes that the principal is aware of the exact value of the coefficient of ARA for each agent. Hence, to generate a benchmark, we calculated an average \( \bar{r} \) from all subjects in our experiment. The average coefficient of ARA from all subject participants was \( \bar{r} \) = 0.549. After inserting \( \bar{r} \) in Eq. 15 in “Appendix A”, it yields the optimal \( \beta^{*} \) = 13.35. With optimal incentive intensity determined, it is easy to show that the optimal effort level for the employee is 7, while the optimal fixed wage remains unchanged at 50 ECU. However, as the participants in our subject pool could select only one decimal, the optimal choice for an employer is \( \beta^{*} \) = 13.4. Consequently, the expected profits (including endowments) for the principal and the agent, respectively, are: \( P^{\text{P}} = 516 \) and \( P^{\text{A}} = 384 \).

3 Experimental design

The experiment was conducted at the University of East Anglia in Z-Tree (Fischbacher 2007) with 360 student participants of a variety of backgrounds. The experiment employed a fictional currency, ECU, which was converted to Great Britain Pounds at the end of the experiment at the rate of £0.02 per ECU. Each session lasted approximately 80 min and the subjects earned on average £9.60, including a show-up fee of £2.00.

Each session consisted of ten rounds, and the first three rounds were practice rounds. At the end of the ten rounds, the subjects had to complete the Holt and Laury (2002) risk elicitation questionnaire. After the completion of the questionnaire, the subjects had to complete two non-incentivized psychology questionnaires (Blais and Weber 2006) that measures risk taking and risk perception. As the first three rounds acted as practice rounds, in the end of the experiment, one of the remaining seven rounds was chosen randomly and was paid privately and anonymously to the subjects in cash, along with any additional earnings from the Holt and Laury (2002) task. A random matching procedure was implemented at the start of each round to control for reputation effects. A positive frame of employer/employee was adopted instead of an abstract frame, as context can be useful to enhance understanding (Cooper and Kagel 2003, 2009) in an organizational setting. In addition, both the employer/employee frame (e.g., Fehr et al. 1998; Karakostas et al. 2017) and the buyer/seller frame (e.g., Fehr and Gächter 2002; Fehr et al. 2007) have been previously used in the context of the gift exchange finding no qualitative differences between the frames.

The experiment had three treatments: no risk (NR), risk (R), and both in risk (BR). We ran 30 sessions in total (10 sessions per treatment) with 12 subjects in each session. The subjects were split evenly as employers or employees and maintained the same role throughout the experiment. The instructions were common for both employers and employees. After the subjects had read the instructions, they had to answer control questions to ensure that they properly understood the instructions. If a subject provided a wrong answer in any question, a detailed explanation appeared in his/her computer screen. All three treatments were identical in every aspect apart from how risk affected the profit functions of the employers and employees.

In the NR treatment, the employer had to offer an employment contract to the employee requesting him or her to exert a level of effort. In the employment contract, the employer specified the size of the fixed wage, piece rate and a suggested effort level. The fixed wage could range between 50 and 200, the piece rate between 5 and 40, and the suggested effort level between 5 and 10.Footnote 3 Then, the employee had to decide whether to accept or reject the contract offer. If the contract was rejected, the round finished and both subjects earned only their endowments (200 ECU). If the employee accepted the contract, then s/he had to decide an effort level between 5 and 10. Recall that exerting effort was costly and the cost of effort was given by the function \( C\left( e \right) = e^{2} \). The total revenue for the employer was given by \( {\text{TR}} = 50e \). After the agent decided an effort level, experimental payoffs were given by

$$ \pi_{\text{Employer}} = {\text{Endowment}} + 50e - \left[ {{\text{Fixed}}\,{\text{wage}} + \left( {{\text{Piece}}\,{\text{Rate}}} \right)e} \right], $$
$$ \pi_{\text{Employee}} = {\text{Endowment}} + {\text{Fixed}}\,{\text{wage}} + \left( {{\text{Piece}}\,{\text{Rate}}} \right)e - C\left( e \right). $$

In the R treatment, the profit function of the employee was altered to incorporate the risk associated with the incentive measure (the piece rate). According to the theory, the random factor is assumed to generate noise in the performance measure not allowing the principal to directly observe the effort choice of the agent.Footnote 4 Given that the principal is assumed to be risk neutral and the random factor is assumed to be normally distributed with a mean of zero, s/he will not be affected by the associated risk in the total revenue or on its impact on the piece rate. The main concern for the principal is whether the risk dilutes the incentives generated by the piece rate for the risk averse agent. Although the principal is assumed to be risk neutral, as s/he is able to diversify the associated risk; given that s/he is assigned only one agent, this assumption may not be justifiable. To overcome this problem and to ensure the principal can indeed act as risk neutral, the risk component was removed from his or her profit function. Hence, the experimenter acts as an insurer for the principal allowing the principal to act as if s/he was risk neutral. This allows us to rule out any effects from the principal being risk averse, which could deviate from a key assumption of the theoretical model. Therefore, the profit function of the employer was held unchanged, whereas the piece rate that was paid to the agent was formulated by the sum of the effort and the random factor (\( x \)):

$$ \pi_{\text{Employer}} = {\text{Endowment}} + 50e - \left[ {{\text{Fixed}}\,{\text{wage}} + \left( {{\text{Piece}}\,{\text{Rate}}} \right)e} \right], $$
$$ \pi_{\text{Employee}} = {\text{Endowment}} + {\text{Fixed}}\,{\text{wage}} + \left( {{\text{Piece}}\,{\text{Rate}}} \right)\left( {e + x} \right) - C\left( e \right). $$

Finally, although not directly related to our research question, we ran the BR treatment as a robustness check to test whether imposing risk neutrality to the principal in the R treatment has an effect in his/her behavior. Hence, in this treatment, the profit function of the agent remained the same as in the R treatment, but the principal was also subject to risk:

$$ \pi_{\text{Employer}} = {\text{Endowment}} + 50\left( {e + x} \right) - \left[ {{\text{Fixed}}\,{\text{wage}} + \left( {{\text{Piece}}\,{\text{Rate}}} \right)\left( {e + x} \right)} \right], $$
$$ \pi_{\text{Employee}} = {\text{Endowment}} + {\text{Fixed}}\,{\text{wage}} + \left( {{\text{Piece}}\,{\text{Rate}}} \right)\left( {e + x} \right) - C\left( e \right). $$

If the imposition of risk neutrality in the R treatment indeed has no effect on principal behavior, then one would expect (statistically) similar outcomes in the R and the BR treatment. However, if there is a difference in the outcomes, then the risk neutrality assumption for the principal will turn out to be a crucial one.

Table 1 summarizes the theoretical predictions for the fixed wage, piece rate, and effort for the R and NR treatments. The model suggests that the optimal piece rate \( \beta \) will be 20 in the NR and 13.35 in the R treatment (“Appendix B”), since the employer partially insures the employee. Hence, following the theoretical predictions, we can coin the following hypotheses.

Table 1 Theoretical predictions

Hypothesis 1

The piece rate is lower in the R treatment than in the NR treatment. In particular, the piece rate offered by the principals in the NR and R treatments are equal to 13.4 ECUFootnote 5 and 20 ECU, respectively.Footnote 6

The agent’s effort level is expected to depend only on the piece rate offered by the principal (and not on the fixed wage or the noise in the performance measure), as it formulates the incentive constraint of the agent.

Hypothesis 2

Effort level depends solely to the piece rate the principal offers to the agent. The fixed wage level and the noise in the performance measure have no influence on the effort level chosen by the agent.

Note that theoretically the principal is assumed to know the coefficient of ARA of the agent to determine the optimal piece rate in the R treatment. Although we had examined the possibility of obtaining the coefficient of risk aversion of the agents in advance and provide it to the principals, we believe that this information would be very difficult to be interpreted. Therefore, we relied on the concept of social projection (Orbell and Dawes 1991). According to the social projection theory, each player will project his own characteristics to others and use them as a cue on how they are more likely to behave [see Krueger (2007) for a review]. If we assume that the principal will use social projection to infer how risk averse the agent they are matched with is, then we can formulate the following hypothesis.

Hypothesis 3

The more risk averse the principal is, the more risk averse s/he expects the agent to be; and as a result, the smaller the piece rate that s/he offers to the agent.

4 Results

In this section, we first present the descriptive statistics and the results of the session-level two-tailed non-parametric tests on piece rates, fixed wages, and effort levels across the treatments. Then, we continue with regression analyses with respect to the piece rates and effort.

4.1 Descriptive statistics and non-parametric tests

Table 2 summarizes the mean values of the aforementioned variables. When there is noise in the environment (R), employers offer on average a higher fixed wage and a lower piece rate than when there is no noise (NR). In addition, the effort level is smaller in the R treatment than in the NR treatment (and almost the same as in the BR treatment).Footnote 7

Table 2 Average fixed wages, piece rates, and efforts across treatments

We conducted pairwise Mann–Whitney tests for fixed wages, piece rates, and effort levels across the treatments. Table 3 provides a summary of the p values of these tests. The variable is depicted on the horizontal axis and the treatment comparison on the vertical axis. As can be seen, only the difference between piece rates in the R and the NR treatments is statistically significant (Mann–Whitney p = 0.034). These are summarized in Result 1.

Table 3 M–W tests for fixed wage, piece rate, and effort across treatments

Result 1 In line with Hypothesis 1, the piece rate is significantly lower in the R treatment than in the NR treatment.

Interestingly, the average piece rate that was offered by the principals in the R treatment was significantly larger than the piece rate predicted by the model (Mann–Whitney p = 0.023).

Result 2 In contrast to Hypothesis 1, on average, the piece rate offered in the R treatment was statistically significantly larger than the predicted piece rate of 13.4.

Note, further, that the piece rates in the BR treatment is in between the piece rate in the R and the NR treatments; and it is not statistically different from either of the two. This can be interpreted, as injecting risk to the principal does not significantly change behavior (BR vs. R).

Table 4 provides a summary of the Spearman correlation tests for fixed wage, effort, and piece rate—for each treatment. By conducting Spearman correlation tests between the three variables above, we found (as expected) a very strong positive correlation between effort and piece rate. In addition, we observe a negative correlation between fixed wage and piece rate, implying that the employers use the two tools as substitutes to each other. Finally, we observe a weak negative correlation between the fixed wage and effort, in line with C-HG and Sloof and van Praag (2010). However, as we discuss in the following section, when we control for the impact of piece rates on effort in regression analyses, we observe a weak but positive relationship between effort and fixed wages, ceteris paribus.

Table 4 Spearman correlation coefficients for fixed wage, piece rate, and effort

Interestingly, we see that even though the correlations between piece rates and effort are quite stable across treatments, this is not true for the correlations for effort and fixed wages, and fixed wages and piece rates. Instead, we observe that employers treat fixed wages and piece rates as much weaker substitutes in NR than in R and BR (− 0.291 relative to − 0.473 and − 0.352). This indicates that when the employees where not subject to risk, a piece rate seemed a relatively more attractive alternative to fixed wages.

4.2 Regression analyses

In this section, we report the results of panel regressions with random effects with data from all the three treatments at the subject level and error clustering at session level first for the piece rates (Table 5) and then for effort (Table 6).Footnote 8 The regressions employ dummy variables for the experimental treatments, with the NR treatment as baseline. As three different measures of risk, (Holt and Laury (2002), risk taking, and risk perception) were collected, in each regression, only one of them is used at a time. All three different measures of risk have been centered.Footnote 9 In addition, relying on the literature on social projection, we use the \( r \) coefficient of the employer instead of the employee, assuming that the employer would expect the employee to be as risk averse (or loving) as s/he is. Furthermore, we used interaction variables between each of the risk elicitation measures and the R treatment to capture any potential interaction effects between risk attitudes and the R treatment. Finally, we used dummy variables for nationality, gender, and economics students. We retained one observation per round for each subject, i.e., a total of 1223 observations.Footnote 10

Table 5 Panel regressions on piece rate (β) with random effects at subject level and error clustering at session level
Table 6 Panel regressions on effort (e) with random effects at subject level and error clustering at session level

The results from the regressions on piece rate in Table 5 are in parallel with the findings of the non-parametric tests. In particular, in all the regressions that are presented in Table 5, the coefficient for the R treatment dummy is negative and statistically significant at the 5% level: reinstating Result 1 that the higher the noise in the environment the smaller the piece rate offered by the employers.

The coefficient for the control treatment (BR) dummy is not statistically significant: as found in the non-parametric tests. Moreover, post-regression tests suggest no statistically significant difference between the BR and R treatments: suggesting that our implementation of Principal’s risk neutrality worked as we hypothesized, and the result is robust.

Result 3 There is no statistically significant difference between the average piece rate offered in the R treatment and the BR treatment.

In regressions 24 in Table 5, we find that risk aversion has no impact on the size of the piece rate. However, after we introduced interaction variables between the risk treatment and each of the risk elicitation measures, we observe a statistically significant impact of the risk aversion coefficients both for the Holt and Laury (2002) task and the risk taking questionnaires. This provides our next result.

Result 4 In line with Hypothesis 3, assuming social projection, the more risk averse the principal believes the agent is, the smaller the piece rate s/he offers.

In addition, we find, as observed in Table 5, that the principals’ view fixed wage and piece rate as substitutes. As a result, the coefficient for fixed wage is negative and statistically significant. The suggested effort is strongly correlated with the piece rate—indicating both an expectation of the principal and possibly his/her understanding of the convex cost function of the agents. The coefficients for the demographic controls are not significant except the British dummy, which is positive and weakly significant. This may possibly reflect the cultural/social differences (relatively costly labor, minimum wage restrictions, etc.) of the British students relative to the international students.

Turning our attention to the regressions on the effort levels in Table 6, we observe that none of the treatment dummies have significant effect in the effort levels. This is in line both with the theory and the findings of C-HG and Sloof and van Praag (2008).

Result 5 In line with Hypothesis 2, the effort choices of the agents are not affected by the introduction of variance in the performance measure.

We also find statistically significant positive coefficients for both the piece rate and the fixed wage. The first finding is in line with the model’s prediction and payoff maximizing behavior. However, significant positive coefficients for fixed wage contradict with the standard theory, as well as the findings of C-HG and Sloof and van Praag (2008). Given that we have implemented a stated effort, this result could be explained through social preferences and/or reciprocal behavior [see, for, e.g., Fehr et al. (1998)] of the agents. These are summarized formally in the following results.

Result 6 In line with Hypothesis 2, the agents responded with higher effort, and the higher the piece rate offered by the principals.

Result 7 In contrast to Hypothesis 2, the agents responded with higher effort the higher the fixed wage offered by the principals.

None of the other explanatory variables and controls are significant. Implying that a suggested effort (given fixed wage and piece rate) does not affect the effort choice. Furthermore, demographic factors, e.g., nationality, gender do not affect the effort choice.

5 Discussion

An important theory in personnel economics, the Incentive Intensity Principle, states that as volatility in performance increases, performance driven contracts should be employed less. Empirical evaluation of this theory until date has provided mixed results. However, since field data may include confounding factors and lack controls, the results might not be free from noise. Laboratory experiments can eliminate such issues. In this study, we investigate the relationship between risk and incentives through a principal–agent experiment.

This is only the second experimental study in this area after C-HG, who implement a real-effort task and find broad support for the theory. They explain some deviations in their findings with the canonical model with loss aversion. Our findings with respect to the relationship between risk and incentive intensity are in line with C-HG and provide further support for the predictions of the IIP. This also confirms the argument that the empirical studies, which found a positive or no relationship, may have been due to the variety of other confounding factors. Consequently, our findings provide support to theorists who introduced alternative dimensions or additional variables to explain the observed positive relationship in the previous studies (e.g., Prendergast 2002; Wright 2004; Budde and Kräkel 2011).

Supporting the model’s prediction, we find no relationship between the variance in the performance and the effort choice of the agent. Furthermore, we find a positive relationship between effort and the size of the piece rate offered (ceteris paribus), which is in line with the predictions of the model, and more generally with the assumption of payoff maximizing behavior found in the literature on incentive contracts (Anderhub et al. 2002; Corgnet and Hernán-González 2018; Karakostas et al. 2017; Sloof and van Praag 2010).

In addition, we observe that whereas the level of effort in NR is significantly lower than the value predicted by the theory, it is still higher than in treatment R. This result differs from Sloof and van Praag (2010) and C-HG who employ real-effort tasks, suggesting that a higher level of effort in a risky environment can be explained if agents are loss averse. Agents may exert a higher effort to increase the probability of being above their reference earnings level. However, this may not be possible when the (stated) effort, as in our setting, has a monetary cost. In this case, increasing the level of effort may not help to obtain higher earnings. Hence, our result can shed light on how different tasks provide diverse results through different mechanisms. As this suggests that using stated rather than real effort may lead to deviations in effort choices, the impact of the two methods in behavior is worthy of further scrutiny.

Moreover, the agents respond positively to higher fixed wages by exerting more effort—contradicting both the standard theory, and the experimental findings. This result, however, is consistent with the previous studies on labor contracts in which agents hold social and reciprocal preferences. Indeed, such a positive relationship between the fixed wage and effort is observed in the existing experimental studies on gift exchange (Fehr et al., 1998; Fehr and Schmidt 1999; Fehr and Gächter 2002; Fehr and Schmidt, 2004, 2007).

Hence, our experiment supports the comparative statics but not the point estimates predicted by the theory. However, this is not uncommon in ‘theory-testing’ experiments, where the micro-foundation of behavior is reflected through treatment effects, but possible noises (e.g., mistakes, preferences) distributed symmetrically across treatments make deviations from the point estimates (Anderhub et al. 2002; Fehr and Schmidt 2004; Hoppe and Schmitz 2015).

Adding to the (thin) existing literature, we introduced a treatment in which both the principal and the agent faced risk. We find that the results from this treatment are not different from the treatment in which only the agent faces risk, and the principal is insured. This result will allow future studies to abstract away from the issues relating to the principal being risk averse and to use simplified design while focusing on their specific research questions.

However, we also observe that the majority of the offers in the no risk treatment were with a suboptimal piece rate, which is in stark difference with the existing studies. A potential explanation for such lower offers may be due to our implementation of a minimum wage, which may have acted as a reference point for the employers driving downwards the offers of the principals. However, since minimum wage is a real-life phenomenon, this is an important observation as well as an interesting avenue for future research.

In conclusion, we test the predictions of the IIP in a laboratory setting in which we employ a stated-effort task, check the robustness of insuring the principal or not, and introduce the concept of social projection. We find support for the negative relationship between risk and incentives and our results are broadly in line with the existing experimental literature. However, we also find some significant and interesting differences with the existing studies that reiterate the importance of laboratory experiments in testing and extending theory. Our study can be extended in various ways. An interesting extension, for example, would be to allow the subjects to form reputation in a repeated setting, therefore, more accurately reflecting the real world.