1 Introduction

This paper considers both luck and effort, and their incorporation into principles of social justice. In particular, we investigate the support for the idea that Society should compensate an individual for low income due to bad luck, but not for low income due to lack of effort. This idea is inspired by the Principle of Accountability.Footnote 1

The following simple model provides a framework for articulating these ideas. It assumes that an individual’s exogenously and randomly determined luck is either bad or good, and that the chosen effort level is either none or some. For all individuals, the raw, pre-redistribution income is a function of effort and luck. A numerical example of a possible situation before Society has imposed redistribution is shown in Table 1.

Table 1 Pre-distribution incomes in four different states of the world

People with bad luck have lower incomes than those with good luck (given any effort level), and those who make no effort have lower incomes than those who make some effort (given any luck). The question we are investigating in this paper is whether Society should compensate people for bad luck and/or for lack of effort. Society can do this through the distribution of some social dividend.

Different liberal approaches to social redistribution,Footnote 2 such as the Principle of Accountability, or liberal egalitarianism (Cappelen & Tungodden, 2006) or luck egalitarianism (Cohen, 1989), or, more loosely speaking, meritocracy, appear to provide a solution to the choice of the distribution rules that Society should apply: respecting the effort decisions of individuals, yet treating equally individuals who are subject to different levels of luck. This implies compensating individuals for bad luck, but not compensating them for choosing not to provide effort. Unfortunately, this Principle of Accountability is not generally implementable; as demonstrated in a different context by Cappelen and Tungodden (2006), building on Bossert and Fleurbaey (1996), there are several situations in which this simple meritocratic principle cannot be implemented, making compromises necessary.

This paper reports on an experiment eliciting preferences for redistribution principles related to the Principle of Accountability, and the compromises that subjects prefer to make when the Principle cannot be fully implemented.

The focus of this paper is on the elicitation of preferences on social compensation principles; our aim being to assess the extent of the support for the Principle of Accountability. Some of the experimental literature indirectly infers subjects’ preferences for redistribution from subjects’ choices. While that is a valid experimental methodology, we think that directly eliciting preferences over principles yields more easily generalisable results, and does not require the use of econometric techniques to identify the preferences from the noisy choices. Moreover, since the attractive Principle of Accountability is not in general applicable, in a choice-based design the experimenter has to choose between avoiding conflicting situations, and hence being unable to observe which compromises subjects are willing to make in these situations, or expose the subjects to those situations, and hence adding substantial amounts of noise to the choice data. We hence develop an experiment to elicit directly preferences over principles, and directly observe which compromises subjects are prepared to make in the face of impossibilities.

Given its intuitive appeal, we expected many of our subjects to agree with the Principle of Accountability. Our design allows us to produce exploratory results on the compromises that subjects are willing to make when faced with the inapplicability of the Principle to specific situations.

The paper is organised as follows. In Sect. 2, we explore the implication of compensation rules, focusing in particular on the non-generalisability of the Principle of Accountability solution. Related literature is discussed in Sect. 3. The experiment that we conducted is described in Sect. 4. Section 5 presents our results, and Sect. 6 concludes.

2 Compensating for income differences

We consider the following framework. Assume there is a social fund, of value X, to be divided between the individuals in Society. Each individual’s dividend (D) from this fund is a supplement to (if positive) or a diminution of (if negative) their income (Y), giving their final total payment (P = Y + D). Therefore, these dividends provide a means of compensating for income differences such as those in Table 1. The following examples all assume a Society of four individuals, as in our experiment. Corresponding examples may easily be constructed for Societies of any size.

Dividends can be used to achieve different social redistribution goals.

One such goal is to compensate fully for all income differences, be they due to luck or effort, thus achieving (final) payments equal for all. Two examples of how this principle plays out in different configuration of incomes, luck and effort choices in the Society are demonstrated in Table 2. Table 2A shows how full compensation can be reached with a social fund of 40 (X = 40). The result is Equal Payments of 15 (the total income divided by 4). Table 2B illustrates, by comparison with Table 2A, that the dividends and payments entailed by equal payments for all depend on, inter alia, the number of individuals in each cell of the matrix.

Table 2 Examples of full compensation for all income differences (Equal Payments)

The diametrically opposite principle is no compensation at all for any income differences; this can be characterised as equal dividends for all, as can be seen from the example in Table 3A.

Table 3 Examples of no compensation (A) and compensation using the Principle of Accountability (B and C)

The Principle of Accountability, which suggests compensating only for those income differences due to luck can then be characterised as a hybrid principle (a ‘compromise’ to quote Cappelen & Tungodden, 2006), comprising (PA) Equal Payments for Equal Effort, and (PB) Equal Dividends for Equal Luck. Table 3B illustrates. At each level of luck, the income differential due to effort, here 6, is preserved in the payment differential. All ex-post income differences due to luck have been compensated, while ex-post inequalities due to effort have not. Table 3C again illustrates that the dividends and payments implied by (PA) and (PB) depend on the distribution of individuals across the four income positions.

Given its intuitive appeal, we expect most of our subjects to agree with the idea of Principle of Accountability. In this case, we expect them to prefer Equal Payments for Equal Effort and Equal Dividends for Equal Luck, if there is no conflict between these two principles.

Unfortunately, the Principle of Accountability is not always implementable in practice, as Fleurbaey (2008) and many others have pointed out. In Tables 1, 2, 3, the income differential due to effort is assumed to be the same, here 6, at each level of luck. Without this assumption, however, (PA) and (PB) are generally incompatible. This is illustrated in Table 4, where the income at good luck and low effort is 8, instead of 4 as previously assumed.

Table 4 Examples of inapplicability of the Principle of Accountability

In Table 4A, dividends are equal at each level of luck. Given this, payments can be equal at only one level of effort. Thus equal payments for those choosing low effort, as shown here, entail unequal payments for those choosing high effort.

In Table 4B, payments are equal at each level of effort. Given this, dividends can be equal at only one level of luck. Equal dividends for those with bad luck, as shown here, entail unequal dividends for those with good luck, and thus a payment differential greater (in this case) than the income differential.

Generally, given such income values, (PA) and (PB) can be satisfied for any three of the four rows and columns, but not for all four.Footnote 3 An advocate of (PA) and (PB) needs to decide which of these should have priority in the event that they are incompatible, as in Tables 4A, B. Note, however, that this assumes that all four income positions are occupied. As Table 4C illustrates, with (at least) one position unoccupied it is, after all, possible to satisfy (PA) and (PB) for all individuals in this Society.

A formal proof of the lack of generality of the Principle of Accountability is given in Appendix 2.Footnote 4

Our design allows us to produce exploratory results on the choices of subjects when there is a conflict between the preferred principles (PA) and (PB).

3 Related literature

Our experiment was designed to assess the support for the Principle of Accountability. As much of the literature makes clear, we need to specify who is expressing support. Is it a member of the Society, or an interested outside observer? Many experiments have two treatments to capture these two points of view: an Internal Dictator treatment and an External Dictator treatment. In the former, we need also to specify what the Internal Dictator knows about hisFootnote 5 position in Society. The specification could vary from trying to reproduce Rawls’ Veil of Ignorance (Rawls, 1971), where the Internal Dictator does not know his future position in Society,Footnote 6 to a situation where the Internal Dictator does know his position.Footnote 7 In this latter case, the experiment essentially determines the ‘other-regarding preferences’ of the subjects.

Our experiment considers a case in which the income of an individual is a function of a choice (endogenous) variable and an exogenous variable, but we start our discussion here with the simpler one-dimensional case. Many of the previous experimental contributions to the literature have considered just that–dividend rules in one dimension – most commonly with exogenous risky income. Experiments using this latter approach, sometimes with both an Internal and External Dictator treatment, include Bukszar and Knetsch (1997), Cabrales et al.,(2010), Frignani and Ponti (2012), Herne and Suojanen (2004), Johannesson and Gerdtham (1995), Schildberg-Horisch (2010) and Sutter and Weck-Hanneman (2003). Some of these studies use a Dictator game from the game-theoretic literature, but all of them get one of the subjects, either pre-specified, or chosen randomly after all have expressed their preferences, to play the Dictator.

A recent strand of the literature has taken the debate into two dimensions: the two dimensions that we are considering–one exogenous and the other endogenous to the individual. Prominent in this literature are Cappelen et al.(2007, 2010 and 2013) and Konow (2000). Once again, they use a one-shot Dictator game, and the two dimensions are the amount of work that the subjects do, and the rate of pay they get for the work (and, in Cappelen et al.(2010), their productivity). The rate of pay is exogenous while the amount of work is endogenous. We adopt a similar approach (though we omit Cappelen et al.(2010)’s productivity). However, our study differs from theirs in that we ask our subjects to state general principles of distribution relevant to the Principle of Accountancy, while they ask either for different principles or for preferred distributions in particular contexts and try to infer the underlying general principles from these particular cases.Footnote 8 In addition, our study is different from theirs in the way we try to create the Veil of Ignorance with an Internal Dictator.

There are also important experimental papers by Mollerstrom and associates (Mollerstrom et al., 2015 and Cappelen et al., 2019), which have some features in common with ours. Mollerstrom et al.(2015) focus on two kinds of luck: option luck and brute luck. Brute luck is outside the control of the individual while option luck is partially controllable through an insurance decision that, in terms of individual responsibility, has a similar role to effort in our experiment. For each subject-pair, income inequality can result from varying combinations of brute and option luck for the two subjects individually. An external dictator has to decide ex ante, for each of those different possibilities, whether or not incomes are to be equalised between that subject-pair. Cappelen et al. (2019) extends the setup to one in which the external dictator is unable to tell with certainty the role of brute luck versus option luck; interestingly, subjects were significantly more egalitarian in this version. In their requirement of ex ante decision-making, covering exhaustively all possible unequal situations, this approach comes closer to ours than do many experiments in this field. Even so, in this setting preferences over distributional principles can be only inferred statistically, rather than directly elicited. Also, as already discussed, a primary reason for our interest in explicit preferences over principles lies in the possibility of such principles being incompatible.

A recent experiment by Becker (2013) focuses specifically on the luck/effort trade off, though her prime interest is in the effect of the type of luck on people’s principles (she considers wage luck, endowment luck and payment luck – essentially framing differences). She concludes that “Our results suggest that normative beliefs about which share should be taken are driven mostly by the factor that subjects’ can be held responsible for, namely effort, whereas actual behavior sometimes deviates from this norm dependent on which type of luck is introduced.” The frame is important, and part of the Principle of Accountability is supported.

Bjerk (2016) also ran an experiment with two dimensions – effectively luck and effort, and considered the effect of the type of Dictator on stated preferences. He was interested in the re-distribution rules that subjects favoured. Each player had to propose a “redistribution rule” which was specified by p, a number between zero and one, “where p represents what fraction of each participant’s task earnings will be taken from each participant and split equally across all participants”. Clearly if p is zero, then no re-distribution occurs; this equivalent in our context, to Equal Dividends for all. If p is one, this is equivalent to our Equal Payments for all. However, there is no value of p for which we get (PA) Equal Dividends for Equal Effort and (PB) Equal Payment for Equal Luck. Hence, his principles do not correspond to ours. Deffains et al. (2016), Espinosa et al. (2020) and Gerber et al. (2014) also investigate the Bjerk (2016) re-distribution principles. Once again, we are investigating different principles of social justice. One advantage of their re-distribution schemes is that they are always implementable, unlike ours, where compromise might be necessary.

A seminal paper is that of Konow (2000). He “differentiates discretionary from exogenous variables. A discretionary variable is one that affects production and that the individual can influence, which in this experiment is the subject’s input or letters produced. An exogenous variable is one that the person cannot reasonably influence but that may have an impact on output.” These are equivalent to our effort and luck respectively.

4 The experimental design

One feature crucially distinguishes our experiment from some of the previous related experiments: we did not ask subjects to take decisions for particular ex ante income distributions; instead, we asked them to select a compensation principle that could be applied to any ex ante income distribution. We realise that this essentially forces subjects to have a consistent compensation principle; the contrary procedure, of trying to infer an underlying principle from decisions for particular distributions, might well lead to imprecise or undefined inferred underlying principles, especially for the cases in which the principles cannot be applied.

This methodology of eliciting principles has been used elsewhere, most notably in game theory experiments, where principles are called strategies, and where the statement of a strategy is simple. In other contexts, where decisions are dependent on many variables, principles/strategies are usually not elicited because of the complexity of stating them. In our experiment, principles can be simply stated: either Equal Payments or Equal Dividends. The difficulty for subjects in our experiment was not in understanding what they are being asked to do, but rather in appreciating the consequences of their chosen principle.

Our experiment had two dimensions, luck and effort, each of them having just two values: luck, good or bad; effort, some or none. Thus, there were four possibilities: (good luck, some effort), (bad luck, some effort), (good luck, no effort) and (bad luck, no effort). As before, we represent these possibilities with the cells of a two-by-two matrix in which the rows are bad luck and good luck and the columns are no effort and some effort, and the cell entries are the ex-ante income levels. These cell entries were ex ante random (between given and known lower and upper bounds). There was a social dividend of a fixed amount that could be distributed by a rule that would be determined by the stated principles of one of the subjects over these four cells.

We had two different treatments: (1) an Internal Dictator treatment in which the rule would be that of a randomly-selected member of Society after preferences were elicited; (2) an External Dictator treatment in which groups of five subjects expressed their preferences, after which one of them was randomly chosen to be outside the Society consisting of the other four, and his principles imposed on the other four, while the chosen dictator would get a flat payment. The External Dictator treatment captures the idea of an external person who is not part of Society but has preferences about its organisation. The Internal Dictator treatment was designed to reproduce as far as possible Rawls’ Veil of Ignorance, in which subjects do not know ex ante their position in Society: as far as luck is concerned, it is clear that subjects did not know ex ante whether they would have good luck or bad; as far as effort is concerned, we tried to make their future decision (as to whether to make an effort or not) as uncertain as possible, by making the initial incomes uncertain.Footnote 9

Subjects were asked to state their preferred compensation rules/principles. We deliberately restricted the set of rules from which they could choose. To be specific, we asked them to state, separately, for each row and for each column of the matrix, whether they preferred Equal Dividends (no compensation) or Equal Payments (full compensation) in that row or column.

Unfortunately, as we have already noted, some combinations of principles are incoherent, and cannot be jointly implemented. To accommodate this, if a subject wished to state mutually-exclusive principles,Footnote 10 then he was also asked to state which one of these conditions was droppable,Footnote 11 if not all conditions could be simultaneously implemented.

Subjects were told that, after all had stated their preferred compensation rules, each of them would be randomly assigned bad or good luck (if appropriate), and that each of them would be asked (if appropriate) whether they wished to ‘make an effort’ or not. ‘Making an effort’ was implemented by them staying in the laboratory for an extra 20 min and completing a simple questionnaire; this we call Part 2. ‘Leave’ and ‘Stay’ below refer to their decision, and correspond to the ‘no effort’ and ‘some effort’ of the theory.

We realised that stating their preferred rules for the full matrix might be a difficult task. Therefore, we simplified the problem for the subjects by building up to the full matrix in stages, with four simple scenarios, in which subjects were faced with just a row or just a column of the matrix:

  • Scenario 1a: All members Leave after Part 1; members individually have Bad or Good luck;

  • Scenario 1b: All members Stay for Part 2; members individually have Bad or Good luck;

  • Scenario 2a: All members have Bad luck; members individually choose to Leave or Stay;

  • Scenario 2b: All members have Good luck; members individually choose to Leave or Stay;

  • Scenario 3: Full matrix.

We asked them in each scenario whether they preferred Equal Dividends (no compensation) or Equal Payments (full compensation). In all the four simple scenarios, the decisions, and their implications, were clear and simple, since just one dimension was varied at each time, the other being fixed. The scenarios were presented to subjects in the fixed order 1a, 1b, 2a, 2b and 3. While this might have created order effects, we chose the fixed order as we needed the full matrix to come at the end, and we saw no particular reason for order effects to have an effect on the elicitation. We deliberately included the simple scenarios to assess support for the two elements of the Principle of Accountability individually.

Subjects knew that, at the end of this part of the experiment, one of the above five scenarios would be selected at randomFootnote 12 and played out; subjects would be paid according to the dictator’s choices, their individual luck and effort level, and the ex-ante payments that would be determined randomly. The playout differed according to scenario. In Scenario 1b, for example, all members would (with common knowledge) be required to Stay for Part 2 and the playout would simply assign each member either Bad or Good luck, followed by the application of the compensation rules, that is, either Equal Payments or Equal Dividends, that had been chosen by the Dictator for Scenario 1b. In Scenario 2a, by contrast, all members would (with common knowledge) be assigned Bad luck, and each would then decide whether or not to Stay for Part 2, again followed by the application of the compensation rules that had been chosen for Scenario 2a. The playout in Scenario 3 was as already described: independently and privately, each member was assigned either Bad luck or Good, and then decided whether to Stay for Part 2.

In terms of practical detail, the experimental sessions were organised as follows: preceding the decision stage (Stage 1) was a (silent) PowerPoint show explaining the task, auto-timed and running simultaneously on each subject’s computer screen, and lasting 15 min. Each subject also had a hardcopy complete handout of that show, to which they could refer at any time during Part 1.Footnote 13 They could ask questions at any stage; few did.

Stage 1—the essence of the experiment–took 20 min. Subjects considered the five scenarios in sequence, starting with Scenario 1a and ending with Scenario 3. In each case, our experimental softwareFootnote 14 allowed each subject, before making a choice, to explore, through simulation, the effects of different compensation rules, against different hypothetical positions of the different members of society and intermediate income values, all controllable by the subject. Appendix 1 contains screenshots, for Scenarios 1a and 3. For each elementary scenario, subjects could explore the effects of their choices for two minutes, after which they had 30 s to submit their choices. Exploration could continue up to the moment of submission. For the more complex full matrix scenario (Scenario 3), subjects had eight minutes of simulation followed by a two-minute period for submission (during which exploration could continue). The countdown clock can be seen in the screenshots, above the large ‘Submit your choice’ button that was greyed out until the submission period. The exploration phase was meant to allow subjects to understand fully the implications of their decisions for a variety of different situations.

For the full scenario, if at any time in the simulation the subject had a mixed compensation rule (a combination of Equal Payments somewhere and Equal Dividends elsewhere) and a saturated profile (that is, all four cells of the matrix were filled), then the program prompted him to select one of the four elements (in the graphic, one of the two rows or one of the two columns) as droppable, before he could continue. This meant that in case of inapplicability of the chosen rules, the rules would not be applied for that particular row or column. It then revealed the hypothetical implications of this for the dividends and payments, as for example in the scenario 3 screenshot in Appendix 1, where the droppable element is identified by the check box. A similar prompt appeared if he tried to submit as his choice a mixed compensation rule without having selected a droppable element.

In the event of non-submission within the allotted time, (known) default rules were in place. Within each Society, the same default rule (either Equal Dividends or Equal Payments) applied to all subjects in their choices for all four elements, and in all five scenarios.Footnote 15 To control for default-bias, we gave half of our societies a default of Equal Dividends, and the other half Equal Payments. This produced a 2 by 2 treatment-structure as set out in Table 5, which also records the number of subjects in each treatment.

Table 5 The 2×2 treatment structure, with subject numbers

Once the elicitation stage was completed, one member of the Society was randomly chosen as a dictator, and then all members were anonymously informed of who was the dictator, which scenario was going to be payoff-relevant, the ex-ante income values, and the rules chosen by the Dictator. This was followed by the actual playout of the drawn scenario. The final playout used a software interface similar to that in Stage 1. First, the software generated random incomes in each of the four cells. Then, in Scenarios 1a and 1b, each subject individually clicked a button to reveal his luck, while in Scenarios 2a and 2b each subject individually registered his effort decision by clicking one of two buttons, labelled Leave and Stay respectively. In Scenario 3, each of these events occurred in turn. In all cases, the onscreen display then revealed the positions of each of the four members of that Society, and the consequent incomes, dividends and payments.

Therefore, the overall sequence of events in Part 1 was as follows:

  • Stage 1 Each member chooses compensation rules for each of the five possible scenarios;

  • Stage 2 One member is randomly selected as Dictator (and in the External Dictator treatments takes no further part);

  • Stage 3 Each member is informed:

    • which (numberedFootnote 16) member is selected as Dictator for his Society;

    • which of the five scenarios applies to his Society;

    • the income values relevant to that scenario;

    • the compensation rule, as chosen by the Dictator;

  • Stage 4 The scenario is then played out, determining for each subject whether he has Bad or Good luck, whether or not he Stays for Part 2, and also his final payment.

Part 1 was then complete. Those subjects staying for Part 2 remained in their places, and completed the simple task. The others left, collecting payment on the way out.

There was a total of 240 subjects; subjects were predominately students at the University of York; they spent on average 25 min in the laboratory if choosing not to stay for part 2, and 45 min if they stayed; since about half subjects chose or were forced to stay, on average subjects spent about 35 min in the laboratory, and earned on average £17.42.

The essence of the design was producing an experimental setup in which subjects could express their preferences for (PA) Equal Payments for Equal Effort and (PB) Equal Dividends for Equal Luck in various contexts: subjects were asked to state their preferred principle in different contexts. The ‘full context’ is where both Luck and Effort play role – this we call Scenario 3. As this is quite a complicated context, we built up to it via a series of simplified contexts, in each of which there was either Luck or Effort, but not both (these are Scenarios 1a, 1b, 2a and 2b).Footnote 17 To shed light on whether the expressed principles differ depending upon whether they are those of a disinterested spectator or an interested member of Society, we had two treatments – which we call the External and Internal Dictator treatments. Whether choices in the two treatments might differ depends upon how the subjects viewed the two treatments, and whether they regarded them to be essentially the same, as far as expressing their preferences was concerned.

We conclude this section with a brief illustration of the experimental software.

For scenario 1a the interface was as follows:

figure a

The only decision that the subject was asked to take was whether he/she thought that one should apply ‘Equality of Dividend’ or ‘Equality of Payment’. As will be seen, in the figure above the former is ticked, implying that the effect of bad luck (a loss of income of 5 ECU) is not compensated and so the payments differ by the same amount as the income. Note, as should be clear from the figure that 2 of the 4 members have bad luck and 2 have good. The subjects were encouraged to vary the Good/Leave income by using the drop down box, and hence explore the implications. In this case, changing the Good/Leave income would change everyone’s payment but not the gap between the good and bad luck subjects. The interface also allowed subjects to experiment with changing the number of subjects in each box. In this case, keeping the Good/Leave income fixed at 5, but having 1 subject in bad leave and 3 in good/leave would increase the income and payment of the one subject moved to good/leave, but, with ‘Equality of Dividend’ ticked, not change the gaps between income and payment. However, if ‘Equality of Dividend’ was ticked, then all would receive a payment of 55/4 instead of 50/4.

5 Results

For each of the four treatments we had 60 subjects in the laboratory. However, in the (External Dictator, Equal Dividends default) treatment there was a computer problem for one subject, which led to the loss of that subject’s data but not the subject’s participation. We did not collect demographics.

Table 5 shows that we had four treatments in a 2 by 2 structure: Internal/External Dictator and Equal Payments/Equal Dividends Default Rule. The data allows us to rule out any effect of the Default Rule effect—see Appendix 4 for details. Hence, in the remainder of the paper, we pool the data of the two default rules, and consider only the two treatments: Internal and External Dictator.

5.1 The four elementary scenarios

Table 6 records for each elementary scenario the number of subjects choosing Equal Payments for All.

Table 6 Percentage of subjects choosing Equal Payments in the elementary scenarios

Equal Payments is the Principle of Accountability (PoA) rule for Scenarios 1a and 1b, while Equal Dividends is for 2a and 2b. Therefore, in each case a large majority of subjects chose the Principle of Accountability rule. For Scenario 1a (all members leave) the proportion choosing Equal Payments is significantly (χ2 p-value = 0.021) greater with an Internal than with an External Dictator. Otherwise, the differences between the two treatments are not significant.

Table 7 shows the results when aggregating choices over the four simple scenarios and in the full matrix compound scenario. In a perfect world, both columns should show the same preferences; in the case of simple scenarios, this preference has been elicited piecewise, one element at a time; in the compound scenario, it has been elicited jointly. A crucial difference between the two is that in the simple scenarios subjects could not be and were not exposed to the impossibility of some of their choices; in the compound scenario, they were. We discuss here only the simple scenarios.

Table 7 Subjects’ responses over all scenarios (rounded percentages)

The Principle of Accountability rule implies Equal Payments in Scenarios 1a and 1b, and Equal Dividends in Scenarios 2a and 2b. We could define a Somewhat Principle of Accountability rule (‘Somewhat PoA’) as differing from this in exactly one element. A still broader class of what might be called orthodox responses comprises these together with responses with either Equal Dividends for each level of luck, or Equal Payments for each level of effort. As can be seen from Table 7, these five categories constitute some 63% to 70% of all responses in the simple scenarios ‒ a significant majority.

Figure 1 shows consistency of choice between scenarios 1a and 1b, and/or between 2a and 2b. The majority of subjects (67%) showed consistency, with a relatively small fraction changing their reported preferences.

Fig. 1
figure 1

Consistency within each simple scenario

5.2 The compound scenario

Table 7 reports the results for the compound scenario. Recall that with the exception of Equal Dividends or Equal Payments everywhere, each regime in the compound scenario is contingent on at least one income position being unoccupied, the subject specifying which one of the four elements is droppable otherwise.

As with the elementary scenarios, orthodox responses (that is, non- ‘other’) account for a significant majority of subjects in each treatment. However, PoA responses are inevitably lower, not significantly so (χ2 p-values 0.0076 and 0.0057 for the Internal and External treatments respectively), in the compound scenario than in the elementary scenarios, and are less attractive than each of Equal Dividends or Equal Payments everywhere. It looks as if the increased complexity of the full-matrix scenario made simpler rules (equal dividends everywhere, that is no compensations, or equal payments everywhere, that is full compensation) more attractive than the more difficult (and not universally applicable) PoA rule of compensating for luck but not for effort.

Figure 2 shows the consistency between the elementary and compound scenarios. One striking feature is that no subject switched from Equal Payments in the simple scenarios to Equal Dividends in the compound, and no subject switched from Equal Dividends in the simple scenarios to Equal Payments in the compound. This might reflect the wide perceived gap between left-wing (full compensation) vs right-wing (no compensation) principles. However, there is a considerable amount of switching elsewhere. Table 8 shows the percentage of subjects who are consistent with their choice of principle in the simple and compound scenarios. For example, 46.7% of all subjects in the Internal Dictator treatment preferred Equal Payments in both the simple and the compound scenarios.

Fig. 2
figure 2

Consistency across simple and complex scenarios

Table 8 Within-subject consistency across simple and compound scenarios, by principle (rounded percentages)

Summarising, we find support (albeit limited) for the Principle of Accountability. Only about 20% of subjects support its pristine form – fully compensating for exogenous luck while not compensating for endogenous effort choices. This percentage decreases in the compound scenario, where subjects have to face the full complexity of the consequences of their choices. A further 20% support principles that can be described as ‘somewhat PoA’, i.e. that keep the PoA core idea but do not apply it in specific situations (e.g., if all subjects have bad luck, or if all subjects exert no effort). About 15% of subjects show right-wing libertarian views, choosing support for no compensation for either luck no effort, and a symmetric 15% show left-wing views, choosing full compensation for both. About a third of the subjects do not express any coherent principle.

5.3 Compromises in the face of the inapplicability of the chosen principles

A secondary purpose of the experiment was to see how those subjects attracted to the Principle of Accountability would respond to its infeasibility in a saturated profile, and in particular, which rules they would select as droppable in the event that they chose the (contingent) Principle of Accountability in the compound scenarios. The number of subjects we can identify as being attracted to PoA, on the basis of choosing PoA or Somewhat PoA in the compound scenario is 37 in the Internal Dictator treatment, and 32 in the External Dictator Treatment. Table 9 sets out the data on droppable rules among these subjects.

Table 9 Distribution of the “droppable” constraint chosen by subjects having chosen PoA or Somewhat PoA in the compound scenario (rounded percentage)

There is a clear and significant difference between the Internal and External treatments for the droppable element. Interestingly, in the External Dictator treatment the responses were not significantly different from a uniform distribution (χ2 p-value = 0.267), while in the Internal Dictator treatment, they were significantly different from a uniform distribution (χ2 p-value = 0.010) with a strong preference for dropping Equal Payments for Leave and Equal Dividends for Bad Luck. The External Dictator results seem consistent with our principal hypotheses,Footnote 18 while the Internal Dictator results suggest that many of the subjects were planning to Stay or were hoping for Good Luck (in other words, misinterpreting the implementation of the Internal Dictator treatment).

5.4 Extent of exploration of the consequences

As noted above, our experimental software allowed subjects to explore in detail the consequences of their choices. They could vary, for each choice of rules for the different rows and columns, the position of the players and the income occurred in different positions. In particular, they had four ways of interacting with the software: they could (a) change the allocation rules; (b) change the position of the players, (c) change the income attached to different choices and (d) use a button that would randomly change the position and the incomes. We tracked each action for both the four simple and the compound scenario. Table 10 details the data. (Table 10)

Table 10 Mean (standard deviation) of each exploratory action, and overall, by scenario

Subjects explored the consequences of their choices extensively. On average, subjects had between 35 and 45 interactions for each of the 2-min simple scenarios, and 85 interactions for the 8-min compound scenario. There was large heterogeneity, as testified by the large standard deviations, and especially so for the random action that shuffled all positions and incomes. This makes sense as the random exploration allows them to understand the implications only if repeated several times. The random button was the most heterogeneous in its use, with some subjects relying mostly on it and others not using it at all.

The extent of the overall exploration shows no correlation with the chosen principle, neither in the simple (kruskal–wallis, p-value = 0.610) nor in the compound scenario (kruskal–wallis, p-value = 0.671); the same Is true when disaggregating the exploration by type, both for the simple (kruskal–wallis, all p-values >  = 0.142) and compound scenario (kruskal–wallis, all p-values >  = 0.396). There is further no significant difference in exploration across dictator treatments (t test, p-value = 0.149). No particular axiom comes ‘intuitively’ and there is no special relation between the need to explore and the choice of more orthodox or appealing social redistribution axioms.

6 Discussion

In this experiment, we directly elicited individual preferences over compensation principles. Even if we think that this direct elicitation mechanism can help reveal social redistribution preferences better than the more popular method of inferring them from given choices, the design has limitations.

First, it can be a daunting task for the subjects. Our priority in designing the structure, the program/interface and the instructions was to make the experiment as simple and intelligible as possible. However, the task might have been challenging for subjects. They had to first understand the nature of the problem and then identify a best response to it. We feel that most of our subjects were able to clear the first of these hurdles. However, even given full comprehension of the problem, comparing regimes is a complex task, especially in the compound scenario, as our discussion above shows. Furthermore, optimal choices could vary substantially across subjects depending, for example on their self-interested motivation, attitude to risk, effort/payment preferences, and beliefs about the corresponding preferences of other members and about the unknown income values. For all of these reasons, the disparate results reported in Sect. 5 concerning the droppable principle were perhaps to be expected.

Subjects’ overall responses are clearly not arbitrary, however, and cannot be said to be driven by miscomprehension of the task; we know from our background data recording every keystroke and mouse click, that most subjects, in all treatments, made full use of the simulation program to explore the implications of different regimes. We found that there was no correlation between the extent of the exploration and the submitted set of rules.

A majority of subjects submitted what we call orthodox choices, and most subjects are consistent when moving from the simple to the full scenario. Nonetheless, the design must have been difficult to navigate for some subjects since about a third of them did not submit orthodox principles. This large amount of noise can indeed be a consequence of our design. On the other hand, such a high level of noise would be a problem for any design. Ours has the advantage that noisy subjects signal themselves out through their choices, while traditional choice-based experimental approaches have no way of dropping the noisy subjects and this could bias their estimations.

It is difficult to compare our results with those in other papers, since they do not seek ‘orthodox responses’. There is, however, quite clearly much noise in subjects’ responses in other experiments, as shown in their results. For example, large variation can be seen in Becker’s Fig. 1; Konow’s Fig. 2 likewise shows considerable variation.

Second, we might have been forced to oversimplify the scenarios. We suspect that one difficulty was that the binary alternatives presented to the subjects (either Equal Payments or Equal Dividends – dictated to us by the Principle of Accountability) were too constraining, and that some subjects might have preferred partial compensation in certain contexts. In this respect, we should note that the evidence from other experiments (most notably those of Cappelen et al. (2007, 2010 and 2013)), indicates that subjects trade-off the various possibilities in a more continuous way. However, these other studies were not attempts to test the Principle of Accountability, nor the other principles discussed in this paper, but were more concerned with estimating utility functions – and usually in contexts in which either luck or effort were manipulated, and not both. Our work provides an important complement to theirs, both methodologically and theoretically. We provide a novel, incentivised design to directly elicit general principles, rather than inferring them from a limited set of choices; moreover, the uncertainty over payoffs allows us to have Internal Dictators that were unaware of their future position in Society – a more appropriate implementation of the Rawlsian Veil of Ignorance from a theoretical point of viewFootnote 19; and we directly test a situation in which both luck and effort vary, and put the subjects face to face with the impossibility of following the Principle of Accountability in a world with people in different positions.

As regards our principal interest, the Principle of Accountability receives modal support in the elementary scenarios. This is in line with other studies that have estimated social redistributive preferences, as for instance Almås et al. (2020). The distribution of preferred compensation strategies is also similar to theirs. With about 15% of the sample being strict Egalitarian (Equal Payments everywhere) and a further 15% Libertarian (Equal Dividends everywhere), our results sit in between the results that Almås et al. (2020) report for Norway and the USA. The fact that similar results stem from widely different methods (following common practice, their results are indirectly estimated from choices of external Dictators only) seems to be an indication of the robustness of the findings.

It is clear that the majority of our subjects agree that Society should compensate citizens for bad luck, and that it should not compensate subjects for lack of effort.

When these two principles conflict, subjects do not display a clear preference over second-best solutions, as choices do not reveal clear patterns for which constraint to be dropped in case of conflict. The practical impossibility of following the Principle of Accountability might be too difficult a setting to comprehend and act upon, or might be a scenario that our subjects – like most other people – never contemplated and are ill equipped to answer. While this last remark might seem discouraging, there is hope. New applications of our direct and incentivised elicitation method, coupled with simulation exercises, could be run on only those following the Principle of Accountability, stressing their preference’s lack of coherence in certain situations and asking which condition(s) should be dropped in these cases. Given the widespread acceptance of the Principle of Accountability in Western democracies, and the never-ending tensions that pervade them, we believe the results of such an experiment would be important.

The implications for constitutional economics are relatively clear: a significant majority of our subjects indicated approval of compensating citizens for bad luck, and a somewhat smaller majority approved not compensating citizens for lack of effort. These are in line with most Western practices and agree with survey evidence. The situation becomes less clear when luck and effort are combined; here there seems no consensus for a clear set of principles. It would seem that the preferences depend upon the context – and there was no context in our experiment. This suggests that we should run a similar experiment with the effort decision put into some kind of context.