In Sect. 3.1, we first describe the vignette-based study design. In Sect. 3.2, we then describes the procedure of the study. After that, in Sect. 3.3, we formulate working hypotheses about the impact of different accountability framings and scenarios on participants’ distribution choices.
The vignettes
In the vignettes, we asked participants to imagine two persons, denoted “Person A” and “Person B”, who do not know each other (for the instructions and the exact wording of the vignette, see Appendix A; we used a set of common German surnames to identify the protagonists). Participants were told that these two persons heat their homes exclusively with firewood and that both have enough logs in stock to survive the upcoming winter. Nonetheless, they need additional firewood so that they will not feel cold. With this in mind, the community allows them to chop wood in the community forest for a certain period of time. Both A and B have little money and, therefore, they have no other means of getting firewood or any other heating material.
In accordance with our definition of need as the amount of some good that a member of society requires in order to avoid harm, the vignettes described the consequences of unfulfilled need (“If they get less than they need, it will get unreasonably cold in their huts. The less firewood they get, the colder their huts will be.”). The participants’ task was to distribute an exogenously given number of logs among A and B. In order to justify unequal distributions of logs, we introduced heterogeneity between A and B with respect to their need for logs, or their productivity in terms of the number of logs contributed, and their accountability for their situation.
Participants’ distribution choices therefore reveal how far need is acknowledged, and whether and how the acknowledgment of need is affected by the two persons’ heterogeneity. Note that we intentionally described the consequences of unfulfilled need as mildly harmful (“[...] it will become unreasonably cold in their huts.”) and not as life threatening in order to leave participants more scope for judgment. We also allowed for over-fulfillment of need (“The persons can use more firewood than they need to heat their huts up to pleasant temperatures or store it for subsequent winters.”) in order to study how participants handle mixed situations (where one person has unfulfilled need and the other has fulfilled need) and situations of “oversupply” (where both persons have different fulfilled needs).
In the following, we call the framing of the vignette with respect to need and productivity scenario and the framing with respect to accountability treatment. The study consisted of two treatments (accountability framings). In the High Accountability Treatment, participants were told that Person A had continued to smoke heavily against the advice of her doctor, which caused a metabolic disease. This is the reason why she needs a higher room temperature (being accountable for greater need in the Need Scenario) or has chopped less wood (being accountable for lower productivity in the Productivity Scenario). In the Low Accountability Treatment, Person A suffers from a congenital metabolic disease, which is why she needs a higher room temperature (hence not accountable for greater need in the Need Scenario) or has chopped less wood (hence not accountable for lower productivity in the Productivity Scenario).
Participants were randomly assigned to one of the two treatments in the beginning of the study. That is, the treatment effect of the source of heterogeneity—high or low accountability—on participants’ distribution decisions was measured at the between-participants level. In both treatments, all participants were presented both scenarios in randomized order. Hence, the impact of the type of heterogeneity—need or productivity differences—on participants’ distribution decisions was measured at the within-participants level.
Table 1 Parametrization of the vignette by scenario and case Pretests showed that we could place up to a dozen different cases in our online survey that should not last longer than 30 min altogether (including a post-survey questionnaire). Hence, we decided to present each participant with ten different cases, five per scenario (a sixth case per scenario, which we dropped from the analysis, was used only as a consistency check). The cases varied the amount and distribution of needs and contributed logs. Participants were presented the cases of each scenario in a randomized order. They were shown the number of logs contributed by Person A and Person B (productivity), the number of logs needed by A and B (need), the total number of logs needed and the total number of logs contributed. Participants were then asked to distribute the logs between the two persons according to what they thought to be “most just”. Participants distributed the logs freely among the hypothetical persons, without being provided with predefined or default options. However, participants always had to allocate all available resources to A and B.
When asking participants for the “most just” distribution, we assume (i) that they care about justice and (ii) that some possible distributions are judged as more just than others. In the literature, there are basically two approaches assuring that the elicitation of participants’ fairness preferences is unbiased by selfish motives. One approach holds that, in order to discover “which pattern is the most just” (Frohlich et al. 1987b, p. 2), participants must be able to rationally evaluate the fairness of different distributions “under conditions of very imperfect information”, that is, in a position that creates impartiality of involved social planners by a (more or less transparent) veil of ignorance. In this paper, we follow—like many of the empirical and experimental studies mentioned in the literature review—the quasi-spectator method. The quasi-spectator “is an observer who has no salient stakes in the matter at hand and possesses some, if not all, information relevant to his internalized moral values” (Konow 2009, p. 106). Hence, our vignette study is concerned with the fairness views of third parties.
Table 1 shows the parametrization of the vignette by scenario and case. Let \(p_i^s\) and \(n_i^s\) denote \(i\in \{A,B\}\)’s productivity and need in scenario \(s\in \{(N)eed,(P)roductivity\}\). As shown in the table, the Need Scenario provided both persons with \(p_A^N=p_B^N=1000\) logs (equal constant productivity), but different needs. For example, in case 2, A’s (B’s) need of \(n^N_A=1400\) (\(n^N_B=800\)) logs was greater (smaller) than her productivity of 1000 logs, and their joint need of \(n^N=2200\) logs was greater than their total productivity of \(p^N=2000\) logs. Likewise, the Productivity Scenario attached an equal constant need of \(n^P_A=n^P_B=1000\) logs to both persons, but varied their productivity. For example, in case 2, A’s (B’s) productivity of \(p^P_A=800\) (\(p^P_B=1400\)) logs was insufficient (more than sufficient) to meet her need of 1000 logs, and their joint productivity of \(p^P=2200\) logs was sufficient to meet their total need of \(n^P=2000\). Note that in both scenarios, Person A was always in the disadvantaged position, that is, she was more needy or less productive than Person B.
The parametrization of the vignettes enables studying the effect of the supply situation (D = deficit, S = surplus, X = exact need satisfaction) and the source of heterogeneity on the distribution of logs among A and B. Figure 1 illustrates the design of the cases. Case 1, displayed in the top left diagram, introduces a situation where both A and B have a deficit of logs (DD) in the Need Scenario and a surplus of logs (SS) in the Productivity Scenario.
In the Need Scenario, total productivity is displayed as a straight “supply” line with intercept \(p^N\). Point \(P^N\) on this line marks the actual distribution of productivity. Total need is displayed as a straight “demand” line with intercept \(n^N\). Point \(N^N\) on this line marks the distribution of need. Since \(N^N\) is located in the grey shaded area at the top right of \(P^N\), both A and B have a deficit. And since \(N^N\) is located to the right of the \(45^\circ\) line of equality (where \(x_A=x_B\)), A is worse off than B because she has a bigger deficit.
Analogously, in the Productivity Scenario, total productivity is displayed as a straight line with intercept \(p^P\), and point \(P^P\) on this line marks the actual distribution of productivity. Total need is displayed as a straight line with intercept \(n^P\), and point \(N^P\) on this line marks the distribution of need. Since \(P^P\) is located in the grey shaded area at the top right of \(N^P\), both A and B have a surplus. And since \(P^P\) is located to the left of the \(45^\circ\) line of equality, A is worse off than B because she has a lower surplus. Hence, case 1 in the Productivity Scenario is a mirror image of case 1 in the Need Scenario where the source of heterogeneity and A’s disadvantage is either need or productivity.
The other cases are constructed similarly. In case 2 in the top right diagram of Fig. 1, A has a deficit and B a surplus in the Need Scenario (point \(N^N\) is in the area at the bottom right of \(N^P\)), and A has a surplus and B a deficit in the Productivity Scenario (point \(P^P\) is in the area at the top left of \(P^N\)). Case 3 in the bottom left diagram shows a situation where A’s need is either exactly satisfied or she has surplus, and where B has a surplus or her need is exactly satisfied, depending on the scenario. In cases 4 and 5, \(N^N\) and \(P^P\) are located in the grey shaded area at the bottom left of \(P^N\) and \(N^P\), respectively, and, hence, both A and B have a surplus in the Need Scenario or a surplus in the Productivity Scenario. Case 5 “doubles” case 4, has a slightly different parametrization, and was included to study participants’ sensitivity towards need in more “extreme” situations. The results regarding this question will be reported in a different paper.
Procedures
Since participants received only a flat payment, we promoted internal validity by asking three control questions after the distribution task in order to make sure that the participants read the instructions carefully and actually understood the task. For the wording of the control questions, see Appendix B. Only those who passed at least two out of three checks were included in the final analysis.
In order to control for participants’ heterogeneity with regard to their socio-demographics and justice attitudes, we asked them, in a post-survey questionnaire, for their age, gender, and equivalent household net income and to state their support for three different distribution principles—need, equity, and equality (compare, for example, Skitka and Tetlock 1992 as well as Mitchell et al. 1993) as well as their evaluation of Person A’s accountability for her situation on a 7-point Likert scale. We also assessed participants’ perceived locus of control in a similar way (see Fong 2001; Phares and Lamiell 1975) and collected information on participants’ health status (dummy variables for being a smoker and suffering from a cardiovascular or metabolic disease). Ubel et al. (2001) found that smokers tend to punish unhealthy behavior regarding the need for transplant organs less often than participants who never smoked (also see Diederich and Schreier 2010). Participants were also asked for their political orientations using a 7-point Likert scale ranging from 1 (most left-wing) to 7 (most right-wing), see Appendix B for wordings.
The survey was programmed in oTree (Chen et al. 2016) and conducted online in September 2019. For reasons of external validity, participants were recruited via the private market research institute respondi. We used respondi’s Online Access Panel which offers a quota sample of the German adult population. Our sample is a random sample of the Online Access Panel, stratified by the three characteristics gender, age, and equivalent household net income, with a sample-size of \(N=200\) (for a breakdown of the sample by gender, age, and income, see Table 9 in Appendix E). The sampling rates of these characteristics in our sample are representative of the adult German population. On the importance of a sample being representative if empirical research is considered as relevant for normative theory, see Schwettmann (2020). Due to financial constraints, though, it was not possible to draw a larger sample that would potentially have captured more characteristics of the German population. The 200 participants who passed the control questions were paid a flat fee of 4.90 euro, equal to 9.80 euro per hour. Note that a further 203 persons started the survey but failed to pass the control questions. These persons were not asked any further questions and did not receive any payment. It was announced by respondi in the beginning of the study that failure to answer the control questions correctly would lead to exclusion from the study without being compensated. The control questions generated similar failure rates (Q1: 29%, Q2: 31%, Q3: 38%). Participants who failed to pass the control questions did not differ from included participants by their stratification characteristics (gender: \(\chi _1^2=0.517\), \(p=0.526\); age: \(\chi _4^2=5.083\), \(p=0.279\); income: \(\chi _4^2=3.447\), \(p=0.486\)). The enormous amount of dropouts shows that general population samples without performance-related incentives can potentially generate a lot of noise in the data. Hence, relatively strict exclusion criteria should be applied.
Before conducting the main study, we tested the efficacy of the vignette with respect to the accountability framing. All details of the pretest are reported in Appendix C. It clearly confirmed that participants attribute higher accountability to persons who disregard their doctor’s warning.
Working hypotheses
As explained above, participants were asked to distribute the logs between A and B according to what they thought to be most just. Let \(x_i^s\) denote the number of logs distributed to person \(i\in \{A,B\}\) in scenario \(s\in \{N,P\}\). We assume that participants maximize the justice J of the distribution \((x_A^s,x_B^s)\) subject to \(x_B^s=p_A^s+p_B^s-x_A^s\) where the parametrization of the vignette by case and scenario with respect to need \((n_A^s,n_B^s)\), productivity \((p_A^s,p_B^s)\), and accountability treatment \(T=\{low,high\}\) is given. Hence, the optimum number of logs distributed to person A is given by
$$\begin{aligned} x_A^{s*}=\arg \max J(x_A,p_A+p_B-x_A|n_A,n_B,p_A,p_B,T). \end{aligned}$$
(1)
Since we study the justice views of laypersons in terms of participants’ actual distribution choices \(x_A^{s*}\), J is not further specified or “axiomatized”. For a comprehensive account of justice evaluation functions, see Jasso (1978) and Jasso et al. (2016).
We consider two outcome measures of the experiment which both are based on \(x_A^{s*}\). Since the total number of logs available differs from case to case and between the scenarios, we normalize \(x_A^{s*}\) by Person A’s and Person B’s total productivity, \(p^s\), to obtain the first outcome measure, namely, the share of logs distributed to Person A:
$$\begin{aligned} \text{ logshare}_A^s=\frac{x_A^{s*}}{p^s}. \end{aligned}$$
(2)
For example, point \(P^N\) in case 1 in Fig. 1 shows a situation where the logs are distributed exactly according to productivity. Thus, we have \(\text{ logshare}_A^N=0.5\) and \(\tan ^{-1}(x^N_B/x^N_A)=\tan ^{-1}(1000/1000)=45^\circ =\alpha\). Analogously, point \(\nu\) where the line through the origin \(\overline{0N^N}\) and the “supply” line \(p^N\) intersect, shows a situation where the logs are distributed proportional to need. Thus, we have \(\text{ logshare}_A^N=0.6\) and \(\tan ^{-1}(800/1200)=\alpha '=34^\circ\).
The second outcome measure is the normalized deviation from the equal split in favor of Person A:
$$\begin{aligned} \text{ deviation}_A^s=\left\{ \begin{array}{ll} \frac{\text{ logshare}_A^s-0.5}{n_A^s/n^s-0.5} &{} \quad \text{(Need } \text{ Scenario) } \\ \frac{0.5-\text{ logshare}_A^s}{0.5-p_A^s/p^s} &{} \quad \text{(Productvitiy } \text{ Scenario) }. \end{array} \right. \end{aligned}$$
(3)
The allocation that distributes resources equally—the equal split—has been demonstrated to serve as a natural reference point or social norm for bargaining processes (compare, for example, Yaari and Bar-Hillel 1984; Dawid and Dermietzel 2006, as well as Andreoni and Bernheim 2009). In bargaining experiments, participants frequently choose equal-split allocations even though they are Pareto dominated (Herreiner and Puppe 2010). However, in social choice experiments, only a minority of participants still opts for the equal split when objective reasons like need differences speak in favor of an unequal distribution (Gaertner and Schokkaert 2012). Hence, participants’ sense of justice is reflected in the deviation from the equal split.
We have \(\text{ deviation}_A^s=0\) (equal split) when participants think that heterogeneity among A and B is irrelevant. In Fig. 1, equal-split distributions are located on the \(45^\circ\) line which goes through points \(N^P\) (equal productivity) and \(P^N\) (equal need). We have \(\text{ deviation}_A^s>0\) when participants think that A should be compensated for her disadvantage due to greater need (clockwise rotation) or smaller productivity (counterclockwise rotation). Observations that do not meet the condition \(0\le \text{ deviation}_A^s\le 1\), that is, choices that make A’s situation even worse than it is or make A better off than B are considered as noise in the analysis.
We expect that, on average, participants distribute more than A’s productivity share and less than A’s need share to her, because they think it is just to partially compensate A for her disadvantage. Consequently, \(\text{ logshare}_A^s\) in the Productivity Scenario is smaller than in the Need Scenario because A’s need share in the Productivity Scenario is 50% (the upper bound for \(\text{ logshare}_A^P\)), and her productivity share in the Need Scenario is also 50% (the lower bound for \(\text{ logshare}_A^N\)). Hence, the deviation from the equal split is between zero and one. Put into graphs, we expect participants to pick the just distributions from the line \(\overline{P^N\nu }\) in the Need Scenario and from the line \(\overline{P^P\rho }\) in the Productivity Scenario in Fig. 1.
Hypothesis 1
(Partial Compensation) Participants compensate Person A partially for her disadvantage: \(p_A^s/p^s<\text{ logshare}_A^s<n_A^s/n^s\). Partial compensation implies that \(\text{ logshare}_A^P<0.5<\text{ logshare}_A^N\) and \(0<\text{ deviation}_A^s<1\).
We expect that participants’ willingness to compensate Person A for her disadvantage is diminished by high accountability. Hence, participants distribute fewer logs to A and they favor smaller (greater) deviations from the equal split in the Need Scenario (Productivity Scenario).
Hypothesis 2
(Accountability) The just distribution of logs depends on A’s accountability such that
-
(a)
\(\text{ logshare}_A^s(low)>\text{ logshare}_A^s(high)\),
-
(b)
\(\text{ deviation}_A^N(low)>\text{ deviation}_A^N(high)\) and \(\text{ deviation}_A^P(low)<\text{ deviation}_A^P(high)\).
If J is homogeneous in the number of logs, A’s just share of logs is determined by her need share and her productivity share, and not by total need and total productivity. Let \(\alpha ^s=\tan ^{-1}(x_B^{s*}/x_A^{s*})\) denote the angle of the line through the origin that contains all just distributions. Since Need Scenario and Productivity Scenario are mirror images of each other, we expect \(45^\circ -\alpha ^N=\alpha ^P-45^\circ\) which is equivalent to \(\text{ logshare}_A^N=1-\text{ logshare}_A^P\). This implies that the just deviation from the equal split is independent of the scenario as indicated by the black arrows in Fig. 1.
Hypothesis 3
(Scenario) Disadvantages that are due to greater need and smaller productivity are treated symmetrically, that is, \(\text{ logshare}_A^N=1-\text{ logshare}_A^P\) and \(\text{ deviation}_A^N=\text{ deviation}_A^P\).
However, one could argue that the cognitive perception of the scenarios differs because the equal split is seen as a reference point (Trueblood 2015). Loss averse participants might find compensating A less justified when her disadvantage is due to lower contribution (negative domain, reduction of a loss for A) than when her disadvantage is due to greater need (positive domain, gain for A). Hence, there could be a gain-loss domain effect (Tversky and Kahneman 1991, also see Weiß et al. 2017) that would imply \(\text{ deviation}_A^P>\text{ deviation}_A^N\). Moreover, there could also be an interaction effect between scenario framing and accountability treatment.
As can be seen in Table 1 and Fig. 1, the five cases differ by their supply situation. Figure 1 shows that \(P^N\) and \(N^P\) remain constant, while \(N^N\) and \(P^P\) are obtained from systematically changing both relative inequality in productivity or need (visualized by the angles of the dashed lines through \(N^N\) and \(P^P\) to the line of equality) and total need or total productivity (visualized by the “demand” and “supply” lines). Hence, the study design does not allow the separation of both possible effects on the just distribution of logs. Keeping relative inequality constant would have led to very small absolute need and productivity differences in cases where \(N^N\) and \(P^P\) are located in the grey shaded area at the bottom left of \(P^N\) and \(N^P\). Hence, we decided to additionally increase relative inequality among A and B.
In order to deal with this problem, we first assume that the absolute supply situation in terms of A having a deficit or surplus (the location of the “demand” and “supply” lines) does not affect participants’ justice considerations. Hence, in the Need Scenario, we expect that the share of logs that is distributed to A increases in relative inequality when moving from case 1 to 5 because point \(\nu\) moves outward on the line of just distributions \(\overline{P^N\nu }\) and \(\alpha '\) decreases. The opposite applies to the Productivity Scenario where \(P^P\) moves outward on the line of just distributions \(\overline{P^P\rho }\) and \(90-\alpha '\) increases. Hence, in the Need Scenario (Productivity Scenario), we hypothesize that participants distribute relatively more (less) logs to A when a case exhibits greater inequality among A and B. Since the deviation from the equal split is normalized by the difference between A’s need share and her productivity share (the length of \(\overline{P^N\nu }\) and \(\overline{P^P\rho }\)), we expect it to be independent of the supply situation.
Hypothesis 4
(Supply situation) Let \(y,z\in \{1,2,3,4,5\}\) denote the number of the case where \(y<z\) and cases with lower numbers exhibit less inequality (with respect to need or productivity) among A and B, then we have \(\text{ logshare}_A^N(z)-\text{ logshare}_A^P(z)>\text{ logshare}_A^N(y)-\text{ logshare}_A^P(y)\) and \(\text{ deviation}_A^s(y)=\text{ deviation}_A^s(z)\).
If the absolute supply situation matters for the distribution of logs, it is likely to have a dampening effect in the Need Scenario (A receives relatively less, \(\alpha '\) decreases less) because increasing inequality is combined with an improving absolute supply situation (Person A starts with a deficit and ends up with a surplus). It is also likely to have a dampening effect in the Productivity Scenario (A receives relatively more, \(90-\alpha '\) increases less) because increasing inequality is combined with a worsening absolute supply situation (Person A starts with a surplus and ends up with a deficit). Hence, the two scenario differences in H4 might get smaller, but the effect of increasing inequality should remain.