1 Introduction

People lead groups by example if they set a positive standard or behavior through their own actions such that other group members can emulate this behavior. A large and interdisciplinary literature considers such exemplary leadership a facilitator of cooperative and successful outcomes in different social contexts, e.g., the military, education, health care or politics (Bulls 2007; Wolbrecht and Campbell 2007; Della et al. 2008; Versland and Erickson 2017), at the international level [e.g. in the context of environmental protection (Burke 2017)], for regional integration in post-conflict societies (e.g. Pesquera 2011) and, of course, in businesses and organizations (e.g. Yaffe and Kark 2011). However, since such standard setters differ from their fellow group members in many characteristics such as personality factors (Bruttel and Fischbacher 2013), it is unclear to which extent any beneficial outcome derives from the leader herself or her action.

Furthermore, the dynamic interaction between leaders and followers in groups complicates the causal evaluation of any leadership process. Even with large sample sizes, statistical relationships are often just correlations that do not allow for causal inference because of endogeneity concerns. Hence, leadership research has focused in recent years on experiments as a complimentary research method. A well-designed experimental study eliminates many statistical concerns in field studies (Antonakis et al. 2010) and circumvents related problems of selectivity. Ideally, such experiments take place in natural environments, but such circumstances often reflect idiosyncratic characteristics of the respective context. A clean identification strategy requires carefully controlled and replicable experiments with a design that tests the hypotheses of a specific leadership theory (Falk and Heckman 2009).

We exploit the benefits of this scientific approach in this paper and provide a meta-analysis of experimental studies that use exemplary leadership in a social dilemma [as described in Güth et al. (2007)]. More specifically, a randomly selected leader visibly commits herself to a specific contribution to a public good before the fellow group members do so. This approach provides perhaps the most parsimonious perspective on leadership because it ignores aspects like leaders’ and followers’ personal, demographic and socioeconomic characteristics (Grossman et al. 2015). It does not even provide leaders with a superior access to information as some other economic analyses do (e.g. Hermalin 1998; Komai et al. 2007; Komai and Stegeman 2010).

This game captures the key components of leadership that have been addressed in the management literature over the last years. Social learning theory (e.g. Bandura and Walters 1977) suggests that individuals in an organization often try to emulate the behaviors of leaders (Men et al. 2020). However, exemplary leadership does not just demonstrate the cooperative behavior expected from followers, it is also an indicator of the behavioral integrity of the leader, which fosters cooperation and workplace performance of followers (Leroy et al. 2012; Eisenkopf 2020; Simons et al. 2022). Simons et al. (2015) find in their meta-analysis that the behavioral integrity of a leader correlates strongly with trust and organizational citizenship behavior. In turn, Jong et al. (2016) observe in their study (which combines results from 112 studies) that trust between leaders and followers correlates significantly with team performance. It is particular important without strong and adequate institutional mechanisms such as monitoring and incentive schemes to induce the desired behavior. Therefore, it is not surprising that leading by example provides a key component of concepts like ethical or authentic leadership which require a demonstration of normatively appropriate conduct through personal actions of the leader (Brown et al. 2005; Walumbwa et al. 2008). Bedi et al. (2016) as well as Zhang et al. (2019) provide meta-analyses on the relationship between ethical leadership and follower outcomes. The results of both studies show that ethical leadership is positively related to the perceptions of leader interactional fairness and follower ethical behavior.

This discussion shows that meta-analyses are a vital tool of leadership research. However, such an aggregation of evidence still provides just correlational evidence and does not eliminate the methodological limitations of field studies that do not exploit random variations in their identification strategies (Antonakis et al. 2010). In their meta-analysis on charismatic leadership, Banks et al. (2017) identify several shortcomings in testing theoretical and methodological moderating variables and call for more experiments because well-designed randomization process eliminates such concerns [see also the discussion in Lee et al. (2020)].

In contrast, economists rarely use meta-analyses but the increasing popularity of experimental methods with standardized games has provided some studies on public goods games (Croson and Marks 2000; Zelmer 2003), trust games (Johnson and Mislin 2011), ultimatum games (Larney et al. 2019), dictator games (Engel 2011) and the experimental paradigm of Fischbacher and Föllmi-Heusi (2013) on truth-telling (Abeler et al. 2019). More often, economists review the relevant journals in dedicated outlets like the Journal of Economic Literature or the Journal of Economic Surveys.

Regarding the topic of our paper, Eichenseer (2021) provides a thorough and thoughtful review of the relevant experimental literature and interesting variations of the leadership game. That paper also includes an aggregate quantitative synthesis of the results in the various papers. In contrast, our meta-analysis allows for a more detailed analysis of the interaction between leaders and followers within the groups because we aggregate the actual experimental data from the different paper. Hence, we can identify reciprocal behavior within groups precisely and in a standardized way across different datasets. This feature allows us to propose and test a parsimonious theoretical model which combines standard economic reasoning with the simple reciprocity model proposed by Fehr and Schmidt (1999).Footnote 1 More precisely, we study four questions about the behavioral impact of exemplary leadership. For each question we propose a hypothesis from this illustrative model. Our large, yet focused, detailed and standardized accumulated data allow for a clean and replicable test of these predictions.

In most studies, the decision sequence of the leadership game increases aggregate contributions and welfare relative to groups without a leader (Moxnes and van der Heijden 2003; Güth et al. 2007; Dannenberg 2015; McCannon 2018; Eichenseer 2021) but this positive effect cannot always be confirmed (Haigner and Wakolbinger 2010; Sahin et al. 2015; Gächter and Renner 2018; Gürerk et al. 2018). Therefore, our first research question focuses on the aggregate impact of such leadership on contributions across several studies.


Question 1: does leading-by-example increase contributions?

Aggregate contributions are the most popular indicator of leadership effectiveness, but they do not constitute the only one. Our second research question focuses on the impact of such leadership on coordination. Fischbacher et al. (2001) show that many people are conditional cooperators, who prefer to match others’ contributions. Barron and Nurminen (2020) demonstrate that even simple nudges lead to a remarkable increase in cooperation rates, because nudges provide a focal point, potentially helping conditional cooperators to coordinate. Thus, when followers are conditional cooperators, leaders’ high efforts may provide a focal point and induce the followers to exert high efforts. Gächter et al. (2012), Frackenpohl et al. (2016) and Cartwright and Patel (2010) elicited followers contributions using the strategy method (Selten 1967). Their results show a significant correlation between the contributions of followers and leaders. Hence, a cautious leader also has an impact. Their low contributions can decrease the expenditure of followers. Note that a limited degree of reciprocity among followers implies costs and risks for the leader. Some studies report that leaders end up worse than followers do (Cappelen et al. 2016; Gächter and Renner 2018; Eisenkopf 2020) and they may receive even lower payoffs than in a group without leadership. Here, the random allocation into experimental roles is particularly helpful for evaluation because voluntary leadership is a highly selective process that attracts only persons with specific characteristics (Arbak and Villeval 2013; Alan et al. 2019).


Question 2: does leading-by-example induces a stronger alignment of group members’ contributions?

Third, we focus on the stability of any leadership effect over time. If leaders are also conditional cooperators they might be unwilling to uphold high levels of contribution in case of lower contributions by the followers. Gächter and Renner (2018) as well as Teyssier (2012) show that leaders behave almost perfectly like conditional cooperators and match their contribution with the amount they believe the followers will contribute. Consequently, the positive effect of the leading-by-example would have to disappear if followers permanently undercut the contribution of the leader.


Question 3: do higher contributions with leading-by-example persist over time?

Last, not least, we have an interest in the role of group size on the impact of leadership. Several public good experiments show a positive correlation between group size and contributions in related experiments without a leader (Goeree et al. 2002; Isaac and Walker 1988; Isaac et al. 1994). Even if this effect is not robust for all comparisons (Nosenzo et al. 2015; Carpenter 2007), a meta study with 27 experiments confirm the positive correlation (Zelmer 2003). However, only few studies deliberately investigate the role of group size in the context of leadership. Some studies suggest that the effect of leading-by-example is also present in larger groups (Figuieres et al. 2012), but the coordination effect of leaders seems to diminish with group size (Komai and Grossman 2009). Therefore, our last research question focuses on the impact of group size on leading-by-example.


Question 4: does the impact of leading-by-example change with the size of the group?

It is important to point out at this stage that our evidence regarding this last question is rather limited. First, we have to rely on cross-study comparisons because few studies vary group sizes systematically. Second, most studies use either groups of 3 or 4 participants which limits the variation in the data (but provides rather large evidence at this particular marginal change). Third, experimentalists typically impose lower marginal per-capita return rates in experiments with larger groups which generates an obvious confound. Note also that the predictions regarding group size are to some extent ambiguous, as the subsequent section points out.

We obtained data from 14 studies with 369 groups as independent observations. Our results show that leading-by-example significantly increases contributions in comparison with leaderless settings. Followers reply in kind to the leader, but only to a certain extent. Consequently, leaders contribute significantly more than followers. Therefore, we can confirm the ‘leader’s curse’ (Gächter and Renner 2018): they earn less than their fellow group members. Nevertheless, leaders do not reduce their contributions more than those of their followers. The positive effect of leading-by-example is maintained over a longer period of time. Moreover, we find that contributions increase in group sizes. However, leaders in larger groups elicit less coherent responses from their followers.

The remainder of the paper is organized as follows. Section 2 explains the underlying game structure, our leadership model as well as the resulting theoretical predictions. Section 3 explains the methodology. Section 4 presents our main results, while Sect. 5 concludes.

2 The underlying game and theoretical predictions

Our study focuses on behavior in the following voluntary contribution game (VCM) that has been by introduced by Isaac and Walker (1988) and adapted to leadership by Güth et al. (2007). Complementary to our literature review we illustrate our predictions with a simple model that incorporates inequality aversion into a standard microeconomic analysis.

Let \(I = (1, \ldots , N\}\) denote a group of \(N \ge 3\) individuals who interact for \(t = 1, \ldots ,T\) periods. In each period \(t\), individual \(i{ } \in { }I\) gets an endowment \(e > 0\), which can be either privately consumed or contributed to a group activity. For our theoretical analysis, we set \(T = 1\), standardize the endowment \(e = 1\) and consider a binary decision regarding the contribution: \(c_{i} \in \left\{ {0,1} \right\}\). This simplification allows us to focus on the conditions that induce group members to make nonnegative contributions and how leading-by-example alters these conditions. The monetary payoff of individual \(i\) takes the following form:

$$\pi_{i} = 1 - c_{i} + q\mathop \sum \limits_{j = 1}^{N} c_{j}$$

We have an interest in all studies in which \(1 > q > \frac{1}{N}\) holds. Because of \(1 > q\) the dominant strategy for each rational and selfish player is to contribute nothing. However, because of \(q > \frac{1}{N}\), full contributions would generate the highest aggregate payoff of all group members.

We consider two variants of this game: the standard simultaneous VCM and the VCM with leadership. In the simultaneous VCM, all N group members make their contribution decisions privately and simultaneously. The VCM with leadership has two decision stages. First, the leader, L\(,\) chooses his contribution \(c_{L}\), which is observed by the followers. Then, the followers \(F \left( { \ne L} \right)\) decide simultaneously about their own \(c_{F}\). Applying backward induction and assuming commonly known monetary payoff maximization, the theoretical prediction for the VCM with leadership do not differ from those for the standard-VCM: Because of \(q < 1,\) the followers’ dominant strategy in stage 2 is to contribute zero. A rational leader will anticipate this and free-ride as well in stage 1.

However, we want to inquire whether reciprocal preferences induce more cooperation in the group and how leading-by-example fosters this cooperation. Theories of reciprocity typically consider two motives for rewarding high contributions in a social dilemma and punishing low contributions: Outcome and intentions (Falk and Fischbacher 2006). However, in the VCM game we study, it is impossible to tell which motive triggers reciprocity for a high contribution.Footnote 2 Both motives suggest that the contribution of player increases in (her belief about) the contributions of the other group members. This observation allows for two simplifications. First, Occam’s razor suggests that a consideration of outcomes as reciprocity motive is sufficient. Second, we can restrict the analysis to a binary choice (contribution or no contribution).

Regarding outcomes we assume that players suffer psychological losses from both advantageous and disadvantageous inequality (Fehr and Schmidt 1999).Footnote 3 The parameter \(\beta_{i}\) (with \(1 > \beta_{i} \ge 0\)) measures the utility loss from advantageous inequality, while \(\alpha_{i}\) with \(\alpha_{i} \ge \beta_{i}\) indicates the loss from disadvantageous inequality. For simplicity, we assume that people have common knowledge about a homogeneous \(\alpha_{i} = \alpha > \left( {1 - q} \right)\) in the population. Moreover, regarding advantageous inequality aversion we consider only two types of persons (\(\beta_{i} \in \left\{ {\underline {\beta } ,\overline{\beta }} \right\}\), with \(\underline {\beta } = 0\) and \(1 - q < \overline{\beta } \le \alpha\)). While the share of \(\underline {\beta }\) in the entire population is common knowledge, individual realizations of \(\beta_{i}\) constitute private knowledge. Groups consist of randomly drawn samples from that population. Let x denote the expected share of people with \(\beta_{i} = \underline {\beta }\) within a group (and \(1 - x\) for \(\beta_{i} = \overline{\beta })\). In case of simultaneous contributions, the utility function of a group member can be denoted as follows:

$$U_{i} = 1 - c_{i} + q_{i} + q_{j} \mathop \sum \limits_{j = 1}^{N - 1} c_{j} - \frac{\alpha }{N - 1}\left( {\mathop \sum \limits_{j = 1 \ne i}^{N - 1} max\left\{ {c_{i} - c_{j} ,0} \right\}} \right) - \frac{{\beta_{i} }}{N - 1}\left( {\mathop \sum \limits_{j = 1 \ne i}^{N - 1} max\left\{ {c_{j} - c_{i} ,0} \right\}} \right)$$

FS have identified in proposition 4 (with some differences in notation) two conditions for potential asymmetric equilibria in which some group members increase their utility with a contribution.Footnote 4 First, for any such member \(\beta_{i} > \left( {1 - q} \right)\) must hold. Hence, only people with \(\beta_{i} = \overline{\beta }\) (conditional cooperators) will contribute. Second, the benefits of a contribution must outweigh the costs of free-riding:

$$q + q\left( {1 - x} \right)\left( {N - 1} \right) - \alpha x > 1 + q\left( {1 - x} \right)\left( {N - 1} \right) - \overline{\beta }{ }\left( {1 - x} \right)$$

This inequality implies the following condition:

$$\hat{x}_{Sim} = \frac{{q - 1 + \overline{\beta }}}{{\alpha + \overline{\beta }}} > x$$

The term \(\hat{x}_{Sim}\) denotes the maximum share of expected free riders at which contributions become profitable for conditional cooperators, given that other conditional cooperators cooperate. We now study the case of the sequential VCM. Applying backward induction, we first look at the utility function of a follower \(F_{i}\) who has observed the decision of leader \(L\):

$$\begin{gathered} U_{{F_{i} }} = 1 - \mathop {c_{F} }\nolimits_{i} + q\mathop {c_{F} }\nolimits_{i} + qc_{L} + q\mathop \sum \limits_{j = 1 \ne i}^{N - 2} c_{j} \hfill \\ \;\;\;\;\;\;\;\;\; - \frac{\alpha }{N - 1}\left( {\max \left\{ {\mathop {c_{F} }\nolimits_{i} - c_{L} ,0} \right\} + \mathop \sum \limits_{j = 1 \ne i}^{N - 2} {\text{max}}\left\{ {(c_{Fi} - c_{Fj} ,0} \right\}} \right) \hfill \\ \;\;\;\;\;\;\;\;\; - \frac{{\beta_{Fi} }}{N - 1}\left( {\max \left\{ {c_{L} - \mathop {c_{F} }\nolimits_{i} ,0} \right\} + \mathop \sum \limits_{j = 1 \ne i}^{N - 2} {\text{max}}\left\{ {c_{Fj} - \mathop {c_{F} }\nolimits_{i} ,0} \right\}} \right) \hfill \\ \end{gathered}$$

In case of \(c_{L} = 0\), the leader has already revealed her free riding. Again, the benefits of a follower’s contribution must outweigh the costs of free-riding for any follower with \(\beta_{i} = \overline{\beta }\):

$$q + q\left( {1 - x} \right)\left( {N - 2} \right) - \alpha \left( {\frac{{1 + x\left( {N - 2} \right)}}{N - 1}} \right) > 1 + q\left( {1 - x} \right)\left( {N - 2} \right) - \overline{\beta }\frac{{\left( {1 - x} \right)\left( {N - 2} \right)}}{N - 1}$$

which leads to the following maximum share of free riders in the group for nonnegative contributions:

$$\left( {\hat{x}_{Seq} {|}c_{L} = 0} \right) = \frac{{\left( {q - 1 - \frac{\alpha }{N - 1} + \frac{{\overline{\beta }\left( {N - 2} \right)}}{N - 1}} \right)}}{{\left( {\alpha + \overline{\beta }} \right)}}\left( {\frac{N - 1}{{N - 2}}} \right) < \frac{{q + \overline{\beta } - 1}}{{\alpha + \overline{\beta }}} = \hat{x}_{Sim}$$

If the leader has contributed her endowment instead (\(c_{L} = 1\)), the number of potential free riders within the group has decreased by one person. Hence, a conditionally contributive follower tolerates a higher share of free riders among the other followers than in the case of simultaneous contributions:

$$\left( {\hat{x}_{Seq} {|}c_{L} = 1} \right) = \left( {\frac{{q + \overline{\beta } - 1}}{{\alpha + \overline{\beta }}}} \right)\left( {\frac{N - 1}{{N - 2}}} \right) > \frac{{q + \overline{\beta } - 1}}{{\alpha + \overline{\beta }}} = \hat{x}_{Sim}$$

Now we study the benefit of a contribution for the leader. She will contribute if

$$\left( {U_{L} {|}c_{L} = 1} \right) > \left( {U_{L} {|}c_{L} = 0} \right)$$

holds which depends on the response of the followers towards the choice of the leader. As shown above, this response depends on the realization of x. We have three cases to distinguish:

  • \(x < \left( {\hat{x}_{Seq} {|}c_{L} = 0} \right)\). In this case, there are enough conditional cooperators in the population. These people will always contribute irrespective of the choice of the leader. Any leader with \(\beta_{L} = \overline{\beta }\) will contribute.

  • \(\left( {\hat{x}_{Seq} {|}c_{L} = 1} \right) < x.{ }\) The followers will never cooperate irrespective of the choice of the leader. No leader will contribute.

  • \(\left( {\hat{x}_{Seq} {|}c_{L} = 0} \right) \le x < \left( {\hat{x}_{Seq} {|}c_{L} = 1} \right)\). In this case, conditional cooperators follow the choice of the leader. This case implies \(\left( {U_{L} {|}c_{L} = 1} \right) = q + q\left( {N - 1} \right)\left( {1 - x} \right) - x\alpha\) while \(\left( {U_{L} {|}c_{L} = 0} \right) = 1\) holds. Hence, we obtain

    $$x < \frac{qN - 1}{{\alpha + q\left( {N - 1} \right)}} = \mathop {\hat{x}_{Seq} }\nolimits_{L}$$

This inequality has an important implication because it provides an explanation why leaders accept lower average payoffs than followers (the leader’s curse). The contribution decision in this critical case does not depend on \(\beta_{L}\), the ‘compassionate’ part of a leader’s inequality aversion. It rests on the expectation about how many followers the leader can induce to reciprocate, such that the expected net returns from the aggregated investments compensate the expected loss from disadvantageous inequality. This insight implies that leaders should not stop contributions if it pays off even if some group members are free-riders. Comparing \(\mathop {\hat{x}_{Seq} }\nolimits_{L}\) with \(\hat{x}_{Sim} = \left( {\frac{{q + \overline{\beta } - 1}}{{\alpha + \overline{\beta }}}} \right)\), the threshold in case of simultaneous contributions, it becomes obvious that a person is more likely to contribute as a leader if \(\overline{\beta }\) does not exceed \(q\) or otherwise, if the group size is sufficiently large \(\left( {N > \frac{{\overline{\beta }}}{q} + 1} \right)\).

Hence, our analysis has identified two sources of increased contributions via leadership. Conditional cooperators are more likely to contribute if the leader has contributed herself while assuming the leadership role itself tends to induce contributions in particular among otherwise uncooperative group members. Overall, our theoretical analysis suggests the following expectations regarding the research questions:

Hypothesis 1: leading-by-example increases contributions.

Hypothesis 2: leading-by-example induces a stronger alignment of group members’ contributions.

Hypothesis 3: higher contributions with leading-by-example persist over time.

Group size has an ambiguous effect in this context. If x is in the appropriate parameter range, an increase in group size makes a leader’s contribution more rewarding. At the same time, an increasing group size decreases that parameter range, as the leader’s decision becomes less relevant for the fellow group members. However, this result relied on the assumption about common knowledge regarding the share of people with \({\beta }_{i}=\underset{\_}{\beta }\). This assumption is rather bold in the context of anonymous interaction between participants who do not know each other. Hence, followers will have a prior about the share. They can use a leader’s decision to update this prior. Therefore, a contribution by the leader can increase the expectations of the followers regarding the share of participants with \(\overline{\beta }\). This revision of expectations does not depend on the group size.

Hypothesis 4: for a given level of marginal per capita return (the parameter q), larger groups experience higher investments of leaders than smaller groups.

3 Method

3.1 Search of studies and study criteria

We searched for relevant studies in early 2020. More specifically, we first looked for experimental studies that investigate leadership with a voluntary contribution mechanism. Leadership in this context means that one of the group members acts first and the others observe the behavior before they act themselves. We applied a three-step search procedure. First, we searched relevant databases for published studies, including e.g., Google Scholar, EconLit or IDEAS, using terms like Leadership, Leading-by-example, Voluntary contribution mechanism, Sequential contribution, Public Goods. This first search yielded 251 potential results. We removed all theoretical papers and those that did not use a voluntary contribution mechanism. Second, we reviewed the references in the remaining papers to identify additional studies. These steps resulted in potential 33 studies. Last, not least, we sent a request to the e-mail list of the Economic Science Association (ESA), asking for additional publications as well as working papers and other unpublished research. The request identified 15 additional studies (including our own relevant working papers). Thus, our search procedure yielded 48 studies.

Subsequently, we further narrowed down our criteria. More specifically we looked for studies that met the following criteria: first, to ensure comparable procedures, we restricted the studies to experimental studies that used the voluntary contribution mechanism as in Güth et al. (2007). Second, each relevant study had to have at least one treatment with a randomly allocated leader in order to avoid self-selection among leaders. Third, all participants remained in their groups throughout the entire experiment. Applying these criteria resulted in a total of 20 potential studies for which we requested the data. We obtained the results in 15 cases. However, for one study we only got data on the aggregated level, so we were unable to recover the individual decisions. In total, we received full data from 14 studies, which we include in our meta-study. Table 1 informs about the included studies. We hope that future meta-analyses will be able to replicate our analyses with more samples from more diverse populations.

Table 1 Included studies

3.2 Coding and data preparation

For each study, we first transcribed the individual observations into a general form. This data form contained all variables necessary to answer our research questions. Our prime dependent variable was the individual contribution towards to public good. However, since all studies has different maximum stakes, we calculated the individual contribution as a percentage of the respective number of tokens (endowment) received in a round. This procedure facilitated the comparison of the results of the different studies. Moreover, we included the respective round, the role in the experiment (leader or follower), the subject number as well as the respective group. Note, that we later assigned a unique identification number to each study, group and subject, to ensure identification. Besides these subject dependent variables, we added study specific characteristics that have been hypothesized in the literature to affect contributions towards a public goods. Such characteristics include the marginal per capita return (mpcr), group size, number of periods, endowment and the exchange rate of the tokens/points into monetary amounts (Ledyard 1995; Zelmer 2003). To ensure comparability, we have converted the exchange rate into euros and adjusted it for inflation (as of summer 2021). Last but not least, we added treatment variables that indicate whether a group played in a sequential or simultaneous contribution structure. After applying the described procedure to all studies, all data sets were transferred into one complete data set.

4 Results

Before we focus on our research questions in detail, we have a brief look at the descriptive statistics. Table 2 presents the average relative contributions for the simultaneous and leading-by-example structures, as well as the number of included studies, groups and subjects. Appendix Table 10 report additional descriptive statistics for each included study separately. For the sake of clarity, we subdivide the results part in different subsection. Each subsection provides results for at least one of our hypotheses presented in Sect. 2. Note that we report results from additional robustness checks in the appendix. More specifically, we looked whether individual studies had an outsized effect on the aggregate outcomes in any of the regression models. We replicated the estimations and eliminated each individual study in a specific subsample for each robustness check. In a few cases, elimination leads to minor changes. The robustness checks are shown in the Appendix Table 10. We refer to the checks in the corresponding sections.

Table 2 Descriptive statistics

4.1 Does leading-by-example increase aggregate cooperation?

To address our first research question, we include only those studies on our subsequent analysis that allow a comparison between sequential and simultaneous contributions (i.e., with and without leader). This leaves us with 11 studies,Footnote 5 including 369 groups (179 for simultaneous, 190 for leading-by-example) as independent observations. Figure 1 demonstrates the cumulative distribution of means from all included groups separated by the contribution (in percentages of the endowment). The distribution for groups with a simultaneous contribution structure is always above the distribution for groups with a leader. A two-sample Kolmogorov–Smirnov test confirms that the distributions are statistically different (p < 0.01).

Fig. 1
figure 1

Cumulative distribution of the mean contribution per group (in percentages of the endowment)

To address our first research question in greater detail, Table 3 shows the overall impact of leading-by-example on contributions. The dependent variable is a subject’s contribution to the public good as a percent of the endowment, with standard errors clustered at the group level. The simultaneous decision structure serves as the benchmark. The variable leading-by-example denotes the dummy variable for the groups with leaders. Model I studies the impact of leading-by-example. The variable is highly significant and indicates a positive impact of a leader. The effect of leading-by-example remains positive and highly significant even if we control for characteristics of the experimental public good environment (model II) or include fixed effects for the studies (model III).

Result 1: leading-by-example enhances cooperation in comparison with simultaneous decisions.

Table 3 Leading-by-example in comparison with simultaneous decisions

4.2 To which extent do group members follow the leader?

Next, we focus on the impact of leading-by-example on followers. We use the data from groups with leaders of the 11 studies from the previous subsection, but we also include groups from three other studies (Frackenpohl et al. 2016; Centorrino and Concina 2013; Drouvelis and Nosenzo 2013) that did not allow comparison with simultaneous contribution structures but investigated leading-by-example in other contexts. In Appendix Sect. A.2.1 we provide robustness checks without the three additional studies. Excluding these three studies does not alter the results significantly. Table 4 presents OLS-regressions with the individual contribution as a percent of the endowment as the dependent variable. We include the characteristics of the experimental public good environment as independent variables. In addition, the dummy variable Leader not fixed indicates whether the leader remains in her role during the experiment (= 0) or whether the leader role is changed between periods (= 1). Models I–III address contribution differences between leaders and followers. These models include a dummy variable which indicates whether a subject is in the role of a leader (= 1) or a follower (= 0). All three models show a positive and highly significant coefficient for the Leader variable. Thus, leaders contribute significantly more to the public good than followers. Moreover, contributions in groups with a fixed leader are higher than in groups with different leaders.

Table 4 The impact of leading-by-example

To identify the impact of the example set by the leader, model IV–VI in Table 4 include only observations of the followers. Again, the dependent variable is the individual contribution as percentages of the endowment. The variable Rel. Leader contribution indicates the contribution of the leader in a given period. In all estimation models, this variable is positive and highly significant. However, both models indicate that followers, on average, employ an imperfect matching strategy.

The previous findings confirm that leading-by-example influences follower behavior. Our second hypothesis predicts that this impact leads to a stronger alignment of the group members’ contributions. Thus, the heterogeneity of individual contributions should be smaller in groups with a leader. Table 5 shows the alignment of group members contributions in detail. The estimations reported in Table 5 use the data from the 369 groups of the 11 studies that allow for a comparison between leading-by-example and simultaneous contributions. We use the standard deviation within a group in each period as measure of intra-group heterogeneity. The variable leading-by-example denotes a dummy variable which indicates the groups with leaders. All three models show that the standard deviation within groups is significantly lower in the leading-by-example treatments in comparison with simultaneous contributions. Thus, leading-by-example leads to a stronger alignment of contributions within the group. This observation confirms our second hypothesis.Footnote 6

Results 2: leading-by-example leads to a stronger alignment of the contributions of the group members.

Table 5 Standard deviations of individual contributions within groups

4.3 What is the long-term impact of leading-by-example?

To answer our third research question, Table 6 presents results from an OLS-regression, which focus on long-term effects of leading-by-example. The first three models estimate the long-term effect for leading-by-example in comparison with simultaneous contribution structures. To estimate the long-term effect, we only include the groups of the 11 studies used in Sect. 4.1. The dependent variable is the individual contribution as percentages of the endowment. Leading-by-example denote the dummy for groups with a leader. Model II controls for characteristics of the experimental public good environment, while model III includes fixed effects for studies. Our estimations show that contributions generally decrease over time. To test the long-term differences between groups with and without leaders, we implement an interaction term between leading-by-example and the period in our estimations. The interaction term enters positively and significantly in all three models. Thus, while during an experiment the contributions generally decrease, the effect is less pronounced in groups with leaders than in groups with simultaneous contributions.Footnote 7

Table 6 Long-term impact of leadership

Model IV–VI now focus on the long-term effect in groups with leaders. For our estimations we rely on the groups of the 14 studies used in Sect. 4.2. Again, Sect. A.2.1 provides robustness checks that exclude the three additional studies. The variable Leader denotes whether a participant held the role of leader (= 1) or a follower (= 0). Model V controls for characteristics of the experimental public good environment, while model VI additionally control for the studies. Again, our models show that contributions decrease during an experiment. Leaders generally contribute more than followers. However, we do not find that leader and followers do react differently to the progress of the experiment which is in line with our third hypothesis.

Result 3: Leading-by-example also has a positive effect on contributions in the long-term view. Leaders do not decrease their contributions more than followers.

4.4 Do contributions increase with group size?

Last not least, we focus on group sizes effects on contributions. Table 7 presents results from an OLS-regression based on the groups of the 11 studies that compare leading-by-example and simultaneous contribution structures (see Sect. 4.1). The dependent variable is the individual contribution to the public goods as percentages of the endowment, with standard errors clustered at the group level. Model I shows that contributions increase with larger groups, but that contributions are higher in the leading-by-example treatments. Model II examines the differential effects of group size on the two treatments. It suggests a positive effect of group size, but indicates that this effect is smaller in groups with a leader. These findings remain highly significant even when we control for the public goods environment (model III) and include study fixed effects (model IV).

Table 7 The impact of group size

To examine the effects of group size on leading-by-example in more detail, we focus below on the groups of the 14 studies that focused on leading-by-example (see Sect. 4.2). Again, Sect. A.2.1 provides robustness checks that exclude the three additional studies. Table 8 presents results from an OLS-regression. Model I–III focus on the leaders, while the remaining models consider the followers. The first two models confirm our hypothesis 4. Leaders increase their own contributions in larger groups. The effect remains significant even if we control for other characteristics of the experimental public good environment (model II) and when we include the fixed effects for studies (model III). Turning to the followers, model IV shows that followers also increase their relative contributions with increasing group sizes. However, the group size variables turn insignificant once we control for the leader contribution, whereas the leader contribution variable enters positive and highly significant (model V). Model VI and VII identifies the impact of group size on the leader–follower relationship. Model VI shows a group size effect for the followers. However, the effect vanishes if we control for the leader’s contribution (model V). Model VI and VII investigate the coordination impact of the leader. Both models show that the coordination impact of leaders on followers becomes weaker for increasing group sizes. However, model I indicates that leaders in smaller groups are more timid. Hence, even though the leader has less coordination power in the larger groups she generates more contributions.Footnote 8

Table 8 The impact of group size in leading-by-example treatments

The result confirms our theoretical prediction from hypothesis 4. Rather reluctant leadership in small groups explain this gap even though leaders in large teams elicit less coordinated responses from their fellow group members.

Result 4: Contributions increase with in group size, but the coordination impact of leaders decreases.

5 Discussion and conclusion

Experimental studies become more and more important in leadership studies, mainly for three reasons. First, they eliminate endogeneity concerns in the identification of causal relationships. Second, they can be tailored to test a specific theory. Third, they allow for replications by other researchers. We exploited these benefits and merged data from 14 studies in a meta-analysis to answer four questions about the impact of exemplary leadership in the light of a theory that combined standard economic reasoning with a simple model of reciprocity. Leading-by-example is often considered an effective instrument to achieve successful cooperative outcomes in many social environments. However, it is not so clear whether, and to which extent, such an outcome causally relates to the specific exemplary action or to broader characteristics of the leader, e.g. her personality. The random assignment of leadership role in experiments addresses this crucial concern head on and provides complementary evidence to what we observe in the field.

First, we hypothesized that leading-by-example increases contributions in a social dilemma. The results support this hypothesis. The establishment of a first-moving leader generates significantly higher contributions in comparison to groups without a leader. Our second focus was on the alignment of decisions between leader and followers. Our model predicted that conditional cooperators follow the leader’s decision, while the rest refuse to make any contribution. As a result, leaders with high contributions will end up worse than followers (the ‘leader’s curse’). We observe that leadership generates a greater alignment of group members’ contributions even though some followers contribute much less than their leader. Hence, on average, leaders contribute more than followers. We then inquired whether higher contributions with leading-by-example persist over time. Our model predicted that even selfish leaders should not stop contributing with sufficiently few likely free riders in their group because their own economic losses from a breakdown in cooperation are too large. The results support these insights. Despite the relatively small gains of leaders, they do not reduce their contributions more than followers over time which establishes a positive long-term effect of leading-by-example. These findings highlight the importance of cooperative leadership for successful groups. Last but not least, our fourth question deals with the impact of group size. While our simple model predicts an ambiguous effect, further considerations of Bayesian Updating suggest that the effect of leading-by-example is stronger in larger groups. Our results show that contributions increase with group size, independent of the contributions structure. This result is consistent with previous literature examining the effects of group size (Zelmer 2003; Goeree et al. 2002; Isaac et al. 1994). At the same time, however, our results show that the effect of a leader decreases as group size increases. Further analysis shows that this is particularly related to the fact that leaders in larger groups elicit fewer coherent responses from their followers. This result suggests that the benefits of leading-by-example do not extent beyond a certain group size. However, as we pointed out in the introduction and the results section, our evidence has some systematic limitations with respect to identifying the group size effect. This problem calls for more studies on the impact of leadership in larger groups.

Overall, our results provide complementary evidence to leadership research in the field. Not only do we provide causal evidence for the beneficial impact of leading by example. The results also indicate factors that can jeopardize these benefits, in particular the ambivalence of group size and the leader’s curse. It would be interesting to observe in the field whether such a curse also occurs in organizations or whether increased monitoring reduces this effect at the expense of reduced voluntary cooperation (Falk and Fischbacher 2006; Eisenkopf and Walter 2021).

Last, not least, we hope that future meta-analyses can rely on a larger and more diverse sample. Such studies could also test, and potentially falsify, specific extensions of our rather simple leadership model. Indeed, we think that a more general theoretical approach could integrate even more of the multifold empirical evidence. Moreover, we did not investigate any leadership instruments such as communication, monitoring or punishment. Furthermore, most leaders emerge endogenously within a group or that they come as outsiders into the group. Nevertheless, we consider our results as encouraging because they derive from a systematic, replicable and theory-guided research agenda that may complement and inspire future research in the lab and the ‘real life’.