1 Introduction

Due to slow progress in reducing carbon emissions, solar radiation management (SRM), as one form of geoengineering, has been proposed as an option to address global warming.Footnote 1 SRM aims at increasing the Earth’s reflectivity to attenuate the effect of incoming solar radiation, thereby cooling down the planet. Among different SRM technologies, stratospheric sulfur aerosol injection is the primary technology currently discussed (Caldeira et al. 2013).Footnote 2 The effectiveness of aerosol injection in reducing temperatures has been observed after volcanic eruptions. For instance, the injection of about 20 mega tonnes of sulfur dioxide into the tropical stratosphere due to the eruption of Mount Pinatubo in 1991 provoked a global cooling effect of about a half degree Celsius in the following years (Crutzen 2006). Following the same logic, SRM offers an apparently fast and pragmatic solution to moderate the effects of human-induced global warming over the coming few decades. The costs of deployment are low—especially if compared to mitigation costs—and the potential benefits are estimated to be high (Aldy and Zeckhauser 2020). However, it also comes with a risk, as collateral damages may be very high (Aldy et al. 2021; Barrett 2014; Barrett et al. 2014; Bodansky 2013; Reynolds 2019; Robock et al. 2009; Stephens et al. 2021). SRM will have impacts on precipitation patterns and might increase the exposure to droughts in many regions (Haywood et al. 2013; Irvine et al. 2019; Kravitz et al. 2014; Ricke et al. 2010). The risk of the depletion of stratospheric ozone will increase (Moreno-Cruz and Keith 2013) and the deposition of sulfur particles could lead to acid rain and soil acidification with detrimental consequences for agriculture and health (Crutzen 2006; Kravitz et al. 2009; Visioni et al. 2020). Finally, SRM cannot easily be reversed as its interruption could cause a termination shock with sudden temperature increase (Irvine et al. 2012; Parker and Irvine 2018). Given these features, SRM poses several governance challenges. On the one hand, countries which are most exposed to climate damages may “free-drive” by individually deploying SRM, with a high risk for the rest of the world (Weitzman 2015). Thus, the issue may not be the underprovision of the public good “mitigation” (i.e., emissions reduction), but the overprovision of SRM. On the other hand, without SRM, the notorious failure of substantial mitigation efforts in the past may lead to an unprecedented warming of the planet that seriously threatens life and the environment.

Governance issues of geoengineering (in the form of SRM) have been analyzed for instance by Weitzman (2015), Rickels et al. (2020) and Ricke et al. (2013) who assume that only members to an agreement can set the level of the global thermostat through SRM, which incentivizes countries to join the agreement. However, the assumption that non-members can be prevented from the use of SRM is defining away the very problem of free-driving, which has been identified by these authors as the major governance issue in the first place. Moreover, Parker et al. (2018) and Heyen et al. (2019) consider the possibility for counter-geoengineering, i.e., measures capable of negating the climate effects of SRM.

All of these papers focus exclusively on SRM, but do not consider the interaction with other climate policies, like mitigation. For instance, as SRM represents a relatively inexpensive alternative to mitigation, one would expect that SRM reduces mitigation efforts by countries. However, as shown by Cherry et al. (2023), Moreno-Cruz (2015) and Urpelainen (2012), due to the possibility of high collateral damages, countries could increase their mitigation efforts in order to reduce or avoid the deployment of SRM technologies. How the interaction between mitigation and SRM could influence the formation of climate agreements is not studied in these papers.

Millard-Ball (2012) proposes a coalition formation model which studies not only the strategic interaction between SRM and mitigation, but also captures the free-rider incentive which hampers the stability of large climate agreements on mitigation. Two main results may be highlighted. First, if collateral damages from SRM are perceived to be sufficiently high, countries will have an incentive to increase total mitigation up to a level at which the use of SRM is no longer attractive. Second, sufficiently high collateral damages can stabilize the grand coalition if signatories behave as Stackelberg leaders. A country leaving the grand coalition is faced with the option to increase its mitigation efforts in order to avoid the deployment of SRM because Stackelberg leaders reduce their mitigation efforts. This is an unattractive option for the free-rider, and works as a credible threat provided collateral damages from SRM are sufficiently high.

Despite Millard-Ball’s model provides an excellent starting point and offers a new perspective, some aspects remain underdeveloped and some parts of his analysis are incomplete. Finus and Furini (2023) use a similar model and expand the analysis in different directions, including the consideration of different possible coalition formation scenarios and the analysis of stability of all possible coalition sizes, not only of the grand coalition. They find that large coalitions, including the grand coalition, can be stabilized if signatories use the threat to deploy geoengineering once a country leaves the agreement. They show that this strategy works if collateral damages from SRM are sufficiently high, similar to Millard-Ball (2012), but also sufficiently small in order for the threat to be credible. In other words, collateral damages must lie in some range in order for a climate agreement without SRM deployment to be stable. However, Finus and Furini (2023) assume Nash–Cournot behavior by countries and wrongly argue that Stackelberg leadership would not make a difference in the Millard-Ball’s model, as countries have dominant geoengineering strategies.

In this paper, we provide an analysis which considers and compares both assumptions, the Nash–Cournot and the Stackelberg assumption. We focus on various punishment options, their credibility and their effectiveness in stabilizing the grand coalition which cooperates on mitigation and avoids SRM. In order to test the robustness of our conclusions, we consider not only the cartel formation game which Millard-Ball (2012) and Finus and Furini (2023) have used in their analysis, but also consider a repeated game, a second framework, which has been frequently employed to study stability of international environmental agreements.Footnote 3 We show that all our qualitative results hold under both frameworks.

We assume, as Millard-Ball (2012), symmetric countries and focus on the grand coalition for simplicity. We highlight that the different punishment options available under Nash–Cournot and Stackelberg behavior are not necessarily exclusive, can complement each other and are useful for different levels of collateral damages.

In the following, we set out the cartel formation game and derive equilibrium mitigation and SRM strategies in Sect. 2 and analyze stability in Sect. 3. Section 4 considers the alternative framework of a repeated game. Section 5 concludes.

2 The mitigation-solar geoengineering cartel formation game

2.1 The model

There are \(n \ge 3\) symmetric countries, \(i = 1,\;2,\;...,\;n\), with the set of all countries denoted by \(N\), which play the cartel formation game which consists of three stages. In the first stage, countries decide whether to join a climate agreement and become signatories (S) or to stay outside as non-signatories (NS). In the second stage signatories choose their mitigation levels by maximizing the aggregate payoff of coalition members whereas non-signatories choose their mitigation levels by maximizing their individual payoff. In the third stage, all countries individually choose whether to deploy SRM.

We consider two versions for the second stage of the game. In the Nash–Cournot (NC-) scenario, signatories and non-signatories simultaneously set their mitigation levels; in the Stackelberg (ST-) scenario, signatories choose mitigation before non-signatories, taking into account the best-response of non-signatories.

The game is solved by backward induction. Based on the payoff function suggested by Millard-Ball (2012), the following decisions are taken.

In the last stage, all countries face a discrete choice whether to deploy SRM technologies, with no deployment \(z_{i} = 0\) and deployment \(z_{i} = 1\). The net benefit of SRM to country \(i\) is given by \(\left( {g - Q} \right) \cdot z_{i}\). The net benefit includes the benefit from geoengineering in the form of reduced climate damages, minus the cost of production (which may be neglected anyway as the cost of SRM are generally viewed to be very lowFootnote 4) minus the collateral damage to country \(i\). The marginal net benefit of SRM decreases in total mitigation \(Q\) with \(g\) being a strictly positive parameter. If \(Q \ge g\), SRM does not pay, and if the reverse is true, i.e., \(Q < g,\) SRM pays. SRM produces collateral damages \(d\) to all \(n - 1\) other countries, except to the country which uses SRM. It is clear that nothing would change if we acknowledge that collateral damages are uncertain by their nature and talk about expected collateral damages, as long all countries have the same expectations.Footnote 5

It is worthwhile to note that in this model, for the decision of a country whether to deploy SRM, expected collateral damages do not matter. This is because all countries are assumed take this decision non-cooperatively.Footnote 6 What matters is the level of total mitigation, as this determines whether SRM pays. Moreover, the decision is the same for all symmetric countries. Following Millard-Ball (2012), we also assume that if all countries deploy SRM (because \(Q < g\)), this is done by a randomly selected country with the probability \({\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 n}}\right.\kern-0pt} \!\lower0.7ex\hbox{$n$}}\).Footnote 7 Hence, in case \(z_{i} = 1\) \(\forall \;i \in N\), the expected net benefit and collateral damages of SRM is given by \(\frac{1}{n}\left( {g - Q} \right) - \frac{{\left( {n - 1} \right)}}{n}d\) (and if \(z_{i} = 0\) \(\forall \;i \in N\), it is \(0\)).

In the second stage, signatories and non-signatories choose their individual mitigation levels \(q_{i} \ge 0\) along a continuous interval. The individual payoff function of country \(i\) is given by

$$\pi_{i} \left( {z_{i} = 0\;\;\forall \;i \in N} \right) = bQ - \frac{c}{2}q_{i}^{2}$$
(1a)
$$\pi_{i} \left( {z_{i} = 1\;\;\forall \;i \in N} \right) = bQ - \frac{c}{2}q_{i}^{2} + \frac{1}{n}\left( {g - Q} \right) - \frac{{\left( {n - 1} \right)}}{n}d$$
(1b)

where \(bQ\) are linear benefits from total mitigation \(Q = \sum\nolimits_{i = 1}^{n} {q_{i} }\) and \(\frac{c}{2}q_{i}^{2}\) are quadratic costs from individual mitigation \(q_{i}\), with \(b\) and \(c\) being strictly positive parameters. Whereas non-signatories behave selfishly, maximizing their own payoff (1) with respect to their own mitigation levels, signatories behave cooperatively, maximizing the sum of payoffs of all signatory countries, \(\sum\nolimits_{i \in S} {\pi_{i} }\), with respect to all mitigation levels among their group. For a given coalition of size \(1 \le k \le n\), equilibrium mitigation levels of non-signatories and signatories can be derived, depending on whether SRM technologies will be deployed and whether signatories behave as Stackelberg leaders.Footnote 8 Accordingly, the respective payoffs for non-signatories and signatories can be computed. Details are provided in Sect. 2.2.

In the first stage of the game, \(k\) out of \(n\) countries, \(1 \le k \le n\), are signatories while \(n - k\) countries remain outside the agreement as non-signatories. In the cartel formation game, a coalition of size \(k\) is called stable if it is internally and externally stable:

$$\pi_{S} \left( k \right) \ge \pi_{NS} \left( {k - 1} \right)$$
(2)
$$\pi_{NS} \left( k \right) \ge \pi_{S} \left( {k + 1} \right).$$
(3)

Internal stability (2) implies that no signatory wants to leave the coalition and external stability (3) implies that no non-signatory wants to join the coalition.

2.2 Mitigation and solar radiation management equilibria

In the last stage, country \(i\) will abstain from SRM \(\left( {z_{i} = 0} \right)\) if \(g - Q \le 0\) and will use SRM \(\left( {z_{i} = 1} \right)\) if \(g - Q > 0\). Due to symmetry, either all countries will find it attractive to use SRM or none does.

In the absence of SRM, i.e., \(z_{i}^{*} = 0\), the individual payoff function is (1a), with equilibrium mitigation levels in the pure “Mitigation-equilibrium” (M-equilibrium) being given by \(q_{S}^{M*} \left( k \right) = \frac{kb}{c}\), \(q_{NS}^{M*} = \frac{b}{c}\) and \(Q^{M*} \left( k \right) = \left( {k^{2} + n - k} \right)\left( \frac{b}{c} \right)\) with \({\raise0.7ex\hbox{${\partial Q^{M*} \left( k \right)}$} \!\mathord{\left/ {\vphantom {{\partial Q^{M*} \left( k \right)} {\partial k}}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\partial k}$}} > 0\). Given linear benefits from mitigation, countries have dominant strategies in the M-equilibrium. Hence, there is no difference between the Stackelberg assumption and Nash–Cournot assumption.

In the “Geoengineering-equilibrium” (G-equilibrium), SRM technologies are deployed, i.e., \(z_{i}^{*} = 1\).Footnote 9 Consequently, based on payoff function (1b), equilibrium individual and total mitigation levels are given by \(q_{S}^{{G^{*} }} \left( k \right) = k\left( {\frac{b}{c} - \frac{1}{nc}} \right)\), \(q_{NS}^{{G^{*} }} = \frac{b}{c} - \frac{1}{nc}\) and \(Q^{{G^{*} }} \left( k \right) = \left( {k^{2} + n - k} \right)\left( {\frac{b}{c} - \frac{1}{nc}} \right)\). Given linear benefits from mitigation, countries have dominant strategies in the G-equilibrium. Hence, the Stackelberg assumption results in the same equilibrium mitigation levels as the Nash–Cournot assumption. In order to guarantee non-negative mitigation levels, we impose condition \(C_{1} : = b \ge \frac{1}{n}\). Inserting equilibrium mitigation levels in payoff function (1b), we derive payoffs of signatories and non-signatories \(\pi_{S}^{{G^{*} }} \left( k \right)\) and \(\pi_{NS}^{{G^{*} }} \left( k \right),\) respectively.

Millard-Ball (2012) assumes that the global mitigation level in the M-equilibrium always falls short of the level \(g\) such that SRM always pays, i.e., even in the grand coalition, i.e., \(Q^{{M^{*} }} \left( n \right) < g.\) This gives condition \(C_{2} : = g > \frac{{bn^{2} }}{c}\). This leads to the consideration of an “Avoidance-equilibrium” (A-equilibrium), in which countries increase their mitigation efforts in order to avoid the deployment of SRM.

In the A-equilibrium, \(Q^{{A^{*} }} \left( k \right) \ge g\) must hold by assumption, such that \(z_{i}^{*} = 0.\) In this equilibrium, it now makes a difference whether the NC- or the ST-scenario is assumed. This has been overlooked by Finus and Furini (2023).

In the NC-scenario, we assume that non-signatories do not contribute to the extra effort to make SRM unattractive. That is, non-signatories stick to their dominant strategy, \(q_{NS}^{{A^{NC*} }} = q_{NS}^{{G^{*} }}\). Hence, all effort must be exerted by signatories. Given that signatories, according to their first order conditions in an interior equilibrium, would normally set the sum of marginal benefits equal to their individual marginal costs of mitigation, signatories have no incentive to provide more mitigation than is required to render SRM an unattractive choice. Hence, \(Q^{{A^{NC*} }} \left( k \right) = g\). Consequently, \(q_{S}^{{A^{NC*} }} \left( k \right) = \frac{{g - \left( {n - k} \right)q_{NS}^{{A^{NC*} }} }}{k} =\)\(\frac{{cgn - n^{2} b + bkn + n - k}}{ckn}\) and \(q_{NS}^{{A^{NC*} }} = q_{NS}^{{G^{*} }} = \frac{b}{c} - \frac{1}{nc}\). Inserting equilibrium mitigation levels in payoff function (1a), gives payoffs \(\pi_{S}^{{A^{{NC^{*} }} }} \left( k \right)\) and \(\pi_{NS}^{{A^{NC*} }} \left( k \right)\) in the A-equilibrium in the NC-scenario.

In the ST-scenario, signatories can use their leadership role in order to shift part of the mitigation burden to non-signatories. They will choose the lowest possible contribution level \(q_{S}^{{A^{ST*} }}\) such that non-signatories will still find it attractive to provide the additional mitigation required to reach the total mitigation level \(Q^{{A^{*} }} = g\) in order to avoid the deployment of geoengineering.

Consider the coalition of size \(n - 1\), which is relevant when testing for the stability of the grand coalition. The individual non-signatory, the free-rider, who has left the grand coalition, needs to provide \(q_{NS}^{{A^{ST*} }} = g - \left( {n - 1} \right)q_{S}^{{A^{{ST^{*} }} }}\), such that the level \(Q^{{A^{ST*} }} \left( {n - 1} \right) = g\) in the A-equilibrium is achieved and the SRM technology is not deployed. This Stackelberg strategy of signatories only works if the non-signatory prefers contributing \(q_{NS}^{{A^{ST*} }}\) rather than only \(q_{NS}^{{G^{*} }}\) as a response to \(q_{S}^{{A^{{ST^{*} }} }}\) where in the former case the SRM-technology is not deployed and in the latter case it is deployed in the last stage. Hence, in order for the A-equilibrium to arise in the Stackelberg scenario, the following inequality must hold:

$$\begin{gathered} \pi_{NS}^{{A^{{ST^{*} }} }} \left( {n - 1} \right) = bg - \frac{c}{2}\left( {g - \left( {n - 1} \right)q_{S}^{{A^{ST*} }} } \right)^{2} \hfill \\ \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\, \ge b\left( {\left( {n - 1} \right)q_{S}^{{A^{ST*} }} + q_{NS}^{{G^{*} }} } \right) - \frac{c}{2}q_{NS}^{{G^{*} 2}} + \frac{1}{n}\left( {g - \left( {\left( {n - 1} \right)q_{S}^{{A^{ST*} }} + q_{NS}^{{G^{*} }} } \right)} \right) - \frac{{\left( {n - 1} \right)}}{n}d = \tilde{\pi }_{NS}^{G*} \left( {n - 1} \right). \hfill \\ \end{gathered}$$
(4)

Note that the G-equilibrium on the right-hand side is more attractive to non-signatories than the “original” G-equilibrium as \(q_{S}^{{A^{ST*} }} > q_{S}^{{G^{*} }} \left( {n - 1} \right)\) holds, i.e., \(\tilde{\pi }_{NS}^{G*} \left( {n - 1} \right) > \pi_{NS}^{G*} \left( {n - 1} \right)\). We can show that the inequality above is more likely to hold the larger \(q_{S}^{{A^{{ST^{*} }} }}\). Given that signatories want to contribute as little as possible to maximize their payoff, we find the smallest contribution level \(\underline {q}_{S}^{{A^{{ST^{*} }} }} = \frac{{gnc - bn + 1 - \sqrt {2dc{\kern 1pt} n\left( {n - 1} \right)} }}{{\left( {n - 1} \right)cn}}\) such that Eq. (4) holds with strict equality. Note that \(\underline {q}_{S}^{{A^{{ST^{*} }} }}\) decreases in \(d\). Hence, if signatories want to establish the Stackelberg A-equilibrium at \(n - 1,\) they can choose a lower mitigation level and consequently impose a greater burden on non-signatories the larger collateral damages from SRM are. The larger collateral damages are, the more attractive it is for non-signatories to stick to the A-equilibrium.

The details of all calculations in this section are provided in Appendix A.1.

3 Stability of the grand coalition in the cartel formation game

3.1 Nash–Cournot scenario

In the Nash–Cournot scenario, for any generic coalition \(k\), the decision of whether to play the A- or the G-equilibrium is taken by signatories. Non-signatories choose the same mitigation level in both equilibria, but signatories go the extra mile in the A-equilibrium and provide additional mitigation in order to reach the total mitigation level \(Q^{{A^{*} }} = g\) such that the deployment of SRM technologies does not pay. Hence, the A-equilibrium is preferred over the G-equilibrium if \(\pi_{S}^{{A^{{NC^{*} }} }} \left( k \right) - \pi_{S}^{{G^{*} }} \left( k \right) \ge 0\), and if \(\pi_{S}^{{A^{{NC^{*} }} }} \left( k \right) - \pi_{S}^{{G^{*} }} \left( k \right) < 0\), the reverse is true, signatories play their dominant strategy and the G-equilibrium is implemented.

Lemma 1 (Choice of equilibrium strategies in the NC-scenario)

For any coalition size \(k\), \(1 \le k \le n\), the A-equilibrium is played if \(d \ge \overline{d} \left( k \right)\), while the G-equilibrium is played if \(d < \overline{d} \left( k \right)\). \(\overline{d} \left( k \right)\) decreases in the coalition size \(k\), i.e., \(\frac{{\partial \overline{d}\left( k \right)}}{\partial k} < 0\).

Proof

See Appendix A.2.

Clearly, if \(d < \overline{d} \left( k \right)\), it would be perfectly rational (individually and globally) to use SRM in a coalition of size \(k\) in the first place, as expected collateral damages falls short of expected benefits from SRM. Thus, an agreement which does not use SRM (A-equilibrium) is only individually and globally rational if \(d \ge \overline{d} \left( k \right)\). The A-equilibrium becomes more attractive with the number of signatories, as each of them needs to contribute less to achieve the benchmark mitigation level \(Q^{A*} = g\) in order to make it unattractive to use SRM.

In the grand coalition, \(\overline{d} \left( n \right) \le d\) is needed such that it is rational to play the A- and not the G-equilibrium. If one country leaves the A-agreement, i.e., the coalition is of size \(n - 1\), either the A- or the G-equilibrium can be played. If the A-equilibrium is played, then it can easily be shown that the grand coalition is not stable. Hence, in the NC-scenario, only the G-equilibrium can serve as a deterrent punishment to stabilize the grand coalition. For this to be credible, we need \(d < \overline{d} \left( {n - 1} \right)\) such that the G- and not the A-equilibrium is played by all countries if one country leaves the agreement. Together, we require \(\overline{d} \left( n \right) \le d < \overline{d} \left( {n - 1} \right)\).

The grand coalition is internally stable, and, hence, stable (as the grand coalition is externally stable by definition), if \(\pi_{S}^{{A^{*} }} \left( n \right) \ge \pi_{NS}^{{G^{*} }} \left( {n - 1} \right)\) holds, which gives the condition \(\hat{d}\left( n \right) \le d\).

Altogether, we can state the following.

Proposition 1 (Stability of the grand coalition in the Nash–Cournot scenario)

In the cartel formation game and Nash–Cournot scenario, it can be individually and globally rational to implement the A-equilibrium in the grand coalition. If this is the case, then the A-equilibrium is stable if and only if after a deviation the G-equilibrium is played for which for the collateral damage parameter \(d\) the following must hold:

\(\overline{d} \left( n \right) < \hat{d}\left( n \right) \le d < \overline{d} \left( {n - 1} \right)\).

That is, collateral damages must lie between a lower \(\hat{d}\left( n \right)\) and an upper bound \(\overline{d} \left( {n - 1} \right)\).

Proof

See Appendix A.3.

Hence, in the NC-scenario, only the G-equilibrium can be used as punishment. The punishment is effective (stability holds) if the collateral damage parameter \(d\) is sufficiently large. The punishment is credible if and only if signatories prefer the G-equilibrium over the A-equilibrium once a country has left their agreement for which the collateral damage parameter cannot be too large. Conceptually, it is important to establish under which conditions the A-equilibrium is attractive in the first place, i.e., \(\overline{d} \left( n \right) \le d\). This has been ignored by Millard-Ball (2012), even though it turns out that this condition is not binding as the internal stability condition \(\hat{d}\left( n \right) < d\) is more demanding and we have \(\overline{d} \left( n \right) < \hat{d}\left( n \right)\).

3.2 Stackelberg scenario

When analyzing the stability of the grand coalition playing the A-equilibrium, the ST-scenario differs from the NC-scenario only in terms of possible punishment after a country has left the agreement. Stackelberg leadership increases the options with which signatories can punish the deviating country.

In the grand coalition, all countries are signatories and hence Stackelberg leadership does not play a role. The A-equilibrium is implemented in the first place if \(\overline{d} \left( n \right) \le d\), as observed above already (see Lemma 1). However, at coalition of size \(n - 1\), Stackelberg leadership may play a role. Obviously, Stackelberg leaders can always replicate their Nash–Cournot strategy. Consequently, one punishment option is to play the G-equilibrium and we recall from Sect. 2.1 that in the G-equilibrium mitigation strategies are not affected by Stackelberg leadership as countries have dominant strategies. Hence, the G-equilibrium is an effective punishment that stabilize the grand coalition for collateral damage levels for which \(\overline{d} \left( n \right) < \hat{d}\left( n \right) \le d < \overline{d} \left( {n - 1} \right)\) is true, as found for the NC-scenario. This possibility has been overlooked by Millard-Ball (2012). He focuses exclusively on the second punishment option with an A-equilibrium, which is not an option in the NC-scenario.

The A-equilibrium can constitute a punishment if signatories contribute less than what they do in the grand coalition, i.e., if \(q_{S}^{{A^{{ST^{*} }} }} \le \frac{g}{n}\). If this is the case, it follows that the individual non-signatory (i.e., the free-rider) will have to contribute \(q_{NS}^{{ST^{*} }} \ge \frac{g}{n}\) and the free-rider will be worse off than in the grand coalition. Hence, the A-equilibrium constitutes an effective punishment that stabilizes the grand coalition. From Sect. 2.2 we know that the free-rider accepts the A-equilibrium punishment only if \(q_{S}^{{A^{{ST^{*} }} }}\) is equal or above \(\underline {q}_{S}^{{A^{{ST^{*} }} }}\) where \(\underline {q}_{S}^{{A^{{ST^{*} }} }}\) decreases in \(d\). We also know that the free-rider is more likely to accept the A-equilibrium punishment the larger \(q_{S}^{{A^{{ST^{*} }} }}\). Hence, we can compute a lower bound for the collateral damage parameter, \(\overline{d}_{\min }^{ST} \left( {n - 1} \right)\), such that if \(d \ge \overline{d}_{\min }^{ST} \left( {n - 1} \right)\), the A-equilibrium constitutes an effective punishment that stabilizes the grand coalition. Because \(q_{NS}^{{A^{{ST^{*} }} }} \left( {q_{S}^{{A^{{ST^{*} }} }} \left( {n - 1} \right)} \right) \ge \frac{g}{n}\) and \(q_{S}^{{A^{{ST^{*} }} }} \left( {n - 1} \right) \le \frac{g}{n}\), we have internal stability: \(\pi_{S}^{{A^{*} }} \left( n \right) \ge \pi_{NS}^{{A^{{ST^{*} }} }} \left( {n - 1} \right)\); at the same time the punishment is credible because signatories are better off in the Stackelberg A-equilibrium at \(n - 1\) than in the A-agreement in the grand coalition, i.e., \(\pi_{S}^{{A^{*} }} \left( n \right) \le \pi_{S}^{{A^{{ST^{*} }} }} \left( {n - 1} \right)\). (Moreover, one can show that \(\pi_{S}^{{A^{{ST^{*} }} }} \left( {n - 1} \right) > \pi_{S}^{{G^{*} }} \left( {n - 1} \right)\), signatories prefer the A-equilibrium to the G-equilibrium at \(n - 1\).)

Proposition 2 (Stability of the grand coalition in the Stackelberg scenario)

In the cartel formation game and Stackelberg scenario, it can be individually and globally rational to implement the A-equilibrium in the grand coalition. If this is the case, then the A-equilibrium can be stable if after a deviation either the A- or the G-equilibrium is played:

(i) The A-equilibrium constitutes an effective and credible punishment provided that for the collateral damage parameter \(d\) the following holds:

$$\overline{d} \left( n \right) < \hat{d}\left( n \right) < \overline{d}_{\min }^{ST} \left( {n - 1} \right) \le d.$$

(ii) If instead \(d < \overline{d}_{\min }^{ST} \left( {n - 1} \right)\), the G-equilibrium can be used as an effective and credible punishment, provided that for the collateral damage parameter \(d\) the following holds:

$$\overline{d} \left( n \right) < \hat{d}\left( n \right) \le d < \overline{d} \left( {n - 1} \right).$$

Proof

See Appendix A.4.

Thus, with Stackelberg leadership signatories have a stronger position to enforce a stable climate agreement than under the NC-scenario which shows up in a larger parameter space for which the grand coalition can establish a stable A-equilibrium. For larger expected collateral damages, the A-equilibrium punishment works and for lower expected collateral damages the G-equilibrium punishment works. In both cases, punishment is deterrent and credible.

4 Stability of the grand coalition in the repeated game

In a repeated game, a free-rider gains a temporary free-rider gain before the deviation is discovered and punishment follows. Following many others, we consider the simple trigger punishment strategy to drive home our result.Footnote 10 Once signatories discover free-riding, they will stop cooperating and play a punishment in all subsequent periods. In an infinitely repeated game, this trigger strategy is subgame-perfect if the punishment constitutes a Nash-equilibrium (or Stackelberg-equilibrium) of the static game. Consistent with our previous assumption, we assume that the grand coalition implements the A-equilibrium as all countries prefer the A-equilibrium over the G-equilibrium, \(\pi_{S}^{{A^{*} }} \left( n \right) \ge \pi_{S}^{{G^{*} }} \left( n \right)\) because \(\overline{d} \left( n \right) \le d\), where equilibrium mitigation levels and payoffs are those derived in the cartel formation game (Sect. 2.2). Furthermore, we denote free-rider payoff by \(\pi_{NS}^{F}\) and denote the punishment payoff by \(\pi_{NS}^{{P^{*} }}\). The A-agreement is stable, provided

$$\frac{{\pi_{S}^{{A^{*} }} \left( n \right)}}{1 - \delta } \ge \pi_{NS}^{F} + \frac{{\delta \pi_{NS}^{{P^{*} }} }}{1 - \delta }\;\; \Rightarrow \;\;\;\delta \ge \delta_{\min }^{{}} : = \frac{{\pi_{NS}^{F} - \pi_{S}^{{A^{*} }} \left( n \right)}}{{\pi_{NS}^{F} - \pi_{NS}^{{P^{*} }} }}$$
(5)

holds. Countries comply if the net present value of complying is larger than the net present value of taking a free-ride and subsequently being punished. That is, if the discount factor by which countries discount time \(\delta\) exceeds a value \(\delta_{\min }^{{}}\), the threat of punishment is deterrent and the agreement is stable. Since \(\delta = \frac{1}{1 + r}\) with \(r\) the discount or time preference rate, \(0 \le \delta \le 1\). Obviously, the harsher the punishment, the lower \(\pi_{NS}^{{P^{*} }}\) and the lower the threshold \(\delta_{\min }^{{}}\).

In a first instance, we need to determine the free-rider payoff. Free-riding implies that the defector chooses the dominant mitigation level \(q_{NS}^{{G^{*} }}\) in the G-equilibrium and individually deploys SRM, receives the benefits but and does not suffer from collateral damages. All other \(n - 1\) compliant countries continue choosing the avoidance mitigation level \(q_{S}^{{A^{*} }} \left( n \right) = \frac{g}{n}\). Hence, the free-rider payoff is given by:

$$\pi_{NS}^{F} = b\left( {g - q_{S}^{{A^{*} }} \left( n \right) + q_{NS}^{{G^{*} }} } \right) - \frac{c}{2}\left( {q_{NS}^{{G^{*} }} } \right)^{2} + \left( {g - \left( {g - q_{S}^{{A^{*} }} \left( n \right) + q_{NS}^{{G^{*} }} } \right)} \right).$$
(6)

This exhaust the common assumption of the NC- and ST-scenario, as different punishment options arise in these two scenarios.

4.1 Nash–Cournot scenario

The only option for signatories to punish the free-rider in the NC-scenario is to stop cooperating and play the non-cooperative G-equilibrium with \(q_{NS}^{{G^{*} }} \left( 1 \right) = \frac{b}{c} - \frac{1}{nc}\) and \(z_{i}^{*} = 1\) as derived in subSect. 2.2. All countries receive the payoff \(\pi_{NS}^{{G^{*} }} \left( 1 \right)\). Essentially, punishment implies that the grand coalition breaks apart and the G-equilibrium is played among single players. For the G-equilibrium to be preferred over the A-equilibrium such that the G-punishment is a credible punishment, we need \(d < \overline{d} \left( 1 \right)\) from Lemma 1. Together with the condition that the A- is preferred over the G-equilibrium in the grand coalition, \(\overline{d} \left( n \right) \le d\), we require \(\overline{d} \left( n \right) \le d < \overline{d} \left( 1 \right).\) Again, as in the cartel formation game, the A-equilibrium cannot be used as punishment. All countries comply if

$$\frac{{\pi_{S}^{{A^{*} }} \left( n \right)}}{1 - \delta } \ge \pi_{NS}^{F} + \frac{{\delta \pi_{NS}^{{G^{*} }} \left( 1 \right)}}{1 - \delta }\;\; \Rightarrow \;\;\;\delta \ge \delta_{\min }^{G} : = \frac{{\pi_{NS}^{F} - \pi_{S}^{{A^{*} }} \left( n \right)}}{{\pi_{NS}^{F} - \pi_{NS}^{{G^{*} }} \left( 1 \right)}}$$
(7)

Given that \(\pi_{NS}^{{G^{*} }} \left( 1 \right)\) decreases in the collateral damage \(d\) and the other payoffs in (7) are not affected by collateral damages, we can conclude that the minimum required discount factor \(\delta_{\min }^{G}\) decreases with \(d\).

Proposition 3 (Stability of the grand coalition in the repeated game and NC-scenario)

In the repeated game and Nash–Cournot scenario, the grand coalition implementing the A-equilibrium and punishing with a G-equilibrium can be stable (subgame perfect Nash equilibrium) if for the collateral damage parameter of SRM \(d\) the following holds:

$$\overline{d} \left( n \right) < d < \overline{d} \left( 1 \right).$$

That is, expected collateral damages must lie between a lower \(\overline{d} \left( n \right)\) and an upper bound \(\overline{d} \left( 1 \right).\)

The agreement is stable if for the discount factor \(\;\delta \ge \delta_{\min }^{G}\) holds, with \(0 < \delta_{\min }^{G} < 1\), \(\delta_{\min }^{G}\) decreasing in the collateral damage level \(\;d\) and \(\delta_{\min }^{G}\) as defined in Eq. (7).

Proof

See Appendix A.5.

Thus, the qualitative conclusion in Proposition 3 for the repeated game is the same as derived for the cartel formation game in Proposition 1. The collateral damage must be sufficiently large such that punishment is deterrent, but cannot be too large, as otherwise the threat to punish with the deployment of SRM technologies is not credible. Within the feasibility range, \(\overline{d} \left( n \right) < d < \overline{d} \left( 1 \right)\), the higher the collateral damage, the lower the minimum discount factor required for stability. Again, it should be noted that if SRM were not to pay even if there is no agreement at all, i.e., \(\overline{d} \left( 1 \right) \le d,\) the grand coalition implementing the A-equilibrium would not be stable, as no deterrent punishment would be available.

4.2 Stackelberg scenario

Compared to the NC-scenario, and similarly to what we found for the cartel formation game, in the ST-scenario signatories have an additional punishment option. Given the G-equilibrium is not affected by Stackelberg leadership, this can also be used as a punishment under the same condition as in the NC-scenario, i.e., if \(\overline{d} \left( n \right) < d < \overline{d} \left( 1 \right)\). Additionally, the A-equilibrium can also be used as punishment if the \(n - 1\) remaining signatories act as Stackelberg leaders and implement a A-equilibrium in which they contribute \(q_{S}^{{A^{ST} }} \le \frac{g}{n}\). As shown previously in Proposition 2, this is possible if \(\overline{d} \left( n \right) < \overline{d}_{\min }^{ST} \left( {n - 1} \right) \le d\). This punishment will be credible as \(\pi_{S}^{{A^{{ST^{*} }} }} \left( {n - 1} \right) \ge \pi_{S}^{{A^{*} }} \left( n \right) \ge \pi_{S}^{{G^{*} }} \left( n \right) > \pi_{NS}^{{G^{*} }} \left( 1 \right)\) holds for \(\overline{d} \left( n \right) < \overline{d}_{\min }^{ST} \left( {n - 1} \right) \le d\). Given that the A-equilibrium is played in the grand coalition and the A-equilibrium is an effective punishment, for which \(\overline{d} \left( n \right) < \overline{d}_{\min }^{ST} \left( {n - 1} \right) \le d\) is required such that the free-rider accepts this punishment, all countries will comply in an infinitely repeated game, provided

$$\delta \ge \delta_{\min }^{A} : = \frac{{\pi_{NS}^{F} - \pi_{S}^{{A^{*} }} \left( n \right)}}{{\pi_{NS}^{F} - \pi_{NS}^{{A^{{ST^{*} }} }} \left( {n - 1} \right)}}$$
(8)

holds. As we know from Sect. 2.2, the punishment payoff \(\pi_{NS}^{{A^{{ST^{*} }} }} \left( {n - 1} \right)\) depends on signatories’ mitigation level \(q_{S}^{{A^{{ST^{*} }} }}\). The harshest possible punishment coincides with the lowest possible mitigation level \(\underline {q}_{S}^{{A^{{ST^{*} }} }}\) which decreases in the collateral damage parameter \(d\). Given that the other payoffs in (8) are not affected by parameter \(d\), we can conclude that the minimum discount factor \(\delta_{\min }^{A}\) decreases in \(d\). Additionally, we note that, given \(\overline{d} \left( n \right) < \overline{d}_{\min }^{ST} \left( {n - 1} \right) \le d\) and \(\pi_{NS}^{{A^{{ST^{*} }} }} \left( {n - 1} \right) > \pi_{NS}^{{G^{*} }} \left( {n - 1} \right) > \pi_{NS}^{{G^{*} }} \left( 1 \right)\) hold, it follows that the minimum discount factor requirement is larger with the A- than the G-punishment, i.e., \(\delta_{\min }^{A} > \delta_{\min }^{G}\). That is, the A-equilibrium is a weaker punishment than the G-equilibrium. Hence, when both punishment options establish cooperation, signatories will choose the G-equilibrium punishment.

Proposition 4 (Stability of the grand coalition in the repeated game and ST-scenario)

In the repeated game and Stackelberg scenario, the grand coalition implementing the A-equilibrium can be stable (subgame perfect Nash equilibrium) under two punishment options:

(i) The G-equilibrium is used as punishment if for the collateral damage parameter \(d\) \(\overline{d} \left( n \right) < d < \overline{d} \left( 1 \right)\) holds.

That is, expected collateral damages must lie between a lower and an upper bound. The agreement is stable if for the discount factor \(\;\delta \ge \delta_{\min }^{G}\) holds, with \(0 < \delta_{\min }^{G} < 1\), \(\delta_{\min }^{G}\) decreasing in the collateral damage parameter \(\;d\) and \(\delta_{\min }^{G}\) as defined in Eq. (7).

(ii) The A-equilibrium is used as punishment if for the collateral damage parameter \(d\) \(\overline{d} (n) < \overline{d}_{\min }^{ST} < \overline{d} \left( 1 \right) \le d\) holds.

That is, expected collateral damages must be sufficiently large.

The agreement is stable if for the discount factor \(\;\delta \ge \delta_{\min }^{A}\) holds, with \(0 < \delta_{\min }^{A} \le 1\), \(\delta_{\min }^{A}\) decreasing in the collateral damage level \(\;d\), \(\delta_{\min }^{A}\) as defined in Eq. (8) and \(\delta_{\min }^{A} > \delta_{\min }^{G}\).

Proof

See Appendix A.6.

In line with what we find in the cartel formation game, Stackelberg leadership increases the punishment options available to signatories in the repeated game too. When the G-equilibrium is a credible punishment, it will be used by signatories as it is the harsher punishment than the A-equilibrium punishment. If the G-equilibrium does not work as a punishment, then the A-equilibrium can be an effective punishment, even though it is a weaker punishment, resulting in a larger minimum discount factor requirement.

4.3 Renegotiation-proof punishment

An interesting twist of our previous results emerges if we consider renegotiation-proofness which is a refinement of subgame-perfection. The version which we consider has been proposed by Farrell and Maskin (1989) and has been applied for instance by Asheim et al. (2006), Asheim and Holtsmark (2009), Barrett (1994, 2002), Finus and Rundshagen (1998a, b) in the context of international environmental agreements. A punishment is only credible if the players conducting the punishment are not worse off. Consequently, as \(\pi_{S}^{{A^{*} }} \left( n \right) \ge \pi_{S}^{{G^{*} }} \left( n \right) > \pi_{NS}^{{G^{*} }} \left( 1 \right)\), where the first inequality follows from \(\overline{d} \left( n \right) \le d\) and the second inequality follows from the fact that full cooperation delivers higher payoffs than the Nash-equilibrium, the harsher punishment with the G-equilibrium is not renegotiation-proof. However, the weaker punishment with the A-equilibrium is renegotiation-proof as \(\pi_{S}^{{A^{*} }} \left( n \right) \le \pi_{NS}^{{A^{{ST^{*} }} }} \left( {n - 1} \right)\) if \(\overline{d} \left( n \right) < \overline{d}_{\min }^{ST} \left( {n - 1} \right) \le d\). This punishment option is only available under Stackelberg leadership. Hence, without Stackelberg leadership, the grand coalition implementing the A-equilibrium cannot be supported as a renegotiation-proof equilibrium and with Stackelberg leadership, only the A-equilibrium punishment is renegotiation proof. In Appendix A.6, we provide further details.

5 Conclusion

We analyzed the governance structure of a self-enforcing climate agreement aiming at reducing greenhouse gases in the light of geoengineering in the form of solar radiation management (SRM) with the possibility of high collateral damages. In particular, we focused on the design and credibility of threats that can be used to deter free-riding under two scenarios, the Nash–Cournot and Stackelberg scenario. We investigated stability in two settings, the cartel formation game and the repeated game. Hence, we qualify, extend and compare the results of Millard-Ball (2012) and Finus and Furini (2023).

We showed that a climate agreement on mitigation can be designed that avoids the deployment of SRM by increasing mitigation levels such that SRM is not attractive. Under the Nash–Cournot scenario, the only effective threat available to signatories is the deployment of SRM. This threat works if expected collateral damages are high enough such that SRM is avoided in the grand coalition and leaving the coalition is not profitable, but not too high so that the threat of SRM deployment once a country leaves the agreement is credible. Thus, the threat to deploy SRM can be a powerful tool to stabilize a climate agreement; it can lead to large stable climate agreements with a high global mitigation effort, but it is not as simple as Millard-Ball (2012) finds for the Stackelberg scenario. If collateral damages are expected to be very high, then this stabilisation strategy does not work. Thus, if SRM was never considered a rational strategy, climate agreements which exclusively focus on mitigation will not achieve much, a feature confirmed by past climate agreements. However, this is different if signatories can choose mitigation levels before non-signatories. In the Stackelberg scenario, an additional threat is available. Signatories can now decrease their mitigation efforts and leave to non-signatories the mitigation burden in order to avoid the use of SRM. This threat is credible and effective if collateral damages are high enough, and no upper bound is required, in line with Millard-Ball (2012). However, even if collateral damages are below this threshold, signatories still have the possibility to stabilize the grand coalition using the deployment of SRM as a threat, as found in the NC-scenario. Hence, Stackelberg leadership increases the threat options available for signatories and expands the collateral damage parameter range under which stability can be achieved. Now, successful agreements providing high mitigation levels that render the use of SRM unattractive can be stable even when SRM is never considered a rational strategy due to very high expected collateral damages. Of course, the reverse is also true for our model: if expected collateral damages are expected to be very low, none of the threats works.

For future research, many issues come to mind. However, we believe the most important extensions would deal with more general payoff functions than those considered by Millard-Ball (2012) and Finus and Furini (2023), the analysis whether a moratorium on research on SRM technologies would be enforceable and effective and how adaptation in addition to mitigation affects the governance structure in climate change in the light of SRM.