The probabilistic pool punishment proportional to the difference of payoff outperforms previous pool and peer punishment

Ohdaira, Tetsushi

doi:10.1038/s41598-022-10582-5

The probabilistic pool punishment proportional to the difference of payoff outperforms previous pool and peer punishment

Article
Open access
Published: 22 April 2022

Volume 12, article number 6604, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

The probabilistic pool punishment proportional to the difference of payoff outperforms previous pool and peer punishment

Download PDF

Tetsushi Ohdaira ORCID: orcid.org/0000-0003-2686-9575¹

1320 Accesses
15 Citations
1 Altmetric
Explore all metrics

Abstract

The public goods game is a multiplayer version of the prisoner’s dilemma game. In the public goods game, punishment on defectors is necessary to encourage cooperation. There are two types of punishment: peer punishment and pool punishment. Comparing pool punishment with peer punishment, pool punishment is disadvantageous in comparison with peer punishment because pool punishment incurs fixed costs especially if second-order free riders (those who invest in public goods but do not punish defectors) are not punished. In order to eliminate such a flaw of pool punishment, this study proposes the probabilistic pool punishment proportional to the difference of payoff. In the proposed pool punishment, each punisher pays the cost to the punishment pool with the probability proportional to the difference of payoff between his/her payoff and the average payoff of his/her opponents. Comparing the proposed pool punishment with previous pool and peer punishment, in pool punishment of previous studies, cooperators who do not punish defectors become dominant instead of pool punishers with fixed costs. However, in the proposed pool punishment, more punishers and less cooperators coexist, and such state is more robust against the invasion of defectors due to mutation than those of previous pool and peer punishment. The average payoff is also comparable to peer punishment of previous studies.

Cooperation evolves by the payoff-difference-based probabilistic reward

Article Open access 29 November 2021

Evolution of cooperation by the introduction of the probabilistic peer-punishment based on the difference of payoff

Article Open access 05 May 2016

Competitions between prosocial exclusions and punishments in finite populations

Article Open access 19 April 2017

Introduction

In the context of social dilemmas, it is widely known that a system of punishment is necessary for the evolution of cooperation. Punishment is a factor that encourages cooperation, however, those who punish others lose their fitness because the cost of punishment reduces their payoff. For this reason, whether punishment is really necessary for the evolution of cooperation, especially whether a system of punishment will evolve, is highly controversial. For example, some studies^{1,2,3,4,5,6,7} have a negative view of punishment, while others^{8,9,10,11,12,13,14,15,16,17,18} have a positive view of punishment.

In the previous studies that have shown a negative point of view, for example, in the prisoner’s dilemma game, Dreber et al.² state that the introduction of peer punishment results in the reduction in the average payoff of the population as well as the introduction of pool punishment. Peer punishment is the one of two types of punishment. Peer punishers directly punish uncooperative opponents by paying some cost. Pool punishment is the other one. When pool punishers pay the investment, they also pay the cost to the punishment pool in advance. In the public goods game, which is a multiplayer version of the prisoner’s dilemma game, Rand and Nowak⁵ extend the public goods game to include the full set of punishment strategies and find that punishment no longer increases cooperation, and that natural selection favours substantial levels of antisocial punishment for a wide range of parameters. They also show that model predictions consistent with the results of behavioural experiments, and punishment is mostly a self-interested tool for protecting oneself against potential competitors. Nowak⁶ states that punishment is not a mechanism for the evolution of cooperation, but only complements other mechanisms such as indirect reciprocity, group selection, and network reciprocity. Wu et al.⁷ describe that punishment does not necessarily facilitate cooperation in one-to-one interactions such as the prisoner’s dilemma game.

In contrast to the preceding argument, regarding positive discussions, Traulsen et al.¹² consider the public goods game with cooperators, defectors, cooperative punishers, and the players who abstain from public goods interactions. They show that cooperation (and punishment) is possible only if interactions are voluntary when mutation rates are small. Sigmund et al.¹⁴ compare the prevailing model of peer punishment with pool punishment. Pool punishment facilitates the sanction of second-order free-riders (who cooperate but do not sanction), because those free-riders can be distinguished even if everyone contributes to the common good. Garcia and Traulsen¹⁶ state that Rand and Nowak⁵ discuss very limited cases where players refrain from collective action, and show that cooperators who only punish defectors can thrive well when the abstainers are isolated even if players have the choice of antisocial behaviour. Perc and Szolnoki¹⁷ propose the adaptive punishment that allows players to change the degree to which they punish their opponents in proportion to the degree of success of cooperation. The adaptive punishment promotes the reciprocity based on spatial connections between players (below simply referred to as connections), and as a result, enhances cooperation. Nakamaru and Dieckmann¹⁸ investigate the evolution of social reaction norms and mainly show that the mechanism where evolution to enhanced cooperation and stricter punishment reaction norms reinforce each other works best in the case of severe punishment.

The discussion regarding punishment is given in more detail below. Regarding peer punishment in the public goods game, Helbing et al.¹⁹ show that the consideration of punishment enables us to understand the formation and development of cooperators who punish defectors. Szolnoki et al.²⁰ study the impact of pool punishment on the spatial public goods game with cooperators, defectors, and pool punishers as three competitive strategies. Helbing et al.^19,21,22 specifically compare the efficiency of pool punishment with that of peer punishment in maintaining social advantage. Chen et al.²³ show that the introduction of punishment has a positive effect in the public goods game, especially for large group-sized cooperation, but is not optimal for medium group-sized cooperation. Sasaki et al.²⁴ study deposits that will be refunded as long as committers adhere to the donation game and punish free-riders and non-comitters.

We discuss other studies regarding punishment as follows. Gardner and West²⁵ state that individuals show greater cooperation when interacting with those who have the high possibility of punishing others. Egas and Riedl²⁶ show that punishment is strongly dominated by its cost-to-impact ratio. Traulsen et al.²⁷ present that the majority of subjects would choose pool punishment if second-order free-riders would also be punished. Schoenmakers et al.²⁸ show that central agencies such as police can be crucial to the evolution of punishment. Perc²⁹ shows that pool punishment in structured populations is sustainable, but it is limited to the case where second-order free riders are also sanctioned to the extent that they cannot prevail. Chen and Perc³⁰ show that the optimal distribution of resources within the framework of institutional punishment depends on whether absolute or degree-normalized payoffs are used. Perc et al.³¹ systematically review the main results obtained in the area of statistical physics of human cooperation and state that the problem of the cost of punishment can be solved by probabilistically sharing responsibility for sanctioning defectors. In the spatial prisoner’s dilemma game, Ohdaira^32,33 shows that cooperation evolves not only in different types of spatial structures but also in the case where both strategy and spatial structure evolve by changing the probability of punishment according to the difference of payoff between players.

The alternative discussions regarding punishment are as follows. Szolnoki and Perc³⁴ consider traditional cooperators and defectors, as well as cooperators punishing defectors and defectors punishing cooperators. They show that antisocial punishment does not prevent cooperation if the synergistic effects are high enough to sustain cooperation based on the network reciprocity and is viable only if the synergistic effects are low, punishment is necessary for cooperation, and the cost-to-fine ratio is low. Chen and Szolnoki³⁵ reveal that cooperators should pay special attention to the growing capacity of renewable resources depending sensitively on the fraction of cooperators and the total consumption of all players in addition to a delicately adjusted punishment. Lee et al.³⁶ introduce a policelike or mercenary punisher who watches the population and punishes defectors and show that the maximal average outcome can be reached at an intermediate cost value of punishment.

Here, this study especially focuses on peer punishment and pool punishment in the public goods game. As described before, peer punishment means that a player pays the cost and directly imposes the punishment on a defector, and pool punishment means that a player pays the cost to the punishment pool in advance. The advantages and disadvantages of peer punishment and pool punishment are as follows. Peer punishment enables a player to punish a defector directly, but it has the disadvantage of high cost of punishment. On the contrary, pool punishment has a lower cost required for punishment than peer punishment, but it has the disadvantage that a punisher must pay the cost to the punishment pool and fixed costs are incurred especially if we do not consider the punishment on second-order free riders (those who invest in public goods but do not punish defectors).

In order to eliminate such a flaw of pool punishment, this study proposes the probabilistic pool punishment proportional to the difference of payoff. In the proposed pool punishment, pool punisher compares his/her payoff with the average payoff of the opponent public goods game participants and pays the cost to the punishment pool only if he/she is disadvantageous in terms of payoff. This study considers the dynamics of four types of players, i.e., punishers (contributing public goods and punishing defectors), defectors, cooperators (only contributing public goods), and non-participants in the public goods game on the regular and the random one-dimensional lattice with the average degree < k > = 4. Then, we compare the proposed pool punishment with peer and pool punishment of previous studies to investigate whether the flaw of pool punishment already mentioned can be eliminated.

Model

The public goods game of this study is based on the framework by Traulsen et al.¹² where investment in public goods is distributed to all participants, including the invested player. When the number of participants in the game is n, the number of cooperators is N_c, and the investment of cooperators in public goods is c, cooperators will have the payoff of (rcN_c/n)-c, while defectors will gain the payoff of rcN_c/n. The value r is a factor multiplying all summed-up contributions, and the best response of the participants is defection (not investing in public goods). Therefore, in the public goods game, punishment on defectors is necessary to encourage cooperation. As described before, there are two types of punishment: peer punishment and pool punishment¹⁴. Peer punishers pay the investment c in public goods and then impose the punishment b on the defectors in the group by paying the cost g. Therefore, when there are Ny defectors and Nw peer punishers in the group, defectors will be punished with a sum of bNw, and peer punishers will bear the cost gNy. On the other hand, when pool punishers pay the investment c, they also pay the cost G to the punishment pool in advance. Defectors will be fined the punishment BNv proportional to the number of pool punishers, Nv.

Here, we consider the dynamics of four types of players, i.e., punishers (contributing public goods and punishing defectors), defectors, cooperators (only contributing public goods), and non-participants (below referred to as loners) in the public goods game on the regular and the random one-dimensional lattice with the average degree < k > = 4. Figure 1a,b shows the sample spatial structure of the regular and the random one-dimensional lattice. A vertex shows a player, and opponent players of each player in public goods game interactions are defined by edges. Note that this figure has only 20 players so that we can easily grasp each spatial structure. The detail of how to construct each spatial structure is described in the previous study of the author³². To ensure that the group of pool punishers or peer punishers outperforms the group of loners in terms of payoff, the payoff of loners (σ) should be smaller than (r − 1)c-G in the case of pool punishment, or (r − 1)c in the case of peer punishment.

As described before, pool punishment is disadvantageous in comparison with peer punishment because pool punishment incurs fixed costs especially if second-order free riders (those who invest in public goods but do not punish defectors) are not punished. In order to eliminate such a flaw of pool punishment, this study proposes the probabilistic pool punishment proportional to the difference of payoff. In the proposed pool punishment, when the payoff of player i is \(P_{i}\) and the average payoff of the players with connections to player i is \(\overline{{P_{i} }}\), player i pays the cost G to the punishment pool with the probability \(q_{i}\) = (\(\overline{{P_{i} }} - P_{i}\)) / \(P_{i}\) (\(P_{i}\) > 0). If \(\overline{{P_{i} }}\) is equal to or larger than \(2P_{i}\), then \(q_{i}\) equals 1, and if \(\overline{{P_{i} }}\) is smaller than \(P_{i}\), then \(q_{i}\) equals 0. Therefore, as shown in Fig. 2, if a player has smaller payoff than the average payoff of opponent players, he/she contributes to the punishment pool with high probability. On the other hand, if his/her payoff is nearly equal to the average payoff of opponent players, he/she hardly contributes to the punishment pool. The previous study³⁷ reports that the avoidance of overpunishing (too much punishment on defectors with high payoff) is essential for the stable cooperation. In this study, to avoid overpunishing, the payoff of each player will be 0 when it becomes a negative value.

The behaviour of player is updated according to the following rules. That is, each player i compares the new payoff \(P_{i} ^{\prime}\) with \(P_{j} ^{\prime}\) of the players j in \(O_{i}\) after punishing the opponents and being punished by them. Note that \(O_{i}\) represents the set of all players connected to player i. Then, each player i imitates the behaviour of a player with the highest payoff \(max\left( {P_{j} ^{\prime}} \right)\) > \(P_{i} ^{\prime}\). If there are multiple players with \(max\left( {P_{j} ^{\prime}} \right)\), player i randomly imitates the behaviour of the one of such players. If \(max\left( {P_{j} ^{\prime}} \right) \) is equal to \(P_{i} ^{\prime}\), player i randomly switches his/her behaviour to that of the player with \(max\left( {P_{j} ^{\prime}} \right)\) including him/her. If \(max\left( {P_{j} ^{\prime}} \right)\) is smaller than \(P_{i} ^{\prime}\), player i does not change his/her behaviour. We consider a series of the public goods game, the process of imposing punishment, and the imitation of behaviour as one generation. One simulation is executed up to 600 generations in order to reach a sufficiently steady state, and we find the average value through 20 times simulations. Table 1 shows the specific parameter settings required for simulations. The values of each parameter conform to Traulsen et al.¹². Whenever only a single cooperator or defector joins the game, he/she acts as a loner. That is, if only one group member chooses to participate, then all group members receive the loner’s payoff σ. The value σ = 1 satisfies both conditions σ < (r − 1)c-G in the case of pool punishment and σ < (r − 1)c in the case of peer punishment.

Table 1 Parameter settings required for simulations.

Full size table

Results

Below, we compare the proposed pool punishment with previous pool and peer punishment. Firstly, Fig. 3a-c shows the results of the regular one-dimensional lattice. Error bars indicate the standard deviation. (The following figure also has error bars of SD). As those results show, in pool punishment of previous studies, cooperators who do not punish defectors occupy almost the population instead of pool punishers who incur fixed costs. In this case, the average payoff does not reach 3 because a few defectors remain. However, in the proposed pool punishment, more punishers and less cooperators coexist. In peer punishment of previous studies, punishers and cooperators coexist in almost the same number. Those results show that the proposed pool punishment is more robust to the invasion of defectors due to mutation than peer punishment of previous studies. Besides, in terms of the average payoff, the proposed pool punishment is almost the same as previous peer punishment.

Secondly, Fig. 4a-c shows the results of the random one-dimensional lattice. As those results show, in pool punishment of previous studies, cooperators still dominate, although some punishers remain in the population in comparison with the case of the regular one-dimensional lattice. On the other hand, in the proposed pool punishment, because the payoff of each player is not averaged by his/her number of edges (degree), the difference of payoff between players depending on the degree of each player is larger than in the case of the regular lattice. Therefore, compared to the regular case, the superiority of punishers over cooperators is reduced, and the number of punishers is reduced. Nevertheless, punishers and cooperators coexist in almost the same number, and the robustness against the invasion of defectors due to mutation is maintained because the number of punishers is the largest among three types of punishment. Like the results of the regular lattice, the average payoff of the proposed pool punishment is almost the same as that of previous peer punishment. In previous peer punishment, because the cost of punishment is high, if a punisher has many opponent defectors, those defectors cannot be punished, and then cooperators will have an advantage over such punisher.

The above results show that the proposed pool punishment solves the fixed cost problem of pool punishment of previous studies, and can build a robust state against the invasion of defectors due to mutation. In addition, as the results on the random lattice show, unlike peer punishment of previous studies, punishment is also available due to the low cost of punishment even when a punisher has many opponent defectors.

Discussion

In the case of the scale-free one-dimensional lattice, which is not mentioned in the results of this study, the author describes what the consequences of the proposed pool punishment and previous pool and peer punishment will be. We compare both cases where the player with the most connections with opponents (the highest degree player) becomes a punisher or a cooperator after the elimination of defectors by punishers. In the case where the highest degree player becomes a punisher, he/she has an advantage over cooperators because he/she can obtain enough payoff to offset the cost of punishment. On the other hand, when he/she becomes a cooperator, he/she has an advantage over punishers because punishers cannot gain enough payoff to offset the cost of punishment. For this reason, in the scale-free one-dimensional lattice, the number of simulations where punishers finally have an advantage and such number where cooperators dominate at last is almost the same regarding the proposed pool punishment and previous pool and peer punishment. This study does not consider punishment on cooperators, the so-called second-order punishment. Therefore, if the number of connections with opponents is the same and the number of punishers or cooperators in opponents is also the same, the highest degree cooperator always has an advantage over punishers in terms of payoff. Considering the second-order punishment, the magnitude relation of payoff between cooperators and punishers naturally changes, then the result can be expected to change.

The author describes the difference between the proposed pool punishment and other probabilistic punishment as follows. Chen et al.³⁸ consider probabilistic punishment as the simplest way of distributing the responsibility to sanction defectors. The probability of punishment is fixed among players and does not change depending on the difference of payoff like this study. The following studies also discuss probabilistic punishment: class-specific probabilities of punishment that is based on the fixed number of classes³⁹, the implicated punishment that has a working probability p (0 < p < 1) and includes the peer punishment on defectors with a probability q (0 < q < 1)⁴⁰. However, those probabilities are fixed and also do not change. Szolnoki and Perc⁴¹ consider the conditional punishment that does not depend on the difference of payoff like this study, but is proportional to the number of other conditional and unconditional punishers within the group. The proposed pool punishment is similar to Fehr and Schmidt’s inequity aversion⁴² that players resist inequitable outcomes, i.e., they are willing to give up their payoff to realize more equitable outcomes. However, in their study, a player can punish all other players rather than other players having connections with him/her like this study.

Another similar method like the probability of punishment of this study is the emotional profile^43,44. Szolnoki et al.⁴³ introduce sympathy and envy as the two emotional profiles that determine the strategy of each player, and define them as the probability that each player cooperates with players having lower and higher payoff, respectively. The evolutionary process leads to a spontaneous fixation to a single emotional profile; however, this emotional profile depends not only on the payoff but also on the heterogeneity of connections between each player. Szolnoki et al.⁴⁴ also consider the imitation of emotional profiles of neighbour players instead of pure strategy. The emotional profile of each player is determined by two pivotal (not continuous) factors only, namely how each player behaves towards less and more successful neighbour players. On the other hand, the probability of punishment of this study is continuous and based on the difference of payoff.

The following studies, although essentially different, utilize a method similar to the probabilistic pool punishment of this study. Iwasa and Lee⁴⁵ introduce the graduated punishment that the degree of punishment gradually changes based on the damage by selfish behaviour and show that the graduated punishment is the most effective rule in the evolution of cooperation when the action of a player is incorrectly reported at a small probability and the sensitivity of a player to the difference in the utility or payoff is not homogeneous. Helbing et al.²¹ investigate the evolution of cooperation in the spatial public goods game and especially show that increasing the fine of punishment induces a rising of the level of cooperation and larger punishment fines do not have any positive effects. Jiang et al.⁴⁶ also describe that severe punishment is not necessarily more effective and if cooperation is likely, mild punishment leads to higher average payoffs.

This study proposes the probabilistic pool punishment proportional to the difference of payoff in order to eliminate the flaw of pool punishment in which pool punishers incur fixed costs especially if second-order free riders (those who invest in public goods but do not punish defectors) are not punished. In the proposed pool punishment, each player pays the cost to the punishment pool with the probability depending on the difference between his/her payoff and the average payoff of the players with connections to him/her. Comparing the proposed pool punishment with previous pool and peer punishment, in pool punishment of previous studies, cooperators who do not punish defectors become dominant instead of pool punishers who incur fixed costs. However, in the proposed pool punishment, more punishers and less cooperators coexist, and such state is more robust against the invasion of defectors due to mutation than those of previous pool and peer punishment. The average payoff is also comparable to peer punishment of previous studies.

In the future, the author will investigate whether the proposed pool punishment similarly does not allow the invasion of defectors due to mutation and can maintain high average payoff in the cases where second-order free riders are punished¹⁴, or all types of players can punish other players⁵. The author also intends to devise the probabilistic pool reward and introduce the combination of reward and punishment like the following previous studies^47,48,49. Szolnoki and Perc⁴⁷ discuss whether the combined application of reward and punishment is evolutionary advantageous, and find rich dynamical behaviour that shows intricate phase diagrams where continuous and discontinuous phase transitions successively occur. Chen et al.⁴⁸ also propose the institutional sanctioning policy that switches the incentive from rewarding to punishing when the frequency of cooperators exceeds a threshold. They find that this policy establishes and recovers full cooperation at lower cost and under a wider range of conditions than either rewards or penalties alone. Góis et al.⁴⁹ show similar results that rewards (positive incentives) are essential to initiate cooperation and sanctions (negative incentives) are instrumental to maintain cooperation. As each parameter value of this study conforms to Traulsen et al.¹², a factor multiplying all summed-up contributions (r) equals 3, which is relatively large and somewhat induces cooperation. It is also a future work to investigate whether the proposed pool punishment shows good results in the case of low r value (e.g. r = 2) where cooperation does not easily evolve.

References

Sigmund, K., Hauert, C. & Nowak, M. A. Reward and punishment. Proc. Natl. Acad. Sci. U.S.A. 98, 10757–10762. https://doi.org/10.1073/pnas.161155698 (2001).
Article CAS PubMed PubMed Central ADS Google Scholar
Dreber, A., Rand, D. G., Fudenberg, D. & Nowak, M. A. Winners don’t punish. Nature 452, 348–351. https://doi.org/10.1038/nature06723 (2008).
Article CAS PubMed PubMed Central ADS Google Scholar
Herrmann, B., Thöni, C. & Gächter, S. Antisocial punishment across societies. Science 319, 1362–1367. https://doi.org/10.1126/science.1153808 (2008).
Article CAS PubMed ADS Google Scholar
Ohtsuki, H., Iwasa, Y. & Nowak, M. A. Indirect reciprocity provides only a narrow margin of efficiency for costly punishment. Nature 457, 79–82. https://doi.org/10.1038/nature07601 (2009).
Article CAS PubMed PubMed Central ADS Google Scholar
Rand, D. G. & Nowak, M. A. The evolution of antisocial punishment in optional public goods games. Nat. Commun. 2, 434. https://doi.org/10.1038/ncomms1442 (2011).
Article CAS PubMed ADS Google Scholar
Nowak, M. A. Five rules for the evolution of cooperation. Science 314, 1560–1563. https://doi.org/10.1126/science.1133755 (2006).
Article PubMed PubMed Central ADS Google Scholar
Wu, J.-J. et al. Costly punishment does not always increase cooperation. Proc. Natl. Acad. Sci. U.S.A. 106, 17448–17451. https://doi.org/10.1073/pnas.0905918106 (2009).
Article PubMed PubMed Central ADS Google Scholar
Fehr, E. & Gächter, S. Altruistic punishment in humans. Nature 415, 137–140. https://doi.org/10.1038/415137a (2002).
Article CAS PubMed ADS Google Scholar
Fehr, E. & Fischbacher, U. The nature of human altruism. Nature 425, 785–791. https://doi.org/10.1038/nature02043 (2003).
Article CAS PubMed ADS Google Scholar
Fowler, J. H. Altruistic punishment and the origin of cooperation. Proc. Natl. Acad. Sci. U.S.A. 102, 7047–7049. https://doi.org/10.1073/pnas.0500938102 (2005).
Article CAS PubMed PubMed Central ADS Google Scholar
O’Gorman, R., Henrich, J. & Van Vugt, M. Constraining free riding in public goods games: Designated solitary punishers can sustain human cooperation. Proc. R. Soc. B 276, 323–329. https://doi.org/10.1098/rspb.2008.1082 (2009).
Article PubMed Google Scholar
Traulsen, A., Hauert, C., De Silva, H., Nowak, M. A. & Sigmund, K. Exploration dynamics in evolutionary games. Proc. Natl. Acad. Sci. U.S.A. 106, 709–712. https://doi.org/10.1073/pnas.0808450106 (2009).
Article PubMed PubMed Central MATH ADS Google Scholar
Rankin, D. J., Santos, M. D. & Wedekind, C. The evolutionary significance of costly punishment is still to be demonstrated. Proc. Natl. Acad. Sci. U.S.A. 106, E135. https://doi.org/10.1073/pnas.0911990107 (2009).
Article CAS PubMed PubMed Central Google Scholar
Sigmund, K., De Silva, H., Traulsen, A. & Hauert, C. Social learning promotes institutions for governing the commons. Nature 466, 861–863. https://doi.org/10.1038/nature09203 (2010).
Article CAS PubMed ADS Google Scholar
Szolnoki, A., Szabó, G. & Czakó, L. Competition of individual and institutional punishments in spatial public goods games. Phys. Rev. E 84, 046106. https://doi.org/10.1103/PhysRevE.84.046106 (2011).
Article CAS ADS Google Scholar
Garcia, J. & Traulsen, A. Leaving the loners alone: Evolution of cooperation in the presence of antisocial punishment. J. Theor. Biol. 307, 168–173. https://doi.org/10.1016/j.jtbi.2012.05.011 (2012).
Article MathSciNet PubMed MATH ADS Google Scholar
Perc, M. & Szolnoki, A. Self-organization of punishment in structured populations. New J. Phys. 14, 043013. https://doi.org/10.1088/1367-2630/14/4/043013 (2012).
Article ADS Google Scholar
Nakamaru, M. & Dieckmann, U. Runaway selection for cooperation and strict-and-severe punishment. J. Theor. Biol. 257, 1–8. https://doi.org/10.1016/j.jtbi.2008.09.004 (2009).
Article MathSciNet PubMed MATH ADS Google Scholar
Helbing, D., Szolnoki, A., Perc, M. & Szabó, G. Evolutionary establishment of moral and double moral standards through spatial interactions. PLoS Comput. Biol. 6, e1000758. https://doi.org/10.1371/journal.pcbi.1000758 (2010).
Article MathSciNet CAS PubMed PubMed Central ADS Google Scholar
Szolnoki, A., Szabó, G. & Perc, M. Phase diagrams for the spatial public goods game with pool punishment. Phys. Rev. E 83, 036101. https://doi.org/10.1103/PhysRevE.83.036101 (2011).
Article CAS ADS Google Scholar
Helbing, D., Szolnoki, A., Perc, M. & Szabó, G. Punish, but not too hard: How costly punishment spreads in the spatial public goods game. New J. Phys. 12, 083005. https://doi.org/10.1088/1367-2630/12/8/083005 (2010).
Article ADS Google Scholar
Helbing, D., Szolnoki, A., Perc, M. & Szabó, G. Defector-accelerated cooperativeness and punishment in public goods games with mutations. Phys. Rev. E 81, 057104. https://doi.org/10.1103/PhysRevE.81.057104 (2010).
Article CAS ADS Google Scholar
Chen, X., Sasaki, T. & Perc, M. Evolution of public cooperation in a monitored society with implicated punishment and within-group enforcement. Sci. Rep. 5, 17050. https://doi.org/10.1038/srep17050 (2015).
Article CAS PubMed PubMed Central ADS Google Scholar
Sasaki, T., Okada, I., Uchida, S. & Chen, X. Commitment to cooperation and peer punishment: Its evolution. Games 6, 574–587. https://doi.org/10.3390/g6040574 (2015).
Article MathSciNet MATH Google Scholar
Gardner, A. & West, S. A. Cooperation and punishment, especially in humans. Am. Nat. 164, 753–764. https://doi.org/10.1086/425623 (2004).
Article PubMed Google Scholar
Egas, M. & Riedl, A. The economics of altruistic punishment and the maintenance of cooperation. Proc. R. Soc. B 275, 871–878. https://doi.org/10.1098/rspb.2007.1558 (2008).
Article PubMed PubMed Central Google Scholar
Traulsen, A., Röhl, T. & Milinski, M. An economic experiment reveals that humans prefer pool punishment to maintain the commons. Proc. R. Soc. B 279, 3716–3721. https://doi.org/10.1098/rspb.2012.0937 (2012).
Article PubMed PubMed Central Google Scholar
Schoenmakers, S., Hilbe, C., Blasius, B. & Traulsen, A. Sanctions as honest signals—the evolution of pool punishment by public sanctioning institutions. J. Theor. Biol. 356, 36–46. https://doi.org/10.1016/j.jtbi.2014.04.019 (2014).
Article MathSciNet PubMed PubMed Central MATH ADS Google Scholar
Perc, M. Sustainable institutionalized punishment requires elimination of second-order free-riders. Sci. Rep. 2, 344. https://doi.org/10.1038/srep00344 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Chen, X. & Perc, M. Optimal distribution of incentives for public cooperation in heterogeneous interaction environments. Front. Behav. Neurosci. 8, 248. https://doi.org/10.3389/fnbeh.2014.00248 (2014).
Article PubMed PubMed Central Google Scholar
Perc, M. et al. Statistical physics of human cooperation. Phys. Rep. 687, 1–51. https://doi.org/10.1016/j.physrep.2017.05.004 (2017).
Article MathSciNet MATH ADS Google Scholar
Ohdaira, T. Evolution of cooperation by the introduction of the probabilistic peer-punishment based on the difference of payoff. Sci. Rep. 6, 25413. https://doi.org/10.1038/srep25413 (2016).
Article CAS PubMed PubMed Central ADS Google Scholar
Ohdaira, T. A remarkable effect of the combination of probabilistic peer-punishment and coevolutionary mechanism on the evolution of cooperation. Sci. Rep. 7, 12448. https://doi.org/10.1038/s41598-017-12742-4 (2017).
Article CAS PubMed PubMed Central ADS Google Scholar
Szolnoki, A. & Perc, M. Second-order free-riding on antisocial punishment restores the effectiveness of prosocial punishment. Phys. Rev. X 7, 041027. https://doi.org/10.1103/PhysRevX.7.041027 (2017).
Article Google Scholar
Chen, X. & Szolnoki, A. Punishment and inspection for governing the commons in a feedback-evolving game. PLoS Comput. Biol. 14, e1006347. https://doi.org/10.1371/journal.pcbi.1006347 (2018).
Article CAS PubMed PubMed Central ADS Google Scholar
Lee, H.-W., Cleveland, C. & Szolnoki, A. Mercenary punishment in structured populations. Appl. Math. Comput. 417, 126797. https://doi.org/10.1016/j.amc.2021.126797 (2022).
Article MATH Google Scholar
Dercole, F., De Carli, M., Della Rossa, F. & Papadopoulos, A. V. Overpunishing is not necessary to fix cooperation in voluntary public goods games. J. Theor. Biol. 326, 70–81. https://doi.org/10.1016/j.jtbi.2012.11.034 (2013).
Article MathSciNet PubMed MATH ADS Google Scholar
Chen, X., Szolnoki, A. & Perc, M. Probabilistic sharing solves the problem of costly punishment. New J. Phys. 16, 083016. https://doi.org/10.1088/1367-2630/16/8/083016 (2014).
Article MathSciNet ADS Google Scholar
Chen, X., Szolnoki, A. & Perc, M. Competition and cooperation among different punishing strategies in the spatial public goods game. Phys. Rev. E 92, 012819. https://doi.org/10.1103/PhysRevE.92.012819 (2015).
Article MathSciNet CAS ADS Google Scholar
Perc, M. & Szolnoki, A. A double-edged sword: Benefits and pitfalls of heterogeneous punishment in evolutionary inspection games. Sci. Rep. 5, 11027. https://doi.org/10.1038/srep11027 (2015).
Article CAS PubMed PubMed Central ADS Google Scholar
Szolnoki, A. & Perc, M. Effectiveness of conditional punishment for the evolution of public cooperation. J. Theor. Biol. 325, 34–41. https://doi.org/10.1016/j.jtbi.2013.02.008 (2013).
Article MathSciNet PubMed MATH ADS Google Scholar
Fehr, E. & Schmidt, K. M. A theory of fairness, competition, and cooperation. Quart. J. Econ. 114, 817–868 (1999).
Article Google Scholar
Szolnoki, A., Xie, N.-G., Ye, Y. & Perc, M. Evolution of emotions on networks leads to the evolution of cooperation in social dilemmas. Phys. Rev. E 87, 042805. https://doi.org/10.1103/PhysRevE.87.042805 (2013).
Article CAS ADS Google Scholar
Szolnoki, A., Xie, N.-G., Wang, C. & Perc, M. Imitating emotions instead of strategies in spatial games elevates social welfare. Europhys. Lett. 96, 38002. https://doi.org/10.1209/0295-5075/96/38002 (2011).
Article CAS ADS Google Scholar
Iwasa, Y. & Lee, J.-H. Graduated punishment is efficient in resource management if people are heterogeneous. J. Theor. Biol. 333, 117–125. https://doi.org/10.1016/j.jtbi.2013.05.007 (2013).
Article MathSciNet PubMed MATH ADS Google Scholar
Jiang, L.-L., Perc, M. & Szolnoki, A. If cooperation is likely punish mildly: Insights from economic experiments based on the snowdrift game. PLoS ONE 8, e64677. https://doi.org/10.1371/journal.pone.0064677 (2013).
Article CAS PubMed PubMed Central ADS Google Scholar
Szolnoki, A. & Perc, M. Correlation of positive and negative reciprocity fails to confer an evolutionary advantage: Phase transitions to elementary strategies. Phys. Rev. X 3, 041021. https://doi.org/10.1103/PhysRevX.3.041021 (2013).
Article CAS Google Scholar
Chen, X., Sasaki, T., Brännström, Å. & Dieckmann, U. First carrot, then stick: How the adaptive hybridization of incentives promotes cooperation. J. R. Soc. Interface 12, 20140935. https://doi.org/10.1098/rsif.2014.0935 (2014).
Article Google Scholar
Góis, A. R., Santos, F. P., Pacheco, J. M. & Santos, F. C. Reward and punishment in climate change dilemmas. Sci. Rep. 9, 16193. https://doi.org/10.1038/s41598-019-52524-8 (2019).
Article CAS PubMed PubMed Central ADS Google Scholar

Download references

Acknowledgements

This work was supported by JSPS KAKENHI Grant Numbers JP17K18074 and JP20K11959. The author truly thanks two anonymous referees for their positive comments and kind suggestions.

Author information

Authors and Affiliations

Institute of Information and Media, Aoyama Gakuin University, 5-10-1 Fuchinobe, Chuo-ku, Sagamihara-city, Kanagawa, 252-5258, Japan
Tetsushi Ohdaira

Authors

Tetsushi Ohdaira
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.O. designed and performed the research, analysed the data and wrote the paper.

Corresponding author

Correspondence to Tetsushi Ohdaira.

Ethics declarations

Competing interests

The author declares no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ohdaira, T. The probabilistic pool punishment proportional to the difference of payoff outperforms previous pool and peer punishment. Sci Rep 12, 6604 (2022). https://doi.org/10.1038/s41598-022-10582-5

Download citation

Received: 17 February 2022
Accepted: 08 April 2022
Published: 22 April 2022
DOI: https://doi.org/10.1038/s41598-022-10582-5
Springer Nature Limited

Associated content

Social physics

Collection 12 November 2019

The probabilistic pool punishment proportional to the difference of payoff outperforms previous pool and peer punishment

Abstract

Similar content being viewed by others

Cooperation evolves by the payoff-difference-based probabilistic reward

Evolution of cooperation by the introduction of the probabilistic peer-punishment based on the difference of payoff

Competitions between prosocial exclusions and punishments in finite populations

Introduction

Model

Results

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Social physics

Navigation

The probabilistic pool punishment proportional to the difference of payoff outperforms previous pool and peer punishment

Abstract

Similar content being viewed by others

Cooperation evolves by the payoff-difference-based probabilistic reward

Evolution of cooperation by the introduction of the probabilistic peer-punishment based on the difference of payoff

Competitions between prosocial exclusions and punishments in finite populations

Introduction

Model

Results

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation