Non-Cooperative Bargaining with Unsophisticated Agents

Trejo, Kristal K.; Juarez, Ruben; Clempner, Julio B.; Poznyak, Alexander S.

doi:10.1007/s10614-020-10003-7

Non-Cooperative Bargaining with Unsophisticated Agents

Published: 02 July 2020

Volume 61, pages 937–974, (2023)
Cite this article

Computational Economics Aims and scope Submit manuscript

Kristal K. Trejo¹,
Ruben Juarez²,
Julio B. Clempner³ &
…
Alexander S. Poznyak¹

250 Accesses
5 Citations
Explore all metrics

Abstract

A traditional non-cooperative bargaining situation involves two or more forward-looking players making offers and counteroffers alternately until an agreement is reached, with a penalty according to the time taken by players in the decision-making process. We introduce a game that aids myopic players to reach the equilibrium as if they were forward-looking agents. The key elements of the game are that players are penalized both for their deviation from the previous best-reply strategy and their time taken for the decision-making at each step of the game. It is shown that our game has an equilibrium not only for the traditional processes and utilities used in traditional non-cooperative bargaining literature, but for an expanded and very comprehensive set of stochastic processes (such as Markov processes) and utility functions. Our work not only complements traditional non-cooperative bargaining literature for myopic agents, but also enlarges the class of processes and functions where Rubinstein’s non-cooperative bargaining solutions might be defined and applied.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Non-Cooperative Bargaining Theory

Non-cooperative Bargaining Theory

Negotiation as a Cooperative Game

Notes

Rubinstein (1982) also studies the case of a fixed linear cost $c^{\iota } t \varDelta$, instead of exponential, associated at every step. Our work focuses on exponential discounting rather than linear since it produces richer solutions.
A utility pair $(\psi ^1,\psi ^2) \in \varPhi ^e$ if and only if $(\psi ^1,\psi ^2) \in \varPhi$ and there does not exist another utility pair $(\varphi ^1,\varphi ^2) \in \varPhi$ such that $\varphi ^1 \ge \psi ^1$, $\varphi ^2 \ge \psi ^2$.
While we currently assume that all the agents receive the same penalty $D^*(x_1, x_2, \dots x_t)$, our work can be extended to asymmetric penalties, for instance, when only the proposing agent is penalized. We note that penalizing all the agents symmetrically guarantees a faster convergence than an asymmetric penalty.
Notably, Kultti and Vartiainen (2010) link the cooperative and non-cooperative aspects of Bargaining by showing that the convergence of a multi-player Rubinstein-like non-cooperative bargaining to a Nash bargaining solution (Nash 1950) when the time between offers tends to zero. However, Kultti and Vartiainen (2010)’s approach is more narrow than the approach considered in this paper. Indeed, our equilibrium work for a general sets of transfers and can be generalized to Markov processes studied in Sect. 4.
We thank one of the outstanding referees for suggesting this open question.

References

Abreu, D., & Manea, M. (2012). Bargaining and efficiency in networks. Journal of Economic Theory, 147(1), 43–70.
Article Google Scholar
Abreu, D., & Manea, M. (2012). Markov equilibria in a model of bargaining in networks. Games and Economic Behavior, 75, 1–16.
Article Google Scholar
Admati, A., & Perry, M. (1987). Strategic dalay in bargaining. The review of economic studies, 54(3),
Akin, Z. (2007). Time inconsistency and lerning in bargaining games. International Journal of Game Theory, 36, 275–299.
Article Google Scholar
Antipin, A. S. (2005). An extraproximal method for solving equilibrium programming problems and games. Computational Mathematics and Mathematical Physics, 45(11), 1893–1914.
Google Scholar
Binmore, K., Osborne, M., & Rubinstein, A. (1992). Handbook of game theory (pp. 179–225). Amsterdam: Elsevier Science Publishers.
Google Scholar
Binmore, K., Piccione, M., & Samuelson, L. (1998). Evolutionary stability in alternating-offers bargaining games. Journal of Economic Theory, 80, 257–291.
Article Google Scholar
Binmore, K., Shaked, A., & Sutton, J. (1985). Testing noncooperative bargaining theory: a preliminary study. American Economic Association, 75(5), 1178–1180.
Google Scholar
Brown, D., & Lewis, L. (1981). Mgames economic agents. Econometrica, 49(2), 359–368.
Article Google Scholar
Carraro, C., Marchiori, C., & Sgobbi, A. (2007). Negotiating on water: insights from non-cooperative bargaining theory. Environmental and Development Economics, 12, 329–349.
Article Google Scholar
Clempner, J. B. (2016). Necessary and sufficient karush-kuhn-tucker conditions for multiobjective markov chains optimality. Automatica, 71, 135–142.
Article Google Scholar
Clempner, J. B. (2018). Strategic manipulation approach for solving negotiated transfer pricing problem. Journal of Optimization Theory and Applications, 178(1), 304–316.
Article Google Scholar
Clempner, J. B., & Poznyak, A. S. (2017). Negotiating the transfer pricing using the nash bargaining solution. International Journal of Applied Mathematics Computer Science, 27(4), 853–864.
Article Google Scholar
Clempner, J. B., & Poznyak, A. S. (2018). Computing the transfer pricing for a multidivisional firm based on cooperative games. Economic Computation and Economic Cybernetics Studies and Research, 52(1), 107–126.
Article Google Scholar
Clempner, Julio B. (2018). On lyapunov game theory equilibrium: Static and dynamic approaches. International Game Theory Review, 20(02), 1750033.
Article Google Scholar
Demuynck, Thomas, Jean-Jacques Herings, P., Saulle, Riccardo D., & Seel, Christian. (2019). The myopic stable set for social environments. Econometrica, 87(1), 111–138.
Article Google Scholar
Fudenberg, D., & Tirole, J. (1983). Sequential bargaining with incomplete information. The Review of Economic Studies, 50(2), 221–247.
Article Google Scholar
Ghosh, P., Roy, N., Das, S., & Basu, K. (2005). A pricing strategy for job allocation in mobile grids using a non-cooperative bargaining theory framework. Journal of Parallel and Distributed Computing, 65, 1366–1383.
Article Google Scholar
Guo, X., & Hernández-Lerma, O. (2009). Continuos–time markov decision processes: Theory and applications. Berlin: Springer.
Book Google Scholar
Haller, H. (1986). Non-cooperative bargaining of $n \ge 3$ players. Economics Letters, 22, 11–13.
Article Google Scholar
Han, Lining, & Juarez, Ruben. (2018). Free intermediation in resource transmission. Games and Economic Behavior, 111, 75–84.
Article Google Scholar
Hougaard, Jens Leth, & Tvede, Mich. (2012). Truth-telling and nash equilibria in minimum cost spanning tree models. European Journal of Operational Research, 222(3), 566–570. https://doi.org/10.1016/j.ejor.2012.05.023. http://www.sciencedirect.com/science/article/pii/S0377221712003724
Hougaard, Jens Leth, & Tvede, Mich. (2015). Minimum cost connection networks: Truth-telling and implementation. Journal of Economic Theory, 157, 76–99. https://doi.org/10.1016/j.jet.2014.12.009. http://www.sciencedirect.com/science/article/pii/S0022053114001860
Howard, N. (1971). Paradoxes of rationality: Theory of metagames and political behaviour. Cambridge: MIT Press.
Google Scholar
Jandoc, Karl, & Juarez, Ruben. (2017). Self-enforcing coalitions with power accumulation. International Journal of Game Theory, 46(2), 327–355.
Article Google Scholar
Jandoc, Karl, & Juarez, Ruben. (2019). An experimental study of self-enforcing coalitions. Games, 10(3), 31.
Article Google Scholar
Juarez, Ruben, & Kumar, Rajnish. (2013). Implementing efficient graphs in connection networks. Economic Theory, 54(2), 359–403.
Article Google Scholar
Juarez, Ruben, Ko, Chiu Yu., & Xue, Jingyi. (2018). Sharing sequential values in a network. Journal of Economic Theory, 177, 734–779.
Article Google Scholar
Juarez, Ruben, Nitta, Kohei, & Vargas, Miguel. (2019). Profit-sharing and efficient time allocation. Economic Theory,.
Jun, B. (1989). Non-cooperative bargaining and union formation. Review of Economic Studies, 56, 59–76.
Article Google Scholar
Kultti, K., & Vartiainen, H. (2010). Multilateral non-cooperative bargaining in a general utility space. International Journal of Game Theory, 39, 677–689.
Article Google Scholar
Madani, K., & Hipel, K. (2011). Non-cooperative stability definitions for strategic analysis of generic water resources conflicts. Water Resources Managment, 25(8), 1949–1977.
Article Google Scholar
Manea, M. (2011). Bargaining in stationary networks. American Economic Review, 101, 2042–2080.
Article Google Scholar
Marden, J. (2012). State based potential games. Automatica, 48, 3075–3088.
Article Google Scholar
Montero, M. (2002). Non-cooperative bargaining in apex games and the kernel. Games and Economic Behavior, 41, 309–321.
Article Google Scholar
Moulin, Herve. (2002). Axiomatic cost and surplus sharing. Handbook of Social Choice and Welfare, 1, 289–357.
Article Google Scholar
Muthoo, A. (2002). Bargaining theory with applications. Cambridge: Cambridge University Press.
Google Scholar
Nash, J. F. (1950). The bargaining problem. Econometrica, 18(2), 155–162.
Article Google Scholar
Okada, A. (2016). A non-cooperative bargaining theory with incomplete information: Verifiable types. Journal of Economic Theory, 163, 318–341.
Article Google Scholar
Ostrom, E. (1990). Governing the commons: The evolution of institutions for collective action. Cambridge: Cambridge University Press.
Book Google Scholar
Ostrom, E. (1998). A behavioral approach to the rational choice theory of collective action. The American Political Science Review, 92(1), 1–22.
Article Google Scholar
Ostrom, E., Gardner, R., & Walker, J. (1994). Rules, games, and common-pool resources. Ann Arbor: The University of Michigan Press.
Book Google Scholar
Perry, M., & Reny, P. (1993). A non-cooperative bargaining model with strastrategic timed offers. Journal of Economic Theory, 59, 50–77.
Article Google Scholar
Poznyak, A. S., Najim, K., & Gomez-Ramirez, E. (2000). Self-learning control of finite markov chains. New York: Marcel Dekker.
Google Scholar
Ray, Debraj, & Vohra, Rajiv. (2015). The farsighted stable set. Econometrica, 83(3), 977–1011.
Article Google Scholar
Rubinstein, A. (1982). Perfect equilibrium in a bargaining model. Econometrica, 50(1), 97–109.
Article Google Scholar
Rubinstein, A., & Wolinsky, A. (1985). Equilibrium in a market with sequential bargaining. Econometrica, 53(5), 1133–1150.
Article Google Scholar
Selbirak, T. (1994). Some concepts of non-myopic equilibria in games with finite strategy sets and their properties. Annals of Operations Research, 51(2), 73–82.
Article Google Scholar
Sutton, J. (1986). Non-cooperative bargaining theory: An introduction. Review of Economic Studies, 53(5), 709–724.
Article Google Scholar
Thomson, William. (1994). Cooperative models of bargaining. Handbook of Game Theory with Economic Applications, 2, 1237–1284.
Article Google Scholar
Thomson, William. (2003). Axiomatic and game-theoretic analysis of bankruptcy and taxation problems: a survey. Mathematical Social Sciences, 45(3), 249–297.
Article Google Scholar
Trejo, K. K., Clempner, J. B., & Poznyak, A. S. (2015). Computing the stackelberg/nash equilibria using the extraproximal method: Convergence analysis and implementation details for markov chains games. International Journal of Applied Mathematics and Computer Science, 25(2), 337–351.
Article Google Scholar
Trejo, K. K., Clempner, J. B., & Poznyak, A. S. (2016). An optimal strong equilibrium solution for cooperative multi-leader-follower Stackelberg Markov chains games. Kybernetika, 52(2), 258–279.
Google Scholar
Trejo, K. K., Clempner, J. B., & Poznyak, A. S. (2017). Computing the strong ${L}_p$-Nash equilibrium for Markov chains games: convergence and uniqueness. Applied Mathematical Modelling, 41, 399–418.
Article Google Scholar
Trejo, K. K., Clempner, J. B., & Poznyak, A. S. (2019). Proximal constrained optimization approach with time penalization. Engineering Optimization, 51(7), 1207–1228.
Article Google Scholar
Winoto, P., McCalla, G., & Vassileva, J. (2005). Non-monotonic-offers bargaining protocol. Autonomous Agents and Multi-Agent Systems, 11(1), 45–67.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Automatic Control, Center for Research and Advanced Studies, Av. IPN 2508, Col. San Pedro Zacatenco, 07360, Mexico City, Mexico
Kristal K. Trejo & Alexander S. Poznyak
Department of Economics, University of Hawaii, 2424 Maile Way, Saunders Hall 542, Honolulu, HI, 96822, USA
Ruben Juarez
Escuela Superior de Física y Matemáticas, Instituto Politécnico Nacional, Building 9, Av. Instituto Politécnico Nacional, San Pedro Zacatenco, Gustavo A. Madero, 07738, Mexico City, Mexico
Julio B. Clempner

Authors

Kristal K. Trejo
View author publications
You can also search for this author in PubMed Google Scholar
Ruben Juarez
View author publications
You can also search for this author in PubMed Google Scholar
Julio B. Clempner
View author publications
You can also search for this author in PubMed Google Scholar
Alexander S. Poznyak
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ruben Juarez.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extensions

We now present two extensions of the bargaining game provided above that include the case when agents have different discount factors, and another where agents might coordinate on their demands. The convergence of results follows trivially from our general analysis presented above.

1.1 Bargaining Under Different Discounting

In this approach we present a solution where at each step of the negotiation process players calculate the Nash equilibrium considering the utility functions of all players but with the particularity that internally each player reaches this equilibrium point in a different time. Following the description of the model presented previously, we redefine the advantage of propose a new offer that depends on the utility function

$$\begin{aligned} f(x_{t},x_{t+1}):=\sum \limits _{\iota =1}^{{\mathfrak {n}}}\left[ \psi ^{\iota }(x_{t+1})-\psi ^{\iota }(x_{t})\right] \ge 0 \end{aligned}$$

for all players to reject the offer $x_t$ and making a new offer $x_{t+1}$ given the time spent to benefit of this advantage $T(x_{t+1})>0$, and $\alpha ^{\iota } (x_{t})$ be the weight that players put on their advantages to reject the offer $x_t$. Thus, the advantages to reject the offer $x_{t}$ and to propose a new offer $x_{t+1}$ are given by $A(x_{t},x_{t+1})=\alpha (x_{t})T(x_{t+1})f(x_{t},x_{t+1})$.

Remark 4

The function $f(x_{t},x_{t+1})$ satisfies the Nash condition

$$\begin{aligned} \begin{array}{c} \psi ^{\iota }(x_{t+1}) - \psi ^{\iota }(x_{t}) \ge 0 \end{array} \end{aligned}$$

for any $x\in X$ and all players.

Definition 7

A strategy $x^{*}\in X$ is said to be a Nash equilibrium if

$$\begin{aligned} x^{*}{\text { } \in }\text {Arg} \max _{x\in X}\text { }\left\{ f(x_{t},x_{t+1}) \right\} \end{aligned}$$

Then, at each step of the bargaining game we have in proximal format that the players must select their strategies according to

$$\begin{aligned} x^{* }= \arg \underset{x\in X}{\max }\left\{ - \delta _{t}T(x)\left\| \left( x-x^{* }\right) \right\| ^{2}+ \alpha _{t}T(x) f(x,x^*) \right\} \end{aligned}$$

(19)

where

$$\begin{aligned} f(x,x^*):=\sum \limits _{\iota =1}^{{\mathfrak {n}}}\left[ \psi ^{\iota }(x)-\psi ^{\iota }(x^*)\right] \end{aligned}$$

At each step of the bargaining process, players calculate simultaneously the Nash equilibrium but considering that each player reach the equilibrium in a different time.

1.1.1 Markov Chains

Let us to define the Nash equilibrium as a strategy $x^{* }=\left( x^{1* },\ldots ,x^{{\mathfrak {n}}}\right)$ such that

$$\begin{aligned} {\psi }\left( x^{1* },\ldots ,x^{{\mathfrak {n}}* }\right) \ge {\psi }\left( x^{1* },\ldots ,x^{\iota },\ldots ,x^{{\mathfrak {n}}* }\right) \end{aligned}$$

for any $x^{\iota }\in X$.

Consider that players try to reach the Nash equilibrium of the bargaining problem, that is, to find a joint strategy $x^{* }=\left( x^{1* },\ldots ,x^{{\mathfrak {n}}* }\right)$ $\in$ X satisfying for any admissible $x^{\iota }\in X^{\iota }$ and any $\iota =\overline{1,{\mathfrak {n}}}$

$$\begin{aligned} f( x,{\hat{x}}(x)) := \sum \limits _{\iota =1}^{{\mathfrak {n}}} \left[ \psi ^{\iota }\left( x^{\iota },x^{{\hat{l}}}\right) - \psi ^{\iota }\left( {\bar{x}}^{\iota },x^{{\hat{l}}}\right) \right] \end{aligned}$$

(20)

where ${\hat{x}}=(x^{{\hat{1}}\top },\ldots ,x^{\mathfrak {{\hat{n}}}\top })^{\top }\in {\hat{X}}\subseteq {\mathbb {R}}^{{\mathfrak {n}}({\mathfrak {n}}-1)}$, ${\bar{x}}^{\iota }$ is the utopia point defined as Eq. (11) and $\psi ^{\iota }\left( x^{\iota },x^{{\hat{\iota }}}\right)$ is the concave cost-function of player $\iota$ which plays the strategy $x^{\iota }\in X^{\iota }$ and the rest of players the strategy $x^{{\hat{\iota }}}\in X^{{\hat{\iota }}}$ defined as Eq. (16) considering the time function.

Remark 5

The function $f( x,{\hat{x}}(x))$ satisfies the Nash condition

$$\begin{aligned} \begin{array}{c} \psi ^{\iota }\left( x^{\iota },x^{{\hat{\iota }}}\right) - \psi ^{\iota }\left( {\bar{x}}^{\iota },x^{{\hat{\iota }}}\right) \le 0 \end{array} \end{aligned}$$

(21)

for any $x^{\iota }\in X^{\iota }$ and all $\iota =\overline{1,{\mathfrak {n}}}$

Definition 8

A strategy $x^{*}\in X$ is said to be a Nash equilibrium if

$$\begin{aligned} { x}^{*}{\text { } \in } \text { Arg} \max _{x\in X_{\text {adm}}}\text { }\left\{ f( x,{\hat{x}}(x)) \right\} \end{aligned}$$

Remark 6

If $f( x,{\hat{x}}(x))$ is strictly concave then

$$\begin{aligned} { x}^{*}{\text { } = \text { }}\arg \max _{x\in X_{\text {adm}}}\text { }\left\{ f( x,{\hat{x}}(x)) \right\} \end{aligned}$$

We redefine the utility function that depends of the average utility function of all players as follows

$$\begin{aligned} \begin{array}{c} F(x,{\hat{x}}(x)):= f( x,{\hat{x}}(x)) - \frac{1}{2} \sum \limits _{\iota =1}^{{\mathfrak {n}}}\sum \limits _{j=1}^{N}\mu _{(j)}^{\iota }h_{(j)}^{\iota }(x^{\iota })- \\ \\ \frac{1}{2} \sum \limits _{\iota =1}^{{\mathfrak {n}}}\sum \limits _{i=1}^{N}\sum \limits _{j=1}^{N} \sum \limits _{k=1}^{M}\xi _{(j)}^{\iota }q_{(j|i,k)}^{\iota }{x}_{(i,k)}^{\iota } - \frac{1}{2} \sum \limits _{\iota =1}^{{\mathfrak {n}}}\sum \limits _{i=1}^{N}\sum \limits _{k=1}^{M} \eta ^{\iota }\left( {x}_{(i,k)}^{\iota }-1\right) \end{array} \end{aligned}$$

then, we may conclude that

$$\begin{aligned} x^*=\arg \underset{x\in X,{\hat{x}}\in {\hat{X}}}{\max }\quad \underset{\mu \ge 0,\xi \ge 0,\eta \ge 0}{\min }\quad F(x,{\hat{x}}(x),\mu ,\xi ,\eta ) \end{aligned}$$

(22)

Finally we have that at each step of the bargaining process, players calculate the Nash equilibrium (but they reach the equilibrium at different time) according to the solution of the non-cooperative bargaining problem in proximal format defined as follows

$$\begin{aligned} \begin{array}{c} \mu ^{* }=\arg \underset{\mu \ge 0}{\min }\left\{ - \delta \Vert \mu -\mu ^{* }\Vert ^{2}+\alpha F\left( x^*,{\hat{x}}^*(x),\mu ,\xi ^*,\eta ^*\right) \right\} \\ \xi ^{* }=\arg \underset{\xi \ge 0}{\min }\left\{ - \delta \Vert \xi -\xi ^{* }\Vert ^{2}+\alpha F\left( x^*,{\hat{x}}^*(x),\mu ^*,\xi ,\eta ^*\right) \right\} \\ \eta ^{* }=\arg \underset{\eta \ge 0}{\min }\left\{ - \delta \Vert \eta -\eta ^{* }\Vert ^{2}+\alpha F\left( x^*,{\hat{x}}^*(x),\mu ^*,\xi ^*,\eta \right) \right\} \\ x^{* }=\arg \underset{x\in X}{\max }\left\{ - \delta \left\| \left( x-x^{* }\right) \right\| _{\varLambda }^{2}+\alpha F\left( x,{\hat{x}}^*(x),\mu ^*,\xi ^*,\eta ^*\right) \right\} \\ {\hat{x}}^{* }=\arg \underset{{\hat{x}}\in {\hat{X}}}{\max }\left\{ - \delta \left\| \left( {\hat{x}}-{\hat{x}}^{* }\right) \right\| _{\varLambda }^{2}+\alpha F\left( x^*,{\hat{x}}(x),\mu ^*,\xi ^*,\eta ^*\right) \right\} \end{array} \end{aligned}$$

(23)

1.1.2 Transfer Pricing Simulation

Following the Section above, in this model each player calculates the strategies according the Nash equilibrium formulation where players calculate the Nash equilibrium simultaneously, but with the characteristic that they reach the equilibrium at different time, following the relation (23) until they reach an agreement (strategies show convergence). Figures 11, 12 and 13 show the behavior of the offers (strategies) during the bargaining process.

Finally, the agreement reached is as follows:

$$\begin{aligned} \begin{array}{ccccc} c^{1 }= \begin{bmatrix} 0.2127 &{} 0.0074 \\ 0.1429 &{} 0.0050 \\ 0.6106 &{} 0.0214 \end{bmatrix} &{} \quad &{} c^{2 }= \begin{bmatrix} 0.0050 &{} 0.2366 \\ 0.0087 &{} 0.4117 \\ 0.0070 &{} 0.3310 \end{bmatrix} &{} \quad &{} c^{3 }= \begin{bmatrix} 0.2237 &{} 0.0071 \\ 0.5877 &{} 0.0186 \\ 0.1579 &{} 0.0050 \end{bmatrix} \end{array} \end{aligned}$$

Following (6) the mixed strategies obtained for players are as follows

$$\begin{aligned} \begin{array}{ccccc} d^{1 }= \begin{bmatrix} 0.9662 &{} 0.0338 \\ 0.9662 &{} 0.0338 \\ 0.9662 &{} 0.0338 \end{bmatrix} &{} \quad &{} d^{2 }= \begin{bmatrix} 0.0207 &{} 0.9793 \\ 0.0207 &{} 0.9793 \\ 0.0207 &{} 0.9793 \end{bmatrix} &{} \quad &{} d^{3 }= \begin{bmatrix} 0.9693 &{} 0.0307 \\ 0.9693 &{} 0.0307 \\ 0.9693 &{} 0.0307 \end{bmatrix} \end{array} \end{aligned}$$

With the strategies calculatedat each step of the negotiation process, the utilities of each player showed a decreasing behavior as shown in the Fig. 14, i.e., at each step of the bargaining process, the utility of each player decreases until they reach an agreement. At the end of the bargaining process, the resulting utilities are as follows $\psi ^{1}(c^1,c^2,c^3)=986.8936$, $\psi ^{2}(c^1,c^2,c^3)=651.4633$ and $\psi ^{2}(c^1,c^2,c^3)=949.6980$ for each player.

1.2 Bargaining with Collusive Behavior

In this approach we analyze a bargaining situation where players make groups and alternately each group makes an offer to the others until they reach an equilibrium point (agreement). We describe a bargaining model with two teams of players as follows. Let us consider a bargaining game with ${\mathfrak {n}}+{\mathfrak {m}}$ players. Let ${\mathcal {N}}=\{1,\ldots ,{\mathfrak {n}}\}$ denote the set of players called team A and let us define the behavior of all players $\iota =\overline{1,{\mathfrak {n}}}$ as $x_t=(x_t^1,\ldots ,x_t^{\mathfrak {n}}) \in X$ where X is a convex and compact set. In the same way, the rest ${\mathcal {M}}=\{1,\ldots ,{\mathfrak {m}}\}$ players are the team B and let the set of the strategy profiles of all player $m=\overline{1,{\mathfrak {m}}}$ be defined by $y_t=(y_t^1,\ldots ,y_t^{\mathfrak {m}}) \in Y$ where Y is a convex and compact set. Then, $X \times Y$ in the set of full strategy profiles. In this model the function $\psi (x,y)$ represents the utility function of team A which determines the decision of accept or reject the offer; similarly, team B makes the decision according to its utility function $\varphi (x,y)$.

Following the description of the model presented above, we redefine the advantage of propose a new offer considering the utility function for team A as follows

$$\begin{aligned} f(x_{t},y_{t},x_{t+1},y_{t+1}):=\sum \limits _{\iota =1}^{{\mathfrak {n}}}\left[ \psi ^{\iota }(x_{t+1},y_{t})-\psi ^{\iota }(x_{t},y_{t})\right] \ge 0 \end{aligned}$$

and, similarly the utility function for team B is as follows

$$\begin{aligned} g(x_{t},y_{t},x_{t+1},y_{t+1}):=\sum \limits _{m=1}^{{\mathfrak {m}}}\left[ \varphi ^{\iota }(x_{t},y_{t+1})-\varphi ^{\iota }(x_{t},y_{t})\right] \ge 0 \end{aligned}$$

Thus, the advantages for team A to reject the offer $x_{t}$ and to propose a new offer $x_{t+1}$ are given by $A(x_{t},y_{t},x_{t+1},y_{t+1})=\alpha (x_{t})T(x_{t+1})f(x_{t},y_{t},x_{t+1},y_{t+1})$; in the same way, the advantages for team B to reject the offer $y_{t}$ and to propose a new offer $y_{t+1}$ are given by $A(x_{t},y_{t},x_{t+1},y_{t+1})=\alpha (y_{t})T(y_{t+1})g(x_{t},y_{t},x_{t+1},y_{t+1})$.

Remark 7

The function $f(x_{t},y_{t},x_{t+1},y_{t+1})$ satisfies the Nash condition

$$\begin{aligned} \begin{array}{c} \psi ^{\iota }(x_{t+1},y_{t})-\psi ^{\iota }(x_{t},y_{t}) \ge 0 \end{array} \end{aligned}$$

for any $x\in X$, $y\in Y$ and $\iota =\overline{1,{\mathfrak {n}}}$ players.

Remark 8

The function $g(x_{t},y_{t},x_{t+1},y_{t+1})$ satisfies the Nash condition

$$\begin{aligned} \begin{array}{c} \varphi ^{\iota }(x_{t},y_{t+1})-\varphi ^{\iota }(x_{t},y_{t}) \ge 0 \end{array} \end{aligned}$$

for any $x\in X$, $y\in Y$ and $m=\overline{1,{\mathfrak {m}}}$ players.

The dynamics of the bargaining game is as follows: at each step of the negotiation process the team A chooses a strategy $x \in X$ considering the utility function $f(x_{t},y_{t},x_{t+1},y_{t+1})$, then team B must decide between to accept or reject the offer calculating a new offer (strategies) $y \in Y$ considering the utility function of the group $g(x_{t},y_{t},x_{t+1},y_{t+1})$. Following the description of the model 1, now we have that teams solve the problem in proximal format as follows:

$$\begin{aligned} \begin{array}{c} x^{* }= \arg \underset{x\in X}{\max }\left\{ - \delta _{t}T(x)\left\| \left( x-x^{* }\right) \right\| ^{2}+ \alpha _{t}T(x) f(x,y,x^*,y^*) \right\} \\ \\ y^{* }= \arg \underset{y\in Y}{\max }\left\{ - \delta _{t}T(y)\left\| \left( y-y^{* }\right) \right\| ^{2}+ \alpha _{t}T(y) g(x,y,x^*,y^*) \right\} \end{array} \end{aligned}$$

(24)

where

$$\begin{aligned} \begin{array}{c} f(x,y,x^*,y^*):=\sum \limits _{\iota =1}^{{\mathfrak {n}}}\left[ \psi ^{\iota }(x,y^*)-\psi ^{\iota }(x^*,y^*)\right] \\ \\ g(x,y,x^*,y^*):=\sum \limits _{m=1}^{{\mathfrak {m}}}\left[ \varphi ^m(x^*,y)-\varphi ^m(x^*,y^*)\right] \end{array} \end{aligned}$$

At each step, teams make a new offer according to Eq. (24), both teams solve the bargaining problem together but they reach the equilibrium at different time, the bargaining game continues until the offers (strategies) of all player show convergence.

1.2.1 Markov Chains

For this model, in the same way that we define the strategies $x \in X$, let us consider a set of strategies denoted by $y^{m}\in Y^{m}$ $\left( m=\overline{1,{\mathfrak {m}}}\right)$ where $Y:=\bigotimes \limits _{m=1}^{{\mathfrak {m}}}Y^{\iota }$ is a convex and compact set,

$$\begin{aligned} y^{m}:=\text {col }(c^{m}),\quad Y^{m}:=C_{\text {adm}}^{m} \end{aligned}$$

where col is the column operator.

Denote by $y=(y^{1},\ldots ,y^{{\mathfrak {m}}})^{\top }\in Y$, the joint strategy of the players and $y^{{\hat{m}}}$ is a strategy of the rest of the players adjoint to $y^{m}$, namely,

$$\begin{aligned} y^{{\hat{m}}}:=\left( y^{1},\ldots ,y^{m-1},y^{m+1},\ldots ,y^{{\mathfrak {m}}}\right) ^{\top }\in Y^{{\hat{m}}}:=\bigotimes \limits _{h=1,\text { }h\ne m}^{{\mathfrak {m}}}Y^{h} \end{aligned}$$

such that $y=(y^{m},y^{{\hat{m}}})$, $m=\overline{1,{\mathfrak {m}}}$.

Consider that players of team A try to reach the Nash equilibrium of the bargaining problem, that is, to find a joint strategy $x^{* }=\left( x^{1* },\ldots ,x^{{\mathfrak {n}}* }\right)$ $\in$ X satisfying for any admissible $x^{\iota }\in X^{\iota }$ and any $\iota =\overline{1,{\mathfrak {n}}}$

$$\begin{aligned} f( x,{\hat{x}}(x)|y) := \sum \limits _{\iota =1}^{{\mathfrak {n}}} \left[ \psi ^{\iota }\left( x^{\iota },x^{{\hat{l}}}|y\right) - \psi ^{\iota }\left( {\bar{x}}^{\iota },x^{{\hat{\iota }}}|y\right) \right] \end{aligned}$$

(25)

where ${\hat{x}}=(x^{{\hat{1}}\top },\ldots ,x^{\mathfrak {{\hat{n}}}\top })^{\top }\in {\hat{X}}\subseteq {\mathbb {R}}^{{\mathfrak {n}}({\mathfrak {n}}-1)}$, ${\bar{x}}^{\iota }$ is the utopia point defined as Eq. (11) and $\psi ^{\iota }\left( x^{\iota },x^{{\hat{\iota }}}|y\right)$ is the concave cost-function of player $\iota$ which plays the strategy $x^{\iota }\in X^{\iota }$ and the rest of players the strategy $x^{{\hat{\iota }}}\in X^{{\hat{\iota }}}$ fixing the strategies $y \in Y$ of team B, and it is defined as Eq. (16) considering the time function.

Similarly, consider that players of team B also try to reach the Nash equilibrium of the bargaining problem, that is, to find a joint strategy $y^{* }=\left( y^{1* },\ldots ,y^{{\mathfrak {m}}* }\right)$ $\in$ Y satisfying for any admissible $y^{m}\in Y^{m}$ and any $m=\overline{1,{\mathfrak {m}}}$

$$\begin{aligned} g( y,{\hat{y}}(y)|x) := \sum \limits _{m=1}^{{\mathfrak {m}}} \left[ \psi ^{m}\left( y^{m},y^{{\hat{m}}}|x\right) - \psi ^{m}\left( {\bar{y}}^{m},y^{{\hat{m}}}|x\right) \right] \end{aligned}$$

(26)

where ${\hat{y}}=(y^{{\hat{1}}\top },\ldots ,y^{\mathfrak {{\hat{m}}}\top })^{\top }\in {\hat{Y}}\subseteq {\mathbb {R}}^{{\mathfrak {m}}({\mathfrak {m}}-1)}$, ${\bar{y}}^{m}$ is the utopia point defined as Eq. (11) and $\psi ^{m}\left( y^{m},y^{{\hat{m}}}|x\right)$ is the concave cost-function of player m which plays the strategy $y^{m}\in Y^{m}$ and the rest of players the strategy $y^{{\hat{m}}}\in Y^{{\hat{m}}}$ fixing the strategies $x \in X$ of team A, and it is defined as Eq. (16) considering the time function.

Then, we have that a strategy $x^* \in X$ of team A together with the collection $y^* \in Y$ of team B are defined as the equilibrium of a strictly concave bargaining problem if

$$\begin{aligned} ({x}^{*},{y}^{*}){\text { } = \text { }}\arg \max _{x\in X_{\text {adm}},y\in Y_{adm}}\text { }\left\{ f(x,{\hat{x}}(x)|y) \le 0 , g(y,{\hat{y}}(y)|x) \le 0 \right\} \end{aligned}$$

We redefine the utility function that depends of the average utility function of all players as follows

$$\begin{aligned} \begin{array}{c} F(x,{\hat{x}}(x),y,{\hat{y}}(y)):= f(x,{\hat{x}}(x)|y) + g(y,{\hat{y}}(y)|x) - \frac{1}{2} \sum \limits _{\iota =1}^{{\mathfrak {n}}}\sum \limits _{j=1}^{N}\mu _{(j)}^{\iota }h_{(j)}^{\iota }(x^{\iota })- \\ \\ \frac{1}{2} \sum \limits _{m=1}^{{\mathfrak {m}}}\sum \limits _{j=1}^{N}\mu _{(j)}^{m}h_{(j)}^{m}(y^{m})- \frac{1}{2} \sum \limits _{\iota =1}^{{\mathfrak {n}}}\sum \limits _{i=1}^{N}\sum \limits _{j=1}^{N} \sum \limits _{k=1}^{M}\xi _{(j)}^{\iota }q_{(j|i,k)}^{\iota }{x}_{(i,k)}^{\iota } - \frac{1}{2} \sum \limits _{m=1}^{{\mathfrak {m}}}\sum \limits _{i=1}^{N}\sum \limits _{j=1}^{N} \sum \limits _{k=1}^{M}\xi _{(j)}^{m}q_{(j|i,k)}^{m}{y}_{(i,k)}^{m} - \\ \\ \frac{1}{2} \sum \limits _{\iota =1}^{{\mathfrak {n}}}\sum \limits _{i=1}^{N}\sum \limits _{k=1}^{M} \eta ^{\iota }\left( {x}_{(i,k)}^{\iota }-1\right) - \frac{1}{2} \sum \limits _{m=1}^{{\mathfrak {m}}}\sum \limits _{i=1}^{N}\sum \limits _{k=1}^{M} \eta ^{m}\left( {y}_{(i,k)}^{m}-1\right) \end{array} \end{aligned}$$

then, we may conclude that

$$\begin{aligned} (x^*,y^*)=\arg \underset{x\in X,{\hat{x}}\in {\hat{X}},y\in Y,{\hat{y}}\in {\hat{Y}}}{\max }\quad \underset{\mu \ge 0,\xi \ge 0,\eta \ge 0}{\min }\quad F(x,{\hat{x}}(x),y,{\hat{y}}(y),\mu ,\xi ,\eta ) \end{aligned}$$

(27)

Finally we have that at each step of the bargaining process, players calculate their equilibrium according to the solution of the non-cooperative bargaining problem in proximal format defined as follows

$$\begin{aligned} \begin{array}{c} \mu ^{* }=\arg \underset{\mu \ge 0}{\min }\left\{ - \delta \Vert \mu -\mu ^{* }\Vert ^{2}+\alpha F\left( x^*,{\hat{x}}^*(x),y^*,{\hat{y}}^*(y),\mu ,\xi ^*,\eta ^*\right) \right\} \\ \xi ^{* }=\arg \underset{\xi \ge 0}{\min }\left\{ - \delta \Vert \xi -\xi ^{* }\Vert ^{2}+\alpha F\left( x^*,{\hat{x}}^*(x),y^*,{\hat{y}}^*(y),\mu ^*,\xi ,\eta ^*\right) \right\} \\ \eta ^{* }=\arg \underset{\eta \ge 0}{\min }\left\{ - \delta \Vert \eta -\eta ^{* }\Vert ^{2}+\alpha F\left( x^*,{\hat{x}}^*(x),y^*,{\hat{y}}^*(y),\mu ^*,\xi ^*,\eta \right) \right\} \\ x^{* }=\arg \underset{x\in X}{\max }\left\{ - \delta \left\| \left( x-x^{* }\right) \right\| _{\varLambda }^{2}+\alpha F\left( x,{\hat{x}}^*(x),y^*,{\hat{y}}^*(y),\mu ^*,\xi ^*,\eta ^*\right) \right\} \\ {\hat{x}}^{* }=\arg \underset{{\hat{x}}\in {\hat{X}}}{\max }\left\{ - \delta \left\| \left( {\hat{x}}-{\hat{x}}^{* }\right) \right\| _{\varLambda }^{2}+\alpha F\left( x^*,{\hat{x}}(x),y^*,{\hat{y}}^*(y),\mu ^*,\xi ^*,\eta ^*\right) \right\} \\ y^{* }=\arg \underset{y\in Y}{\max }\left\{ - \delta \left\| \left( y-y^{* }\right) \right\| _{\varLambda }^{2}+\alpha F\left( x^*,{\hat{x}}^*(x),y,{\hat{y}}^*(y),\mu ^*,\xi ^*,\eta ^*\right) \right\} \\ {\hat{y}}^{* }=\arg \underset{{\hat{y}}\in {\hat{Y}}}{\max }\left\{ - \delta \left\| \left( {\hat{y}}-{\hat{y}}^{* }\right) \right\| _{\varLambda }^{2}+\alpha F\left( x^*,{\hat{x}}^*(x),y^*,{\hat{y}}(y),\mu ^*,\xi ^*,\eta ^*\right) \right\} \end{array} \end{aligned}$$

(28)

1.2.2 Transfer Pricing Simulation

For this example, the team 1 is only formed by player 1 while team 2 is composed of players 2 and 3. Although the players calculate the strategies together following the relation (28), we consider that players reach the equilibrium at different times. Figures 15, 16 and 17 show the behavior of the offers (strategies) during the bargaining process.

Finally, the agreement reached is as follows:

$$\begin{aligned} \begin{array}{ccccc} c^{1 }= \begin{bmatrix} 0.2127 &{} 0.0074 \\ 0.1429 &{} 0.0050 \\ 0.6106 &{} 0.0214 \end{bmatrix} &{} \quad &{} c^{2 }= \begin{bmatrix} 0.0050 &{} 0.2366 \\ 0.0087 &{} 0.4117 \\ 0.0070 &{} 0.3310 \end{bmatrix} &{} \quad &{} c^{3 }= \begin{bmatrix} 0.2237 &{} 0.0071 \\ 0.5877 &{} 0.0186 \\ 0.1579 &{} 0.0050 \end{bmatrix} \end{array} \end{aligned}$$

Following (6) the mixed strategies obtained for players are as follows

$$\begin{aligned} \begin{array}{ccccc} d^{1 }= \begin{bmatrix} 0.9662 &{} 0.0338 \\ 0.9662 &{} 0.0338 \\ 0.9662 &{} 0.0338 \end{bmatrix} &{} \quad &{} d^{2 }= \begin{bmatrix} 0.0207 &{} 0.9793 \\ 0.0207 &{} 0.9793 \\ 0.0207 &{} 0.9793 \end{bmatrix} &{} \quad &{} d^{3 }= \begin{bmatrix} 0.9693 &{} 0.0307 \\ 0.9693 &{} 0.0307 \\ 0.9693 &{} 0.0307 \end{bmatrix} \end{array} \end{aligned}$$

With the strategies calculated at each step of the negotiation process, the utilities of each player showed a decreasing behavior as shown in the Fig. 18, i.e., at each step of the bargaining process, the utility of each player decreases until they reach an agreement. At the end of the bargaining process, the resulting utilities are as follows $\psi ^{1}(c^1,c^2,c^3)=986.8936$, $\psi ^{2}(c^1,c^2,c^3)=651.4631$ and $\psi ^{2}(c^1,c^2,c^3)=949.6978$ for each player.

The following figure shows the behavior of the utilities at each of the applied models (model 1 is the general bargaining model, model 2 corresponds to bargaining under different discounting and model 3 to bargaining with collusive behavior), we can see that the utilities begin at the same point, the strong Nash equilibrium, and then decrease until the strategies converge (see Fig. 19). From the results obtained we observed that model 1 favors the utilities of players 2 and 3, while model 2 and 3 are better for player 1. We also observed that even if models 2 and 3 reach the same agreement (equilibrium point) the strategies and, as a consequence, the utilities have a different behavior during the bargaining process.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Trejo, K.K., Juarez, R., Clempner, J.B. et al. Non-Cooperative Bargaining with Unsophisticated Agents. Comput Econ 61, 937–974 (2023). https://doi.org/10.1007/s10614-020-10003-7

Download citation

Accepted: 21 May 2020
Published: 02 July 2020
Issue Date: March 2023
DOI: https://doi.org/10.1007/s10614-020-10003-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Non-Cooperative Bargaining with Unsophisticated Agents

Abstract

Access this article