Nash equilibrium seeking over directed graphs

Tang, Yutao; Yi, Peng; Zhang, Yanqiong; Liu, Dawei

doi:10.1007/s43684-022-00026-2

Nash equilibrium seeking over directed graphs

Short Paper
Open access
Published: 18 April 2022

Volume 2, article number 7, (2022)
Cite this article

Download PDF

You have full access to this open access article

Autonomous Intelligent Systems Aims and scope Submit manuscript

Nash equilibrium seeking over directed graphs

Download PDF

Yutao Tang¹,
Peng Yi ORCID: orcid.org/0000-0002-2494-1505^2,3,
Yanqiong Zhang⁴ &
…
Dawei Liu⁵

2123 Accesses
9 Citations
1 Altmetric
Explore all metrics

Abstract

In this paper, we aim to develop distributed continuous-time algorithms over directed graphs to seek the Nash equilibrium in a noncooperative game. Motivated by the recent consensus-based designs, we present a distributed algorithm with a proportional gain for weight-balanced directed graphs. By further embedding a distributed estimator of the left eigenvector associated with zero eigenvalue of the graph Laplacian, we extend it to the case with arbitrary strongly connected directed graphs having possible unbalanced weights. In both cases, the Nash equilibrium is proven to be exactly reached with an exponential convergence rate. An example is given to illustrate the validity of the theoretical results.

A Zero-Sum Game Between the Network Designer and an Adversary in Consensus Protocols

A Distributed v-GNE Seeking Algorithm for Multi-agent Games over Unbalanced Graphs

Nash Equilibrium Seeking over Networks

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Nash equilibrium seeking in noncooperative games has attracted much attention due to its broad applications in multi-robot systems, smart grids, and sensor networks [1–3]. In such problems, each decision-maker/player has an individual payoff function depending upon all players’ decisions and aims at reaching an equilibrium from which no player has incentive to deviate. Information that one player knows about others and the information sharing structure among these players play a crucial role in resolving these problems. In a classical full-information setting, each player has access information including its own objective function and the decisions taken by the other players in the game [4–6]. As the decisions of all other agents can be not directly available due to the privacy concerns or communication cost, distributed designs only relying on each player’s local information are of particular interest, and sustained efforts have been made to generalize the classical algorithms to this case via networked information sharing.

In multi-agent coordination literature, the information structure (or the information sharing topology) among agents is often described by graphs [7]. Following this terminology, the Nash equilibrium seeking problem in the classical full-information setting involves a complete graph where any two players can directly communicate with each other [4, 5, 8–10]. A similar scenario is the case when this full-decision information is obtained via broadcasts from a global coordinator [11]. By contrast, distributed rules via local communication and computation do not require this impractical assumption on the information structure.

To overcome the difficulty brought by the lack of full information, a typical approach is to leverage the consensus-based mechanism to share information via network diffusion [12–15]. To be specific, each player maintains a local estimate vector of all players’ decisions and updates this vector by an auxiliary consensus process with its neighbors. After that, the player can implement a best-response or gradient-play rule with the estimate of the joint decision. For example, the authors conducted an asynchronous gossip-based algorithm for finding a Nash equilibrium in [16]. The two awake players will appoint their estimates as their average and then take a gradient step. Similar results have been delivered for general connected graphs by extending classical gradient-play dynamics [17, 18]. Along this line, considerable progress has been made with different kinds of discrete-time or continuous-time Nash equilibrium seeking algorithm with or without coupled decision constraints even for nontrivial dynamic players [19–26]. However, all these results except a few for special aggregative games heavily reply on the assumption that the underlying communication graph is undirected, which definitely narrows down the applications of these Nash equilibrium seeking algorithms.

Based on the aforementioned observations, this paper is devoted to the solvability of the Nash equilibrium seeking problem for general noncooperative games over directed graphs. Moreover, we aim to obtain an exponential convergence rate. Note that the symmetry of information sharing structure plays a crucial role in both analysis and synthesis of existing Nash equilibrium seeking algorithms. However, the information structure will lose such symmetry over directed graphs, which certainly makes the considered problem more challenging.

To solve this problem, we start from the recent work [17]. In [17], the authors presented an augmented gradient-play dynamics and showed the dynamics converge to consensus on the Nash equilibrium exponentially fast under undirected and connected graphs. We will first develop a modified version of gradient-play algorithms for weight-balanced digraphs by adding a proportional gain, and then extend it to the case with arbitrary strongly connected digraph by further embedding a distributed estimator of the left eigenvector associated with zero eigenvalue of the graph Laplacian. Under some similar assumptions on the cost functions as in [17], we show that the developed two algorithms can indeed recover the exponential convergence rate in both cases. Moreover, by adding such a free-chosen proportional gain parameter, we provide an alternative way to remove the extra graph coupling condition other than singular perturbation analysis as that in [17]. To the best knowledge of us, this is the first exponentially convergent continuous-time result to solve the Nash equilibrium seeking problem over general directed graphs.

The remainder of this paper is organized as follows: Some preliminaries are presented in Sect. 2. The problem formulation is given in Sect. 3. Then, the main designs are detailed in Sect. 4. Following that, an example is given to illustrate the effectiveness of our algorithms in Sect. 5. Finally, concluding remarks are given in Sect. 6.

2 Preliminaries

In this section, we present some preliminaries of convex analysis [27] and graph theory [7] for the following analysis.

2.1 Convex analysis

Let $\mathbb{R}^{n}$ be the n-dimensional Euclidean space and $\mathbb{R}^{n\times m}$ be the set of all $n\times m$ matrices. ${\mathbf{1}}_{n}$ (or ${\mathbf{0}}_{n}$) represents an n-dimensional all-one (or all-zero) column vector and ${\boldsymbol{1}}_{n\times m}$ (or ${\boldsymbol{0}}_{n\times m}$) all-one (or all-zero) matrix. We may omit the subscript when it is self-evident. $\operatorname{diag}(b_{1}, {\dots }, b_{n})$ represents an $n\times n$ diagonal matrix with diagonal elements $b_{i}$ with $i=1, {\dots }, n$. $\operatorname{col}(a_{1}, {\dots }, a_{n}) = [a_{1}^{\top }, {\dots }, a_{n}^{\top }]^{\top }$ for column vectors $a_{i}$ with $i=1, {\dots }, n$. For a vector x and a matrix A, $\|x\|$ denotes the Euclidean norm and $\|A\|$ the spectral norm.

A function $f\colon \mathbb{R}^{m} \rightarrow \mathbb{R}$ is said to be convex if, for any $0\leq a \leq 1$ and $\zeta _{1},\zeta _{2} \in \mathbb{R}^{m}$, $f(a\zeta _{1}+(1-a)\zeta _{2})\leq af(\zeta _{1})+(1-a)f(\zeta _{2})$. It is said to be strictly convex if this inequality is strict whenever $\zeta _{1} \neq \zeta _{2}$. A vector-valued function $\Phi \colon \mathbb{R}^{m} \rightarrow \mathbb{R}^{m}$ is said to be ω-strongly monotone, if for any $\zeta _{1}, \zeta _{2} \in \mathbb{R}^{m}$, $(\zeta _{1}-\zeta _{2})^{\top }[\Phi (\zeta _{1})-\Phi (\zeta _{2})] \geq \omega \|\zeta _{1}-\zeta _{2}\|^{2}$. Function $\Phi \colon \mathbb{R}^{m} \rightarrow \mathbb{R}^{m}$ is said to be ϑ-Lipschitz, if for any $\zeta _{1}, \zeta _{2} \in \mathbb{R}^{m}$, $\|\Phi (\zeta _{1})-\Phi (\zeta _{2})\|\leq \vartheta \|\zeta _{1}- \zeta _{2}\|$. Apparently, the gradient of an ω-strongly convex function is ω-strongly monotone.

2.2 Graph theory

A weighted directed graph (digraph) is described by $\mathcal{G}=(\mathcal{N}, \mathcal{E}, \mathcal{A})$ with the node set $\mathcal{N}=\{1, {\dots }, N\}$ and the edge set $\mathcal{E}$. $(i, j)\in \mathcal{E}$ denotes an edge from node i to node j. The weighted adjacency matrix $\mathcal{A}=[a_{ij}]\in \mathbb{R}^{N\times N}$ is defined by $a_{ii}=0$ and $a_{ij}\geq 0$. Here $a_{ij}>0$ iff there is an edge $(j, i)$ in the digraph. The neighbor set of node i is defined as $\mathcal{N}_{i}=\{j\mid (j, i)\in \mathcal{E} \}$. A directed path is an alternating sequence $i_{1}e_{1}i_{2}e_{2}{\dots }e_{k-1}i_{k}$ of nodes $i_{l}$ and edges $e_{m}=(i_{m},i_{m+1}) \in \mathcal{E}$ for $l=1,2,{\dots },k$. If there is a directed path between any two nodes, then the digraph is said to be strongly connected. The in-degree and out-degree of node i are defined by $d^{\text{in}}_{i}=\sum_{j=1}^{N} a_{ij}$ and $d^{\text{out}}_{i}=\sum_{j=1}^{N} a_{ji}$. A digraph is weight-balanced if $d^{\text{in}}_{i}=d^{\text{out}}_{i}$ holds for any $i=1, \dots , N$. The Laplacian matrix of $\mathcal{G}$ is defined as $L\triangleq D^{\text{in}}-\mathcal{A}$ with $D^{\text{in}}=\operatorname{diag}(d^{\text{in}}_{1}, \dots , d^{\text{in}}_{N})$. Note that $L{\boldsymbol{1}}_{N}={\boldsymbol{0}}_{N}$ for any digraph. When it is weight-balanced, we have ${\boldsymbol{1}}_{N}^{\top }L={\boldsymbol{0}}_{N}^{\top }$ and the matrix $\operatorname{Sym}(L)\triangleq \frac{L+L^{\top }}{2}$ is positive semidefinite.

Consider a group of vectors $\{{\mathbf{1}}, {a_{2}}, \dots , a_{N}\}$ with $a_{i}$ the ith standard basis vector of $\mathbb{R}^{N}$, i.e., all entries of $a_{i}$ are zero except the i-th, which is one. These vectors are verified to be linearly independent. We apply the Gram-Schmidt process to them and obtain a group of orthonormal vectors $\{\hat{a}_{1}, \dots , \hat{a}_{N}\}$. Let $M_{1}=\hat{a}_{1}\in \mathbb{R}^{N}$ and $M_{2}=[\hat{a}_{2} \dots \hat{a}_{N}]\in \mathbb{R}^{N\times (N-1)}$. It can be verified that $M_{1}=\frac{1}{\sqrt{N}}{\boldsymbol{1}}_{N}$, $M_{1}^{\top }M_{1}=1$, $M_{2}^{\top }M_{2}=I_{N-1}$, $M_{2}^{\top }M_{1}={\boldsymbol{0}}_{N-1}$, and $M_{1} M_{1}^{\top }+ M_{2} M_{2}^{\top }=I_{N}$. Then, for a weight-balanced and strongly connected digraph, we can order the eigenvalues of $\operatorname{Sym}(L)$ as $0=\lambda _{1}<\lambda _{2}\leq \cdots \leq \lambda _{N}$ and further have $\lambda _{2} I_{N-1}\leq M_{2}^{\top }\operatorname{Sym}(L)M_{2}\leq \lambda _{N} I_{N}$.

3 Problem formulation

In this paper, we consider a multi-agent system consisting of N agents labeled as $\mathcal{N}=\{1, \dots , N\}$. They play an N-player noncooperative game defined as follows: Agent i is endowed with a continuously differentiable cost function ${J}_{i}(z_{i}, {\boldsymbol{z}}_{-i})$, where $z_{i}\in \mathbb{R}$ denotes the decision (or action) profile of agent i and ${\boldsymbol{z}}_{-i}\in \mathbb{R}^{N-1}$ denotes the decision profile of this multi-agent system except for agent i. In this game, each player seeks to minimize its own cost function $J_{i}$ by selecting a proper decision $z_{i}$. Here we adopt a unidimensional decision variable for the ease of presentation and multiple dimensional extensions can be made without any technical obstacles.

The equilibrium point of this noncooperative game can be defined as in [5].

Definition 1

Consider the game $\text{G}=\{\mathcal{N}, {J}_{i}, \mathbb{R}\}$. A decision profile $z^{*}=\operatorname{col}(z_{1}^{*}, \dots , z_{N}^{*})$ is said to be a Nash equilibrium (NE) of the game G if $J_{i}(z_{i}^{*}, {\boldsymbol{z}}_{-i}^{*})\leq J_{i}(z_{i}, {\boldsymbol{z}}_{-i}^{*})$ for any $i\in \mathcal{N}$ and $z_{i}\in \mathbb{R}$.

At a Nash equilibrium, no player can unilaterally decrease its cost by changing the decision on its own, and thus all agents tend to keep at this state. Denote $F(z)\triangleq \operatorname{col}(\nabla _{1} J_{1}(z_{1}, \boldsymbol{z}_{-1}), \dots , \nabla _{N} J_{N}(z_{N}, \boldsymbol{z}_{-N}))\in \mathbb{R}^{N}$ with $\nabla _{i} J_{i}(z_{i}, {\boldsymbol{z}}_{-i})\triangleq \frac{\partial }{\partial z_{i}}J_{i}(z_{i}, {\boldsymbol{z}}_{-i})\in \mathbb{R}$. Here F is called the pseudogradient associated with $J_{1}, \dots , J_{N}$.

To ensure the well-posedness of our problem, the following assumptions are made throughout the paper:

Assumption 1

For each $i\in \mathcal{N}$, the function $J_{i}(z_{i}, {\boldsymbol{z}}_{-i})$ is twice continuously differentiable, strictly convex and radially unbounded in $z_{i}\in \mathbb{R}$ for any fixed ${\boldsymbol{z}}_{-i}\in \mathbb{R}^{N-1}$.

Assumption 2

The pseudogradient F is $\underline{l}$-strongly monotone and l̄-Lipschitz for two constants $\underline{l}, \bar{l}>0$.

These assumptions have been used in [17] and [21]. Under these assumptions, our game G admits a unique Nash equilibrium $z^{*}$ which can be characterized by the equation $F(z^{*})={\boldsymbol{0}}$ according to Propositions 1.4.2 and 2.2.7 in [28].

In a full-information scenario when agents can have access to all the other agents’ decisions, a typical gradient-play rule

$$\begin{aligned} \dot{z}_{i}=-\frac{\partial J_{i}}{\partial z_{i}}(z_{i}, { \boldsymbol{z}}_{-i}), \quad i\in \mathcal{N} \end{aligned}$$

can be used to compute this Nash equilibrium $z^{*}$. In this paper, we are more interested in distributed designs and assume that each agent only knows the decisions of a subset of all agents during the phase of computation.

For this purpose, a weighted digraph $\mathcal{G}=(\mathcal{N}, \mathcal{E}, \mathcal{A})$ is used to describe the information sharing relationships among the agents with node set $\mathcal{N}$ and weight matrix $\mathcal{A}\in \mathbb{R}^{N\times N}$. If agent i can get the information of agent j, then there is a directed edge from agent j to agent i in the graph with weight $a_{ij}>0$. Note that agent i may not have the full-information of ${\boldsymbol{z}}_{-i}$ except the case with a complete communication graph. Thus, we have a noncooperative game with incomplete partial information. This makes the classical gradient-play rule unimplementable.

To tackle this issue, a consensus-based rule has been developed in [17] and each agent is required to estimate all other agents’ decisions and implement an augmented gradient-play dynamics:

$$\begin{aligned} \dot{{\mathbf{z}}}^{i}&=-\sum _{j=1}^{N} a_{ij} \bigl({{ \mathbf{z}}}^{i}-{{ \mathbf{z}}}^{j} \bigr)- R_{i} \nabla _{i} J_{i} \bigl({ \mathbf{z}}^{i} \bigr), \end{aligned}$$

(1)

where $R_{i}=\operatorname{col}({\boldsymbol{0}}_{i-1}, 1, {\boldsymbol{0}}_{N-i})$ and ${{\mathbf{z}}}^{i}=\operatorname{col}(z_{1}^{i}, \dots , z_{N}^{i})$. Here ${\mathbf{z}}^{i}\in \mathbb{R}^{N}$ represents agent i’s estimate of all agents’ decisions with $z_{i}^{i}=z_{i}$ and ${\mathbf{z}}_{-i}^{i}=\operatorname{col}(z_{1}^{i}, \dots ,z^{i}_{i-1}, z^{i}_{i+1}, \dots , z^{i}_{N})$. Function $\nabla _{i} J_{i}({\mathbf{z}}^{i})= \frac{\partial J_{i}}{\partial z_{i}^{i}}(z_{i}^{i}, {\mathbf{z}}_{-i}^{i})$ is the partial gradient of agent i’s cost function evaluated at the local estimate ${\mathbf{z}}^{i}$.

For convenience, we define an extended pseudogradient as ${\boldsymbol{F}}({\mathbf{z}})=\operatorname{col}( \nabla _{1} J_{1}({\mathbf{z}}^{1}), \dots , \nabla _{N} J_{N}({\mathbf{z}}^{N}))\in \mathbb{R}^{N}$ for this game G. The following assumption on this extended pseudogradient F was made in [17]:

Assumption 3

The extended pseudogradient F is $l_{F}$-Lipschitz with $l_{F}>0$.

Let $l=\max \{\bar{l}, l_{F}\}$. According to Theorem 2 in [17], along the trajectory of system (1), ${\boldsymbol{z}}^{i}(t)$ will exponentially converge to $z^{*}$ as t goes to +∞ if graph $\mathcal{G}$ is undirected and satisfies a strong coupling condition of the form: $\lambda _{2}> \frac{l^{2}}{\underline{l}}+l$. Note that the coupling condition might be violated in applications for a given game and undirected graph (since the scalars $\lambda _{2}$ and $\frac{l^{2}}{\underline{l}}+l$ are both fixed). Although the authors in [17] further relaxed this connectivity condition by some singular perturbation technique, the derived results are still limited to undirected graphs.

In this paper, we assume that the information sharing graph is directed and satisfies the following condition:

Assumption 4

Digraph $\mathcal{G}$ is strongly connected.

The main goal of this paper is to exploit the basic idea of algorithm (1) and develop effective distributed variants to solve this problem for digraphs under Assumption 4 including undirected connected graphs as a special case. Since the information flow might be asymmetric in this case, the resultant equilibrium seeking problem is thus more challenging than the undirected case.

4 Main result

In this section, we first solve our Nash equilibrium seeking problem for the weight-balanced digraphs and then extend the derived results to general strongly connected ones with unbalanced weights.

4.1 Weight-balanced graph

To begin with, we make the following extra assumption:

Assumption 5

Digraph $\mathcal{G}$ is weight-balanced.

Motivated by algorithm (1), we propose a modified version of gradient-play rules for game G as follows:

$$\begin{aligned} \dot{{\mathbf{z}}}^{i}&=-\alpha \sum _{j=1}^{N} a_{ij} \bigl({{ \mathbf{z}}}^{i}-{{ \mathbf{z}}}^{j} \bigr)- R_{i} \nabla _{i} J_{i} \bigl({ \mathbf{z}}^{i} \bigr), \end{aligned}$$

(2)

where $R_{i}$, ${{\mathbf{z}}}^{i}$ are defined as above and $\alpha >0$ is a constant to be specified later. Putting it into a compact form, we have

$$\begin{aligned} \dot{{\mathbf{z}}}=-\alpha {\mathbf{L}} {\mathbf{z}}-R { \boldsymbol{F}}({\mathbf{z}}), \end{aligned}$$

(3)

where ${\mathbf{z}}=\operatorname{col}({\mathbf{z}}^{1}, \dots , {\mathbf{z}}^{N})$, $R=\operatorname{diag}(R_{1}, \dots , R_{N})$ and ${\mathbf{L}}=L\otimes I_{N}$ with the extended pseudogradient ${\boldsymbol{F}}({\mathbf{z}})$.

Different from algorithm (1) and its singularly perturbed extension presented in [17], we add an extra parameter α to increase the gain of the proportional term Lz. With this gain being large enough, the effectiveness of algorithm (3) is shown as follows:

Theorem 1

Suppose Assumptions 1–5hold. Let $\alpha >\frac{1}{\lambda _{2}}(\frac{l^{2}}{\underline{l}}+l)$. Then, for any $i\in \mathcal{N}$, along the trajectory of system (3), ${\boldsymbol{z}}^{i}(t)$ exponentially converges to $z^{*}$ as t goes to +∞.

Proof

We first show that at the equilibrium of system (3), $z_{i}$ indeed reaches the Nash equilibrium of game G. In fact, letting the righthand side of (2) be zero, we have $\alpha {\mathbf{L}}{\mathbf{z}}^{*}+ R {\boldsymbol{F}}({\mathbf{z}}^{*})={\boldsymbol{0}}$. Premultiplying both sides by ${\boldsymbol{1}}^{\top }_{N} \otimes I_{N}$ gives

$$\begin{aligned} {\boldsymbol{0}}=\alpha \bigl({\boldsymbol{1}}^{\top }_{N} \otimes I_{N} \bigr) (L\otimes I_{N}){\mathbf{z}}^{*}+ \bigl({\boldsymbol{1}}^{\top }_{N} \otimes I_{N} \bigr) R {\boldsymbol{F}} \bigl({\mathbf{z}}^{*} \bigr). \end{aligned}$$

Using ${\boldsymbol{1}}^{\top }_{N}L=0$ gives ${\boldsymbol{0}}=({\boldsymbol{1}}^{\top }_{N} \otimes I_{N}) R {\boldsymbol{F}}({\mathbf{z}}^{*})$. By the notation of R and F, we have ${\boldsymbol{F}}({\mathbf{z}}^{*})=\boldsymbol{0}$. This further implies that ${\mathbf{L}}{\mathbf{z}}^{*}=\boldsymbol{0}$. Recalling the property of L under Assumption 4, one can determine some $\theta \in \mathbb{R}^{N}$ such that ${\mathbf{z}}^{*}={\boldsymbol{1}}\otimes \theta $. This means ${\boldsymbol{F}}({\boldsymbol{1}}\otimes \theta )=\boldsymbol{0}$ and thus $\nabla _{i} J_{i}(\theta _{i}, \theta _{-i})=0$, or equivalently, $F(\theta )=\boldsymbol{0}$. That is, θ is the unique Nash equilibrium $z^{*}$ of G and ${\mathbf{z}}^{*}={\boldsymbol{1}}\otimes z^{*} $.

Next, we show the exponential stability of system (3) at its equilibrium ${\mathbf{z}}^{*}={\boldsymbol{1}}\otimes z^{*}$. For this purpose, we denote $\tilde{\mathbf{z}}={\mathbf{z}}-{\mathbf{z}}^{*}$ and perform the coordinate transformation $\bar{\mathbf{z}}_{1}=(M_{1}^{\top }\otimes I_{N})\tilde{\mathbf{z}}$ and $\bar{\mathbf{z}}_{2}=(M_{2}^{\top }\otimes I_{N})\tilde{\mathbf{z}}$. It follows that

$$\begin{aligned} &\dot{\bar{\mathbf{z}}}_{1}=- \bigl(M_{1}^{\top } \otimes I_{N} \bigr)R \Delta, \\ &\dot{\bar{\mathbf{z}}}_{2}=-\alpha \bigl[ \bigl(M_{2}^{\top }L M_{2} \bigr)\otimes I_{N} \bigr] \bar{ \mathbf{z}}_{2}- \bigl(M_{2}^{\top }\otimes I_{N} \bigr) R\Delta, \end{aligned}$$

where $\Delta \triangleq {\boldsymbol{F}}({\mathbf{z}})- {\boldsymbol{F}}({\mathbf{z}}^{*})$.

Let $V(\bar{\mathbf{z}}_{1}, \bar{\mathbf{z}}_{2})=\frac{1}{2}(\|\bar{\mathbf{z}}_{1} \|^{2}+\|\bar{\mathbf{z}}_{2}\|^{2})$. Then its time derivative along the trajectory of system (3) satisfies that

$$\begin{aligned} \dot{V}={}&{-}\bar{\mathbf{z}}_{1}^{\top } \bigl(M_{1}^{\top }\otimes I_{N} \bigr)R \Delta - \bar{\mathbf{z}}_{2}^{\top } \bigl(M_{2}^{\top } \otimes I_{N} \bigr) R\Delta \\ &{}- \alpha \bar{\mathbf{z}}_{2}^{\top } \bigl\{ \bigl[M_{2}^{\top }L M_{2} \bigr]\otimes I_{N} \bigr\} \bar{\mathbf{z}}_{2} \\ ={}&{-}\tilde{\mathbf{z}}^{\top }R \Delta - \alpha \bar{ \mathbf{z}}_{2}^{\top } \bigl[ \bigl(M_{2}^{\top } \operatorname{Sym}(L) M_{2} \bigr)\otimes I_{N} \bigr] \bar{ \mathbf{z}}_{2} \\ \leq{}& {-}\alpha \lambda _{2} \Vert \bar{\mathbf{z}}_{2} \Vert ^{2}-\tilde{\mathbf{z}}^{\top }R \Delta. \end{aligned}$$

(4)

Since $\tilde{\mathbf{z}}=(M_{1} \otimes I_{N}) \bar{\mathbf{z}}_{1}+(M_{2} \otimes I_{N}) \bar{\mathbf{z}}_{2}\triangleq \tilde{\mathbf{z}}_{1}+ \tilde{\mathbf{z}}_{2}$, we split $\tilde{\mathbf{z}}$ into two parts to estimate the above cross term and obtain that

$$\begin{aligned} -\tilde{\mathbf{z}}^{\top }R \Delta ={}&(\tilde{\mathbf{z}}_{1}+ \tilde{\mathbf{z}}_{2} )^{\top }R \bigl[{\boldsymbol{F}} \bigl( \tilde{ \mathbf{z}}_{1}+\tilde{\mathbf{z}}_{2}+{ \mathbf{z}}^{*} \bigr)- { \boldsymbol{F}} \bigl({\mathbf{z}}^{*} \bigr) \bigr] \\ ={}&{-} \tilde{\mathbf{z}}_{1}^{\top }R \bigl[{\boldsymbol{F}} \bigl( \tilde{\mathbf{z}}_{1}+ \tilde{\mathbf{z}}_{2}+{ \mathbf{z}}^{*} \bigr)-{\boldsymbol{F}} \bigl(\tilde{\mathbf{z}}_{1}+{ \mathbf{z}}^{*} \bigr) \bigr] \\ &{}-\tilde{\mathbf{z}}_{2}^{\top }R \bigl[{\boldsymbol{F}} \bigl( \tilde{\mathbf{z}}_{1}+ \tilde{\mathbf{z}}_{2}+{ \mathbf{z}}^{*} \bigr)-{\boldsymbol{F}} \bigl(\tilde{\mathbf{z}}_{1}+{ \mathbf{z}}^{*} \bigr) \bigr] \\ &{}-\tilde{\mathbf{z}}_{1}^{\top }R \bigl[{\boldsymbol{F}} \bigl( \tilde{\mathbf{z}}_{1}+{\mathbf{z}}^{*} \bigr)-{ \boldsymbol{F}} \bigl({\mathbf{z}}^{*} \bigr) \bigr] \\ &{}-\tilde{\mathbf{z}}_{2}^{\top }R \bigl[{\boldsymbol{F}} \bigl( \tilde{\mathbf{z}}_{1}+{\mathbf{z}}^{*} \bigr)-{ \boldsymbol{F}} \bigl({\mathbf{z}}^{*} \bigr) \bigr]. \end{aligned}$$

As we have ${\boldsymbol{F}}({\boldsymbol{1}}_{N}\otimes y)=F(y)$ for any $y\in \mathbb{R}^{N}$, it follows by the strong monotonicity of F that

$$\begin{aligned} &\tilde{\mathbf{z}}_{1}^{\top }R \bigl[{\boldsymbol{F}} \bigl( \tilde{\mathbf{z}}_{1}+{\mathbf{z}}^{*} \bigr)-{ \boldsymbol{F}} \bigl({\mathbf{z}}^{*} \bigr) \bigr] \\ &\quad =\frac{\bar{\mathbf{z}}_{1}^{\top }}{\sqrt{N}} \biggl[{\mathbf{F}} \biggl({\mathbf{1}} \otimes \biggl(\frac{\bar{\mathbf{z}}_{1}}{\sqrt{N}}+y^{*} \biggr) \biggr)-{\boldsymbol{F}} \bigl({ \mathbf{1}} \otimes y^{*} \bigr) \biggr] \\ &\quad =\frac{\bar{\mathbf{z}}_{1}^{\top }}{\sqrt{N}} \biggl[F \biggl(y^{*}+ \frac{{\bar{\mathbf{z}}}_{1}}{\sqrt{N}} \biggr)-{F} \bigl(y^{*} \bigr) \biggr] \\ &\quad \geq \frac{\underline{l}}{N} \Vert {\bar{\mathbf{z}}}_{1} \Vert ^{2}, \end{aligned}$$

where we use the identity $({\mathbf{1}}^{\top }\otimes I_{N}) R=I_{N}$ and $\tilde{\mathbf{z}}_{1}^{\top }R =\frac{\bar{\mathbf{z}}_{1}^{\top }}{\sqrt{N}}$. Note that $\|R\|=\|M_{2}\|=1$ by definition. This implies that $\|R^{\top }\tilde{\mathbf{z}}_{2} \|\leq \|\tilde{\mathbf{z}}_{2}\| =\| \bar{\mathbf{z}}_{2}\|$. Then, under Assumptions 2 and 3, we have

$$\begin{aligned} -\tilde{\mathbf{z}}^{\top }R \Delta &\leq \frac{2 l}{\sqrt{N}} \Vert \bar{\mathbf{z}}_{1} \Vert \Vert \bar{\mathbf{z}}_{2} \Vert +l \Vert \bar{\mathbf{z}}_{2} \Vert ^{2}- \frac{\underline{l}}{N} \Vert \bar{ \mathbf{z}}_{1} \Vert ^{2}. \end{aligned}$$

(5)

Bringing inequalities (4) and (5) together gives

$$\begin{aligned} \dot{V}&\leq -\frac{\underline{l}}{N} \Vert \bar{ \mathbf{z}}_{1} \Vert ^{2} -( \alpha \lambda _{2}- l) \Vert \bar{\mathbf{z}}_{2} \Vert ^{2}+\frac{2 l}{\sqrt{N}} \Vert \bar{\mathbf{z}}_{1} \Vert \Vert \bar{\mathbf{z}}_{2} \Vert \\ &=- \begin{bmatrix} \Vert \bar{\mathbf{z}}_{1} \Vert & \Vert \bar{\mathbf{z}}_{2} \Vert \end{bmatrix} A_{\alpha } \begin{bmatrix} \Vert \bar{\mathbf{z}}_{1} \Vert \\ \Vert \bar{\mathbf{z}}_{2} \Vert \end{bmatrix} \end{aligned}$$

(6)

with $A_{α} = [\begin{array}{c} \frac{\underline{l}}{N} & - \frac{l}{\sqrt{N}} \\ - \frac{l}{\sqrt{N}} & α λ_{2} - l \end{array}]$ . When $\alpha >\frac{1}{\lambda _{2}}(\frac{l^{2}}{\underline{l}}+l)$, matrix $A_{\alpha }$ is positive definite. Thus, there exists a constant $\nu >0$ such that

$$\begin{aligned} \dot{V}&\leq -\nu V. \end{aligned}$$

Recalling Theorem 4.10 in [29], one can conclude the exponential convergence of ${\mathbf{z}}(t)$ to ${\mathbf{z}}^{*}$, which implies that ${\mathbf{z}}^{i}(t)$ converges to $z^{*}$ as t goes to +∞. The proof is thus complete. □

Remark 1

Algorithm (3) is a modified version of the gradient-play dynamics (1) with an adjustable proportional control gain α. The criterion to choose α clearly presents a natural trade-off between the control efforts and graph algebraic connectivity. By choosing a large enough α, this theorem ensures the exponential convergence of all local estimates to the Nash equilibrium $z^{*}$ over weight-balanced digraphs and also provides an alternative way to remove the restrictive graph coupling condition presented in [17].

4.2 Weight-unbalanced graph

In this subsection, we aim to extend the preceding design to general strongly connected digraphs. In the following, we first modify (3) to ensure its equilibrium as the Nash equilibrium of game G, and then implement it in a distributed manner by adding a graph imbalance compensator.

At first, we assume that a left eigenvector of the Laplacian L associated with the trivial eigenvalue is known and denoted by $\xi =\operatorname{col}(\xi _{1}, \dots , \xi _{N})$, i.e., $\xi ^{\top }L={\boldsymbol{0}}$. Without loss of generality, we assume $\xi ^{\top }{\mathbf{1}}=1$. Then, ξ is componentwise positive by Theorem 4.16 in Chap. 6 of [30]. Here we use this vector ξ to correct the graph imbalance in system (2) as follows:

$$\begin{aligned} \dot{{\mathbf{z}}}^{i}&=- \alpha \xi _{i} \sum_{j=1}^{N} a_{ij} \bigl({{ \mathbf{z}}}^{i}-{{\mathbf{z}}}^{j} \bigr)- R_{i} \nabla _{i} J_{i} \bigl({ \mathbf{z}}^{i} \bigr). \end{aligned}$$

(7)

Similar ideas can be found in [14] and [31]. We put this system into a compact form

$$\begin{aligned} \dot{{\mathbf{z}}}=-\alpha {\mathbf{L}}_{\Lambda }{ \mathbf{z}}-R {\boldsymbol{F}}({\mathbf{z}}), \end{aligned}$$

(8)

where $\Lambda =\operatorname{diag}({\xi _{1}}, \dots , {\xi _{N}})$ and ${\mathbf{L}}_{\Lambda }=\Lambda L \otimes I_{N} $. It can be easily verified that ΛL is the associated Laplacian of a new digraph $\mathcal{G}'$, which has the same connectivity topology as digraph $\mathcal{G}$ but with scaled weights, i.e., $a'_{ij}={\xi _{i}}{a_{ij}}$ for any $i, j\in \mathcal{N}$. As this new digraph $\mathcal{G}'$ is naturally weight-balanced, we denote $\lambda '_{2}$ as the minimal positive eigenvalue of $\operatorname{Sym}(\Lambda L)$.

Here is an immediate consequence of Theorem 1.

Lemma 1

Suppose Assumptions 1–4hold and let $\alpha >\frac{1}{\lambda _{2}'}(\frac{l^{2}}{\underline{l}}+l)$. Then, for any $i\in \mathcal{N}$, along the trajectory of system (8), ${\mathbf{z}}^{i}(t)$ exponentially converges to $z^{*}$ as t goes to +∞.

Note that the aforementioned vector ξ is usually unknown to us for general digraphs. To implement our algorithm, we embed a distributed estimation rule of ξ into system (7) as follows:

$$\begin{aligned} \begin{aligned} \dot{\boldsymbol{\xi}}^{i}&=- \sum_{j=1}^{N} a_{ij} \bigl({{ \boldsymbol{\xi}}}^{i}-{{ \boldsymbol{\xi}}}^{j} \bigr), \end{aligned} \end{aligned}$$

(9)

where ${\boldsymbol{\xi}}^{i}=\operatorname{col}(\xi ^{i}_{1}, \dots , \xi ^{i}_{N})$ with $\xi _{i}^{i}(0)=1$ and $\xi ^{i}_{j}=0$ for any $j\neq i \in \mathcal{N}$.

Here the diffusion dynamics of ${\boldsymbol{\xi}}^{i}$ is proposed to estimate the eigenvector ξ by $\operatorname{col}(\xi _{1}^{1}, \dots , \xi _{N}^{N})$. The following lemma shows the effectiveness of (9).

Lemma 2

Suppose Assumption 4holds. Then, along the trajectory of system (9), $\xi ^{i}_{i}(t)>0$ for any $t\geq 0$ and exponentially converges to $\xi _{i}$ as t goes to +∞.

Proof

Note that the matrix −L is essentially nonnegative in the sense that $\kappa I-L$ is nonnegative for all sufficiently large constant $\kappa >0$. Under Assumption 4, matrix −L is also irreducible. By Theorem 3.12 in Chap. 6 of [30], the matrix exponential $\exp(-Lt)$ is componentwise positive for any $t\geq 0$. As the evolution of ${\boldsymbol{\xi}}_{i}=\operatorname{col}(\xi ^{1}_{i}, \dots , \xi _{i}^{N})$ is governed by $\dot{\boldsymbol{\xi}}_{i}=-L{\boldsymbol{\xi}}_{i}$ with initial condition ${\boldsymbol{\xi}}_{i}(0)=\operatorname{col}({\boldsymbol{0}}, 1, {\boldsymbol{0}})$. Thus, ${\boldsymbol{\xi}}_{i}(t)=\exp(-Lt){\boldsymbol{\xi}}_{i}(0)>0$ for any t. By further using Theorems 1 and 3 in [12], we have that ${\xi }^{i}_{i}(t)$ exponentially converges to the value $\xi _{i}^{*}=\frac{\xi _{i}}{\sum_{j=1}^{N} \xi _{j}}$ for any $i\in \mathcal{N}$ as t goes to +∞. Since $\xi =\operatorname{col}(\xi _{1}, \dots , \xi _{N})$ is a left eigenvector of L associated with eigenvalue 0, one can easily verify that ${\xi ^{*}}^{\top }L=0$. Under Assumption 4, 0 is a simple eigenvalue of L. Then, there must be a constant $c\neq 0$ such that $\xi =c\xi ^{*}$. Note that $\xi ^{\top }{\mathbf{1}}={\xi ^{*}}^{\top }{\mathbf{1}}=1$. One can conclude that $c=1$ and thus complete the proof. □

The whole algorithm to seek the Nash equilibrium is presented as follows:

$$\begin{aligned} \begin{aligned} &\dot{{\mathbf{z}}}^{i}=- \alpha \xi ^{i}_{i} \sum_{j=1}^{N} a_{ij} \bigl({{ \mathbf{z}}}^{i}-{{\mathbf{z}}}^{j} \bigr)- R_{i} \nabla _{i} J_{i} \bigl({ \mathbf{z}}^{i} \bigr), \\ &\dot{\boldsymbol{\xi}}^{i}=-\sum_{j=1}^{N} a_{ij} \bigl({{\boldsymbol{\xi}}}^{i}-{{ \boldsymbol{\xi}}}^{j} \bigr) \end{aligned} \end{aligned}$$

(10)

with $\xi _{i}^{i}(0)=1$ and $\xi ^{i}_{j}=0$ for any $j\neq i \in \mathcal{N}$.

Bringing Lemmas 1 and 2 together, we provide the second main theorem of this paper.

Theorem 2

Suppose Assumptions 1–4hold and let $\alpha >\frac{1}{\lambda _{2}'}(\frac{l^{2}}{\underline{l}}+l)$. Then, for any $i\in \mathcal{N}$, along the trajectory of system (10), ${\mathbf{z}}^{i}(t)$ exponentially converges to $z^{*}$ as t goes to +∞.

Proof

First, we put the algorithm into a compact form:

$$\begin{aligned} \begin{aligned} &\dot{{\mathbf{z}}}=-\alpha { \mathbf{L}}_{\Lambda '}{\mathbf{z}}-R {\boldsymbol{F}}({\mathbf{z}}), \\ &\dot{\boldsymbol{\xi}}=-\mathbf {L} {{\boldsymbol{\xi}}}, \end{aligned} \end{aligned}$$

(11)

where ${\mathbf{L}}_{\Lambda '}=\Lambda ' L \otimes I_{N}$ and $\Lambda '=\operatorname{diag}({\xi _{1}^{1}}, \dots , {\xi _{N}^{N}})$. From this, one can further find that the composite system consists of two subsystems in a cascaded form as follows:

$$\begin{aligned} \begin{aligned} &\dot{{\mathbf{z}}}=-\alpha {\mathbf{L}}_{\Lambda }{ \mathbf{z}}-R {\boldsymbol{F}}({\mathbf{z}})- \alpha \bigl[(\Delta _{\Lambda }L) \otimes I_{N} \bigr]{\mathbf{z}}, \\ &\dot{\boldsymbol{\xi}}=-\mathbf {L} {{\boldsymbol{\xi}}}, \end{aligned} \end{aligned}$$

where ${\mathbf{L}}_{\Lambda }$ is defined as in (8) and $\Delta _{\Lambda }= \Lambda '-\Lambda $. Note that the term $\alpha [(\Delta _{\Lambda }L)\otimes I_{N}]{\mathbf{z}}$ can be upper bounded by $\gamma _{p}\exp(-\beta _{p} t)\|\mathbf {z}\|$ for some positive constants $\gamma _{p}$ and $\beta _{p}$ according to Lemma 2. By viewing $\alpha [(\Delta _{\Lambda }L)\otimes I_{N}]{\mathbf{z}}$ as a vanishing perturbation of the upper subsystem, the unperturbed z-subsystem is globally exponentially stable at its equilibrium ${\mathbf{z}}^{*}={\mathbf{1}}_{N}\otimes z^{*}$ by Lemma 1. Recalling Corollary 9.1 in [29], the whole algorithm (11) is globally exponentially stable at its equilibrium. This implies that along the trajectory of system (11), ${\boldsymbol{z}}^{i}(t)$ exponentially converges to $z^{*}$ as t goes to +∞. The proof is thus complete. □

Remark 2

In contrast to the algorithm (2) with proportional gains in Theorem 1, this new rule (10) further includes a distributed left eigenvector estimator to compensate the imbalance of the graph Laplacian. Compared with those equilibrium seeking results in [15, 17, 18] for undirected graphs, the proportional control and graph imbalance compensator together facilitate us to solve this problem for strongly connected digraphs including undirected graphs as its special case.

5 Simulation

In this section, we present an example to verify the effectiveness of our designs.

Consider an eight-player noncooperative game. Each player has a pay-off function of the form $J_{i}(x_{i}, x_{-i})=c_{i} x_{i} -x_{i}f(x)$ with $x=\operatorname{col}(x_{1}, \dots , x_{8})$ and $f(x)=D-\sum_{i=1}^{8} x_{i}$ for a constant $D>0$. Suppose the communication topology among the agents is depicted by a digraph in Fig. 1 with all weights as one. The Nash equilibrium of this game can be analytically determined as $z^{*}=\operatorname{col}(z_{1}, \dots , z_{n})$ with $z^{*}_{i}=46-4*i$.

Since the communication graph is directed and weight-unbalanced, the gradient-play algorithm developed in [17] might fail to solve the problem. At the same time, Assumptions 1–4 can be easily confirmed. Then we can resort to Theorem 2 and use algorithm (10) to seek the Nash equilibrium in this eight-player noncooperative game.

For simulations, let $c_{i}=4i$ and $D=270$. We sequentially choose $\alpha =2$ and $\alpha =10$ for algorithm (10). Since the righthand side of our algorithm is Lipschitz, we conduct the simulation via the forward Euler method with a small step size [32]. The simulation results are shown in Figs. 2–4. From Fig. 2, one can find that the estimate $\xi (t)$ converges quickly to the left eigenvector of the graph Laplacian $\xi =\operatorname{col}(4, 4, 3, 2, 2, 1, 1, 1)/18$. At the same time, $\operatorname{col}(z_{1}(t), \dots , z_{8}(t))$ approaches the Nash equilibrium $z^{*}$ of this game for different proportional parameters. Moreover, a larger proportional gain α is observed to imply a faster rate of convergence. We also show the profile of $\eta _{i}(t)\triangleq t^{2}(z_{i}(t)-z_{i}^{*})$ in Fig. 5 to confirm the exponential convergence rate when $\alpha =10$. These results verify the effectiveness of our designs in resolving the Nash equilibrium seeking problem over general strongly connected digraphs.

6 Conclusion

Nash equilibrium seeking problem over directed graphs has been discussed with consensus-based distributed rules. By selecting some proper proportional gains and embedding a distributed graph imbalance compensator, the expected Nash equilibrium is shown to be reached exponentially fast over general strongly connected digraphs. In the future, we may use the adaptive high-gain techniques as in [21, 33] to extend the results to fully distributed versions. Another interesting direction is to incorporate high-order agent dynamics and nonsmooth cost functions.

Availability of data and materials

Not applicable.

References

D. Fudenberg, J. Tirole, Game Theory (MIT Press, Cambridge, 1991)
MATH Google Scholar
T. Başar, G. Zaccour, Handbook of Dynamic Game Theory (Springer, New York, 2018)
Book Google Scholar
M. Maschler, S. Zamir, E. Solan, Game Theory (Cambridge University Press, Cambridge, 2020)
Book Google Scholar
S. Li, T. Başar, Distributed algorithms for the computation of noncooperative equilibria. Automatica 23(4), 523–533 (1987)
Article MathSciNet Google Scholar
T. Basar, G.J. Olsder, Dynamic Noncooperative Game Theory (2nd) (SIAM, Philadelphia, 1999)
MATH Google Scholar
M.S. Stankovic, K.H. Johansson, D.M. Stipanovic, Distributed seeking of Nash equilibria with applications to mobile sensor networks. IEEE Trans. Autom. Control 57(4), 904–919 (2011)
Article MathSciNet Google Scholar
M. Mesbahi, M. Egerstedt, Graph Theoretic Methods in Multiagent Networks (Princeton University Press, Princeton, 2010)
Book Google Scholar
J.S. Shamma, G. Arslan, Dynamic fictitious play, dynamic gradient play, and distributed convergence to Nash equilibria. IEEE Trans. Autom. Control 50(3), 312–327 (2005)
Article MathSciNet Google Scholar
P. Frihauf, M. Krstic, T. Basar, Nash equilibrium seeking in noncooperative games. IEEE Trans. Autom. Control 57(5), 1192–1207 (2011)
Article MathSciNet Google Scholar
G. Scutari, F. Facchinei, J.-S. Pang, D.P. Palomar, Real and complex monotone communication games. IEEE Trans. Inf. Theory 60(7), 4197–4231 (2014)
Article MathSciNet Google Scholar
S. Grammatico, Dynamic control of agents playing aggregative games with coupling constraints. IEEE Trans. Autom. Control 62(9), 4537–4548 (2017)
Article MathSciNet Google Scholar
R. Olfati-Saber, J.A. Fax, R.M. Murray, Consensus and cooperation in networked multi-agent systems. Proc. IEEE 95(1), 215–233 (2007)
Article Google Scholar
B. Swenson, S. Kar, J. Xavier, Empirical centroid fictitious play: an approach for distributed learning in multi-agent games. IEEE Trans. Signal Process. 63(15), 3888–3901 (2015)
Article MathSciNet Google Scholar
Y. Lou, Y. Hong, L. Xie, G. Shi, K.H. Johansson, Nash equilibrium computation in subnetwork zero-sum games with switching communications. IEEE Trans. Autom. Control 61(10), 2920–2935 (2016)
Article MathSciNet Google Scholar
J. Koshal, A. Nedić, U.V. Shanbhag, Distributed algorithms for aggregative games on graphs. Oper. Res. 64(3), 680–704 (2016)
Article MathSciNet Google Scholar
F. Salehisadaghiani, L. Pavel, Distributed Nash equilibrium seeking: a gossip-based algorithm. Automatica 72, 209–216 (2016)
Article MathSciNet Google Scholar
D. Gadjov, L. Pavel, A passivity-based approach to Nash equilibrium seeking over networks. IEEE Trans. Autom. Control 64(3), 1077–1092 (2019)
Article MathSciNet Google Scholar
M. Ye, G. Hu, Distributed Nash equilibrium seeking in multiagent games under switching communication topologies. IEEE Trans. Cybern. 48(11), 3208–3217 (2017)
Article Google Scholar
S. Liang, P. Yi, Y. Hong, Distributed Nash equilibrium seeking for aggregative games with coupled constraints. Automatica 85, 179–185 (2017)
Article MathSciNet Google Scholar
X. Zeng, J. Chen, S. Liang, Y. Hong, Generalized Nash equilibrium seeking strategy for distributed nonsmooth multi-cluster game. Automatica 103, 20–26 (2019)
Article MathSciNet Google Scholar
C. De Persis, S. Grammatico, Distributed averaging integral Nash equilibrium seeking on networks. Automatica 110, 108548 (2019)
Article MathSciNet Google Scholar
P. Yi, L. Pavel, Distributed generalized Nash equilibria computation of monotone games via double-layer preconditioned proximal-point algorithms. IEEE Trans. Control Netw. Syst. 6(1), 299–311 (2018)
Article MathSciNet Google Scholar
A. Romano, L. Pavel, Dynamic NE seeking for multi-integrator networked agents with disturbance rejection. IEEE Trans. Control Netw. Syst. 7(1), 129–139 (2020)
Article MathSciNet Google Scholar
Y. Zhang, S. Liang, X. Wang, H. Ji, Distributed Nash equilibrium seeking for aggregative games with nonlinear dynamics under external disturbances. IEEE Trans. Cybern., 1–10 (2019)
Z. Deng, S. Liang, Distributed algorithms for aggregative games of multiple heterogeneous Euler–Lagrange systems. Automatica 99, 246–252 (2019)
Article MathSciNet Google Scholar
T. Tatarenko, W. Shi, A. Nedić, Geometric convergence of gradient play algorithms for distributed Nash equilibrium seeking. IEEE Trans. Autom. Control 66(11), 5342–5353 (2020)
Article MathSciNet Google Scholar
A. Ruszczynski, Nonlinear Optimization (Princeton University Press, Princeton, 2006)
Book Google Scholar
F. Facchinei, J.-S. Pang, Finite-Dimensional Variational Inequalities and Complementarity Problems (Springer, New York, 2003)
MATH Google Scholar
H.K. Khalil, Nonlinear Systems, 3rd edn. (Prentice Hall, New Jersey, 2002)
MATH Google Scholar
A. Berman, R.J. Plemmons, Nonnegative Matrices in the Mathematical Sciences (SIAM, Philadelphia, 1994)
Book Google Scholar
C.N. Hadjicostis, A.D. Domínguez-García, T. Charalambous, Distributed averaging and balancing in network systems: with applications to coordination and control. Found. Trends Syst. Control. 5(2–3), 99–292 (2018)
Article Google Scholar
R.J. LeVeque, Finite Difference Methods for Ordinary and Partial Differential Equations (SIAM, Philadelphia, 2007)
Book Google Scholar
Y. Tang, X. Wang, Optimal output consensus for nonlinear multiagent systems with both static and dynamic uncertainties. IEEE Trans. Autom. Control 66(4), 1733–1740 (2020)
Article MathSciNet Google Scholar

Download references

Funding

This work was partially supported by the National Natural Science Foundation of China under Grants 61973043, 62003239, and 61703368, Shanghai Sailing Program under Grant 20YF1453000, Shanghai Municipal Science and Technology Major Project No. 2021SHZDZX0100, and Shanghai Municipal Commission of Science and Technology Project No. 19511132101.

Author information

Authors and Affiliations

School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, 100876, China
Yutao Tang
Department of Control Science and Engineering, Tongji University, Shanghai, 200092, China
Peng Yi
Shanghai Institute of Intelligent Science and Technology, Tongji University, Shanghai, 200092, China
Peng Yi
School of Automation, Hangzhou Dianzi University, Hangzhou, 310018, China
Yanqiong Zhang
China Research and Development Academy of Machinery Equipment, Beijing, 100086, China
Dawei Liu

Authors

Yutao Tang
View author publications
You can also search for this author in PubMed Google Scholar
Peng Yi
View author publications
You can also search for this author in PubMed Google Scholar
Yanqiong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Dawei Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the study conception and design. Material preparation and analysis were performed by YT, PY, YZ, and DL. The first draft of the manuscript was written by YT and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Peng Yi.

Ethics declarations

Competing interests

Peng Yi is an editorial board member for Autonomous Intelligent Systems and was not involved in the editorial review, or the decision to publish, this article. All authors declare that there are no competing interests.

Additional information

Code availability

Not applicable.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tang, Y., Yi, P., Zhang, Y. et al. Nash equilibrium seeking over directed graphs. Auton. Intell. Syst. 2, 7 (2022). https://doi.org/10.1007/s43684-022-00026-2

Download citation

Received: 08 December 2021
Accepted: 05 April 2022
Published: 18 April 2022
DOI: https://doi.org/10.1007/s43684-022-00026-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Nash equilibrium seeking over directed graphs

Abstract

Similar content being viewed by others

A Zero-Sum Game Between the Network Designer and an Adversary in Consensus Protocols

A Distributed v-GNE Seeking Algorithm for Multi-agent Games over Unbalanced Graphs

Nash Equilibrium Seeking over Networks

1 Introduction

2 Preliminaries

2.1 Convex analysis

2.2 Graph theory

3 Problem formulation

Definition 1

Assumption 1

Assumption 2

Assumption 3

Assumption 4

4 Main result

4.1 Weight-balanced graph

Assumption 5

Theorem 1

Proof

Remark 1

4.2 Weight-unbalanced graph

Lemma 1

Lemma 2

Proof

Theorem 2

Proof

Remark 2

5 Simulation

6 Conclusion

Availability of data and materials

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Code availability

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation