1 Introduction

Large-scale systems have become interesting in control engineering. Many systems become large in scope and complex in scale since decades ago and coping with these systems requires time and efforts since many problems are appeared like existence of disturbances, nonlinear parameters and complex dynamic. Many algorithms have proposed for large-scale systems since years ago [1, 2]. The decentralized controller is an interesting approach that scholars aimed to solve these problems [3]. In this approach, the considered large-scale system is decomposed into several subsystems and a controller is applied to each subsystem separately, and the interconnections between subsystems are considered. So the overall system is stabilized by adopting this approach. In [4], a decentralized fault-tolerant tracking control is applied to a large-scale system with sensor and actuator faults that its problem is decomposed into several problems. The decentralized adaptive fuzzy controller is proposed for a class of uncertain nonlinear large-scale systems considering random sensor delays and random sensor nonlinearities in [5]. By using a group of sensor blocks two new distributed data-driven optimal fault detection approaches are applied to a large-scale system in [6].

As it was mentioned above, one of the important issues in large-scale system is its complex dynamic. To design a suitable controller and solve problems, it is significant to have the exact model of the system. But it is almost impossible when the system has many nonlinear parameters and severe dynamic. In this case, scientists use some methods like fuzzy logic or neural networks to identify the model of the system [7, 8]. For instance, in [9], the neural network is used to estimate the inverse dynamic of the da Vinci surgical robot to enables estimation of the external environment forces. In this paper, since the considered system is large-scale, the fuzzy model of the systems is used instead of the dynamic model. Nowadays, fuzzy systems with IF–THEN rules have become popular and useful approach for systems with large, complex, and nonlinear relations and several researches have been done [10,11,12,13. Introduces a type of fuzzy inference system well-known Takagi–Sugeno, fuzzy model, and model complex nonlinear system to arbitrary degrees of accuracy. On the other hand, another vital issue that is considered in this paper is time-varying delay that is unavoidable problem. The time delay phenomena can be seen in many real and industrial systems [14], and in many cases it causes instability and might lead outputs to unexpected goal. Since the time delay causes instability, attentions have been attracted for many years. Totally, Lyapunov theory for systems with time-varying delay can be proposed into two approaches, the Lyapunov–Krasovskii functional (LKF) and Lyapunov-Razumikhin function (LRF) [15]. For discrete time systems, the Krasovskii method makes use of an augmentation of the state vector with all delayed states, which yields the application of classical Lyapunov methods to an augmented system without delay. In the Krasovskii method, an augmentation of the state vector with all delayed states are used, which yields the applications of classical Lyapunov methods to an augmented system without delay. So the computational burden in this method will be heavy, especially for large-scale systems [16]. On the contrast, the Razumikhin method includes a Lyapunov function for the original non-augmented system despite of the fact that it might be a bit conservative. Thereby, Razumikhin method has a better potential to avoid the complexity of the Krasovskii method. Many studies have been done by adopting Razumikhin method [17,18,19. In [16], the author proposed an LMI-based model predictive control for fuzzy T-S system with time-varying delay and used Razumikhin method for the delay problem and the LMI-based method is proposed for not interconnected system. In [20], a model predictive control strategy for nonlinear time-delay systems with unknown time varying delay is considered instead of the traditional constant delay. Nonlinear discrete time-delay systems are represented by T–S fuzzy systems comprised of piecewise linear delay difference equations in [21]. The stability of delay coupled systems with hybrid switching diffusions based on Razumikhin method is studied in [22] and it combines Lyapunov method with graph theory for typical systems. To stabilize these systems that are involved time-varying delay, many control systems have been applied like adaptive, fuzzy, robust, or so. One of the most interesting controller has been model predictive control.

Model predictive control (MPC) has attended popularity as a reliable control approach. By properly using the system model to predict the output response, MPC methods allow to choose the optimal control action that minimizes a desired cost function. In many applications, MPC has been recognized as a viable alternative to many other classic schemes based. For such applications, the advantages introduced by MPC are the higher dynamic performance, the possibility of performing a multivariable controller design, the inclusions of constraints on input and output variables, and the possibility of including nonlinearities in both the model and the constraints. Although the need for an accurate model may represent a drawback in some applications, it is remarked how reliable model of systems are usually available for control design [23, 24]. The gist of the model predictive control is an optimal control sequence, which is computed by minimizing a finite horizon cost function at each sampling time. Studying on nonlinear robust model predictive control for Takagi–Sugeno fuzzy systems can be seen in [25]. As mentioned in [26, 27], invariant set theory can provide sufficient conditions to make sure the recursive feasibility and the closed-loop system stability. In [28] both online and off-line robust fuzzy model predictive control with structured uncertainties and persistent disturbances is studied. In [23] robust fuzzy model predictive control with nonlinear local models is introduced, in which, for both nonlinear and linear states of system, separated controllers are designed. In [29], an LMI-based MPC is proposed for a large-scale system with disturbances and uncertainties. Obviously, it can be seen that hierarchical-based optimization MPC for large-scale systems with time-varying delay, persistent disturbances, and with respect to Lyapunov-Razumikhin function has not been studied yet and several problems remain unsolved. In this study, the considered controller is model predictive control. Since the considered system is a large-scale, the hierarchical scheme is applied to solve the optimization problem based on decomposition and coordination concept [30]. On the other hand, no research has been done regarding the MPC for fuzzy large-scale systems with time-varying delay adopting the Lyapunov-Razumikhin function. Therefore, the MPC is applied to a large-scale system that its dynamic is modeled by T-S fuzzy, and the Lyapunov-Razumikhin function is considered for the time-varying delay problem. Besides, the hierarchical scheme is assumed for the optimization problem of the system.

So the contribution of this paper can be summarized as:

  • Proposing the fuzzy model predictive control for discrete nonlinear large-scale system.

  • Considering the time-varying delay and persistent disturbances simultaneously.

  • Applying hierarchical optimization to the considered large-scale system.

  • Studying Lyapunov-Razumikhin function.

  • Analyzing the input-to-state stability.

  • Using the fuzzy Takagi–Sugeno model of the large-scale system.

  • Inspiring the \({H}_{\infty }\) performance to encounter the effectiveness of disturbances.

The remainder of this paper is: in Section 2, some preliminaries information about fuzzy model predictive control of Takagi–Sugeno large-scale systems with time-varying delay and hierarchical optimization scheme are introduced. In Section 3, robust positive invariance (RPI), the computation of terminal constraint set for nonlinear model-based fuzzy systems is provided. In Section 4, two numerical examples are illustrated. Finally, the conclusion is given in Section 5.

Notations: \(R, {R}_{+}, Z,\) and \({Z}_{+}\) depict sets of real numbers, positive real numbers, integers, and positive integers, respectively. \({Z}_{[m,n]}\) is symbol of set of integers in the interval \(\left[m,n\right]\) for convenience. \(\Vert x\Vert\) depicts the norm of vector \({x}_{i}\in {R}^{c}.\) A real-valued scalar function \(\partial :{R}_{+}\to {R}_{+}\) is an \(\mathcal{H}\)-function \(\left(\partial \in \mathcal{H}\right)\), if it is continuous, rigidly increasing and \(\partial \left(0\right)=0.\) So we can say, \(\partial \in {\mathcal{H}}_{\infty }\) if \(\partial \in \mathcal{H}\) and \(\underset{s\to \infty }{\mathrm{lim}}\partial \left(s\right)=\infty .\) A function \(\beta :\) \(R_{ + } \times R_{ + } \to R_{ + }\) is an \(\mathcal{H}L\)-function \(\left(\beta \in \mathcal{H}L\right),\) if for each fixed \(s>0, \beta (.,s)\) is a \(\mathcal{H}\)-function, and for each fixed \(r>0\), \(\beta (r,.)\) is rigidly decreasing and \(\beta \left(r,s\right)\to0\) as \(s\to\infty\).

2 Preliminaries

2.1 System Description

The following discrete-time nonlinear large-scale system is assumed:

$$x\left(k+1\right)\in f\left({x}_{i}\left(k\right),{x}_{id}\left(k\right),{u}_{i}\left(k\right),{d}_{i} \, \left(k\right)\right) \, k\in {Z}_{+}$$
(1)

where \({x}_{i}\left(k\right),{x}_{id}\left(k\right)\in {\mathbb{X}}, {u}_{i}\left(k\right)\in {\mathbb{U}}\) and \({d}_{i}\left(k\right)\in {\mathbb{D}},\) with \({\mathbb{X}}\subseteq {\mathbb{R}}^{n}, {\mathbb{U}}\subseteq {\mathbb{R}}^{m},\) and \({\mathbb{D}}\subseteq {\mathbb{R}}^{c}\), are system current and delayed states, inputs, and disturbances, respectively. \({x}_{id}\left(k\right)\) is defined as follows:

$${x}_{id}\left(k\right)={x}_{id}\left(k+d(k)\right), d\left(k\right)\in {\mathbb{Z}}_{\left[-h,-1\right]}.$$
(2)

where \(d(k)\) denotes the number of delays, h denotes the upper bound of delay, and the minimal delay is set as 1. Now, for time-delay systems some definitions related to the robust positive invariance and input-to-state stability are proposed [31]:

Definition 1

(Robust Positively Invariant (RPI) set) For the Lyapunov-Razumikhin function conditions, consider system \(\left(1\right),\) a set \({\Omega }_{is}\) is an RPI set for the closed-loop system corresponding to the control law, and for the \(d\left(k\right)\in {\mathbb{Z}}_{\left[-h,-1\right]},\) if \(\forall {x}_{i},{x}_{id}\in {\Omega }_{is} ,\) and \(\forall {d}_{i}\left(k\right)\in {\mathbb{D}},\) the control effort assures that \({x}_{i}^{+}\in {\Omega }_{is}.\)

Definition 2

(Input-to-state-Stability (ISS)) A discrete-time nonlinear system \(x\left(k+1\right)=f({x}_{i}\left(k\right),{x}_{id}\left(k\right),{d}_{i}\left(k\right))\) where \(d\) symbolizes the disturbance vector, is input-to-state stable (ISS) if there exists an \(\mathcal{H}L\)-function \(\beta\) and a \(\mathcal{H}\)-function \(\gamma\) such that for each input \(d\),\(\Vert x\left(k\right)\Vert \le \beta \left(\Vert {{x}_{i}}_{\left[-h,0\right]}\Vert ,k\right)+\gamma \left(\parallel {{d}_{i}}_{\left[0,k-1\right]}\parallel \right),\) where \({{x}_{i}}_{\left[-h,0\right]}\in {\mathbb{X}}^{h+1}\) is the initial (delayed) state vector, \({{d}_{i}}_{\left[0,k-1\right]}\in {\mathbb{W}}^{k}\) is the disturbance sequence, \(k\in {Z}_{+}\).

Definition 3

(ISS Lyapunov-Razumikhin Function [16]) A continuous positive definite function \(V\left(x\left(k\right)\right)\) is called an ISS-Lyapunov-Razumikhin function for system \(x\left(k+1\right)=f\left({x}_{i}\left(k\right),{x}_{id}\left(k\right),{d}_{i}\left(k\right)\right),\) if there exist \({\mathcal{H}}_{{\infty }^{-}}\mathrm{function}\) \({\partial }_{1},{ \partial }_{2},\) and \(\mathcal{H}\)-function \(\rho\) such that:

$${\partial }_{1}\left(\parallel x\left(k\right)\parallel \right)\le V\left(x\left(k\right)\right)\le {\partial }_{2}\left(\parallel x\left(k\right)\parallel \right)$$
(3)
$$V\left(x\left(k+1\right)\right)\le max\left\{\overline{V }\left(x\left(k\right)\right),\rho \left(\parallel d\left(k\right)\parallel \right)\right\}$$
(4)

where \(\overline{V }\left(x\left(k\right)\right)=max\left\{V\left({x}_{i}\left(k\right)\right),V\left({x}_{id}\left(k\right)\right)\right\}\)

Lemma 1

[32] If system \(x\left(k+1\right)=f({x}_{i}\left(k\right),{x}_{id}\left(k\right),{d}_{i}\left(k\right))\) admits an ISS-Lyapunov- Razumikhin function, then it is ISS.

2.2 Time Delay Takagi–Sugeno System Description

The fuzzy Takagi–Sugeno large-scale system composed of \(N\) subsystems with time-varying delay is:

$${S}_{i}^{l}:\left\{\begin{array}{c} \mathbf{IF} \, {z}_{i1} \, is \, {F}_{i1}^{l} \, and \, \dots \, and \, {z}_{ig} \, is \, {F}_{ig}^{l} \\ \\ \mathbf{THEN} \, {x}_{i}\left(k+1\right)={A}_{i}^{l}{x}_{i}\left(k\right)+{B}_{i}^{l}{u}_{i}\left(k\right)+{A}_{id}^{l}{x}_{id}\left(k\right)+{w}_{i}^{l}{d}_{i}\left(k\right)+\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}(k)\end{array}\right.$$
(5)

in which \({A}_{i}^{l},{B}_{i}^{l},\) and \({w}_{i}^{l}\) \(\left(i=\mathrm{1,2},\dots ,N;l=\mathrm{1,2},\dots ,{r}_{i}\right)\) are the system matrices and disturbances of rule-\(l\) in subsystem \({S}_{i }.\) Here, \({x}_{i}\left(k\right)\in {R}^{c},\) \({u}_{i}\left(k\right)\in {R}^{n},\) and \({d}_{i}\left(k\right)\in {R}^{m}\) are state vector, input vector, and disturbance vector.\({r}_{i}, {f}_{ij}, {z}_{i}\left(k\right)=[{z}_{i1},{z}_{i2},\dots ,{z}_{ig}]\), and \({F}_{iq}^{l}\left(q=\mathrm{1,2},\dots ,g\right),\) respectively, illustrate the number of the fuzzy rules in subsystem \({S}_{i } ,\) the interconnection between subsystem \({S}_{i}\) and \({S}_{J},\) some measurable premise variables, and the linguistic fuzzy sets of the rule\(l\). The fuzzy large-scale system \((5)\) with time-varying delay can be shown as:

$${x}_{i}^{+}={A}_{i\mu }{x}_{i}+{B}_{i\mu }{u}_{i}+{A}_{id\mu }{x}_{id}+{w}_{i\mu }{d}_{i}+\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j} , \, i=\mathrm{1,2},\dots ,N$$
(6)
$$\begin{array}{c}{A}_{i\mu }=\sum_{l=1}^{{r}_{i}}{\mu }_{i}^{l}\left({z}_{iq}\right) \, {A}_{i}^{l}; \, {A}_{id\mu }=\sum_{l=1}^{{r}_{i}}{\mu }_{i}^{l} \, \left({z}_{iq}\right){A}_{id}^{l}; \, {B}_{i\mu }=\sum_{l=1}^{{r}_{i}}{\mu }_{i}^{l}\left({z}_{iq}\right){B}_{i}^{l}\\ {w}_{i\mu }=\sum_{l=1}^{{r}_{i}}{\mu }_{i}^{l}\left({z}_{iq}\right){w}_{i}^{l}; \, {f}_{ij\mu }=\sum_{l=1}^{{r}_{i}}{\mu }_{i}^{l}({z}_{iq}){f}_{ij}\end{array}$$
(7)

and \({\mu }_{i}^{l}\left({z}_{iq}\right)\) represents the normalized membership function.

Control law is shown as:

$${C}_{i}^{l}:\left\{\begin{array}{c} \mathbf{IF} \, {z}_{il} \, is \, {F}_{i1}^{l} \, and \, \dots \, and \, {z}_{ig} \, is \, {F}_{ig}^{l} \\ \mathbf{THEN} \, {u}_{i}\left(k\right)={k}_{i}^{l}{x}_{i}\left(k\right)\end{array}\right.$$
(8)

With the same weight notation \({\mu }_{i}^{l}\left({z}_{iq}\right)\) and respect to \(\left(8\right)\), the final output of the controller will be:

$${u}_{i}\left(k\right)=\sum_{l=1}^{{r}_{i}}{\mu }_{i}^{l}({z}_{iq}){k}_{i}^{l}{x}_{i}(k)$$
(9)

The closed-loop system with time-varying delay will be:

$${x}_{i}\left(k+1\right)=\sum_{l=1}^{{r}_{i}}\sum_{m=1}^{{r}_{i}}{\mu }_{i}^{l}\left({z}_{iq}\right){\mu }_{i}^{m}\left({z}_{iq}\right)\left(\left[{A}_{i}^{l}+{B}_{i}^{l}{k}_{i}^{m}\right]{x}_{i}\left(k\right)+{A}_{id}^{l}{x}_{id}\left(k\right)+{w}_{i}^{l}{d}_{i}\left(k\right)\right)+\sum_{l=1}^{{r}_{i}}\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{\mu }_{i}^{l}\left({z}_{iq}\right){f}_{ij}{x}_{j}\left(k\right)$$
(10)

2.3 Model Predictive Control

The prediction model and cost function are proposed here. The prediction form is:

$${x}_{i}\left(k+n+1|k\right)={A}_{i\mu }{x}_{i}\left(k+t|k\right)+{A}_{id\mu }{x}_{id}\left(k+t|k\right)+{B}_{i\mu }{u}_{i}\left(k+t|k\right)+{w}_{i\mu }{d}_{i}\left(k+t|k\right)+\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij\mu }{x}_{j}\left(k+t|k\right)$$
(11)

and the cost function is:

$$J\left(k\right)=\sum_{i=1}^{N}{j}_{i}\left(k\right)=\sum_{i=1}^{N}\left({\Pi }_{i}\left(K\right)+\sum_{n=0}^{T-1}{\Pi }_{i}\left(k+t|k\right)+{V}_{it}\left({x}_{i}\left(k+T|k\right)\right)\right)$$
(12)

where \({\Pi }_{i}\left(k+n|k\right)\) and \({V}_{it}\left({x}_{i}\left(k+T|k\right)\right)\) are stage cost at the predicted time instant and terminal cost, and \(T\) is the prediction length. It is highly notable that \({V}_{it}\left( \cdot \right)\) must be a positive function [33], and the stage cost is chosen as:

$$\Pi \left(k\right)=\sum_{i=1}^{N}{\Pi }_{i}\left(k\right)=\sum_{i=1}^{N}\left({x}_{i}^{T}\left(k+t|k\right)Q{x}_{i}\left(k+t|k\right)+{u}_{i}^{T}\left(k+t|k\right)R{u}_{i}\left(k+t|k\right)-{\tau }_{i}{d}_{i}^{T}\left(k+t|k\right){d}_{i}\left(k+t|k\right)\right)$$
(13)

where \(R\) and \(Q\) are real fixed matrices, and \({\tau }_{i}\) is a positive scalar. It is evident the cost function includes the disturbance, and is influenced by the \({H}_{\infty }\) control [34]. Consequently, the cost function cannot be optimized directly as the disturbance is included. Instead, a min–max technique is chosen that minimizing the worst-case cost function [35]. Furthermore, at the end of the prediction, it is mostly required that the states enter a terminal constraint set to attain asymptotic stability, since it is almost impossible task to achieve the asymptotic stability to the origin, in the presence of persistent disturbance [36]. \({x}_{i}\left(k+T|k\right) \in {\Omega }_{is}\) shows the terminal constraint set. The online optimization problem is:

$$\underset{{\mathit{ }u}_{i}\left(k+t|k\right)}{\mathrm{min}}\mathit{ }\underset{{d}_{i}\left(k+t|k\right)}{\mathrm{max}}\mathit{ }{J}_{i}\left(k\right),$$
$${s.t. u}_{i} \,\left(k+t|k\right)\in {U}_{i}$$
$${d}_{i} \,\left(k+t|k\right)\in {D}_{i}$$
$${x}_{i} \,\left(k+T|k\right)\in {\Omega }_{is}$$

where \(t\in {Z}_{[1,T-1]}\) and \({\Omega }_{is}\) shows the terminal constraint set. Here\({d}_{i}\in {D}_{i}:=\left\{{d}_{i}|{d}_{i}^{T}{d}_{i}\le {{\gamma }_{i}}^{2}\right\}\), \({u}_{i}\in {U}_{i}:=\left\{{u}_{i}||{u}_{im}|\le {u}_{im.max}\right\}\) and \({{\gamma }_{i}}^{2}\) is a positive scalar, \({u}_{im}\) is the \(m\)-th element of the inputs, \(m\in {Z}_{[1,w]}\).

As it was mentioned, the model predictive control is a kind of controller which computes the input vector by minimizing a specified cost function. In this manuscript, the considered system is large-scale and the form of the controller is decentralized. Thus, to avoid the uncertainty in cost function and reach the best solution, the hierarchical scheme is applied to the optimization problem [37].

3 Main Results

3.1 RPI Set for Fuzzy Takagi–Sugeno Large-Scale System with Time-Varying Delay

First, the RPI property and terminal constraint set are defined. Second, the recursive feasibility is analyzed. Finally, the ISS will be provided. The RPI set is illustrated by \({\Omega }_{is},\) and \({\Lambda }_{i}\left(x\right)\) as the corresponding control law. The RPI set is defined as, \(\forall {x}_{i},{x}_{id}\in {\Omega }_{is} ,\) the control effort assures that \({x}_{i}^{+}\in {\Omega }_{is},\) for all allowable disturbance. For the fuzzy Takagi–Sugeno large-scale system \(\left(3\right)\) with time-varying delay, define \({\Omega }_{is}\) as:

$${\Omega }_{is}:=\left\{\left\{{x}_{i},{x}_{id}\right\}\left|max\left\{\left(\sum_{l=1}^{{r}_{i}}{{\mu }_{i}^{l}\left({z}_{iq}\right)x}_{i}^{T}{P}_{i\mu }{x}_{i}\right),\left(\sum_{l=1}^{{r}_{i}}{{\mu }_{i}^{l}\left({z}_{iq}\right)x}_{id}^{T}{P}_{i\mu }{x}_{id}\right)\right\}\le {\varsigma }_{i}\right.\right\}$$
(14)

where \({P}_{i\mu }=\sum_{l=1}^{{r}_{i}}{\mu }_{i}^{l}({z}_{iq}){P}_{i}\) and \({\varsigma }_{i}\) is a positive scalar and the corresponding control law is \({\Lambda }_{i}\left({x}_{i},{x}_{id}\right)=\sum_{l=1}^{{r}_{i}}{\mu }_{i}^{l}({z}_{iq}){k}_{i}^{l}{x}_{i}(k)\).

Lemma 2

[28]: The set \({\Omega }_{is}\) is an RPI set if there exists a positive scalar \({\lambda }_{i}, \left({0<\lambda }_{i}<1\right),\) such that,

$$\sum_{i=1}^{N}\left\{\frac{1}{{\varsigma }_{i}}{x}_{i}^{{+}^{T}}{P}_{i\mu }^{+}{x}_{i}^{+}-\frac{1-{\lambda }_{i}}{{\varsigma }_{i}}max\left\{{x}_{i}^{T}{P}_{i\mu }{x}_{i},{x}_{id}^{T}{P}_{i\mu }{x}_{id}\right\}-\frac{{\lambda }_{i}}{{{\gamma }_{i}}^{2}}{d}_{i}^{T}{d}_{i}\right\}\le 0$$
(15)

with \({P}_{i\mu }^{+}=\sum_{l=1}^{{r}_{i}}{\mu }_{i}^{l}({x}_{i}^{+}){P}_{i},\) for all \({x}_{i}^{+}\in {A}_{i\mu }{x}_{i}+{A}_{id\mu }{x}_{id}+{B}_{i\mu }{u}_{i}+{w}_{i\mu }{d}_{i}+\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij\mu }{x}_{j}, {u}_{i}\in {U}_{i},\) and \({d}_{i}\in {D}_{i}.\)

Remark 1

Here, by proposing the Theorem 1, two LMIs are solved and if they are feasible, two results are achieved. (1) It will be proved that the considered large-scale fuzzy system with time-varying delay is stable in the sense of Lyapunov. (2) Controller’s gains are computed in the restricted bound, optimally.

Theorem 1

Consider the fuzzy system \(\left(5\right),\) if there exist positive definite matrices \({X}_{i}\) \({X}_{j},\) and \({X}_{i\mu }\), positive scalar \({\lambda }_{i} \left(0<{\lambda }_{i}<1\right)\) such that the following matrix inequalities are met:

(16)
$$\left[\begin{array}{cc}{Z}_{i}& {k}_{i\mu }^{T}\\ {k}_{i\mu }& 1\end{array}\right]\ge 0 , {Z}_{iss}\le {u}_{is,max}^{2}, s\in {Z}_{\left[1,m\right]}$$
(17)

then the set \({\Omega }_{is}:=\left\{\left\{{x}_{i},{x}_{id}\right\}\left|max\left\{\left(\sum_{l=1}^{{r}_{i}}{{\mu }_{i}^{l}\left({z}_{iq}\right)x}_{i}^{T}{P}_{i\mu }{x}_{i}\right),\left(\sum_{l=1}^{{r}_{i}}{{\mu }_{i}^{l}\left({z}_{iq}\right)x}_{id}^{T}{P}_{i\mu }{x}_{id}\right)\right\}\le {\varsigma }_{i}\right.\right\}\) is a RPI set for the fuzzy system \(\left(5\right)\) corresponding to the feedback control law \({\Lambda }_{i}\left({x}_{i},{x}_{id}\right)=\sum_{l=1}^{{r}_{i}}{\mu }_{i}^{l}({z}_{iq}){k}_{i}^{l}{x}_{i}(k).\) Where \({\uptheta }_{i}=\left({A}_{i\mu }+{B}_{i\mu }{k}_{i\mu }\right), {Z}_{iww}\) is \(w-th\) diagonal element of matrix \({Z}_{i},\) \(N\) represents the number of subsystems, \({\varrho }_{i}\) and \({\varrho }_{id}\) are fixed positive values which satisfy \({\varrho }_{i}+{\varrho }_{id}=1\), \({X}_{i}={\varsigma }_{i}{P}_{i}, {X}_{i\mu }={\varsigma }_{i}{P}_{i\mu }, {X}_{j}={\varsigma }_{i}{P}_{j},\) \({\varpi }_{i}=\frac{{\varsigma }_{i}}{{{\gamma }_{i}}^{2}}, \alpha \ge 2\),and \(i,j, l, N,\alpha \in {R}_{+}.\)

Proof

See Appendix A.

3.2 The Coordination Sub-Network

In this section the hierarchical optimization will be carried on. For this, the Hamiltonian function is defined as:

$$H\left( \cdot \right)=\sum_{i=1}^{N}{H}_{i}\left( \cdot \right)=\sum_{i=1}^{N}\left\{{\Pi }_{i}\left(K\right)+\sum_{k=0}^{K-1}\left({x}_{i}^{T}\left(k\right)Q{x}_{i}\left(k\right)+{u}_{i}^{T}\left(k\right)R{u}_{i}\left(k\right)-{{\tau }_{i}d}_{i}^{T}\left(k\right){d}_{i}\left(k\right)+{\delta }_{i}^{T}\left(k\right)\left({z}_{i}\left(k\right)-\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}\left(k\right)\right)+{\overline{p} }_{i}^{T}\left(k+1\right)\left(-{x}_{i}\left(k+1\right)+{g}_{i}\left({x}_{i}\left(k\right) ,{u}_{i}\left(k\right), {z}_{i}\left(k\right)\right)\right)\right)\right\}$$
(18)

where \({g}_{i}\left({x}_{i}\left(k\right) ,{u}_{i}\left(k\right), {z}_{i}\left(k\right)\right)={A}_{i\mu }{x}_{i}+{A}_{id\mu }{x}_{id}+{B}_{i\mu }{u}_{i}+{w}_{i\mu }{d}_{i}+{C}_{i}{z}_{i}\left(k\right)\) and \({\Pi }_{i}\left(K\right)\) is continuously differentiable.

Here, \({\delta }_{i},\) \({\overline{p} }_{i},\) and \({C}_{i}\) are the Hamiltonian multipliers, the co-state variables, and fixed value matrix. By the interaction prediction strategy, the coordination for hierarchical optimization is to find an approach for updating the coordination values \({\delta }_{i}\) and \({z}_{i}\left(k\right)\):

$$\frac{\partial H\left( \cdot \right)}{\partial {z}_{i}\left(k\right)}=0, \frac{\partial H\left( \cdot \right)}{\partial {\delta }_{i}\left(k\right)}=0, i=1,\dots ,N ; k=0,\dots K-1$$
(19)

thereby,

$${\delta }_{i}\left(k\right)=-\frac{\partial {g}_{i}\left({x}_{i},{u}_{i},{z}_{i}\right)}{\partial {z}_{i}} {\overline{p} }_{i}\left(k+1\right)=-{C}_{i}^{T}{\overline{p} }_{i}\left(k+1\right), i=1,\dots ,N ; k=0,\dots K-1$$
(20)
$${z}_{i}\left(k\right)=\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}(k) ,i=1,\dots ,N ; k=0,\dots K-1$$
(21)

3.3 The Local Optimization Sub-Networks

In local sub-networks, exchanging the information between the coordination and local sub-networks will make up the hierarchical scheme as shown in Fig. 1. In this figure \({\delta }_{i}={ \sigma }_{i}\).

Fig. 1
figure 1

Architectures of interactions

in the final step of hierarchical optimization, the convergence must be checked by evaluating the overall interaction error:

$$e\left(k\right)=\sum_{i=1}^{N}\sum_{k=1}^{K-1}{\left\{{z}_{i}\left(k\right)-\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}\left(k\right)\right\}}^{T}\left\{{z}_{i}\left(k\right)-\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}\left(k\right)\right\}$$
(22)

now, provided the desire convergence obtained, stop. Otherwise, repeat the steps again.

3.4 The Terminal Constraint Set

The terminal constraint set \({\Omega }_{is}\) should satisfy two conditions. First, it should be RPI set. Second, a positive definite function (terminal cost function)\({V}_{i}\left(x\right)\) should exist such that \(\forall {x}_{i}\in {\Omega }_{is},\)

$${\gamma }_{3}(\parallel {x}_{i}\parallel )\le {V(x}_{i})\le {\gamma }_{4}(\parallel {x}_{i}\parallel )$$
(23)
$$\sum_{i=1}^{N}V\left({x}_{i}^{+}\right)-V\left({x}_{i}\right)<-\sum_{i=1}^{N}{H}_{i}\left(.\right)$$
(24)

where \({\varsigma }_{3}\) and \({\varsigma }_{4}\) are \({H}_{\infty }\) functions, \(V\left(x\right)\) is given as:

$$V\left(x\right)=\sum_{i=1}^{N}{V}_{i}\left(x\right)=\sum_{i=1}^{N}\sum_{l=1}^{{r}_{i}}{\mu }_{i}^{l}{({z}_{iq})x}_{i}^{T}{P}_{i}{x}_{i}$$
(25)

Remark 2

In this section, by proposing the Theorem 2, the concept of the robust positively invariant and constraint set are achieved through solving a LMI, and it is ensured that trajectories of the large-scale system with time-varying delay are stable robustly.

For convenience, a table of notations is provided:

Table 1

Table 1 Abbreviations and notation

Theorem 2

Consider the fuzzy Takagi–Sugeno system \(\left(5\right),\) if \(\left(16\right),(17)\) and the following matrix inequality are feasible,

(26)

then \({\Omega }_{is}\) is a terminal constraint set considering the terminal cost function. Where \(N\) represents the number of subsystems,\({\uptheta }_{i}=\left({A}_{i\mu }+{B}_{i\mu }{k}_{i\mu }\right)\),\({\overline{X} }_{i}^{T}\left(k\right)={\varsigma }_{i}{\overline{p} }_{i}^{T}\left(k\right)\), \({\mathrm{\rm H}}_{i}={\varsigma }_{i}R\), \({X}_{i}={\varsigma }_{i}{P}_{i}, {X}_{i\mu }={\varsigma }_{i}{P}_{i\mu }, {X}_{j}={\varsigma }_{i}{P}_{j}\), \(\alpha \ge 2,\) and \(i,j, l, N,\alpha \in {R}_{+}.\)

Proof

See Appendix B.

3.5 Control Algorithm

Based on the recent results and to finalize the control design, now, the online control algorithm is studied. Therefore, the terminal constraint set \(V\left(x\left(k\right)\right),\) see \(\left(24\right),\) should satisfy the following condition,

$$V\left(x\left(k\right)\right)=\sum_{i=1}^{N}{\overline{V} }_{i}\left(x\left(k\right)\right)=\sum_{i=1}^{N}\sum_{l=1}^{{r}_{i}}{\mu }_{i}^{l}{({z}_{iq})x}_{i}^{T}\left(k+d\right){P}_{i}{x}_{i}\left(k+d\right)\le {\varsigma }_{i} ; \, d\left(k\right)\in {\mathbb{Z}}_{\left[-h,-1\right]}.$$
(27)

the following optimization problem is for minimizing \({\varsigma }_{i} :\)

$${\text{min }} {\varsigma}_{i} \,\;{\text{subject to }} {\overline{V}_{i}}\left(x\left(k\right)\right)\le {\varsigma }_{i}$$

furthermore, a proper condition for \({\overline{V} }_{i}\left(x\left(k\right)\right)\le \, {\varsigma }_{i}\) is:

$${x}_{i}^{T}{\left(k+d\right)P}_{i}{x}_{i}\left(k+d\right)\le {\xi }_{i} ; \, i=1,\dots ,N$$
(28)

which is \({\varsigma }_{i}-{x}_{i}^{T}{\left(k+d\right)P}_{i}{x}_{i}\left(k+d\right)\ge 0\) and equal to \({\varsigma }_{i}-{x}_{i}^{T}\left(k+d\right)\frac{{{\varsigma }_{i}P}_{i}}{{\varsigma }_{i}}{x}_{i}\left(k+d\right)\ge 0\) by defining symmetrical matrix \({X}_{i}={\varsigma }_{i}{P}_{i}\), is guaranteed by the following LMIs,

$$\left[\begin{array}{cc}{\varsigma }_{i}& {x}_{i}^{T}\left(k+d\right)\\ {x}_{i}\left(k+d\right)& {X}_{i}^{-1}{\varsigma }_{i}\end{array}\right]\ge 0 ; \, d\left(k\right)\in {\mathbb{Z}}_{\left[-h,-1\right]}.$$
(29)
  • Note 1: The symmetric matrix \({X}_{i}\) is defined as a matrix with an achievable inversion. So, the inversion of \({X}_{i}\) must be considered in \((29)\).

  • Note 2: Since the first Example is a mathematical Example, the interconnection between two subsystems are considered as mathematical matrices. But in a real example, the interconnection between two subsystems must be modeled while the engineer is modeling the dynamic of the system. Interconnections in a large-scale system are same as system matrices like \({A}_{i}^{l} \, and \, {B}_{i}^{l}\). These are computed when the system’s dynamic is linearized.

Remark 3

As mentioned in introductory section, an important issue in MPC is optimization problem. The considered system in this paper is large-scale and values of the LMIs are computed and controlled the closed-loop system based on the minimized value of \({\varsigma }_{i}.\) In the previous theorems, the bilinear matrix inequalities have been proposed. Since the significant parts of this paper are gains and minimized \({\varsigma }_{i},\) the values of \({X}_{i}, {X}_{i\mu }, {\lambda }_{i}{, \varpi }_{i},\) and \({H}_{i}\) are defined until the feasible solutions for gains and minimized value of \({\varsigma }_{i}.\)

figure e

One of the significant part of the model predictive control is recursive feasibility. In the following theorem, it will be proven that the recursive feasibility can be achieved and the constrained optimization problem has the solution at any time.

Theorem 3

For the system \((5)\), it will always be solvable if the solution of the optimization problem can be achievable at time \(0\), and the recursive feasibility can be obtained.

Proof

If \((30)\) is feasible at time \(k\), then resorting to the \({x}_{i},{x}_{id}\in {\Omega }_{is},\) and according to the Theorem 1, which has been implied that \({x}_{i}\left(k+1\right)\in {\Omega }_{is},\) then it can be concluded that \((30)\) can be solvable at time \(k+1\). Besides, the solution which achieved at the time \(k\), is feasible at time \(k+1\), and also the optimization problem is feasible at all time.

Theorem 4

should the constrained optimization problem is feasible at the initial time \(0,\) system \((5)\) is ISS due to the disturbance \(d\).

Proof

Assume the Lyapunov-Razumikhin function , consider that it gets its maximum values at the delayed states \({x}_{i}\left(k+\Delta \right)\), \(\Delta \in \left\{d\left(k\right),0\right\}\), and \(d\left(k\right)\) defined in (2). in which , here, is an optimal value of \({P}_{i}\left(k\right)\) at time \(k,\) then it can be obtained:

(31)

in which . where \({\psi }_{max}\left( \cdot \right)\) and \({\psi }_{min}\left( \cdot \right)\) are, respectively, the maximal and minimal eigenvalues.

In addition, \(\left(24\right)\) implies that,

$${V}_{k}\left({x}_{i}\left(k+1\right)\right)-\overline{V }\left({x}_{i}\left(k\right)\right)<-\left({x}_{i}^{T}\left(k\right)Q{x}_{i}\left(k\right)+{u}_{i}^{T}\left(k\right)R{u}_{i}\left(k\right)-{{\tau }_{i}d}_{i}^{T}\left(k\right){d}_{i}\left(k\right)+{\delta }_{i}^{T}\left(k\right)\left({z}_{i}\left(k\right)-\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}\left(k\right)\right)+{\overline{p} }_{i}^{T}\left(k+1\right)\left(-{x}_{i}\left(k+1\right)+{g}_{i}\left({x}_{i}\left(k\right) ,{u}_{i}\left(k\right), {z}_{i}\left(k\right)\right)\right)\right)$$
(32)

where , and if

$${V}_{k}\left({x}_{i}\left(k+1\right)\right)-V\left({k,x}_{i}\right)<{-x}_{i}^{T}\left(k\right)Q{x}_{i}\left(k\right)+{{\tau }_{i}d}_{i}^{T}\left(k\right){d}_{i}\left(k\right)$$
(33)

due to the \(\left(30\right)\) at time \(\left(k+1\right)\), it will be:

$${V}_{k+1}\left({x}_{i}\left(k+1\right)\right)\le {V}_{k}\left({x}_{i}\left(k+1\right)\right)$$
(34)

finally, it achieves that

$${V}_{k+1}\left({x}_{i}\left(k+1\right)\right)-V\left({k,x}_{i}\right)<{-x}_{i}^{T}\left(k\right)Q{x}_{i}\left(k\right)+{{\tau }_{i}d}_{i}^{T}\left(k\right){d}_{i}\left(k\right)$$
(35)

corresponding definition \((3)\), Lemma 1, \((31)\), and \((35)\), the \(V\left({x}_{i}\left(k\right)\right)\) is an ISS Lyapunov function. And the closed-loop system is ISS due to disturbances. So, the proof is completed.

4 Numerical Example

In this section, to prove the effectiveness of the proposed algorithm, two examples are illustrated, the mentioned algorithm is applied, and results with explanations are depicted. Consider a large-scale system \(S\) composed of three fuzzy subsystems \({S}_{i}, i=\mathrm{1,2},3 \left(N=3\right),\) as follows, in which each state of each subsystem has two dimensions. Let \({\varpi }_{i}=0.5,\) \(Q=diag\left\{5, 5\right\},{\mathrm{\rm H}}_{i}=5.\) In this example, the delay time is considered as \(1.\) The membership function, \({\mu }_{i}^{1}={cos}^{2}\left({x}_{i2}\left(k\right)\right)\), \({\mu }_{i}^{2}=1-{\mu }_{i}^{1}.\) The T-S fuzzy large-scale model with time-varying delay is:

$$\left\{\begin{array}{l} \mathbf{IF} \, {z}_{i1} \, is \, {F}_{i1}^{l} \, and \, \dots \, and \, {z}_{ig} \, is \, {F}_{ig}^{l} \\ \mathbf{THEN} \, {x}_{i}\left(k+1\right)={A}_{i}^{l}{x}_{i}\left(k\right)+{B}_{i}^{l}{u}_{i}\left(k\right)+{A}_{id}^{l}{x}_{id}\left(k\right)+{w}_{i}^{l}{d}_{i}\left(k\right)+\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}(k)\end{array}\right.$$

Subsystem \({{\varvec{S}}}_{1}\)

$$\begin{array}{c}{A}_{11}=\left[\begin{array}{c}\begin{array}{cc}0.55& 0.05\end{array}\\ \begin{array}{cc} 0 & 0.42\end{array}\end{array}\right], {A}_{11d}=0.5{A}_{11}, {B}_{11}=\left[\begin{array}{c}1\\ 0\end{array}\right], {w}_{11}=\left[\begin{array}{c}0.1\\ 0\end{array}\right], {f}_{12}=\left[\begin{array}{c}\begin{array}{cc}0.08& 0.05\end{array}\\ \begin{array}{cc}0.05& 0.05\end{array}\end{array}\right], \\ {f}_{13}=\left[\begin{array}{c}\begin{array}{cc}0.09& 0.06\end{array}\\ \begin{array}{cc}0.06& 0.09\end{array}\end{array}\right], {\lambda }_{1}=0.5, {X}_{1}=\left[\begin{array}{c}\begin{array}{cc}0.015& 0 \end{array}\\ \begin{array}{cc} 0& 0.015\end{array}\end{array}\right]\end{array}$$
$${A}_{12}=\left[\begin{array}{c}\begin{array}{cc}0.4& 0\end{array}\\ \begin{array}{cc} 0& 0.08\end{array}\end{array}\right], {A}_{12d}=0.5{A}_{12}, {B}_{12}=\left[\begin{array}{c}0\\ 1\end{array}\right], {w}_{12}=\left[\begin{array}{c}0\\ 0.1\end{array}\right].$$

Subsystem \({{\varvec{S}}}_{2}\)

$${A}_{21}=\left[\begin{array}{c}\begin{array}{cc}0.325& 0 \end{array}\\ \begin{array}{cc}0.4 & 0\end{array}\end{array}\right], {A}_{21d}=0.5{A}_{21}, {B}_{21}=\left[\begin{array}{c}1\\ -1\end{array}\right], {w}_{21}=\left[\begin{array}{c}-0.1\\ 0\end{array}\right], {f}_{21}=\left[\begin{array}{c}\begin{array}{cc}0.1& 0.1\end{array}\\ \begin{array}{cc}0 & 0\end{array}\end{array}\right]$$
$${f}_{23}=\left[\begin{array}{c}\begin{array}{cc}0 & 0\end{array}\\ \begin{array}{cc}0.1& 0.1\end{array}\end{array}\right], {\lambda }_{2}=0.488, {X}_{2}=\left[\begin{array}{c}\begin{array}{cc}0.018& 0 \end{array}\\ \begin{array}{cc} 0& 0.018\end{array}\end{array}\right]$$
$${A}_{22}=\left[\begin{array}{c}\begin{array}{cc}0.6& 0.2\end{array}\\ \begin{array}{cc}0.1& 0\end{array}\end{array}\right], {A}_{22d}=0.5{A}_{22},{B}_{22}=\left[\begin{array}{c}-1\\ 1\end{array}\right], {w}_{22}=\left[\begin{array}{c}0\\ -0.2\end{array}\right].$$

Subsystem \({{\varvec{S}}}_{3}\)

$${A}_{31}=\left[\begin{array}{c}\begin{array}{cc}0.2& 0.4\end{array}\\ \begin{array}{cc}0.2 & 0\end{array}\end{array}\right], {A}_{31d}=0.5{A}_{31}, {B}_{31}=\left[\begin{array}{c}1\\ 1\end{array}\right], {w}_{31}=\left[\begin{array}{c}-0.3\\ 0\end{array}\right], {f}_{31}=\left[\begin{array}{c}\begin{array}{cc}0.03& 0\end{array}\\ \begin{array}{cc} 0& 0.02\end{array}\end{array}\right],$$
$${f}_{32}=\left[\begin{array}{c}\begin{array}{cc}0.1& 0\end{array}\\ \begin{array}{cc}0.1& 0\end{array}\end{array}\right], {\lambda }_{3}=0.487, {X}_{3}=\left[\begin{array}{c}\begin{array}{cc}0.027& 0 \end{array}\\ \begin{array}{cc} 0& 0.027\end{array}\end{array}\right]$$
$${A}_{32}=\left[\begin{array}{c}\begin{array}{cc}0.3& 0\end{array}\\ \begin{array}{cc}0& 0.4\end{array}\end{array}\right], {A}_{32d}=0.5{A}_{32}, {B}_{32}=\left[\begin{array}{c}-2\\ 1\end{array}\right], {w}_{32}=\left[\begin{array}{c}0\\ -0.4\end{array}\right]$$

where \({x}_{1}\left(k\right)={\left[\begin{array}{cc}{x}_{11}\left(k\right)& {x}_{12}\left(k\right)\end{array}\right]}^{T},\) \({x}_{2}\left(k\right)={\left[\begin{array}{cc}{x}_{21}\left(k\right)& {x}_{22}\end{array}\left(k\right)\right]}^{T},\) \({x}_{3}\left(k\right)={\left[\begin{array}{cc}{x}_{31}\left(k\right)& {x}_{32}\left(k\right)\end{array}\right]}^{T}\), and we have:

$${S}_{i}={x}_{i}\left(k+1\right)=\sum_{l=1}^{{r}_{i}}\sum_{m=1}^{{r}_{i}}{\mu }_{i}^{l}\left({z}_{iq}\right){\mu }_{i}^{m}\left({z}_{iq}\right)\left[{A}_{i\mu }^{l}+{B}_{i\mu }^{l}{k}_{i}^{m}\right]{x}_{i}\left(k\right)+{\mu }_{i}^{l}\left({z}_{iq}\right){A}_{id\mu }^{l}{x}_{id}\left(k\right)+{w}_{i\mu }^{l}{d}_{i}\left(k\right)+\sum_{l=1}^{{r}_{i}}\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{\mu }_{i}^{l}\left({z}_{iq}\right){f}_{ij}{x}_{j}\left(k\right).$$

Remark 4

As it is evident in simulation results, Fig. 2, the trajectories of the subsystem 1, at first, the states of the two rules went to instability due to the existence of the time-varying delay, which is 1 s. But as they go on, the proposed controller has learned to compute efficient gains, which converged the states to 0. Same as Fig. 2, in Figs. 3, and 4, it is clear that the states of the subsystems were attempting to have overshoot and undershoot. But controllers have successfully stabilized states as time keeps going. The settling time in the first subsystem is before 10, but in subsystems 2 and 3 is before 5, which means the subsystem 1 is more complex than two others.

Fig. 2
figure 2

Trajectories of subsystem 1

Fig. 3
figure 3

Trajectories of subsystem 2

Fig. 4
figure 4

Trajectories of subsystem 3

Remark 5

According to Figs. 5, 6, and 7, it will be noticeable that the trajectories of the input vector \({u}_{i}\in {R}^{n}\) are converged to 0. Therefore, there is no need to input vector and costs will be reduced. And also by referring to Fig. 8, after \(2\) or \(3\) iterations, the interaction error leads to \(0,\) means that, local subsystems are fully optimized. Finally, Figs. 9, 10, and 11, show 3 cost functions corresponding to 3 subsystems and it is obvious that despite of the fact that cost functions involve negative terms, they are definitely positive during the time. It would be necessary to say that the mentioned numerical example is illustrated to show the effectiveness and usefulness of the proposed method, and it will be definitely a practical method in real conditions.

Fig. 5
figure 5

Trajectories of controller 1

Fig. 6
figure 6

Trajectories of controller 2

Fig. 7
figure 7

Trajectories of controller 3

Fig. 8
figure 8

Interaction error

Fig. 9
figure 9

Trajectory of cost function 1

Fig. 10
figure 10

Trajectory of cost function 2

Fig. 11
figure 11

Trajectory of cost function 3

Remark 6

In comparison, in [23] and [28], the model predictive control is applied to a usual system with persistent disturbances and uncertainties. But a noticeable issue is that the considered system is a usual one and the proposed algorithm is not efficient for large-scale systems. On the other hand, the mentioned paper has not considered the time-varying delay that is an inevitable part in real systems. In [21], an algorithm is proposed for typical systems with time-varying delay. The model predictive control is assumed for the mentioned research but as mentioned above, it is not considered for large-scale systems. In this research, model predictive control is applied to a fuzzy large-scale system with time-varying delay and persistence disturbances, which the proposed algorithm covers the weaknesses of the mentioned researches.

Example 2:

A double inverted pendulum is considered due to the [38] to show the effectiveness of the method. All configurations and parameters are chosen same as [38], but a 0.5 s time-varying delay is considered for the system. All other configurations and considered assumptions are same as the previous example.

Subsystem \({{\varvec{S}}}_{1}\)

$${A}_{11}={A}_{13}=\left[\begin{array}{cc}1& 0.005\\ 0.0262& 1\end{array}\right], {A}_{13}=\left[\begin{array}{cc}1& 0.005\\ 0.0441& 1\end{array}\right], {A}_{11d}=0.5{A}_{11}, {A}_{12d}=0.5{A}_{12},{A}_{13d}=0.5{A}_{13}, {B}_{11}={B}_{12}={B}_{13}=\left[\begin{array}{c}1\\ 0\end{array}\right], {w}_{11}={w}_{12}={w}_{13}=\left[\begin{array}{c}0.1\\ 0\end{array}\right], {g}_{12}=\left[\begin{array}{c}\begin{array}{cc}0.08& 0.05\end{array}\\ \begin{array}{cc}0.05& 0.05\end{array}\end{array}\right], {\lambda }_{1}=0.5, {X}_{1}=\left[\begin{array}{c}\begin{array}{cc}0.015& 0 \end{array}\\ \begin{array}{cc} 0& 0.015\end{array}\end{array}\right], {H}_{1}=\left[1 0\right].$$

Subsystem \({{\varvec{S}}}_{2}\)

$${A}_{21}={A}_{23}=\left[\begin{array}{cc}1& 0.005\\ 0.0272& 1\end{array}\right], {A}_{23}=\left[\begin{array}{cc}1& 0.005\\ 0.0451& 1\end{array}\right], {A}_{21d}=0.5{A}_{21}, {A}_{22d}=0.5{A}_{22}, {A}_{23d}=0.5{A}_{23}, {B}_{21}={B}_{22}={B}_{23}=\left[\begin{array}{c}1\\ 1\end{array}\right], {w}_{11}={w}_{22}={w}_{23}=\left[\begin{array}{c}0.1\\ 0\end{array}\right], {g}_{21}=\left[\begin{array}{c}\begin{array}{cc}0.08& 0.05\end{array}\\ \begin{array}{cc}0.05& 0.05\end{array}\end{array}\right], {\lambda }_{2}=0.448, {X}_{2}=\left[\begin{array}{c}\begin{array}{cc}0.018& 0 \end{array}\\ \begin{array}{cc} 0& 0.018\end{array}\end{array}\right], {H}_{2}=\left[1 0\right].$$
$${x}_{i}\left(k+1\right)=\sum_{l=1}^{{r}_{i}}\sum_{m=1}^{{r}_{i}}{\mu }_{i}^{l}\left({z}_{iq}\right){\mu }_{i}^{m}\left({z}_{iq}\right)\left[{A}_{i\mu }^{l}+{B}_{i\mu }^{l}{k}_{i}^{m}\right]{x}_{i}\left(k\right)+{\mu }_{i}^{l}\left({z}_{iq}\right){A}_{id\mu }^{l}{x}_{id}\left(k\right)+{w}_{i\mu }^{l}{d}_{i}\left(k\right)+\sum_{l=1}^{{r}_{i}}\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{\mu }_{i}^{l}\left({z}_{iq}\right){f}_{ij}{x}_{j}\left(k\right).$$
$${y}_{i}(k)={H}_{i}{x}_{i}\left(k\right)$$

Subsequently, we can obtain the feedback gains:

k11 = [−4.54 −6.06] k12 = [−6.009 −8.79] k13 = [−15.15 −19.585].

k21 = [−5.14 −.01] k22 = [−1.049 −4.14] k23 = [−28.255 −12.252].

Remark 7

Figure 12 shows output responses of the closed-loop discrete-time nonlinear large-scale system with time-varying delay. It is evident that the proposed controller in this paper based on the fuzzy dynamic model not only stabilizes the original nonlinear large-scale system but also effectively attenuates the disturbances as expected. And as it is clear, for the first few seconds there is no response due to the delay that is in the nature of the system.

Fig. 12
figure 12

Output responses of the closed-loop system

Remark 8

To prove the effectiveness of the proposed method, a comparison is made due to [39]. In [39], the PI controller is considered for a large-scale system. At first, a control scheme using PI controllers, which are currently used in industry, is used. although this scheme is very simple to implement, its performance has often limitation and tuning the P and I gains are a tedious process. Besides, the mentioned controlled is applied to a system without time-varying delay. Todays, many systems in industry and academic face many delays in their natures and the proposed method in [39] is not able to stabilize the system.

Remark 9

By referring to gains, a notable point in this example is results of gains. To stabilize the system with time-varying delay, the computed gains have little values and this means the double inverted pendulum can be stabilized with lower cost and this is the efficient of the proposed approach.

5 Conclusion

Here, fuzzy model predictive control and hierarchical optimization for a class of discrete large-scale systems with time-varying delay and disturbance is investigated. The considered method in this paper is Razumikhin for time-varying delay systems, in which it includes a Lyapunov function associated with the original non-augmented state space of system dynamics. The Razumikhin method has the perfect potential to avoid the inherent complexity of the Krasovskii method especially in the presence of large delays and disturbances. Model predictive control is applied to the fuzzy Takagi–Sugeno large-scale system, and the system is composed of several subsystems to consider the decentralized scheme for controller. Many similar methods have been done before in this area and Razumikhin approach was applied to. But the novelty of this paper is that this method is applied to a large-scale system and simultaneously the optimization problem is considered. On the other hand, by robust performance and LMI algorithm, the stability of the results is guaranteed. In this paper, the uncertainties and disturbances that are always seen in systems are considered. As the assumed system in this paper is large-scale and there are some subsystems, the hierarchical approach is applied to the optimization problem which divides the system into two upper and lower orders. At the upper order, coordination, and at the lower order, some units are established. The exchange information between two orders, finally, leads to the most efficient in the optimization problem. This algorithm is assumed for systems with known time-varying delay systems and in future works it can be applied to nonlinear dynamics with unknown time-varying delay. At least two examples are submitted to investigate the effectiveness of the proposed method.

6 Appendix A

Proof

Applying Schur complement, the inequality \(\left(16\right)\) can be written as:

(36)

where \({\varphi }_{i}=N{\Theta }_{i}^{T}{X}_{i}{\Theta }_{i}+N\sqrt{a}\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}^{T}{X}_{j}{f}_{ij}-{\varrho }_{i}\left({-\lambda }_{i}+1\right){X}_{i\mu }\). By considering \({X}_{i}={\varsigma }_{i}{P}_{i}, {X}_{i\mu }={\varsigma }_{i}{P}_{i\mu }, {X}_{j}={\varsigma }_{i}{P}_{j},\) \({\varpi }_{i}=\frac{{\varsigma }_{i}}{{{\gamma }_{i}}^{2}}\), now, the inequality (36) is:

(37)

\({\varphi }_{i}=\frac{1}{{\varsigma }_{i}}N{\Theta }_{i}^{T}{P}_{i}{\Theta }_{i}+\frac{1}{{\varsigma }_{i}}N\sqrt{a}\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}^{T}{P}_{j}{f}_{ij}-\frac{1}{{\varsigma }_{i}}{\varrho }_{i}\left({-\lambda }_{i}+1\right){P}_{i\mu },\) now, if the inequality (37) multiplied from both sides by \(\left[\begin{array}{cccccc}{d}_{i}^{T}& {x}_{i}^{T}& {x}_{id}^{T}& {x}_{j}^{T}& \cdots & {x}_{N}^{T}\end{array}\right]\) and its transpose, respectively,

(38)

according to [40], the inequality (38) is equivalent to:

(39)

The inequality (39) can be written:

(40)

the inequality (40) is equivalent to:

$$\frac{1}{{\varsigma }_{i}}{\left(\left({A}_{i\mu }+{B}_{i\mu }{k}_{i\mu }\right){x}_{i}+{A}_{id\mu }{x}_{id}+{w}_{i\mu }{d}_{i}+\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}\right)}^{T}{P}_{i\mu }^{+}\left(\left({A}_{i\mu }+{B}_{i\mu }{k}_{i\mu }\right){x}_{i}+{A}_{id\mu }{x}_{id}+{w}_{i\mu }{d}_{i}+\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}\right)-\frac{{\lambda }_{i}}{{{\gamma }_{i}}^{2}}{d}_{i}^{T}{d}_{i}\le \frac{1-{\lambda }_{i}}{{\varsigma }_{i}}\left({\varrho }_{i}{x}_{i}^{T}{P}_{i\mu }{x}_{i}+{\varrho }_{id}{x}_{id}^{T}{P}_{i\mu }{x}_{id}\right)$$
(41)

the equation \({x}_{i}^{+}={A}_{i\mu }{x}_{i}+{A}_{id\mu }{x}_{id}+{B}_{i\mu }{u}_{i}+{w}_{i\mu }{d}_{i}+\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}\) is results:

$$\sum_{i=1}^{N}\left\{\frac{1}{{\varsigma }_{i}}{x}_{i}^{{+}^{T}}{P}_{i\mu }^{+}{x}_{i}^{+}-\frac{{\lambda }_{i}}{{{\gamma }_{i}}^{2}}{d}_{i}^{T}{d}_{i}\le \frac{1-{\lambda }_{i}}{{\varsigma }_{i}}\left({\varrho }_{i}{x}_{i}^{T}{P}_{i\mu }{x}_{i}+{\varrho }_{id}{x}_{id}^{T}{P}_{i\mu }{x}_{id}\right)\right\}$$
(42)

it is assumed that \({\varrho }_{i}+{\varrho }_{id}=1,\) so it will be easily confirmed that \({\varrho }_{i}{x}_{i}^{T}{P}_{i\mu }{x}_{i}+{\varrho }_{id}{x}_{id}^{T}{P}_{i\mu }{x}_{id}\le max\left\{{x}_{i}^{T}{P}_{i\mu }{x}_{i}, \, {x}_{id}^{T}{P}_{i\mu }{x}_{id}\right\}.\) Substituting into (42), thus, \((15)\) is obtained. Furthermore, the input constraint can be acknowledged by \(\left(17\right),\) and the proof is shown here:

By multiplying \(diag\left\{I, {x}_{i}\right\}\) and its transpose from both sides of \(\left(17\right):\)

$$\left[\begin{array}{cc}{{x}_{i}^{T}Z}_{i}{x}_{i}& {{x}_{i}^{T}k}_{i\mu }^{T}\\ {k}_{i\mu }{x}_{i}& 1\end{array}\right]\ge 0$$
(43)

applying Schur complement to (43), then,

$${{x}_{i}^{T}Z}_{i}{x}_{i}-{\left({k}_{i\mu }{x}_{i}\right)}^{T}\left({k}_{i\mu }{x}_{i}\right)\ge 0$$
(44)

as \({u}_{i}={k}_{i\mu }{x}_{i},\) the following is got:

$${\left({u}_{i}\right)}^{T}\left({u}_{i}\right)\le { {x}_{i}^{T}Z}_{i}{x}_{i}={H}_{i}=positive value$$
(45)

thus \({u}_{i}^{T}{u}_{i}\le {H}_{i}\). The proof is, thereby, completed.

7 Appendix B

Proof

Applying schur compliment to the inequality \((36)\):

(46)

where \({\upchi }_{i}=N{{\uptheta }_{i}}^{T}{X}_{i}{\uptheta }_{i}+N\sqrt{a}\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}^{T}{X}_{j}{f}_{ij}-{\varrho }_{i}{X}_{i\mu }+{\varsigma }_{i}Q+{k}_{i\mu }^{T}{H}_{i}{k}_{i\mu }\). The inequality (46) is equivalent to:

(47)

where \({\Xi }_{i}=N{{\uptheta }_{i}}^{T}{P}_{i}{\uptheta }_{i}+N\sqrt{a}\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}^{T}{P}_{j}{f}_{ij}-{\varrho }_{i}{P}_{i\mu }+Q+{k}_{i\mu }^{T}R{k}_{i\mu }\).by multiplying \(\left[\begin{array}{cccc}{d}_{i}^{T}& {x}_{i}^{T}& {x}_{id}^{T}& \begin{array}{ccc}{x}_{j}^{T}& \cdots & \begin{array}{cc}{x}_{N}^{T}& {I}_{i}^{T}\end{array}\end{array}\end{array}\right]\) and its transpose from both sides of the matrix in the inequality (47), respectively, the following inequality is obtained:

(48)

according to [40], the inequality (48) is equivalent to:

(49)

the inequality (49) is equivalent to:

$$\sum_{i=1}^{N}\left\{{\left[\left({A}_{i\mu }+{B}_{i\mu }{k}_{i\mu }\right){x}_{i}+{A}_{id\mu }{x}_{id}+{w}_{i\mu }{d}_{i}+\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}\right]}^{T}{P}_{i\mu }^{+}\left[\left({A}_{i\mu }+{B}_{i\mu }{k}_{i\mu }\right){x}_{i}+{A}_{id\mu }{x}_{id}+{w}_{i\mu }{d}_{i}+\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}\right]-\left({\varrho }_{i}{x}_{i}^{T}{P}_{i\mu }{x}_{i}+{\varrho }_{id}{x}_{id}^{T}{P}_{i\mu }{x}_{id}\right)+{x}_{i}^{T}Q{x}_{i}+{u}_{i}^{T}{k}_{i\mu }^{T}R{k}_{i\mu }{u}_{i}-{\tau }_{i}{d}_{i}^{T}{d}_{i}+{I}_{i}^{T}\left({\delta }_{i}^{T}\left(k\right){z}_{i}\left(k\right){+\overline{p} }_{i}^{T}\left(k+1\right){z}_{i}\left(k\right)\right){I}_{i}-\frac{1}{2}\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{I}_{i}^{T}\left({\overline{p} }_{i}^{T}\left(k+1\right)+{\delta }_{j}^{T}\left(k\right)\right){f}_{ij}{x}_{j}-\frac{1}{2}\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{x}_{j}^{T}{f}_{ij}^{T}{\left({\overline{p} }_{i}^{T}\left(k+1\right)+{\delta }_{j}^{T}\left(k\right)\right)}^{T}{I}_{i}-\left(a-1\right)\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{x}_{j}^{T}{f}_{ij}^{T}{P}_{i}{f}_{ij}{x}_{j}+\left(a-1\right)\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{x}_{j}^{T}{f}_{ij}^{T}{P}_{i}{f}_{ij}{x}_{j}+{x}_{i}^{T}\sqrt{\alpha }{\left({A}_{i\mu }+{B}_{i\mu }{k}_{i\mu }\right)}^{T}{P}_{i}\left(\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}\right)-{x}_{i}^{T}\sqrt{\alpha }{\left({A}_{i\mu }+{B}_{i\mu }{k}_{i\mu }\right)}^{T}{P}_{i}\left(\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}\right)+\sqrt{\alpha }\left(\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{x}_{j}^{T}{f}_{ij}^{T}\right){P}_{i}\left({A}_{i\mu }+{B}_{i\mu }{k}_{i\mu }\right){x}_{i}-\sqrt{\alpha }\left(\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{x}_{j}^{T}{f}_{ij}^{T}\right){P}_{i}\left({A}_{i\mu }+{B}_{i\mu }{k}_{i\mu }\right){x}_{i}\right\} <0$$
(50)

the inequality (50) can be written as:

$$\sum_{i=1}^{N}\left\{{\left[\left({A}_{i\mu }+{B}_{i\mu }{k}_{i\mu }\right){x}_{i}+{A}_{id\mu }{x}_{id}+{w}_{i\mu }{d}_{i}+\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}\right]}^{T}{P}_{i\mu }^{+}\left[\left({A}_{i\mu }+{B}_{i\mu }{k}_{i\mu }\right){x}_{i}+{A}_{id\mu }{x}_{id}+{w}_{i\mu }{d}_{i}+\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{f}_{ij}{x}_{j}\right]-\left({\varrho }_{i}{x}_{i}^{T}{P}_{i\mu }{x}_{i}+{\varrho }_{id}{x}_{id}^{T}{P}_{i\mu }{x}_{id}\right)+{x}_{i}^{T}Q{x}_{i}+{u}_{i}^{T}R{u}_{i}-{\tau }_{i}{d}_{i}^{T}{d}_{i}+{\delta }_{i}^{T}\left(k\right){z}_{i}\left(k\right)-\sum_{\begin{array}{c}j=1\\ i\ne j\end{array}}^{N}{\delta }_{j}^{T}\left(k\right){f}_{ij}{x}_{j}+{\overline{p} }_{i}^{T}\left(k+1\right)\left(-{x}_{i}\left(k+1\right)+{g}_{i}\left({x}_{i}\left(k\right) ,{u}_{i}\left(k\right), {z}_{i}\left(k\right)\right)\right)\right\}<0$$
(51)

if it is assumed that \(\varrho_{i} + \varrho_{id} = 1,\) it will be obvious \(\varrho_{i} x_{i}^{T} P_{i\mu } x_{i} + \varrho_{id} x_{id}^{T} P_{i\mu } x_{id} \le V\left( {x_{i} } \right)\), therefore:where \(\mathop \sum \limits_{i = 1}^{N} V\left( {x_{i}^{ + } } \right) - V\left( {x_{i} } \right) < - \mathop \sum \limits_{i = 1}^{N} H_{i} \left( \cdot \right).\) Thereby, the Proof is completed.