1 Introduction

Fault tolerant control systems with actuator failure compensation have received many interests from the researchers of industrial control field over decades [1,2,3,4,5,6,7]. Serious studies in computer science have been dedicated to address important theoretical and practical questions, raised in adaptive nonlinear control systems, where dynamic surface control (DSC) method served as a novel useful tool for designing adaptive control systems, especially for nonlinear strict-feedback [8], [9], and fractional-order [10] systems.

An important research question, which was not addressed in those studies [11,12,13,14,15,16,17,18,19,20,21], is effect of stochastic behaviors and Prandtl–Ishlinskii (PI) hysteresis on the system performance. PI or backlash-like hysteresis and dead-zone phenomena are considered as the two important general nonlinearities, seen in the literature. However, a general adaptive control method with the capability of incorporating both stochastic and nonlinear behaviors of the control system, including the joint Prandtl–Ishlinskii hysteresis and dead-zone phenomena, cannot be seen in those studies in an objective way. One of the problems in developing such a generalized method corresponds to stability of the methods at the presence of an unknown nonlinearity.

Dynamic surface method has been employed by several neural network-based methods for nonlinear control systems [9, 12, 21,22,23,24]. However, this is not true for stochastic nonlinear systems, when general nonlinearities such as PI hysteresis and dead zone appear in the actuators. To the best of our knowledge, the presented methods are mainly based on the backstepping method, which makes this method an appropriate baseline study [25]. To a lesser extent, a nonlinear stochastic system was studied, under the condition of actuator dead-zone, which considers either the time-delay [17, 18], or pure-feedback control design method [20]. It is important to note that in most of practical cases, the control systems, i.e., autonomous vehicle systems, nonlinear stochastic conditions are involved [26, 27]. In addition to these conditions, nonlinear behaviors such as dead-zone and hysteresis are typically seen in the actuators [11,12,13,14,15,16,17,18,19, 21]. Ignoring such the conditions can lead to serious flaws like internal instability and physical damages. However, recently adaptive dynamic surface control for uncertain nonstrict-feedback systems is investigated in [28, 29].

In this paper, neural network in conjunction with dynamic surface control design is employed to introduce a novel method of adaptive control design for nonlinear stochastic systems with a general class of different actuator nonlinearities, including PI hysteresis and dead-zone. These nonlinearities might be a result of actuator aging, a faulty condition of the actuator, or its intrinsical characteristic. The unknown dynamics of the system are innovatively approximated using a Radial Basis Function (RBF) neural network, where the universal approximation capability of the method makes it possible to approximate a wide range of nonlinear Lipschitz functions. Furthermore, the minimal-learning-parameters algorithm is elaboratively employed to reduce the number of adaptive parameters in an online updating way, which effectively reduces the calculational complexities. In order to show effectiveness of the RBF in both the parameter approximation, and in the nonlinearity compensation of the actuators, a sophistication of the method is also proposed as a baseline method for comparison. In this baseline, compensation of the actuator nonlinearity is performed using an adaptive eliminating term.

The stability analysis of the proposed method along with the baseline are theoretically proved and confirmed by simulation. Performance of the direct method of backstepping is also investigated as another baseline for comparison. It is shown that the proposed controller guarantees the boundedness of all the closed-loop signals, where the tracking error remains in an arbitrary small vicinity of the origin, in terms of the mean quartic value. It is shown that the proposed method exhibits superior performance both in the failure-free condition and in different cases of the actuator nonlinearity, compared to the baselines.

The presented method offers extensive applications in a broad range of the engineering and industrial fields such as flight control [30], autonomous vehicle control systems [31, 32], turbo-machine design [33,34,35], piezo-actuators [36] and micro-electro-mechanical-systems (MEMSs) [37], and also in various military applications [38].

The main contributions of the paper are: (1) presenting a novel neural network-based method for designing adaptive controller for nonlinear stochastic system with broad range of the actuator nonlinearity, (2) presenting a sophistication of the method as a baseline for the study, in which nonlinearity of the actuator is directly compensated without using the neural network, (3) analytically proving stability of the mentioned methods in failure free condition and also at the presence of the actuator nonlinearities, i.e., PI hysteresis, and dead-zone, (4) exploring performance of the direct backstepping method, detailed in [12], for a broad range of the actuator nonlinearity, as the second baseline study, (5) comparing the proposed method along with the two baselines using different cases of actuator nonlinearities, and studying privileges and limitations of each of the methods.

The paper is organized as follows. Section 1, presents a literature review on the previously published studies. In Section 3, preliminaries and problem statements are described. In Section 4, the methods along with the theorems are presented, which contains the main contributions of the paper. Simulation examples are presented in Section 5. In Section 6 and 7, discussion and conclusion of the paper are presented, respectively. In addition to the main sections of the paper, there are also five appendices, in which details of the theorem proofs are included accordingly.

2 Related studies

Actuator failure can occur in many practical systems, named plants, that may lead to the plant instability and even sometime catastrophic events [1,2,3,4,6,−7, 27, 39,40,41,42,43,44]. Systematic design methods for different nonlinear control systems have been studied in the form of the strict-feedback, pure feedback, and block-strict-feedback [45], where various direct methods have been investigated for the purpose of actuator failure compensation [39,40,41,42,43,44]. Backstepping design method was proposed as a systematic adaptive controller design, which is still considered as one of the mostly used methods for nonlinear systems. Backstepping-based methods for compensation of the actuator failures such as sliding-mode control [42], and adaptive failure compensation [5, 39, 41, 43, 44, 46,47,48,49] have been proposed for several practical and theoretical systems. Among these methods, the problem of accommodating infinite number of actuator failures/faults in control systems has been investigated in [5]. Backstepping method was theoretically studied to be employed for adaptive control design for the parameter-strict-feedback systems [43], and its capabilities in compensating actuator nonlinearities for a flight control system were investigated [11]. Radial Bases Function (RBF) neural network has been integrated with backstepping method to overcome the problem of uncertain nonlinear systems in pure-feedback form with PI hysteresis [21]. Backstepping controller design method using adaptive neural networks was proposed in conjunction with variable separation and minimal-learning-parameters algorithm technique for stochastic nonlinear single–input–single–output systems in the form of nonstrict-feedback with unknown backlash-like hysteresis [21], strict-feedback [20], and pure-feedback [50], [51].

Although the backstepping design technique has many useful benefits for the designers, it suffers from an inherent problem, called ‘explosion of complexity’, that occurs with increasing system order, due to the continuously differentiation of virtual control signal and system states. Dynamic Surface Control (DSC) method was introduced as another alternative method, which resolves explosion of complexity [8,9,10, 23, 32, 36,37,38, 52, 53]. It avoids continuous differentiation of virtual control inputs leading to ‘explosion of terms’. DSC has been privileged over backstepping in several studies [9]. Integration of adaptive neural network and DSC was studied in nonlinear strict-feedback systems [12], and also in time-delayed nonlinear systems [15], under failure-free condition of the actuators, as well as a certain form of the PI hysteresis [16]. The effect of actuator dead-zone in nonlinear systems was separately studied for adaptive DSC method [23].

Application of dynamic surface method has been studied in several applied researches, such as controlling pneumatic servo system [32], trajectory tracking control of underactuated surface vehicles [36], suppressing chatter in a micro-milling machine with piezo-actuators [37], controlling micro-electro mechanical gyroscope systems [38], controlling process of continuous heavy cargo airdrop of nonlinear transport aircraft [52], controlling of spacecraft terminal safe approach with actuator saturation [53], and precise position tracking problem of permanent magnet synchronous motors [54].

In many practical systems their parameters, and dynamics, as well as the corresponding disturbances are unknown, but can likely show stochastic and mostly nonlinear characteristics. Details of a complete course of stochastic systems and stochastic differential equation are found in [17], [18].

Adaptive neural networks were employed in conjunction with the dynamic surface technique for nonlinear stochastic systems with either time-delays or dead-zone in the actuators [15]. A certain class of nonlinear systems, but not stochastic, with unknown Prandtl–Ishlinskii hysteresis was studied by X. Zhang et al. and the performance of the design method was investigated [22]. In this study, an adaptive neural DSC controller was constructed to eliminate the effect of unknown actuator hysteresis. The adaptive neural network was utilized in DSC design method to stabilize nonlinear time-delay systems with unknown disturbances [9]. Adaptive neural network control systems have been investigated for specific cases of uncertain nonlinear strict-feedback systems [12], and also a class of time-delay nonlinear systems with PI hysteresis with dynamic uncertainties [16,17,18]. Nevertheless, for nonlinear stochastic cases, the adaptive neural network dynamic surface design was studied only under the condition of time delayed and actuator dead-zone [25].

3 Preliminaries and problem statement

A stochastic nonlinear system with strict-feedback can be defined by its state variable \(x={\left[{x}_{1},{x}_{2},\dots , {x}_{n}\right]}^{T}\in {R}^{n}\):

$$\left\{\begin{array}{l}d{x}_{1}=\left({g}_{1}{x}_{2}+{f}_{1}\right)dt+{\psi }_{1}d\psi \\ \vdots \\ d{x}_{i}=\left({g}_{i}{x}_{i+1}+{f}_{i}\right)dt+{\psi }_{i}d\psi\, {b}_{m}\le {g}_{i}\le {b}_{M}\\ \vdots 1\le i\le n\\ d{x}_{n}=\left({g}_{n}u+{f}_{n}\right)dt+{\psi }_{n}d\psi , \\ y={x}_{1}, \end{array}\right.$$
(1)

where \(\psi\) is an r-dimensional variable introduced as standard Brownian motion defined on a complete probability space,Footnote 1 and \(f_{i} \left( \cdot \right), g_{i} \left( \cdot \right):R^{i} \times R^{ + } \to R\), \(\psi_{i}^{T} : R^{i} \times R^{ + } \to R^{i \times r}\) are unknown smooth functions in \(x_{i} \in R^{i}\) with zero initial conditions [25]. It should be noted that \(u\) in Eq. (1) is the control input that is by itself the output of an actuator, which can be subjected to different nonlinearities such as Prandtl–Ishlinskii (PI) hysteresis, or dead-zone.

Prandtl–Ishlinskii (PI) hysteresis is a nonlinearity defined as follows:

$$u\left(t\right)={p}_{0}v\left(t\right)-\underset{0}{\overset{R}{\int }}p\left(r\right){F}_{r}\left[v\right]\left(t\right)dr$$
(2)

where \(u(t)\) is the output of the actuator, \(v(t)\) is the input signal to the actuator, \(p(r)\) is the density function, \({p}_{0}= {\int }_{0}^{R}p(r)dr\) is a constant which depends on the density function \(p(r)\), and \({F}_{r}\left[v\right]\left(t\right)\) is a function, describing the nonlinearity behavior, and named the “play operator” [13]. It should be noted that Eq. (2) decomposes the hysteretic action into two terms, describing the linear reversible part and the nonlinear hysteretic behavior, at its first and the second terms, respectively. This decomposition is crucial since it facilitates utilization of the currently available control techniques for the controller design [15]. An actuator with PI hysteresis is a component with memory, and therefore its value depends on its previous outputs in time. Consequently, for an input \(v\left(t\right)\in {C}_{m}\left[0,{t}_{E}\right]\), where \({C}_{m}\left[0,{t}_{E}\right]\) is the space of piecewise monotone continuous functions, and the play operator is defined by:

$$\begin{gathered} F_{r} \left[ {v;u_{ - 1} } \right]\left( 0 \right) = f_{r} \left( {v\left( 0 \right),u_{ - 1} } \right) \hfill \\ F_{r} \left[ {v;u_{ - 1} } \right]\left( t \right) = f_{r} \left( {v\left( t \right),F_{r} \left[ {v;u_{ - 1} } \right]\left( {t_{i} } \right) } \right), \hfill \\ {\text{for}} \,t_{i} < t \le t_{i + 1} {\text\,{and }}\,0 \le i \le N - 1, \hfill \\ \end{gathered}$$
(3)

with

$${f}_{r}\left(v,u\right)=\mathrm{max}\left(v-r,\mathrm{min}\left(v+r,u\right)\right)$$
(4)

where \(0 = t_{0} < t_{1} < \ldots < t_{N} = t_{E}\) is a partition of \(\left[ {0,t_{E} } \right]\) such that the function v is monotone on each of the sub-interval \(\left[ {t_{i} , t_{i + 1} } \right]\), and \(u_{ - 1} \in R\) is the general initial condition [13].

Consider the PI-model expressed by the play operator in (7), the hysteresis output \(u(t)\) can be expressed as [14]:

$$u\left(t\right)={p}_{0}v\left(t\right)-d\left[v\right]\left(t\right)$$
(5)

where

$$d\left[v\right]\left(t\right)= \underset{0}{\overset{R}{\int }}p\left(r\right){F}_{r}\left[v\right]\left(t\right)dr,\mathrm{ with }\,{p}_{0}= \underset{0}{\overset{R}{\int }}p\left(r\right)dr$$
(6)

It should be noted that (11) is bounded, and the detailed description of its boundedness is discussed in [1620]. Furthermore, in this paper it is assumed that the characteristics of PI hysteresis nonlinearity in the actuator is unknown and should be estimated by the controller.

Actuator dead-zone is another form of the nonlinearity model can be described as follows:

$$u=D\left(v\right)= \left\{\begin{array}{c}{g}_{r}\,\left(v\right), v\le {b}_{l}\\ 0,\, { b}_{l}<v<{b}_{r} \\ {g}_{r}\,\left(v\right), v\ge {b}_{r}\end{array}\right.$$
(7)

where \(u\) is the output of the dead-zone, \(v\) is the input of the dead-zone, \({b}_{l}<0\) and \({b}_{r}>0\) are unknown parameters of the dead-zone, which should be estimated by the control system, named as the start and end of the dead-zone, respectively. The output of the dead-zone is not measurable, and therefore the smooth and bounded first derivative functions \({g}_{l}(v)\) and \({g}_{r}\left(v\right)\) are employed to express the output. In order to achieve a pseudolinear relationship between the input and output of the dead-zone, the following expression is often employed:

$$u\left(t\right)=D\left(v\right)={K}^{T}(t)\Phi \left(t\right)v+d(v)$$
(8)

where detailed description of functions \({K}^{T}\left(t\right),\Phi (t)\), and \(d(v)\) can be found in [16,17,18,]–[19]. However, \({K}^{T}\left(t\right)\Phi (t)\) is bounded, \(\left|d\left(v\right)\right|\le {p}^{*}\), and \({p}^{*}\) is an unknown positive constant [19].

One way to approximate the unknown dynamic of the actuator nonlinearity is the use of a Radial Basis Function Neural Network (RBF). It provides universal approximating capability, by which any unknown continuous function \(f\left(Z\right):{R}^{n}\to R\) can be approximated as follows:

$$f\left(Z\right)={W}^{*T}{\zeta }^{T}\left(Z\right)+\delta \left(Z\right)$$
(9)

where \(Z\in {\Omega }_{\mathrm{Z}}\subset {R}^{q}\) is the input vector with \(q\) being the neural networks input dimension, \(W={\left[{\fancyscript{w}}_{1},{\fancyscript{w}}_{2},\cdots , {\fancyscript{w}}_{l}\right]}^{T}\in {R}^{l}\), is the weight vector of neural networks with \(l>1\), the neural networks node number, and \(\zeta \left(Z\right)=\left[{\varsigma }_{1}\left(Z\right),{\varsigma }_{2}\left(Z\right),\cdots ,{\varsigma }_{l}\left(Z\right)\right]\) is the basis function vector with \({\varsigma }_{i}(Z)\) being chosen as Gaussian function following the form:

$${\varsigma }_{i}\left(Z\right)=\mathrm{exp}\left(-\frac{{\left(Z-{\mu }_{i}\right)}^{T}\left(Z-{\mu }_{i}\right)}{{\eta }_{i}^{2}}\right),\, i=1,\cdots ,l$$
(10)

The \({\mu }_{i}=[{\mu }_{i1},\cdots ,{\mu }_{iq}]\) is the center of the respective field and \({\mu }_{i}\) is the width of the Gaussian function [25]. \(\delta (Z)\) is the approximation error and satisfies \(\left|\delta \left(Z\right)\right|\le \varepsilon\), \(\varepsilon >0\). W*T is the ideal constant weight vector [25] and is defined as:

$${W}^{*T}=\mathrm{arg}\underset{W\in {R}^{l}}{\mathrm{min}} \left\{\underset{x\in {\Omega }_{x}}{\mathrm{sup}}\left|f\left(x\right)-{W}^{T}\zeta \left(x\right)\right|\right\}$$
(11)

For simplicity, by using the minimal-learning-parameters algorithm an unknown constant \(\theta\) is introduced as:

$$\theta =\mathrm{max}\left\{\frac{1}{{b}_{m}} \Vert {W}_{j}^{*T}\Vert ;j=\mathrm{1,2}, \dots , n\right\}$$
(12)

We consider a stochastic nonlinear system in strict-feedback form with unknown dynamics where the actuator is subjected to a nonlinearity. The method proposed by the following sequels employs a radial basis neural network to estimate unknown dynamics of the system, and hence to design the adaptive control method.

4 Methods

4.1 Overview

The proposed control design method is based on using the dynamic surface as a systematic controller design in conjunction with an adaptive RBF neural network to serve as a global approximator meant for unknown dynamics, nonlinearities, and stochastic behaviors of the system. The method which we call Adaptive Neural Network Dynamic Surface Control (ANNDSC) is independently investigated for nonlinear stochastic strict-feedback systems using three different actuator characteristics: linear, nonlinear with dead-zone, and nonlinear with hysteresis characteristics. The probability boundedness of all the closed-loop signals will be proven via stability analysis in an analytic way, and the simulations support the theories for all the three cases of the actuator nonlinearity.

In order to demonstrate effectiveness of RBFNN in compensating actuator nonlinearity, a modification of the ANNDSC method is introduced as a baseline for comparison. This baseline method is named Adaptive PI Compensation using Dynamic Surface Control (APIC-DSC), where the adaptive term is employed to directly compensate the PI hysteresis nonlinearity. The stability analysis of the last method is also analytically proven. In order to show effectiveness of both the ANNDSC and the APIC-DSC, compared to the existing design method, another baseline is defined based on the direct implementation of the backstepping design method. Technical details of the design method for this baseline are found in [25]. For further clarity, and meanwhile maintaining continuity of the subjects, proofs of the presented theorems are included in the appendices.

4.2 The proposed method ANNDSC

Consider a nonlinear stochastic strict-feedback system defined by Eq. (1), ANNDSC offers an iterative procedure with n steps (n is order of the system) for designing the control system. At each step of the method, the error surface is firstly calculated by subtracting state variables from the desired output. Then, the calculated error is passed through a first-order filter, for all the steps, but the last step. An RBF neural network employs the filtered error to approximate dynamic of the system. The error surface \(\left\{{S}_{i}\right\}\), for step i, \(1\le i\le n\), is defined as:

$${S}_{i}={x}_{i}-{z}_{i}, 1\le i\le n,$$
(13)

where \(x_{i}\) and \(z_{i}\) are the corresponding state variable and desired state value, respectively. For \(i = 1\), \(z_{1} = y_{r}\), where \(y_{r}\) is the reference input, the desired output of the system. The proposed procedure involves \(n\) successive steps of computation, as depicted in Fig. 1.

Fig. 1
figure 1

Flowchart of the proposed method, ANNDSC, for nonlinear stochastic systems in the form of strict-feedback

A virtual control input \(\overline{x}_{i + 1}\) is defined at each step:

$$\overline{x}_{i + 1} = - k_{i} S_{i} - \frac{1}{{2a_{i} }}S_{i}^{3} \hat{\theta }\zeta_{i}^{T} \left( {Z_{i} } \right)\zeta_{i} \left( {Z_{i} } \right), \,1 \le i \le n,\, Z_{i} = \left[ {\overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\smile}$}}{x}_{i} ,\hat{\theta }} \right], \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\smile}$}}{x}_{i} = \left[ {x_{1} , \ldots ,x_{i} } \right],$$
(14)
$$v = \overline{x}_{n + 1} = - k_{n} S_{n} - \frac{1}{{2a_{n} }}S_{n}^{3} \hat{\theta }\zeta_{n}^{T} \left( {Z_{n} } \right)\zeta_{n} \left( {Z_{n} } \right),$$
(15)

where the ai, \({k}_{i}\) are the design parameters, \(\zeta_{i} \left( {Z_{i} } \right)\) are the radial basis functions of the corresponding neural network, and \(\widehat{\theta }\) is an estimation of \(\theta\). The virtual control input is passed through a low-pass filter to obtain the desired value for the next state:

$${\epsilon }_{i+1}{\dot{z}}_{i+1}+{z}_{i+1}={\stackrel{-}{x}}_{i+1}, 1\le i\le n-1$$
(16)

where the \({\epsilon }_{i+1}, 1\le i\le n-1\) are the design constants. Finally, the RBF neural network weights are approximated using the following expression:

$$\dot{\widehat{\theta }}=\sum_{j=1}^{n}\frac{\lambda }{2{a}_{j}^{2}}{S}_{j}^{6}{\zeta }_{j}^{T}\left({Z}_{j}\right){\zeta }_{j}\left({Z}_{j}\right)-{k}_{0}\widehat{\theta }$$
(17)

where \(\lambda\) is a design constant, and \({\zeta }_{j}(\cdot )\), \(1\le j\le n\) are the basis functions of the neural network. The RBF neural network is indeed composed of two layers. The first layer incorporates \(l\) nodes. Each node \(i\) (\(1\le i\le l\)) corresponds to a Gaussian function of center \({\eta }_{i}\), and width \({\mu }_{i}\). The three parameters \((l,{\eta }_{i}, {\mu }_{i})\) are treated as the design parameters. The second layer is a linear superposition of the Gaussian functions, using the learning weight \(W\). Norm of the learning weights is employed for the approximation.

Theorem 1

Applying the ANNDSC controller design method, to a nonlinear stochastic strict-feedback system with a linear actuator and any unknown dynamics, Eq. (1), guarantees the boundedness in probability of all closed-loop signals of the system.

Proof 1

The comprehensive proof of this theorem is explained in Appendix 2.

Theorem 2

Applying the ANNDSC controller design method, to nonlinear stochastic strict-feedback systems with any unknown dynamics, Eq. (1), which is subjected to a hysteresis nonlinearity in its actuator, guarantees that all the closed-loop signals of the system remain bounded in probability.

Proof 2

The comprehensive proof of this theorem is explained in Appendix 4.

Theorem 3

Applying the ANNDSC controller design method, to a nonlinear stochastic system in strict-feedback form with any unknown dynamics, Eq. (1), subjected to actuator dead-zone nonlinearity, guarantees the boundedness in probability of all closed-loop signals of the system.

Proof 3

The comprehensive proof of this theorem is explained in Appendix 5.

Experimentation of ANNDSC is herein described through a practical example of a hypersonic aircraft cruising at a Mach number of 15 and an altitude of 110,000 ft, which is subjected to a nonlinear stochastic condition. The control system comprises two separate controllers, for the velocity and the flight path angle [24]. Dynamic of the flight path angle can be expressed by the state equation system (Eq. 18) using three state variables: \(x={\left[{x}_{1},{x}_{2},{x}_{3}\right]}^{T}={\left[\gamma , \theta , q\right]}^{T}\):

$$\begin{gathered} {\varvec{dx}}_{1} = \left( {{\varvec{f}}_{1} \left( {{\varvec{x}}_{1} } \right) + {\varvec{g}}_{1} \left( {{\varvec{x}}_{1} } \right){\varvec{x}}_{2} } \right){\varvec{dt}} + {\varvec{\psi}}_{1} \user2{d\psi } \hfill \\ {\varvec{dx}}_{2} = {\varvec{x}}_{3} {\varvec{dt}} + {\varvec{\psi}}_{2} \user2{d\psi } \hfill \\ {\varvec{dx}}_{3} = \left( {{\varvec{f}}_{3} \left( {{\varvec{x}}_{1} ,{\varvec{x}}_{2} ,{\varvec{x}}_{3} } \right) + {\varvec{g}}_{3} \left( {{\varvec{x}}_{1} ,{\varvec{x}}_{2} } \right){\varvec{u}}} \right){\varvec{dt}} + {\varvec{\psi}}_{3} \user2{d\psi } \hfill \\ {\varvec{y}} = {\varvec{x}}_{1} \hfill \\ \end{gathered}$$
(18)

The three state variables \(\gamma\), \(\theta\) and \(q\) are the flight path angle, the altitude, and the pitch rate, respectively. The functions \({{\varvec{f}}}_{1}\left({{\varvec{x}}}_{1}\right)\), \({{\varvec{f}}}_{3}\left({{\varvec{x}}}_{1},{{\varvec{x}}}_{2},{{\varvec{x}}}_{3}\right)\), \({{\varvec{g}}}_{1}\left({{\varvec{x}}}_{1}\right)\), and \({{\varvec{g}}}_{3}\left({{\varvec{x}}}_{1},{{\varvec{x}}}_{2}\right)\) are the nonlinear functions describing dynamic of the system. The \({\psi }_{i}, 1\le i\le 3\) are unknown smooth functions, and \(\mu\) is a constant number. Details of finding dynamic model of the system are found in [24]. The system defined in Eq. (18) demonstrates dynamics of a nonlinear stochastic strict-feedback plant. The flight path \({\varvec{y}}\) can be controlled using the ANNDSC.

4.3 Adaptive PI compensation DSC (APIC-DSC)

In ANNDSC, the proposed neural network provided sufficient tools both for the control design and for compensating the actuator nonlinearity. The proposed baseline of APIC-DSC is introduced to investigate effect of using neural network for compensating the actuator nonlinearity, proposed by ANNDSC. In this baseline study, an adaptive PI hysteresis compensator is proposed using direct method, in conjunction with the adaptive RBF neural network to compensate actuator nonlinearity of kind Prandtl–Ishlinskii (PI) hysteresis, in contrast with ANNDSC in which the neural network undertook the compensation task. as defined in Eqs. (2) to (5). In this situation, for each step i, \(i\le i\le n\), the error surface and the virtual control inputs are defined as in Eq. (13) and Eq. (14). Figure 2 illustrates flowchart of the method.

Fig. 2
figure 2

Flowchart of the baseline method, named APIC-DSC, for nonlinear stochastic systems in strict-feedback form

The virtual control input is passed through a low-pass filter to obtain the desired value for the next state:

$${\epsilon }_{i+1}{\dot{z}}_{i+1}+{z}_{i+1}={\stackrel{-}{x}}_{i+1}, 1\le i\le n-1$$
(19)

where \({\epsilon }_{i+1}, 1\le i\le n-1\) are design parameters. The control input is:

$$v= {\stackrel{-}{x}}_{n+1}=-{k}_{n}{S}_{n}-\frac{1}{2{a}_{n}^{2}}{S}_{n}^{3}\widehat{\theta }{\zeta }_{n}^{T}\left({Z}_{n}\right)\zeta \left({Z}_{n}\right)+\underset{0}{\overset{R}{\int }}{\widehat{p}}_{{p}_{0}}\left(t,r\right)\left|{F}_{r}\left[v\right]\left(t\right)\right|dr$$
(20)

where the ai, \({k}_{i}\) are design parameters, \(\zeta_{i} \left( {Z_{i} } \right)\) are the radial basis functions, \(\widehat{\theta }\) and \({\widehat{p}}_{{p}_{0}}\) are the estimations of \(\theta\) and \({p}_{{p}_{0}}\), respectively. The \({\widehat{p}}_{{p}_{0}}\) is approximated using an adaptive law:

$$\frac{\partial }{\partial t}{\widehat{p}}_{{p}_{0}}\left(t,r\right)= \left\{\begin{array}{l}-{\gamma }_{pr}\left[\left|{S}_{n}^{3}\right|\left|{F}_{r}\left[u\right]\left(t\right)\right|+\sigma {\widehat{p}}_{{p}_{0}}\left(t,r\right)\right], 0\le {\widehat{p}}_{{p}_{0}}\le {p}_{max},\\ -\sigma {\widehat{p}}_{{p}_{0}}\left(t,r\right), {\widehat{p}}_{{p}_{0}}>{p}_{max},\end{array}\right.$$
(21)

where the\(\sigma\) and \({p}_{max}\) are positive design parameters, \({p}_{{p}_{0}}\left(t,r\right)=p(t,r)/{p}_{0}\), and \(p_{{p_{\max } }} : = \left( {p_{\max } /p_{0} } \right)\). Finally, the RBF weights are approximated using the following adaptive law:

$$\dot{\widehat{\theta }}=\sum_{j=1}^{n}\frac{\lambda }{2{a}_{j}^{2}}{S}_{j}^{6}{\zeta }_{j}^{T}\left({Z}_{j}\right){\zeta }_{j}\left({Z}_{j}\right)-{k}_{0}\widehat{\theta }$$
(22)

where \(\lambda\) is a design constant, and \({\zeta }_{j}(\cdot )\), \(1\le j\le n\) are the basis functions of the neural network.

It can be seen from (22) and (21) that both the nonlinearities and the system dynamics are approximated using the adaptive law in Eq. (22) resulted from the neural network weights; however, the density function of the PI integral is directly approximated using the adaptive law in Eq. (21).

Theorem 4

Applying the APIC-DSC controller design method, to a nonlinear stochastic system with any unknown dynamics, subjected to actuator PI hysteresis nonlinearity guarantees the boundedness in probability of all closed-loop signals of the system.

Proof 4

The comprehensive proof of this theorem is explained in Appendix 6.

5 Simulation results

Performance of the proposed ANNDSC method, along with the two baseline methods, is evaluated and compared in a tracking problem using a 3rd-order benchmark system. Details of the benchmark system are found in [25]. Another alternative benchmark of 2rd-order system can be found in [25], but we used the 3rd-order ones with more complexities to explore performance of the methods and hence provide a better comparison, under a rather complex condition. This benchmark for study considers a stochastic nonlinear system in strict-feedback form:

$$\begin{gathered} dx_{1} = \left( {\left( {0.3 + x_{1}^{2} } \right)x_{2} - 0.8\sin \left( {x_{1} } \right)} \right)dt + x_{1} \sin \left( {x_{1} } \right)d\omega , \hfill \\ dx_{2} = \left( {\left( {1 + x_{2}^{2} } \right)x_{3} - x_{2} - 0.5{ }x_{2}^{3} - x_{1}^{3} - \sqrt {x_{1} } } \right)dt + x_{1} \cos \left( {x_{2} } \right)d\omega , \hfill \\ dx_{3} = \left( {\left( {1.5 + \sin \left( {x_{1} x_{2} } \right)} \right)u - 0.5x_{3} - \frac{1}{3}x_{3}^{2} - x_{2}^{2} x_{3} - \frac{{x_{1} }}{{\left( {1 + x_{1}^{2} } \right)}}} \right)dt + 3x_{1} e^{{ - x_{2}^{2} }} d\omega , \hfill \\ y = x_{1} \hfill \\ y_{r} = {\text{sin}}\left( t \right) \hfill \\ \end{gathered}$$
(23)

The simulation study considers a failure-free condition together with two other cases of the actuator nonlinearity, i.e., actuator dead-zone, and the actuator PI hysteresis for the proposed method, ANNDSC, along with the two baseline design methods, named APIC-DSC and backstepping design, respectively. The ANNDSC-based controller for the failure-free, and the two cases of nonlinearity, is designed using Eqs. (13)–(17) as follows:

$$\left\{\begin{array}{l}{S}_{1}={x}_{1}-{y}_{r}, \\ {\stackrel{-}{x}}_{2}= -{k}_{1}{S}_{1}-\frac{1}{2{a}_{1}^{2}}{S}_{1}^{3}\widehat{\theta }{\zeta }_{1}^{T}\left({Z}_{1}\right){\zeta }_{1}\left({Z}_{1}\right),\\ {\epsilon }_{2}{\dot{z}}_{2}+{z}_{2}={\stackrel{-}{x}}_{2}, \\ {S}_{2}={x}_{2}-{z}_{2}, \\ {\stackrel{-}{x}}_{3}=-{k}_{2}{S}_{2}-\frac{1}{2{a}_{2}^{2}}{S}_{2}^{3}\widehat{\theta }{\zeta }_{2}^{T}\left({Z}_{2}\right){\zeta }_{2}\left({Z}_{2}\right),\\ {\epsilon }_{3}{\dot{z}}_{3}+{z}_{3}={\stackrel{-}{x}}_{3}, \\ {S}_{3}={x}_{3}-{z}_{3}, \\ v= -{k}_{3}{S}_{3}-\frac{1}{2{a}_{3}^{2}}{S}_{3}^{3}\widehat{\theta }{\zeta }_{3}^{T}\left({Z}_{3}\right){\zeta }_{3}\left({Z}_{3}\right),\\ \dot{\widehat{\theta }}= \sum_{k=1}^{3}\frac{\lambda }{2{a}_{k}^{6}}{S}_{k}^{6}{\zeta }_{k}^{T}\left({Z}_{k}\right){\zeta }_{k}\left({Z}_{k}\right)-{k}_{0}\widehat{\theta },\end{array}\right.$$
(24)

where \({Z}_{1}={S}_{1},\) \({Z}_{2}=[{S}_{1},{S}_{2},\widehat{\theta }]\), \({Z}_{3}=[{S}_{1},{S}_{2},{S}_{3},\widehat{\theta }]\), \(p\left(r\right)={e}^{-0.067{\left(r-1\right)}^{2}}, r\in \left[\mathrm{0,10}\right], v\left(t\right)=\frac{7\mathrm{sin}\left(3t\right)}{1+t}, t\in [\mathrm{0,2}\pi ]\), and \({u}_{-1}=0\) are the input vectors of neural networks. Equation (24) shows the design steps of a control system of order 3, which clearly involves three design steps. At each of the step, firstly the error surface \({S}_{i}\) is calculated by subtracting the desired value \({z}_{i}\) from the actual value of state \({x}_{i}\). Using the error surface, along with the neural network weights, \(\widehat{\theta }\), the virtual control input of the step, \({x}_{i+1}\), is estimated. The desired value is calculated for each step by passing the virtual control input signal through a first-order filter, except for the final step, where the actual control input is directly generated. Subsequently, the neural network weights are found using the adaptive law, by which the unknown dynamics and nonlinearities of the system will be approximated.

The design parameter set for the simulation is \({[{k}_{0},k}_{1},{k}_{2},{k}_{3}]\), [\({a}_{1},{a}_{2},{a}_{3}]\), \({\epsilon }_{2}=0.006,\) and \({\epsilon }_{3}=0.008\), which are obtained empirically considering the transient performance, the limitations on control effort growth, the closed-loop internal stability of the system, and improvement in the tracking error. The simulation runs under the initial condition of \({\left[{x}_{1}\left(0\right),{x}_{2}\left(0\right),{x}_{3}(0)\right]}^{T}={\left[\mathrm{0,0.6,0.4}\right]}^{T}\), \(\widehat{\theta }(0)=0.1\). The three RBF neural networks \({W}_{1}^{T}{\zeta }_{1}({Z}_{1})\), \({W}_{2}^{T}{\zeta }_{2}({Z}_{2})\), and \({W}_{3}^{T}{\zeta }_{3}({Z}_{3})\) are chosen to contain eleven nodes with the centers spaced evenly within the intervals \([-\mathrm{5,5}]\), \(\left[-\mathrm{5,5}\right]\times \left[-\mathrm{5,5}\right]\times [-\mathrm{5,5}]\), and \(\left[-\mathrm{5,5}\right]\times \left[-\mathrm{5,5}\right]\times \left[-\mathrm{5,5}\right]\times [-\mathrm{5,5}]\), respectively.

The APIC-DSC baseline controller for the system in Eq. (23) is designed using Eqs. (13), (14), (19)–(22). This controller is completely similar to Eq. (24), unless the controller input signal:

$$v= -{k}_{3}{S}_{3}-\frac{1}{2{a}_{3}^{2}}{S}_{3}^{3}\widehat{\theta }{\zeta }_{3}^{T}\left({Z}_{3}\right){\zeta }_{3}\left({Z}_{3}\right)+\underset{0}{\overset{R}{\int }}{\widehat{p}}_{{p}_{0}}\left(t,r\right)\left|{F}_{r}\left[v\right]\left(t\right)\right|dr,$$
$$\frac{\partial }{\partial t}{\widehat{p}}_{{p}_{0}}\left(t,r\right)= \left\{\begin{array}{c}-{\gamma }_{pr}{S}_{n}^{3}{F}_{r}\left[u\right]\left(t\right)-{\gamma }_{pr}\sigma {\widehat{p}}_{{p}_{0}}\left(t,r\right), {\widehat{p}}_{{p}_{0}}>{p}_{max}\\ -{\gamma }_{pr}\sigma {\widehat{p}}_{{p}_{0}}\left(t,r\right), {\widehat{p}}_{{p}_{0}}<{p}_{max}\end{array}\right.,$$
(25)

where \({[{k}_{0},k}_{1},{k}_{2},{k}_{3}]\), [\({a}_{1},{a}_{2},{a}_{3}]\) are design parameters, obtained empirically likewise the ANNDSC case. \({p}_{max}=0.2, R=2, { \gamma }_{pr}=1, \sigma =2\), and other parameters are similar to the previous simulation step. The backstepping case is entirely implemented as detailed in [25].

Simulation results of these three cases are depicted in Figs. 3, 4, 5, 6 and 7. The system output is depicted against the reference sinusoidal input, for all the three methods, under the three actuator conditions.

Fig. 3
figure 3

Output of the three control systems and the reference input, for three different characteristics of the actuators

Fig. 4
figure 4

Integrated Mean Square Error (IMSE) of a tracking problem for the proposed method (ANNDSC), the baseline methods 1 (APIC-DSC), and the baseline method 2 (backstepping), under different conditions of failure-free, actuator dead-zone and Prandtl–Ishlinskii (PI) hysteresis

Fig. 5
figure 5figure 5figure 5

Comparison of the closed-loop states of the proposed control system (ANNDSC) along with the baseline methods APIC-DSC and backstepping in the tracking problem. The three actuator nonlinearities are separately illustrated for the third order system with the three state variables

Fig. 6
figure 6

Control input of the three systems in the tracking problem, described in the sequel

Fig. 7
figure 7

Adaptive law of the three methods for the tracking problem

Minimal deflection from the desired form of sinusoidal wave is seen for all the methods and conditions. In order to quantitate the deflection from the desired output, the Integrated Mean Square Error of the actual outputs is calculated with respect to the inputs. Figure 4 demonstrates the IMSE for the all methods and conditions.

Outperformance of ANNDSC is seen for all the case with the minimal IMSE. In the tracking problem, lower IMSE is regarded as an indication of the better performance. For the failure free cases, the relative depression in IMSE of the proposed method is observed to be 25% and 11% as compared to the backstepping and APIC-DSC, respectively. Nevertheless, effectiveness of the method is further highlighted when there is a nonlinearity condition in the actuator. For the PI hysteresis, the proposed method improves the tracking performance by 76% and 38% as reflected by the relative IMSE for the backstepping and APIC-DSC method, respectively. For the dead-zone nonlinearity, this relative outperformance is, however, 32% and 49%, showing a good improvement in the tracking performance. For the dead-zone condition, the APIC-DSC offers the worst IMSE, implying that improvement in the PI hysteresis is served at the expense of impairing the performance for other condition, when direct method is employed. It is also seen that all the methods offer their optimal performance at the absence of the actuator nonlinearity.

In order to investigate internal stability of the control methods, closed-loop signals and states of the three control systems are plotted with different actuator nonlinearities. Figure 5 shows the closed-loop states.

The close-loop signals of the APIC-DSC are considerably higher than the ANNDSC and the backstepping, showing further tendency to internal instability in practical situations, even though the values are bounded. This is confirmed by the control input signal, depicted in Fig. 6.

The ANNDSC method exhibits smaller control effort, compared to the two baselines. The control input signal of ANNDSC shows smoother and low oscillatory waveform, which provides a more reliable functionality in practice. The risk of the internal stability is by far highest for the APIC-DSC, even though the outputs are not far different for all the methods. It is important to note that high amplitude of the control input signal can practically put the system into the risk of actuator saturation. These conditions sometimes make finding a control strategy impractical, despite showing acceptable tracking.

Figure 7 demonstrates adaptive law of the three methods.

As seen in Fig. 7, the adaptive law, \(\widehat{\theta }\), damps quicker for the ANNDSC and APIC-DSC, revealing faster convergence for the neural network-based methods compared to the backstepping one. Figures 8 and 9 show the system outputs and the control inputs, for a case of the joint dead-zone and PI hysteresis nonlinearities, occurring at two different time instances.

Fig. 8
figure 8

Output of the three control systems and the reference input, for the joint dead-zone and hysteresis nonlinearity, occurring at the seconds 1 and 4

Fig. 9
figure 9

Control input of the three systems in the tracking problem, for the joint dead-zone and hysteresis nonlinearity, occurring at the seconds 1 and 4

All the three methods show good performance in tracking the output. However, the APIC-DSC dramatically increases the control inputs on the occurrence of the dead-zone. This makes the APIC-DSC an inappropriate candidate for the practical situations, where such the large value of the control input put the system into the risk of saturation.

6 Discussion

The paper suggested an adaptive control design method for nonlinear stochastic systems with a general class of the actuator nonlinearity. In contrast to the existing techniques relying on the backstepping design method [11,12,13,14,15,16,17,18,19,20,]–[21], the proposed method employed dynamic surface control design, along with neural networks through an algorithm of minimal learning parameters, to avoid the “explosion of complexity” and decline the computational efforts. This favorable feature which cannot be seen in the backstepping-based methods will become especially important for the systems with increased order. Such the implementation improves agility of the design method to be suitable for an online application. The paper proved boundedness of all the closed-loop signals and convergence of all the error signals to a small vicinity of the origin at the presence of two different nonlinearities, commonly seen at the actuators, dead-zone and hysteresis, in both analytic and simulation manners.

Although certain nonlinearities have been investigated in recent studies [25], the joint dead-zone and hysteresis were not included in the studies. In many practical applications, actuators can accidentally encounter with any of the dead-zone and hysteresis, due to the aging. It is sometimes critically important to consider such the conditions in the design method.

We introduced a baseline method for nonlinear stochastic system, named APIC-DSC, sophisticated for compensating the actuator hysteresis. In this baseline method, adaptive neural network is not invoked for the compensation. It is analytically proved that the closed-loop signals remain bounded in probability. This method although shows acceptable performance for the failure-free and also for the hysteresis conditions, but dramatically increases the control input at the presence of the dead-zone.

Considering Figs. 5 and 9, the control effort of the ANNDSC is much less than the two other baseline methods. It is possible to improve the tracking at the cost of increasing the control effort. It might, however, lead to actuator saturation or internal instability of the system. It was observed that the control effort is by far lower for ANNDSC than the two baseline methods.

In this study, the proposed method was empirically optimized by jointly considering the tracking performance and the control effort. Among the design parameters, the set of \(\left[ {k_{0} ,k_{1} ,k_{2} ,k_{3} } \right]\) and \(\left[ {a_{1} ,a_{2} ,a_{3} } \right]\) have more effect on the transient and the steady state characteristics of the system where the \({k}_{3},\) and \({a}_{3}\) directly affect the control input of the system. However, the proposed method can be well-integrated with the genetic algorithm for finding an optimal set of the design parameters. This is also true for other metaheuristic methods, or natural-based algorithms, such as ant colony algorithm. For our baseline study of backstepping method, we used the same set of the design parameters described in [25] as an initial set, and followed similar empirical procedure for improving the performance, as was done for ANNDSC and APIC-DSC.

Selecting an appropriate sampling rate plays an important role in efficient performance of any control system. A low sampling rate can lead to system instability, while on the other hand, an excessive sampling rate increases redundant complexities. A recent study proposed an interesting systematic method, named FIRCEP, that can be easily employed for finding an optimal sampling rate [55].

We used MATLAB R2017b for the simulations and analysis. Nowadays, there are various platforms, commercially available for efficient implementation in the practical situations and real plants, such as PLC systems with strong computational power. It is obvious that such the implementations demand a level of the practical considerations.

7 Conclusion

This paper proposed a novel adaptive design method for nonlinear stochastic control systems using neural network. The proposed method was investigated under joint conditions of the actuator nonlinearities, defined as the dead-zone and the Prandtl–Ishlinskii hysteresis. Stability analysis was analytically studied and confirmed by the simulation results in a tracking problem. Performance of the proposed method was compared to a baseline of widely used method, the backstepping method. It is observed that using the proposed neural network in conjunction with the dynamic surface method, considerably enhances performance of the control design method, and meanwhile decreases the computational complexities as well as the control effort.