1 Introduction

The main inspiration behind the introduction of fuzzy sets theory was the necessity for modeling real-world phenomena, which are inherently vague and ambiguous. Human knowledge about complex problems can be successfully represented using the imprecise terms of natural language. The theories of fuzzy sets and fuzzy logic provide formal tools for mathematical representation and efficient processing of such information.

The term “system” is usually understood as a set of interacting components with well-defined structure and organized as an intricate whole that can be distinguished from the “external” environment. A system communicates with the environment through so-called inputs and outputs. Fuzzy systems are structures based on fuzzy techniques oriented towards information processing, where the usage of classical sets theory and binary logic is impossible or difficult. In the literature, terms such as fuzzy system, fuzzy model, system based on fuzzy rules, fuzzy controller, or fuzzy associative memory are used interchangeably depending on the application type [16]. Their main characteristic involves symbolic knowledge representation in a form of fuzzy conditional (if-then) rules.

Fig. 2.1
figure 1

The typical structure of a fuzzy system

The typical structure of a fuzzy system (Fig. 2.1) consists of four functional blocks: the fuzzifier, the fuzzy inference engine, the knowledge base, and the defuzzifier. Both linguistic values (defined by fuzzy sets) and crisp (numerical) data can be used as inputs for a fuzzy system. If crisp data are applied, then the inference process is preceded by fuzzification, which assigns the appropriate fuzzy set to the nonfuzzy input. The values of input variables are mapped into linguistic values of the output variable by means of the appropriate method of approximate reasoning (inference engine) using expert knowledge, which is represented as a collection of fuzzy conditional rules (knowledge base). In addition to the linguistic values, the numerical data may be required as the fuzzy system output. In such cases defuzzification methods are used, which assign the representative crisp data to the resultant output fuzzy set.

Practical applications of fuzzy systems include problems for which the complete mathematical description is unavailable, or where the usage of the precise (nonfuzzy) model is uneconomical or highly inconvenient. The ability to process inaccurate information makes a fuzzy system an excellent tool, for example, for control processes [12, 19], system identification [11, 20], decision support [24, 33], and signal and image processing [4, 23].

In the following sections only static fuzzy systems (i.e., systems where the outputs are determined only on the basis of the current input values) are considered. Included are concepts of knowledge representation in the form of fuzzy conditional rules, the idea of approximate reasoning, and the description of basic structures of fuzzy systems.

2 Fuzzy Conditional Rules

One of the fundamental concepts of fuzzy sets theory is a linguistic variable [34]. Its values are the statements of natural language (terms), which are the labels (descriptions) of fuzzy sets defined on a given universe (space) of discourse. Formally, a linguistic variable is defined as a quintuple [35]:

$$\begin{aligned} X =\left( \mathscr {N},\mathscr {L}\left( \mathsf {G}\right) ,\mathbb {X} ,\mathsf {G},\mathsf {S}\right) , \end{aligned}$$
(2.1)

where \(\mathscr {N}\) is a name of the linguistic variable, \(\mathscr {L}\left( \mathsf {G}\right) \) denotes the family of values of the linguistic variable being a collection of labels of the fuzzy sets defined on the universe \(\mathbb {X}\), \(\mathsf {G}\) is the set of syntactic rules defined by a grammar determining all terms in \(\mathscr {L}(\mathsf {G})\), and \(\mathsf {S}\) represents the semantics of the variable X, that defines the meaning of all labels.

As an example we can use a linguistic variable describing the fetal heart rate (FHR). The name of the variable can be defined as \(\mathscr {N} = \text {``mean FHR''}\). According to FIGO guidelines [21], the set of possible linguistic values is a collection of three labels describing the fetal state as: \(\mathscr {L} = \) {“normal,” “suspicious,” “pathological”}. To each of the labels we can assign a fuzzy set \(A_{i}: i = 1, 2, \ldots , 5,\) defined on \(\mathbb {X} = [0, 250]\) bpm, which represents the range of possible number of heart beats per min [3]. The examples of membership functions \(\mu _{A_{i}}(x)\) of the fuzzy sets \(A_{i}\) are shown in Fig. 2.2.

Fig. 2.2
figure 2

Examples of membership functions of fuzzy sets defining the values of the linguistic variable X \(=\) “mean FHR”

An elementary statement for the linguistic variable X is the fuzzy expression:

$$\begin{aligned} X\,\mathbf {is }\ L_A, \end{aligned}$$
(2.2)

where \(L_A\) is a label from the collection \(\mathscr {L}(\mathsf {G})\), defined by a fuzzy set A on the universe \(\mathbb {X}\). The logical value of the expression is determined on the basis of membership function \(\mu _{A}\left( x\right) \) of the fuzzy set A. In the preceding example, an elementary statement is:

$$\begin{aligned} \text {``mean FHR'' } \mathbf {is} \text { ``normal''}, \end{aligned}$$

which value for the measurement 110 bpm is equal to \(\mu _{A_3}\left( x\right) = 0.5\) (see Fig. 2.2).

A more complex fuzzy expression can be obtained by combining two or more elementary expressions. It can be presented in the conjunctive:

$$\begin{aligned} (X_{1}\ \mathbf {is }\ L_{A_{1}})\ \mathbf { and }\ (X_{2}\,\mathbf { is }\ L_{A_{2}}), \end{aligned}$$
(2.3)

or the disjunctive form:

$$\begin{aligned} (X_{1}\ \mathbf {is }\ L_{A_{1}})\,\mathbf { or }\ (X_{2}\,\mathbf { is }\ L_{A_{2}}), \end{aligned}$$
(2.4)

where \(X_{1},X_{2}\) are linguistic variables with labels \(L_{A_{1}},L_{A_{2}}\) defined by the fuzzy sets \(A_{1}\) and \(A_{2}\), respectively, on the universes \(\mathbb {X}_{1}\) and \(\mathbb {X}_{2}\).

The value of a complex fuzzy expression for \(x_{1}\in \mathbb {X}_{1}\) and \(x_{2}\in \mathbb {X}_{2}\) is determined on the basis of the membership functions of fuzzy sets \(A_{1}\) and \(A_{2}\) [16]:

$$\begin{aligned} \mu _{A_{1}}\left( x_{1}\right) \star _{T}\mu _{A_{2}}\left( x_{2}\right) , \end{aligned}$$
(2.5)

for the conjunctive form, and

$$\begin{aligned} \mu _{A_{1}}\left( x_{1}\right) \star _{S}\mu _{A_{2}}\left( x_{2}\right) , \end{aligned}$$
(2.6)

for the disjunctive form, where \(\star _{T}\) denotes a t-norm, and \(\star _{S}\) an s-norm.

An elementary fuzzy statement can also be expressed in the form of an implication forming a fuzzy if-then rule (fuzzy conditional statement):

$$\begin{aligned} \mathbf {if }\ \left( X\ \mathbf { is }\ L_A \right) ,\ \mathbf { then }\ \left( Y \ \mathbf { is }\ L_B \right) , \end{aligned}$$
(2.7)

defining a relationship between linguistic variables. The statement “X \(\mathbf {is}\) \(L_A\)” is called the antecedent (premise), and the statement “Y \(\mathbf {is}\) \(L_B \)” is called the consequent (conclusion).

A generalized form of the fuzzy conditional statement can be defined as an implication of complex fuzzy expressions. For the conjunctive form it can be written as:

$$\begin{aligned}&\mathbf { if }\ \left( X_{1}\ \mathbf { is }\ L_{A_{1}} \right) \ \mathbf { and } \ \left( X_{2}\ \mathbf { is }\ L_{A_{2}} \right) \ \mathbf { and }\ \cdots \ \mathbf { and }\ \left( X_{N}\ \mathbf { is }\ L_{A_{N}} \right) , \\&\mathbf { then } \ \left( Y_{1}\ \mathbf { is }\ L_{B_{1}} \right) \text {, }\left( Y_{2}\ \mathbf { is }\ L_{B_{2}} \right) , \ldots , \left( Y_{M}\ \mathbf { is }\ L_{B_{M}} \right) , \nonumber \end{aligned}$$
(2.8)

and for the disjunctive form as:

$$\begin{aligned}&\mathbf { if }\ \left( X_{1}\ \mathbf { is }\ L_{A_{1}} \right) \ \mathbf { or } \ \left( X_{2}\ \mathbf { is }\ L_{A_{2}} \right) \ \mathbf { or } \ \cdots \ \mathbf { or }\ \left( X_{N}\ \mathbf { is }\ L_{A_{N}} \right) , \\&\mathbf { then } \left( Y_{1}\ \mathbf { is }\ L_{B_{1}} \right) \text {, }\left( Y_{2}\ \mathbf { is }\ L_{B_{2}} \right) , \ldots , \left( Y_{M}\ \mathbf { is }\ L_{B_{M}} \right) , \nonumber \end{aligned}$$
(2.9)

where \(X_{1},X_{2},\ldots ,X_{N}\) are the input linguistic variables; \(Y_{1},Y_{2},\ldots ,Y_{M}\) are the output linguistic variables; \(L_{A_{1}},L_{A_{2}}, \ldots ,\) \(L_{A_{N}}\), and \(L_{B_{1}},L_{B_{2}}, \ldots ,\) \(L_{B_{M}}\) are their linguistic values, defined with fuzzy sets \(A_{1},A_{2}, \ldots ,\) \(A_{N}\) and \(B_{1}, B_{2}, \ldots ,\) \(B_{M}\) on universes \(\mathbb {X}_{1}, \mathbb {X}_{2}, \ldots ,\) \(\mathbb {X}_{N}\), and \(\mathbb {Y}_{1}, \mathbb {Y}_{2}, \ldots , \mathbb {Y}_{M}\), respectively.

Both implications are the fuzzy if-then rules with multiple inputs and multiple outputs (MIMO). The MIMO fuzzy rule can be decomposed into the corresponding set of canonical fuzzy if-then rules [16], which are the MISO (multiple inputs and single output) type of fuzzy conditional statements with conjunctive antecedent:

$$\begin{aligned} \mathbf {if }\ \underset{n=1}{\overset{N}{\mathbf {and}}} \left( X_{n}\ \mathbf { is }\ L_{A_{n}}\right) , \ \mathbf { then }\ Y\ \mathbf { is }\ L_{B}. \end{aligned}$$
(2.10)

Canonical fuzzy conditional statements are the basics for representing expert knowledge in a fuzzy system. Using pseudo-vector notation, the canonical fuzzy if-then rule can be written as

$$\begin{aligned} \mathbf {if }\ \left( \mathbf {X}\ \mathbf {is }\ \mathbf {L}_\mathbf {A}\right) ,\ \mathbf { then }\left( Y\ \mathbf { is }\ L_B\right) , \end{aligned}$$
(2.11)

which is an \(N + 1\)-nary fuzzy relation [4]:

$$\begin{aligned} R = \left( \left( A_{1} \times A_{2} \times \cdots \times A_{N}\right) \Longrightarrow B\right) \, = \, \left( \mathbf {A} \Longrightarrow B\right) , \end{aligned}$$
(2.12)

defined on \(\mathbb {X}_{1}\times \mathbb {X}_{2}\times \cdots \times \mathbb {X}_{N}\times \mathbb {Y}\), with the membership function:

$$\begin{aligned} \mu _{R}\left( x_{1},\ldots ,x_{N},y\right) =\varPhi \left( \mu _{\mathbf {A}} \left( \mathbf {x}\right) , \mu _{B}\left( y\right) \right) , \end{aligned}$$
(2.13)

where \(\mathbf {x} = {\left[ x_{1},\ldots ,x_{N}\right] }^T \in \mathbb {X}_{1}\times \mathbb {X}_{2}\times \cdots \times \mathbb {X}_{N}\), \(y \in \mathbb {Y}\), and depending on the interpretation of the fuzzy if-then rule, \(\varPhi \left( \cdot ,\cdot \right) \) denotes a t-norm (a conjunctive interpretation) [8, 16] or fuzzy implication (logical interpretation) [8, 9, 16].

If the conjunction “and” in the antecedents of the fuzzy if-then rules is represented by a t-norm T, then:

$$\begin{aligned} \mu _{\mathbf {A}}\left( \mathbf {x}\right) = \mu _{A_{1}}\left( x_{1}\right) \star _{T}\mu _{A_{2}}\left( x_{2}\right) \star _{T}\cdots \star _{T}\mu _{A_{N}}\left( x_{N}\right) , \end{aligned}$$
(2.14)

where \(A_{1},A_{2},\ldots ,A_{N}\) are fuzzy sets representing the values of linguistic variables in the antecedent of the canonical fuzzy rule.

Hence, for the conjunctive interpretation we get:

$$\begin{aligned}&\mu _{R}\left( \mathbf {x},y\right) =\mu _{R}\left( x_{1} ,\ldots ,x_{N},y\right) =\mu _{\mathbf {A}}\left( \mathbf {x}\right) \star _{T_{r}}\mu _{B}\left( y\right) =\nonumber \\&\mu _{A_{1}}\left( x_{1}\right) \star _{T}\mu _{A_{2}}\left( x_{2}\right) \star _{T}\cdots \star _{T}\mu _{A_{N}}\left( x_{N}\right) \star _{T_{r}}\mu _{B}\left( y\right) , \end{aligned}$$
(2.15)

where \(\star _{T_{r}}\) is a t-norm representing the fuzzy if-then rule, whereas for logical interpretation:

$$\begin{aligned}&\mu _{R}\left( \mathbf {x},y\right) =\mu _{R}\left( x_{1} ,\ldots ,x_{N},y\right) =\varPsi \left( \mu _{\mathbf {A}}\left( \mathbf {x}\right) , \mu _{B}\left( y\right) \right) =\nonumber \\&\varPsi \left( \mu _{A_{1}}\left( x_{1}\right) \star _{T}\mu _{A_{2}}\left( x_{2}\right) \star _{T}\ldots \star _{T}\mu _{A_{N}}\left( x_{N}\right) ,\mu _{B}\left( y\right) \right) , \end{aligned}$$
(2.16)

where \(\varPsi \left( \cdot ,\cdot \right) \) denotes a fuzzy implication.

Fuzzy implication is usually introduced using an axiomatic approach [9], where it is defined as a continuous function \(\varPsi :[0,1]\times [0,1]\rightarrow [0,1]\), which for each \(a,b,c\in [0,1]\) fulfills five necessary (general) conditions:

  1. P1:

    \(\quad \text {if } a\le c, \text {then } \varPsi \left( a,b\right) \ge \varPsi \left( c,b\right) \),

  2. P2:

    \(\quad \text {if } b\le c, \text {then } \varPsi \left( a,b\right) \le \varPsi \left( a,c\right) \),

  3. P3:

    \(\quad \varPsi \left( 0,b\right) = 1\),

  4. P4:

    \(\quad \varPsi \left( a,1\right) = 1\),

  5. P5:

    \(\quad \varPsi \left( 1,0\right) = 0\),

and eight recommended (specific) conditions [4]. Properties P3, P4, and P5 are called falsity, neutrality, and Booleanity, respectively [4, 22]. As examples we can use Lukasiewicz:

$$\begin{aligned} \varPsi \left( a,b\right) = \min \left( 1-a+b,1\right) , \end{aligned}$$
(2.17)

Reichenbach:

$$\begin{aligned} \varPsi \left( a,b\right) =1-a+ab, \end{aligned}$$
(2.18)

and Zadeh fuzzy implication:

$$\begin{aligned} \varPsi \left( a,b\right) = \max \left( 1-a,\min \left( a,b\right) \right) . \end{aligned}$$
(2.19)

A single fuzzy rule describes a local relationship between the input and output variables of the fuzzy system within the limits defined by the domain of fuzzy sets in the rule antecedent. The complete input–output mapping is represented by the whole collection of fuzzy if-then rules from the knowledge (rule) base. For further considerations we assume a base consisting of I rules in the form:

$$\begin{aligned} \mathscr {R}=\left\{ R^{(i)}\right\} _{i =1}^{I}= \left\{ \mathbf {if }\ \underset{n=1}{\overset{N}{\mathbf { and}}} \left( X_{n}\ \mathbf { is }\ L_{A_{n}}^{(i)}\right) , \ \mathbf { then }\ Y\ \mathbf { is }\ L_{B}^{(i)} \right\} _{i =1}^{I}. \end{aligned}$$
(2.20)

A well-defined fuzzy rule base should be complete, consistent, and continuous [31]. The completeness means that for each value from the input space at least one rule is activated, that is \(\exists _{i = 1,2, \ldots , I} \quad \mu _{\mathbf {A}^{(i)}}(\mathbf {x}) \ne 0\). The knowledge base is consistent if there are no rules with the same antecedent but different consequents. And finally, the knowledge base is continuous if there are no neighboring rules, for which the result of intersection of fuzzy sets in their consequents is an empty set.

The knowledge base is constructed first by acquiring knowledge about the modeled phenomenon, and next by representing it in a form of fuzzy conditional rules. In practice, there are three basic methods to create a fuzzy rule base [16]:

  • by using knowledge of a human expert or based on the physical laws describing the phenomenon (white box modeling),

  • by automatically extracting the rules based on numerical data representing the relationship between inputs and outputs of the phenomenon (black box modeling),

  • mixed, where part of the knowledge is derived from a human expert and part from automated extraction (grey box modeling).

The possible applications of a fuzzy system depend, however, not only on the properly defined knowledge base, but also on the appropriate design of an inference engine.

3 Approximate Reasoning

Inference methods originating from classical logic are based on so-called rules of inference. A rule of inference is a pattern of reasoning that explains how a conclusion may be logically derived from a given premise previously assumed to be true. One of the most commonly used rules of inference is the rule of detachment, often referred to as modus ponendo ponens (“the way that affirms by affirming”). Modus ponendo ponens (MPP) is based on two premises. The first is the conditional statement \(p \Longrightarrow q\), namely that “p implies q”. The second assumes that the antecedent p of the conditional statement is true. From these two premises it can be concluded that the consequent q is true. The MPP rule can be written as [4]:

 \(\text {Premise I (fact):}\)

p

 \(\text {Premise II (rule):}\)

\(p \Longrightarrow q\)

 \(\text {Conclusion:}\)

q

or symbolically:

$$\begin{aligned} \left( p\wedge \left( p\Longrightarrow q\right) \right) \Longrightarrow q. \end{aligned}$$
(2.21)

Binary logic assumes only two possibilities: total compliance or total noncompliance of the fact with the implication antecedent. In contrast, fuzzy inference engines use an approximate reasoning based on the generalized rules of inference. The generalized modus ponendo ponens (GMPP) may be written as [34]:

 \(\text {Premise I (fact):}\)

\(p^{\prime }\)

 \(\text {Premise II (rule):}\)

\(p \Longrightarrow q\)

 \(\text {Conclusion:}\)

\(q^{\prime }\)

or:

$$\begin{aligned} \left[ p^{\prime }\wedge \left( p\Longrightarrow q\right) \right] \Longrightarrow q^{\prime }, \end{aligned}$$
(2.22)

where statements \(p^{\prime }\) and \(q^{\prime }\) are similar, respectively, to p and q.

A conditional fuzzy rule can be defined as a fuzzy relation, and hence, the statements in antecedents and consequents as fuzzy sets. The statement X \(\mathbf {is}\) \(L_{A^{\prime }}\) is a fact, where \(L_{A^{\prime }}\) denotes the label of a linguistic variable X defined by a fuzzy set \(A^{\prime }\) on the universe \(\mathbb {X}\). The knowledge is represented by the fuzzy conditional rule “\(\mathbf {if}\) X \(\mathbf {is}\) \(L_A\), \(\mathbf {then}\) Y \(\mathbf {is}\) \(L_B\),” where \(L_A\) and \(L_B\) are the linguistic values of linguistic variables X and Y, defined by fuzzy sets A and B, on the universes \(\mathbb {X}\) and \(\mathbb {Y}\), respectively. Consequently, the inference scheme of GMPP takes the form:

 \(\text {Premise I (fact):}\)

X \(\mathbf {is}\) \(L_{A^{\prime }}\)

 \(\text {Premise II (rule):}\)

\(\mathbf {if}\) X \(\mathbf {is}\) \(L_A\), \(\mathbf {then}\) Y \(\mathbf {is}\) \(L_B\)

 \(\text {Conclusion:}\)

Y \(\mathbf {is}\) \(L_{B^{\prime }}\)

or:

$$\begin{aligned} \left[ \left( X\ \mathbf { is }\ L_{A^{\prime }} \right) \wedge \left( X\ \mathbf { is }\ L_{A}\Longrightarrow Y\ \mathbf { is }\ L_{B}\right) \right] \Longrightarrow Y\ \mathbf { is }\ L_{B^{\prime }}. \end{aligned}$$
(2.23)

The fuzzy set \(B^{\prime }\) is determined using Zadeh’s compositional rule of inference [34].

3.1 Compositional Rule of Inference

The compositional rule of inference (CRI), also known as supremum-star composition [34], is a generalization of an operation for determining the function value. The first stage of CRI is to construct a cylindrical extension of a fuzzy set \(A^{\prime }\left( x\right) \) from the universe \(\mathbb {X}\) to \(\mathbb {X}\times \mathbb {Y}\):

$$\begin{aligned} \underset{\left( x,y\right) \in \mathbb {X}\times \mathbb {Y}}{\forall }\qquad \mu _{\text {Ce}\left( A^{\prime }\right) }\left( x,y\right) =\mu _{A^{\prime }}\left( x\right) . \end{aligned}$$
(2.24)

Secondly, an intersection (logical product) of cylindrical extension \(\text {Ce}\left( A^{\prime }\right) \) and fuzzy relation R is constructed using t-norm T:

(2.25)

The final CRI outcome is a result of the \({\text {Ce}\left( A^{\prime }\right) \cap R}\) projection on \(\mathbb {Y}\):

$$\begin{aligned} \underset{y\in \mathbb {Y}}{\forall }\qquad \mu _{B^{\prime }}\left( y\right) =\,\underset{x\in \mathbb {X}}{\sup }\left[ \mu _{A^{\prime }}\left( x\right) \star _{T}\mu _{R}\left( x,y\right) \right] . \end{aligned}$$
(2.26)

The fuzzy set \(B^{\prime }\) can also be presented as a composition of a fuzzy set \(A^{\prime }\), which is an unary fuzzy relation, with conditional fuzzy rule R being a binary fuzzy relation:

$$\begin{aligned} B^{\prime }=A^{\prime }\circ R, \end{aligned}$$
(2.27)

where \(\circ \) is the operator of the supremum-t-norm composition.

The GMPP for the ith canonical fuzzy if-then rule (2.20) can be written as [16]:

$$\begin{aligned} B^{\prime (i)}=\mathbf {A}^{\prime }\circ R^{(i)}=\mathbf {A}^{\prime }\circ \left( \mathbf {A}^{(i)}\Longrightarrow B^{(i)}\right) , \end{aligned}$$
(2.28)

where \(\mathbf {A}^{\prime } = A_{1}^{\prime }\times A_{2}^{\prime }\times \cdots \times A_{N}^{\prime }\) is a multidimensional fuzzy set that defines the value of the multidimensional input linguistic variable on the space \(\underline{\mathbb {X}} = \mathbb {X}_{1}\times \mathbb {X}_{2}\times \cdots \times \mathbb {X}_{N}\).

The membership function of the conclusion \(B^{\prime (i)}\) is calculated as follows.

$$\begin{aligned}&\mu _{B^{\prime (i)}}\left( y\right) =\,\underset{\mathbf {x}\in \underline{\mathbb {X}}}{\sup }\left[ \mu _{\mathbf {A}^{\prime }}\left( \mathbf {x}\right) \star _{T_{s}}\mu _{R^{(i)}}\left( \mathbf {x},y\right) \right] =\nonumber \\&\underset{\mathbf {x}\in \underline{\mathbb {X}}}{\sup }\left[ \mu _{A_{1}^{\prime }}\left( x_{1}\right) \star _{T}\mu _{A_{2}^{\prime }}\left( x_{2}\right) \star _{T}\cdots \star _{T}\mu _{A_{N}^{\prime }}\left( x_{N}\right) \star _{T_{s}}\mu _{R^{(i)}}\left( x_{1},\ldots ,x_{N},y\right) \right] , \end{aligned}$$
(2.29)

where \(T_{s}\) is a t-norm of the supremum-t-norm composition. In the case of the conjunctive interpretation (2.15) we can write:

$$\begin{aligned}&\mu _{B^{\prime (i)}}\left( y\right) =\,\underset{\mathbf {x}\in \underline{\mathbb {X}}}{\sup }\left[ \mu _{\mathbf {A}^{\prime }}\left( \mathbf {x}\right) \star _{T_{s}}\mu _{\mathbf {A}^{(i)}}\left( \mathbf {x} \right) \star _{T_{r}}\mu _{B^{(i)}}\left( y\right) \right] =\nonumber \\&\underset{\mathbf {x}\in \underline{\mathbb {X}}}{\sup }\left[ \left( \mu _{A_{1}^{\prime }}\left( x_{1}\right) \star _{T}\mu _{A_{2}^{\prime }}\left( x_{2}\right) \star _{T}\cdots \star _{T}\mu _{A_{N}^{\prime }}\left( x_{N}\right) \right) \star _{T_{s}}\right. \nonumber \\&\left. \left( \mu _{A_{1}^{(i)}}\left( x_{1}\right) \star _{T}\mu _{A_{2}^{(i)}}\left( x_{2}\right) \star _{T}\cdots \star _{T}\mu _{A_{N}^{(i)} }\left( x_{N}\right) \right) \star _{T_{r}}\mu _{B^{(i)}}\left( y\right) \right] . \end{aligned}$$
(2.30)

And for logical interpretation (2.16) we get:

$$\begin{aligned}&\mu _{B^{\prime (i)}}\left( y\right) =\,\underset{\mathbf {x}\in \underline{\mathbb {X}}}{\sup }\left[ \mu _{\mathbf {A}^{\prime }}\left( \mathbf {x}\right) \star _{T_{s}} \varPsi \left( \mu _{\mathbf {A}^{(i)}}\left( \mathbf {x}\right) ,\mu _{B^{(i)}}\left( y\right) \right) \right] =\nonumber \\&\underset{\mathbf {x}\in \underline{\mathbb {X}}}{\sup }\left[ \left( \mu _{A_{1}^{\prime }}\left( x_{1}\right) \star _{T}\mu _{A_{2}^{\prime }}\left( x_{2}\right) \star _{T}\cdots \star _{T}\mu _{A_{N}^{\prime }}\left( x_{N}\right) \right) \star _{T_{s}}\right. \nonumber \\&\left. \varPsi \left( \mu _{A_{1}^{(i)}}\left( x_{1}\right) \star _{T}\mu _{A_{2}^{(i)}}\left( x_{2}\right) \star _{T}\cdots \star _{T}\mu _{A_{N}^{(i)} }\left( x_{N}\right) ,\mu _{B^{(i)}}\left( y\right) \right) \right] . \end{aligned}$$
(2.31)

Under certain conditions [5], logical and conjunctive interpretation of fuzzy conditional rules leads to equivalent inference results.

Equations (2.30) and (2.31) define the membership function of a fuzzy set representing the resulting conclusion of an inference using only one fuzzy if-then rule. For a knowledge base consisting of many fuzzy conditional statements it is necessary to combine conclusions from all individual rules.

3.2 Approximate Reasoning with Knowledge Base

Generally, there are two methods of approximate reasoning that can be applied to determine the outcome fuzzy set \(B^{\prime }\) on the basis of a collection of fuzzy if-then rules [4]:

  • composition-based inference (first aggregate then infer: FATI), where first a combination of all rules from the knowledge base is constructed, and then inference using the supremum-star composition is conducted,

  • individual rule-based inference (first infer then aggregate: FITA), in which the first step involves inference using the supremum-star composition for each of the rules individually and then, a combination of inference results is performed.

The FATI process of combining the rules, as well as the stage in the FITA schema of determining the resulting conclusion, is called aggregation [10]. The aggregation can be defined by introduction of the concept of the aggregation operator [16], which for I values \(x_{1},x_{2},\ldots ,x_{I}\in \left[ 0,1\right] \) represents a mapping \(\oplus :[0,1]^{I}\Rightarrow [0,1]\):

$$\begin{aligned} x=\bigoplus \limits _{i=1}^{I}x_{i}=\oplus \left( x_{1},x_{2},\ldots ,x_{I}\right) . \end{aligned}$$
(2.32)

There are various definitions of aggregation operator including logical sum, represented by an s-norm (Mamdani combination [19]), logical product, represented by a t-norm (Gödel combination [16]), as well as nonmonotonic fuzzy operations that allow conducting the inference even if part of the knowledge is missing [32]. Most of them can be defined as special cases of the generalized average operator [4]:

$$\begin{aligned} \uplus _{(\alpha )}\left( x_{1},\ldots ,x_{I}\right) =\,\underset{i=1\quad }{\overset{I\quad }{\biguplus \nolimits _{(\alpha )}}}x_{i}=\left[ \frac{1}{I}\sum \limits _{i=1}^{I}\left( x_{i}\right) ^{\alpha }\right] ^{\frac{1}{\alpha }}, \end{aligned}$$
(2.33)

for \(\alpha \in \mathbb {R}\setminus \left\{ 0\right\} \).

Consequently, the first stage of the FATI method can be defined as:

$$\begin{aligned} \mathscr {R}=\bigoplus \limits _{i=1}^{I}R^{(i)}, \end{aligned}$$
(2.34)

where \(R^{(i)}\) is the ith fuzzy relation.

Next, the outcome fuzzy set \(B_{FATI}^{\prime }\) is determined for an input fuzzy set \(\mathbf {A}^{\prime }\) using the GMPP:

$$\begin{aligned} B_{FATI}^{\prime }= \mathbf {A}^{\prime } \circ \mathscr {R} = \mathbf {A}^{\prime }\circ \left[ \bigoplus \limits _{i=1} ^{I}R^{(i)}\right] , \end{aligned}$$
(2.35)

the membership function of which is defined as

$$\begin{aligned} \mu _{B_{FATI}^{\prime }}\left( y\right)&=\,\underset{\mathbf {x} \in \underline{\mathbb {X}}}{\sup }\left[ \mu _{\mathbf {A}^{\prime }}\left( \mathbf {x}\right) \star _{T_{s}}\mu _{\mathscr {R}}\left( \mathbf {x},y\right) \right] \nonumber \\&=\,\underset{\mathbf {x}\in \underline{\mathbb {X}}}{\sup }\left\{ \mu _{\mathbf {A}^{\prime }}\left( \mathbf {x}\right) \star _{T_{s}}\left[ \bigoplus \limits _{i=1}^{I}\mu _{R^{(i)}}\left( \mathbf {x},y\right) \right] \right\} . \end{aligned}$$
(2.36)

In the case of the FITA method, first the conclusion of each fuzzy if-then rule is determined:

$$\begin{aligned} \underset{i=1,2,\ldots ,I}{\forall }\qquad B^{\prime (i)}=\mathbf {A}^{\prime }\circ R^{(i)}, \end{aligned}$$
(2.37)

the membership function of which is written as:

$$\begin{aligned} \mu _{B^{\prime (i)}}\left( y\right) =\,\underset{\mathbf {x}\in \underline{\mathbb {X}}}{\sup }\left[ \mu _{\mathbf {A}^{\prime }}\left( \mathbf {x}\right) \star _{T_{s}}\mu _{R^{(i)}}\left( \mathbf {x},y\right) \right] . \end{aligned}$$
(2.38)

During the next stage, these partial results of the inference are aggregated forming the outcome fuzzy set:

$$\begin{aligned} B_{FITA}^{\prime }=\bigoplus \limits _{i=1}^{I}\;\left( \mathbf {A}^{\prime }\circ R^{(i)}\right) , \end{aligned}$$
(2.39)

defined by the membership function:

$$\begin{aligned} \mu _{B_{FITA}^{\prime }}\left( y\right) =\bigoplus \limits _{i=1} ^{I}\;\underset{\mathbf {x}\in \underline{\mathbb {X}}}{\sup }\left[ \mu _{\mathbf {A}^{\prime }}\left( \mathbf {x}\right) \star _{T_{s}}\mu _{R^{(i)} }\left( \mathbf {x},y\right) \right] . \end{aligned}$$
(2.40)

It can be proven [7], that the results of the FATI method are a subset of those obtained using the FITA procedure:

$$\begin{aligned} B_{FATI}^{\prime }\subseteq B_{FITA}^{\prime }, \end{aligned}$$
(2.41)

that is:

$$\begin{aligned} \underset{y\in \mathbb {Y}}{\forall }\quad \mu _{B_{FATI}^{^{\prime }}}\left( y\right) \le \mu _{B_{FITA}^{^{\prime }}}\left( y\right) . \end{aligned}$$
(2.42)

Usually, for simplicity of calculations, the \(B_{FATI}^{\prime }\) is used instead of \(B_{FITA}^{\prime }\), under the assumption that the difference is insignificant [4].

3.3 Fuzzification and Defuzzification

In many applications inputs of the fuzzy systems are defined as crisp numerical data. However, approximate reasoning requires inputs to be represented as fuzzy sets. The process of mapping real values \(\mathbf {x}_{0} = \left[ x_{01},x_{02},\ldots ,x_{0N}\right] ^{\top } \in \underline{\mathbb {X}} \subset \mathbb {R}^{N}\) to an N-dimensional fuzzy set \(\mathbf {A^{\prime }}\) defined on \(\underline{\mathbb {X}}\) is called fuzzification. The fuzzification can be symbolically expressed as a transformation of N-dimensional space into a multitude of fuzzy sets [16]:

$$\begin{aligned} \underline{\mathbb {X}}\Rightarrow \mathscr {F}(\underline{\mathbb {X}}). \end{aligned}$$
(2.43)

Using membership functions we can write:

$$\begin{aligned} \underline{\mathbb {X}}\Rightarrow \left\{ \left. \mu _{\mathbf {A^{\prime }}}\left( \mathbf {x}\right) \right| \;\mathbf {x}\in \underline{\mathbb {X}},\mu _{\mathbf {A^{\prime }}}\left( \mathbf {x}\right) \in \left[ 0,1\right] \right\} . \end{aligned}$$
(2.44)

Among many definitions of a fuzzification operator, the singleton fuzzifier can be distinguished:

$$\begin{aligned} \mu _{\mathbf {A^{\prime }}}\left( \mathbf {x}\right) = \;\delta _{ \mathbf {x},\mathbf {x}_{0}} = {\left\{ \begin{array}{ll} 1, &{} \mathbf {x}=\mathbf {x}_{0},\\ 0, &{} \mathbf {x}\ne \mathbf {x}_{0}, \end{array}\right. } \end{aligned}$$
(2.45)

for which both methods of approximate reasoning (FATI and FITA) provide equivalent inference results [5].

The result of approximate reasoning is a fuzzy set \(B^{\prime }\left( y\right) \), which can be associated with a specific linguistic label. However, there are applications that require a crisp numerical inference outcome. The process of calculating a representative numerical output \(y_{0}\in \mathbb {Y}\) from the outcome fuzzy set \(B^{\prime }\left( y\right) \) on \(\mathbb {Y}\) is called defuzzification. Defuzzification is a mapping of a multitude of fuzzy sets defined on the space \(\mathbb {Y}\) to a single numerical value from \(\mathbb {Y}\) [16]:

$$\begin{aligned} \mathscr {F}\left( \mathbb {Y}\right) \rightarrow \mathbb {Y}. \end{aligned}$$
(2.46)

Using membership functions we get:

$$\begin{aligned} \left\{ \left. \mu _{B^{\prime }}\left( y\right) \right| \,y\in \mathbb {Y},\mu _{B^{\prime }}\left( y\right) \in \left[ 0,1\right] \right\} \rightarrow \mathbb {Y}. \end{aligned}$$
(2.47)

Due to the different criteria for determining which element \(y_0\) of the fuzzy set \(B^{\prime }\left( y\right) \) should be regarded as the most representative one, there are many definitions of the defuzzification procedure [6, 14, 31]. One of the most popular is a center of gravity method (COG), which specifies the result as a center of the area under the membership function \(\mu _{B^{\prime }}\left( y\right) \):

$$\begin{aligned} y_{0}=\frac{\int \limits _{\mathbb {Y}}y\,\mu _{B^{\prime }}\left( y\right) dy}{\int \limits _{\mathbb {Y}}\mu _{B^{\prime }}\left( y\right) dy}. \end{aligned}$$
(2.48)

4 Basic Types of Fuzzy Systems

Due to a wide range of possible applications there are many different types of fuzzy systems that have been proposed in the literature thus far [4, 16, 22, 23, 31]. But new solutions characterized by decreased computation complexity, improved modeling quality, or greater ease of the linguistic interpretation of the inference results are still the topic of research. The model proposed by E.H. Mamdani and S. Assilan [19] is generally regarded as the first fuzzy system presented in the literature. Currently, it can be considered as the foundation of the fuzzy models family based on if-then rules with fuzzy sets in antecedents as well as consequents.

4.1 Mamdani–Assilan Fuzzy Model

The Mamdani–Assilan fuzzy system (MAFS) uses a set of conditional fuzzy rules in the canonical form (2.20), which can be determined by a human expert. The MAFS is based on the conjunctive interpretation of fuzzy rules, where the conjunctive “and” of a rule antecedent is defined with the t-norm minimum \((\wedge )\). The inference results from individual rules are aggregated by applying the s-norm maximum \((\vee )\). The numerical inputs \(\mathbf {x}_{0} = \left[ x_{01},x_{02},\ldots ,x_{0N}\right] ^{\top }\) are mapped into fuzzy sets with the singleton fuzzifier, and the numerical outcome is calculated using the COG method. The approximate reasoning schema is realized on the basis of Eq. (2.40), which takes the form:

$$\begin{aligned} \mu _{B^{\prime }}\left( y\right) =\bigvee \limits _{i=1}^{I}\;\left[ \mu _{\mathbf {A}^{(i)}}\left( \mathbf {x}_{0}\right) \wedge \mu _{B^{(i)} }\left( y\right) \right] , \end{aligned}$$
(2.49)

where

$$\begin{aligned} \mu _{\mathbf {A}^{\left( i\right) }}\left( \mathbf {x}_{0}\right) =\mu _{A_{1}^{\left( i\right) }}\left( x_{01}\right) \wedge \mu _{A_{2}^{\left( i\right) }}\left( x_{02}\right) \wedge \cdots \wedge \mu _{A_{N}^{\left( i\right) }}\left( x_{0N}\right) . \end{aligned}$$
(2.50)

The above equation defines the so-called firing strength of the ith rule, denoted as \(F^{\left( i\right) }\left( \mathbf {x}_{0}\right) \). Hence, the formula (2.49) can also be written as

$$\begin{aligned} \mu _{B^{\prime }}\left( y\right) =\bigvee \limits _{i=1}^{I}\;\left[ F^{\left( i\right) }\left( \mathbf {x}_{0}\right) \wedge \mu _{B^{(i)} }\left( y\right) \right] . \end{aligned}$$
(2.51)

Using the COG defuzzification we get:

$$\begin{aligned} y_{0}=\frac{\int \limits _{\mathbb {Y}}y\, \mu _{B^{\prime }}\left( y\right) dy}{\int \limits _{\mathbb {Y}}\mu _{B^{\prime }}\left( y\right) dy}. \end{aligned}$$
(2.52)

Figure 2.3 shows an example of fuzzy inference using MAFS with two inputs and the knowledge base consisting of two conditional fuzzy rules.

Fig. 2.3
figure 3

Example of fuzzy inference using the Mamdani–Assilan fuzzy system with two inputs and the knowledge base consisting of two conditional fuzzy rules

The defuzzification requires high computational complexity, however, some simplifications can be applied. Using the algebraic product t-norm and the arithmetic mean as the aggregation operator we obtain a Larsen fuzzy system, which is defined as [16]:

$$\begin{aligned} \mu _{B^{\prime }}\left( y\right) =\frac{1}{I}\sum \limits _{i=1}^{I}F^{\left( i\right) }\left( \mathbf {x}_{0}\right) \mu _{B^{(i)}}\left( y\right) . \end{aligned}$$
(2.53)

By substitution of (2.53) into (2.52) we get:

$$\begin{aligned} y_{0}=\frac{\sum \limits _{i=1}^{I}F^{\left( i\right) }\left( \mathbf {x} _{0}\right) \int \limits _{\mathbb {Y}}y\,\mu _{B^{\left( i\right) }}\left( y\right) dy}{\sum \limits _{j=1}^{I}F^{\left( j\right) }\left( \mathbf {x}_{0}\right) \int \limits _{\mathbb {Y}}\mu _{B^{\left( j\right) } }\left( y\right) dy}. \end{aligned}$$
(2.54)

Denoting the area under a membership function of the fuzzy set \({B^{\left( i\right) }}\left( y\right) \) as

$$\begin{aligned} \mathscr {A}\left( \mu _{B^{\left( i\right) }}\left( y\right) \right) = \int \limits _{\mathbb {Y}}\mu _{B^{\left( i\right) }}\left( y\right) dy, \end{aligned}$$
(2.55)

and its center of gravity as \(y^{\left( i\right) }\), we can write:

$$\begin{aligned} y_{0}=\frac{\sum \limits _{i=1}^{I}F^{\left( i\right) }\left( \mathbf {x} _{0}\right) \;\mathscr {A}\left( \mu _{B^{\left( i\right) }}\left( y\right) \right) \;y^{\left( i\right) }}{\sum \limits _{j=1}^{I}F^{\left( i\right) }\left( \mathbf {x}_{0}\right) \;\mathscr {A}\left( \mu _{B^{\left( j\right) }}\left( y\right) \right) }. \end{aligned}$$
(2.56)

The above solution requires only a single calculation of the areas under the membership functions and centers of gravity locations for all fuzzy rules. By assuming additionally that \(\mathscr {A}\left( \mu _{B^{\left( i\right) }}\left( y\right) \right) \) are the same for all I consequents, we get the Sugeno–Yasukawa fuzzy model [26].

Approximate reasoning without the defuzzification necessity was presented in papers by Takagi and Sugeno [27] and Sugeno and Kang [25]. The proposed model, called the Takagi–Sugeno–Kang fuzzy system (TSKFS), is described in the following subsection.

4.2 Takagi–Sugeno–Kang Fuzzy System

The knowledge base of the TSKFS consists of conditional fuzzy rules with the consequents in the form of classical functions, the arguments of which are the input numerical data:

$$\begin{aligned} \mathscr {R}=\left\{ {R}^{(i)}\right\} _{i=1}^{I}=\left\{ \mathbf {if }\bigwedge \limits _{n=1}^{N}\left( x_{0n}\ \mathbf { is }\ L_{A_{n}}^{(i)}\right) , \mathbf { then }\ y=y^{\left( i\right) }\left( \mathbf {x}_{0}\right) \right\} _{i=1}^{I}, \end{aligned}$$
(2.57)

where \(x_{0n}\) is an input singleton, \(\mathbf {x}_{0} = \left[ x_{01},x_{02},\ldots ,x_{0N}\right] ^{\top }\), and \(y^{\left( i\right) }\left( \mathbf {x}\right) \) is the function in the ith consequent.

The output of each fuzzy rule is a crisp numerical datum \(y = y^{\left( i\right) }\left( \mathbf {x}_{0}\right) \), and the TSKFS outcome is calculated as a weighted average of individual outputs:

$$\begin{aligned} y_{0}=\frac{\sum \limits _{i=1}^{I}F^{\left( i\right) }\left( \mathbf {x} _{0}\right) \;y^{\left( i\right) }\left( \mathbf {x}_{0}\right) }{\sum \limits _{j=1}^{I}F^{\left( j\right) }\left( \mathbf {x}_{0}\right) }, \end{aligned}$$
(2.58)

where

$$\begin{aligned} F^{\left( i\right) }\left( \mathbf {x}_{0}\right) =\mu _{A_{1}^{\left( i\right) }}\left( x_{01}\right) \star _{T}\mu _{A_{2}^{\left( i\right) } }\left( x_{02}\right) \star _{T}\cdots \star _{T}\mu _{A_{N}^{\left( i\right) }}\left( x_{0N}\right) , \end{aligned}$$
(2.59)

is the firing strength and \(\star _{T}\) is a t-norm (usually a minimum or algebraic product).

Equation (2.58) can be interpreted as a mixture of experts, each modeled by a single fuzzy rule. Each rule defines the relationship between outputs and inputs of the system in the relevant input range. The weighted average of statements from all local experts (rules) determines the reasoning result. The weight, represented by the firing strength of the rule, specifies the influence level of a single expert on the final inference outcome.

The consequent of the ith TSKFS fuzzy rule can also be understood as a singleton [4], the location of which is determined by the function \(y^{\left( i\right) }\left( \mathbf {x}\right) \):

$$\begin{aligned} \mu _{B^{\left( i\right) }}\left( y\right) = \;\delta _{y,y^{\left( i\right) }} = \left\{ \begin{array} [c]{ll} 1, &{} y=y^{\left( i\right) }\left( \mathbf {x}_{0}\right) ,\\ 0, &{} y\ne y^{\left( i\right) }\left( \mathbf {x}_{0}\right) . \end{array} \right. \end{aligned}$$
(2.60)

Hence, the TSKFS is usually referred to as the fuzzy system with “moving” singletons. The term “moving” relates to the relationship between a singleton location and the input numerical data. The amplitude (height) of the singleton after the approximate reasoning is defined by the firing strength of a rule.

The TSKFS consequents are frequently defined as linear functions (first-order polynomials):

$$\begin{aligned} y^{\left( i\right) }\left( \mathbf {x}_{0}\right) =p_{0}^{\left( i\right) }+p_{1}^{\left( i\right) }x_{01}+p_{2}^{\left( i\right) }x_{02} +\cdots +p_{N}^{\left( i\right) }x_{0N}=\mathbf {p}^{\left( i\right) \top }\mathbf {x}_{0}^{\prime }, \end{aligned}$$
(2.61)

where \(\mathbf {p}^{\left( i\right) }\) is the \((N + 1)\)-dimensional vector of parameters of the function \(y^{\left( i\right) }\left( \mathbf {x}\right) \), and \(\mathbf {x}_{0}^{\prime }\) denotes the extended input vector:

$$\begin{aligned} \mathbf {x}_{0}^{\prime }=\left[ \begin{array} [c]{ll} 1 &{} \mathbf {x}_{0}\\ \end{array} \right] ^{\top } . \end{aligned}$$
(2.62)

A collection of simple linear functions \(y^{\left( i\right) }\left( \mathbf {x}\right) \) allows for modeling the most complex input–output relationships. Overlapping areas of antecedents in neighboring rules ensure smooth switching between the local models.

Fig. 2.4
figure 4

An example of approximate reasoning with Takagi–Sugeno–Kang fuzzy system with two inputs and two fuzzy if-then rules

An example of TSKFS inference with two inputs and two conditional fuzzy rules is shown in Fig. 2.4. The main advantage of the TSKFS is the low computational effort required to determine the numerical output of the system as the inference process does not involve defuzzification. However, it does not allow for the application of different interpretations of the fuzzy rules and different types of aggregation operators. This is due to the application of singletons in the rules consequents. The artificial neural network based fuzzy inference system (ANNBFIS) [17] is devoid of such disadvantages. The ANNBFIS combines the benefits of the usage of a fuzzy set in the rule consequent (as in the MAFS) together with the dependency of the consequent location on system inputs (as in the TSKFS) [4, 15, 16]. Another extension of the TSKFS is the Tsukamoto fuzzy system (TFS) [28]. The main difference between TSKFS and TFS is the method of determining the singleton location in the consequent of the fuzzy rule. In TFS it is defined using a monotonic function as well as a firing strength of the rule.

4.3 Tsukamoto Fuzzy System

The knowledge base of TFS is a collection of fuzzy conditional statements in the form:

$$\begin{aligned} {R}^{(i)} = \mathbf {if }\bigwedge \limits _{n=1}^{N}\left( x_{0n}\ \mathbf { is }\ L_{A_{n}}^{(i)}\right) ,\mathbf { then }\ y=f_{i}^{-1}\left( F^{\left( i\right) }\left( \mathbf {x}_{0}\right) \right) , \end{aligned}$$
(2.63)

where \(f_{i}\left( y\right) \) is a monotonic function in the ith consequent.

For the firing strength equal to \(F^{\left( i\right) }\left( \mathbf {x}_{0}\right) \) the consequent is a singleton with the amplitude \(F^{\left( i\right) }\left( \mathbf {x}_{0}\right) \) and the location \(y^{\left( i\right) }\) such that \(F^{\left( i\right) }\left( \mathbf {x}_{0}\right) = f_{i}\left( y^{\left( i\right) }\right) \):

$$\begin{aligned} \mu _{B^{\prime \left( i\right) }}\left( y\right) =F^{\left( i\right) }\left( \mathbf {x}_{0}\right) \;\delta _{y,y^{\left( i\right) }}=\left\{ \begin{array} [c]{ll} F^{\left( i\right) }\left( \mathbf {x}_{0}\right) , &{} y=y^{\left( i\right) },\\ 0, &{} y\ne y^{\left( i\right) }, \end{array} \right. \end{aligned}$$
(2.64)

where \(y^{\left( i\right) } = f_{i}^{-1}\left( F^{\left( i\right) }\left( \mathbf {x}_{0}\right) \right) \).

The inference outcome of the TFS is calculated as a weighted average of singleton locations from all rules, with weights defined as the rules firing strengths:

$$\begin{aligned} y_{0}=\frac{\sum \limits _{i=1}^{I}F^{\left( i\right) }\left( \mathbf {x} _{0}\right) \;y^{\left( i\right) }}{\sum \limits _{j=1}^{I}F^{\left( j\right) }\left( \mathbf {x}_{0}\right) }=\frac{\sum \limits _{i=1} ^{I}F^{\left( i\right) }\left( \mathbf {x}_{0}\right) \;f_{i}^{-1}\left( F^{\left( i\right) }\left( \mathbf {x}_{0}\right) \right) }{\sum \limits _{j=1}^{I}F^{\left( j\right) }\left( \mathbf {x}_{0}\right) }. \end{aligned}$$
(2.65)

An example of the Tsukamoto approximate reasoning with two inputs and two fuzzy if-then rules is shown in Fig. 2.5.

Fig. 2.5
figure 5

Example of the Tsukamoto approximate reasoning with two inputs and two fuzzy if-then rules

The TFS is rarely used due to the difficulty in obtaining the conditional fuzzy rules from a human expert in the form (2.63). For the same reasons the Baldwin fuzzy system (BFS) [1, 2] is difficult to apply in practice. The BFS represents a different approach to fuzzy modeling, which is not based on Zadeh’s compositional rule of inference but on reasoning using fuzzy truth value restrictions. The literature describes many other interesting proposals of fuzzy models, including those based on interval-valued fuzzy sets and type-2 fuzzy sets. A detailed overview can be found, for example, in [13, 18, 29, 30].

5 Summary

In this chapter we discussed basic problems related to the idea of fuzzy systems based on the Zadeh compositional rule of inference. The presentation started with explaining the concepts of the linguistic variable and fuzzy conditional statement. Next, different types of the fuzzy if-then rules and various methods of their mathematical representation were presented. Also, an overview of the compositional rule of inference proposed by Zadeh was introduced. General theoretical considerations on approximate reasoning were supplemented with examples of elementary fuzzy models. We described the basic solutions being the foundation of many modern constructions including fuzzy systems of Mamdani–Assilan, Takagi–Sugeno–Kang, and Tsukamoto.