Skip to main content

Advertisement

Log in

Obtaining linguistic fuzzy rule-based regression models from imprecise data with multiobjective genetic algorithms

  • Focus
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Backfitting of fuzzy rules is an Iterative Rule Learning technique for obtaining the knowledge base of a fuzzy rule-based system in regression problems. It consists in fitting one fuzzy rule to the data, and replacing the whole training set by the residual of the approximation. The obtained rule is added to the knowledge base, and the process is repeated until the residual is zero, or near zero. Such a design has been extended to imprecise data for which the observation error is small. Nevertheless, when this error is moderate or high, the learning can stop early. In this kind of algorithms, the specificity of the residual might decrease when a new rule is added. There may happen that the residual grows so wide that it covers the value zero for all points (thus the algorithm stops), but we have not yet extracted all the information available in the dataset. Focusing on this problem, this paper is about datasets with medium to high discrepancies between the observed and the actual values of the variables, such as those containing missing values and coarsely discretized data. We will show that the quality of the iterative learning degrades in this kind of problems, because it does not make full use of all the available information. As an alternative to sequentially obtaining rules, we propose a new multiobjective Genetic Cooperative Competitive Learning (GCCL) algorithm. In our approach, each individual in the population codifies one rule, which competes in the population in terms of maximum coverage and fitting, while the individuals in the population cooperate to form the knowledge base.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. The results of FGCCL on the two dropped problems building and elec can be found in Table 6. In both cases, FGCCL is better than all GFSs in reference (Sánchez et al. 2006), but the difference is not relevant in this context.

References

  • Alcala J et al (2008) KEEL: a software tool to assess evolutionary algorithms for data mining problems. Soft Comput (in press)

  • Cordón O, Herrera F (2000) A proposal for improving the accuracy of linguistic modeling. IEEE Trans Fuzzy Syst 8(3):335–344

    Article  Google Scholar 

  • Cordón O, Herrera F, Sánchez L (1999) Solving electrical distribution problems using hybrid evolutionary data analysis techniques. Appl Intell 10(1):5–24

    Article  Google Scholar 

  • Cornelis C, Kerre E (2003) A fuzzy inference methodology based on the fuzzification of set inclusion. In: Recent advances in intelligent paradigms and applications, Physica-Verlag, pp 71–89

  • Couso I, Sánchez L (2008) Higher order models for fuzzy random variables. Fuzzy Sets Syst 159:237–258

    Article  Google Scholar 

  • Deb K, Pratap A, Agarwal S, Meyarevian T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput 6(2):182–197

    Article  Google Scholar 

  • del Jesus MJ, Hoffmann F, Junco L, Sánchez L (2004) Induction of fuzzy-rule-based classifiers with evolutionary boosting algorithms. IEEE Trans Fuzzy Syst 12(3):296–308

    Article  Google Scholar 

  • Dubois D, Prade H (1987) The mean value of a fuzzy number. Fuzzy Sets Syst 24(3):279–300

    Article  MATH  Google Scholar 

  • Ein-Dor P, Feldmesser J (1987) Attributes of the performance of central processing units: a relative performance prediction model. Commun ACM 30(4):308–317

    Article  Google Scholar 

  • Ferson S, Kreinovich V, Hajagos J, Oberkampf W, Ginzburg L (2007) Experimental uncertainty estimation and statistics for data having interval uncertainty. Technical Report SAND2007-0939, Sandia National Laboratories

  • Friedman J (1991) Multivariate adaptive regression splines. Ann Stat 19:1–141

    Article  MATH  Google Scholar 

  • Friedman J, Hastie T, Tibshirani R (1998) Additive logistic regression: a statistical view of boosting. Mach Learn

  • Greene DP, Smith SF (1993) Competition-based induction of decision models from examples. Mach Learn 3:229–257

    Article  Google Scholar 

  • Herrera F (2005) Genetic fuzzy systems: status, critical considerations and future directions. Int J Comput Intell Res 1(1):59–67

    Google Scholar 

  • Herrera F (2008) Genetic fuzzy systems: taxonomy, current research trends and prospects. Evol Intell 1:27–46

    Article  Google Scholar 

  • Ishibuchi H, Nakashima T, Murata T (1999) Performance evaluation of fuzzy classifier systems for multidimensional pattern classification problems. IEEE Trans Syst Man Cybern Cybern 29(5):601–618

    Article  Google Scholar 

  • Juang CF, Lin JY, Lin CT (2000) Genetic reinforcement learning through symbiotic evolution for fuzzy controller design. IEEE Trans Syst Man Cybern B Cybern 30(2):290–302

    Article  MathSciNet  Google Scholar 

  • Koeppen M, Franke K, Nickolay B (2003) Fuzzy-Pareto-dominance driven multi-objective genetic algorithm. In: Proceedings of 10th international fuzzy systems assotiation world congress (IFSA), Istanbul, pp 450–453

  • Limbourg P (2005) Multi-objective optimization of problems with epistemic uncertainty. EMO 2005:413–427

  • Mallat S, Zhang Z (1993) Matching pursuits with time–frequency dictionaries. IEEE Trans Signal Process 41:3397–3415

    Article  MATH  Google Scholar 

  • Marín E, Sánchez L (2004) Supply estimation using coevolutionary genetic algorithms in the Spanish electrical market. Appl Intell 21(1):7–24

    Article  Google Scholar 

  • Nozaki K, Ishibuchi H, Tanaka H (1997) A simple but powerful heuristic method for generating fuzzy rules from numerical data. Fuzzy Sets Syst 86:251–270

    Article  Google Scholar 

  • Otero J, Sanchez L (2006) Induction of descriptive fuzzy classifiers with the Logitboost algorithm. Soft Comput 10(9):825–835

    Google Scholar 

  • Prechelt L (1994) PROBEN1—a set of benchmarks and benchmarking rules for neural network training algorithms. Tech. Rep. 21/94, Fakultat fur Informatik, Universitat Karlsruhe

  • Press W et al (1992) Numerical recipes in C. The art of scientific computing. Cambridge University Press, New York

  • Sánchez L, Couso I (2007) Advocating the use of imprecisely observed data in genetic fuzzy systems. IEEE Trans Fuzzy Syst 15(4):551–562

    Article  Google Scholar 

  • Sánchez L, Otero J (2004) A fast genetic method for inducting descriptive fuzzy models. Fuzzy Sets Syst 141(1):33–46

    Article  MATH  Google Scholar 

  • Sánchez L, Otero J (2007) Boosting fuzzy rules in classification problems under single-winner inference. Int J Intell Syst 22(9):1021–1034

    Article  MATH  Google Scholar 

  • Sánchez L, Villar JR (2008) Obtaining transparent models of chaotic systems with multiobjective simulated annealing algorithms. Inform Sci 178(4):952–970

    Article  MATH  Google Scholar 

  • Sánchez L, Casillas J, Cordón O et al (2002) Some relationships between fuzzy and random set-based classifiers and models. Int J Approx Reason 29(2):175–213

    Article  MATH  Google Scholar 

  • Sánchez L, Otero J, Villar JR (2006) Boosting of fuzzy models for high-dimensional imprecise datasets. In: Proceedings of IPMU 2006, Paris, pp 1965–1973

  • Sánchez L, Couso I, Casillas J (2007) Modelling vague data with genetic fuzzy systems under a combination of crisp and imprecise criteria. In: Proceedings of 2007 IEEE symposium on Computational Intellignece in multicriteria decision making, Honolulu, pp 30–37

  • Sánchez L, Couso I, Casillas J (2009) Genetic learning of fuzzy rules based on low quality data. Fuzzy Sets Syst (submitted)

  • Takagi T, Sugeno M (1985) Fuzzy identification of systems and its application to modeling and control. IEEE Trans Syst Man Cybern 15(1):116–132

    MATH  Google Scholar 

  • Teich J (2001) Pareto-front exploration with uncertain objectives. EMO 2001:314–328

  • Wang LX, Mendel J (1992) Generating fuzzy rules by learning from examples. IEEE Trans Syst Man Cybern 25(2):353–361

    MathSciNet  Google Scholar 

Download references

Acknowledgments

This work was supported by the Spanish Ministry of Education and Science, under grants TIN2005-08036-C05-05 and TIN2005-08036-C05-01.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Luciano Sánchez.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sánchez, L., Otero, J. & Couso, I. Obtaining linguistic fuzzy rule-based regression models from imprecise data with multiobjective genetic algorithms. Soft Comput 13, 467–479 (2009). https://doi.org/10.1007/s00500-008-0362-4

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-008-0362-4

Keywords

Navigation