A classification and regression technique to handle heterogeneous and imperfect information

Garrido, M. Carmen; Cadenas, Jose M.; Bonissone, Piero P.

doi:10.1007/s00500-009-0509-y

A classification and regression technique to handle heterogeneous and imperfect information

Original Paper
Published: 22 October 2009

Volume 14, pages 1165–1185, (2010)
Cite this article

Soft Computing Aims and scope Submit manuscript

M. Carmen Garrido¹,
Jose M. Cadenas¹ &
Piero P. Bonissone²

184 Accesses
11 Citations
Explore all metrics

Abstract

Imperfect information inevitably appears in real situations for a variety of reasons. Although efforts have been made to incorporate imperfect data into learning and inference methods, there are many limitations as to the type of data, uncertainty and imprecision that can be handled. In this paper, we propose a classification and regression technique to handle imperfect information. We incorporate the handling of imperfect information into both the learning phase, by building the model that represents the situation under examination, and the inference phase, by using such a model. The model obtained is global and is described by a Gaussian mixture. To show the efficiency of the proposed technique, we perform a comparative study with a broad baseline of techniques available in literature tested with several data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A survey of transfer learning

Article Open access 28 May 2016

Learning from imbalanced data: open challenges and future directions

Article Open access 22 April 2016

Data clustering: application and trends

Article 27 November 2022

References

Asuncion A, Newman DJ (2007) UCI machine learning repository (http://www.ics.uci.edu/mlearn/MLRepository.html). University of California, School of Information and Computer Science, Irvine
Cadenas JM, Garrido MC (2002) Imperfección Explícita versus adaptación de la información en MFGN extendido, In: Proceedings of the XI Congreso Español sobre Tecnologías y Lógica Fuzzy, León, Spain, pp 547–552
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B 39(19):1–38
MATH MathSciNet Google Scholar
Dubois D, Prade H (1980) Fuzzy sets and systems: theory and applications. Academic Press, Cambridge
MATH Google Scholar
Dubois D, Prade H (1988) Possibility theory: an approach to computerized processing of uncertainty. Plenum Press, New York
MATH Google Scholar
Duda RO, Hart PE, Stork DG (1991) Pattern classification. Wiley, NY
Google Scholar
Fukunaga K (1990) Introduction to statistical pattern recognition. Academic Press, Boston
MATH Google Scholar
Garrido MC, Cadenas JM, Ruiz A (1999) Visual object detection using fuzzy concepts. In: Proceedings of the International ICSC Congress on computational intelligence methods and applications, Rochester, USA, pp 1–6
Grabisch M, Disport F (1992) A comparison of some methods of fuzzy classification on real data. In: Proceedings of the 2nd international conference on fuzzy logic and neural network. Iizuka, Japan, pp 659–662
Guan JW, Bell DA (1991) Evidence theory and its applications. Studies in computer science and artificial intelligence, vol 1. Elsevier Science Inc., North-Holland
Google Scholar
Guan JW, Bell DA (1992) Evidence theory and its applications. Studies in computer science and artificial intelligence, vol 2. Elsevier Science Inc., North-Holland
Google Scholar
Hu Y-C (2008) A novel fuzzy classifier with Choquet integral-based grey relational analysis for pattern classfication problems. Soft Comput 12:523–533
Article MATH Google Scholar
Janikow CZ (1996) Exemplar learning in fuzzy decision trees. In: Proceeding of the FUZZ-IEEE, New Orleans, USA, pp 1500–1505
Janikow CZ (1998) Fuzzy decision trees: issues and methods. IEEE Trans Man Systems Cybern 28:1–14
Article Google Scholar
Klir GJ (1992) Probabilistic versus possibilistic conceptualization of uncertainty. In: Ayyub BM, Gupta MM, Kanal LN (eds) Analysis and management of uncertainty: theory and applications. Elsevier, Amsterdam, pp 13–25
Google Scholar
Klir GJ, Wierman MJ (1998) Uncertainty-based information. Physica-Verlag, Heidelberg
MATH Google Scholar
Kotsiantis SB, Kanellopoulos D, Pintelas PE (2006) Local boosting of decision stumps for regression and classification problems. J Comput 1(4):30–37
Google Scholar
Mackay DJC (2003) Information theory, inference and learning algorithms. Cambridge University Press, Cambridge
MATH Google Scholar
McLachlan GJ, Krishnan T (1997) The EM algorithm and extensions. Wiley, New York
MATH Google Scholar
McLachlan GJ, Peel D (2000) Finite mixtures models. Wiley, New York
Book Google Scholar
Mitra S, Pal SK (1995) Fuzzy multi-layer perceptron, inferencing and rule generation. IEEE Trans Neural Netw 6:51–63
Article Google Scholar
Negoita CV, Ralescu DA (1975) Representation theorems for fuzzy concepts. Kybernetes 4:169–174
Article MATH Google Scholar
Quinlan JR (1993) C4.5: programs for machine learning. The Morgan Kaufmann Series in Machine Learning, California
Ruiz A, López de Teruel PE, Garrido MC (1998) Probabilistic inference from arbitrary uncertainty using mixtures of factorized generalized Gaussians. J Artif Intell Res 9:167–217
MATH Google Scholar
Ruthven I, Lalmas M (2002) Using Dempster–Shafer’s theory of evidence to combine aspects of information use. J Intell Inf Syst 19(3):267–301
Article Google Scholar
Witten IH, Frank E (2000) Data mining. Morgan Kaufmann Publishers, San Francisco
Google Scholar
Yager RR, Kacprzyk J, Fedrizzi M (eds) (1994) Advances in the Dempster–Shafer theory of evidence. Wiley, New York

Download references

Acknowledgments

This study was supported by the Project TIN2008-06872-C04-03 of the MICINN of Spain and European Fund for Regional Development. We also thank the Funding Program for Research Groups of Excellence with code 04552/GERM/06 granted by the “Fundación Séneca” (Spain).

Author information

Authors and Affiliations

Department of Ingeniería de la Información y las Comunicaciones, Universidad de Murcia, Murcia, Spain
M. Carmen Garrido & Jose M. Cadenas
GE Global Research, One Research Circle, Niskayuna, NY, 12309, USA
Piero P. Bonissone

Authors

M. Carmen Garrido
View author publications
You can also search for this author in PubMed Google Scholar
Jose M. Cadenas
View author publications
You can also search for this author in PubMed Google Scholar
Piero P. Bonissone
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jose M. Cadenas.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Garrido, M.C., Cadenas, J.M. & Bonissone, P.P. A classification and regression technique to handle heterogeneous and imperfect information. Soft Comput 14, 1165–1185 (2010). https://doi.org/10.1007/s00500-009-0509-y

Download citation

Published: 22 October 2009
Issue Date: September 2010
DOI: https://doi.org/10.1007/s00500-009-0509-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A classification and regression technique to handle heterogeneous and imperfect information

Abstract

Access this article

Similar content being viewed by others

A survey of transfer learning

Learning from imbalanced data: open challenges and future directions

Data clustering: application and trends

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A classification and regression technique to handle heterogeneous and imperfect information

Abstract

Access this article

Similar content being viewed by others

A survey of transfer learning

Learning from imbalanced data: open challenges and future directions

Data clustering: application and trends

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation