Classification of incomplete data integrating neural networks and evidential reasoning

Choudhury, Suvra Jyoti; Pal, Nikhil R.

doi:10.1007/s00521-021-06267-1

Classification of incomplete data integrating neural networks and evidential reasoning

S.I. : Neuro, fuzzy and their Hybridization
Published: 25 October 2021

Volume 35, pages 7267–7281, (2023)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

302 Accesses
3 Citations
Explore all metrics

Abstract

When missing data are imputed by any method, there is some uncertainty associated with the imputed value. Consequently, when such imputed data are classified, some uncertainty will be propagated to the classifier output. This leads to two issues to address. First, reducing the uncertainty in the imputed value. Second, modeling and processing of the uncertainty associated with the classifier output to arrive at a better decision. To deal with the first issue, we use a latent space representation, while for the second issue we use Dempster-Shafer evidence theory. First, we train a neural network using the data without any missing value to generate a latent space representation of the input. The complete data set is now extended by deleting every feature once. These missing values are estimated using a nearest neighbor-based scheme. The network is then refined using this extended dataset to obtain a better latent space. This mechanism is expected to reduce the effect of the missing data on the latent space representation. Using the latent space representation of the complete data, we train two classifiers, support vector machines and evidential t-nearest neighbors. To classify an input with a missing value, we make a rough estimate of the missing value using the nearest neighbor rule and generate its latent space representation for classification by the classifiers. Using each classifier output, we generate a basic probability assignment (BPA) and all BPAs are combined to get an overall BPA. Final classification is done using Pignistic probabilities computed on the overall BPA. We use three different ways to defining BPAs. To avoid some problems of Dempster’s rule of aggregation, we also use several alternative aggregations including some T-norm-based methods. Note that, T-norm has been used for combination of belief function in Pichon and Denœux (in: NAFIPS 2008: 2008 annual meeting of the North American fuzzy information processing society, pp 1–6, 2008). To demonstrate the superiority of the proposed method, we compare its performance with four state-of-the-art techniques using both artificial and real datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Imbalanced data preprocessing techniques for machine learning: a systematic mapping study

Article 09 November 2022

A survey on ensemble learning

Article 30 August 2019

References

Allison PD (2001) Missing data: Sage university papers series on quantitative applications in the social sciences (07–136), Thousand Oaks, CA
Choudhury SJ, Pal NR (2019) Classification of incomplete data using autoencoder and evidential reasoning. In: IFIP international conference on artificial intelligence applications and innovations. Springer, pp 167–177
Choudhury SJ, Pal NR (2021) Deep and structure-preserving autoencoders for clustering data with missing information. IEEE Trans Emerg Top Comput Intell 5(4):639–650. https://doi.org/10.1109/TETCI.2019.2949264
Article MathSciNet Google Scholar
Choudhury SJ, Pal NR (2019) Imputation of missing data with neural networks for classification. Knowl Based Syst. https://doi.org/10.1016/j.knosys.2019.07.009
Article Google Scholar
Chung D, Merat FL (1996) Neural network based sensor array signal processing. In: IEEE/SICE/RSJ international conference on multisensor fusion and integration for intelligent systems, 1996. IEEE, pp 757–764
Cobb BR, Shenoy PP (2003) A comparison of methods for transforming belief function models to probability models. In: European conference on symbolic and quantitative approaches to reasoning and uncertainty. Springer, pp 255–266
DENOEUX T (1995) A k-nearest neighbor classification rule based on Dempster–Shafer theory. IEEE Trans Syst Man Cybern 25(5):804–813
Article Google Scholar
Dixon JK (1979) Pattern recognition with partly missing data. IEEE Trans Syst Man Cybern 9(10):617–621
Article Google Scholar
Dubois D, Prade H (1988) Representation and combination of uncertainty with belief functions and possibility measures. Comput Intell 4(3):244–264
Article Google Scholar
Fessant F, Midenet S (2002) Self-organising map for data imputation and correction in surveys. Neural Comput Appl 10(4):300–310
Article MATH Google Scholar
García-Laencina PJ, Sancho-Gómez JL, Figueiras-Vidal AR (2010) Pattern classification with missing data: a review. Neural Comput Appl 19(2):263–282
Article Google Scholar
Gautam C, Ravi V (2015) Counter propagation auto-associative neural network based data imputation. Inf Sci 325:288–299
Article Google Scholar
Gautam C, Ravi V (2015) Data imputation via evolutionary computation, clustering and a neural network. Neurocomputing 156:134–142
Article Google Scholar
Kalton G (1983) Compensating for missing survey data. Inst for Social Research the Univ
Kofman P, Sharpe IG (2003) Using multiple imputation in the analysis of incomplete observations in finance. J Financ Econom 1(2):216–249
Google Scholar
Krstulovic J, Miranda V, Costa AJS, Pereira J (2013) Towards an auto-associative topology state estimator. IEEE Trans Power Syst 28(3):3311–3318
Article Google Scholar
Kumar S (2004) Neural networks: a classroom approach. Tata McGraw-Hill Education, New York
Google Scholar
Lefevre E, Colot O, Vannoorenberghe P (2002) Belief function combination and conflict management. Inf fusion 3(2):149–162
Article Google Scholar
Little RJ, Rubin DB (2014) Statistical analysis with missing data. Wiley, Hoboken
MATH Google Scholar
Liu Z.g, Dezert J, Pan Q, Mercier G (2011) Combination of sources of evidence with different discounting factors based on a new dissimilarity measure. Decis Support Syst 52(1):133–141
Article Google Scholar
Liu ZG, Pan Q, Mercier G, Dezert J (2015) A new incomplete pattern classification method based on evidential reasoning. IEEE Trans Cybern 45(4):635–646
Article Google Scholar
Marseguerra M, Zoia A (2005) The autoassociative neural network in signal analysis: II. Application to on-line monitoring of a simulated BWR component. Ann Nucl Energy 32(11):1207–1223
Article Google Scholar
Marwala T, Chakraverty S (2006) Fault classification in structures with incomplete measured data using autoassociative neural networks and genetic algorithm. Curr Sci 90:542–548
Google Scholar
Miranda V, Krstulovic J, Keko H, Moreira C, Pereira J (2012) Reconstructing missing data in state estimation with autoencoders. IEEE Trans Power Syst 27(2):604–611
Article Google Scholar
Morin R, Raeside B (1981) A reappraisal of distance-weighted \( k \)-nearest neighbor classification for pattern recognition with missing data. IEEE Trans Syst Man Cybern 3:241–243
Article MathSciNet Google Scholar
Narayanan S, Marks R, Vian JL, Choi J, El-Sharkawi M, Thompson BB (2002) Set constraint discovery: missing sensor data restoration using autoassociative regression machines. In: Proceedings of the 2002 international joint conference on neural networks, 2002. IJCNN’02, vol 3. IEEE, pp 2872–2877
Narayanan S, Vian JL, Choi J, Marks R, El-Sharkawi M, Thompson BB (2003) Missing sensor data restoration for vibration sensors on a jet aircraft engine. In: Proceedings of the international joint conference on neural networks, 2003, vol 4. IEEE, pp 3007–3010
Nowicki R (2009) Rough neuro-fuzzy structures for classification with missing data. IEEE Trans Syst Man Cybern Part B (Cybern) 39(6):1334–1347
Article Google Scholar
Pichon F, Denœux T (2008) T-norm and uninorm-based combination of belief functions. In: NAFIPS 2008: 2008 annual meeting of the North American fuzzy information processing society, pp 1–6
Platt J et al (1999) Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Adv Large Margin Classif 10(3):61–74
Google Scholar
Qiao W, Gao Z, Harley RG, Venayagamoorthy GK (2008) Robust neuro-identification of nonlinear plants in electric power systems with missing sensor measurements. Eng Appl Artif Intell 21(4):604–618
Article Google Scholar
Samad T, Harp S.A (1992) Self-organization with partial data. Netw Comput Neural Syst 3(2):205–212
Article Google Scholar
Schafer JL (1997) Analysis of incomplete multivariate data. CRC Press, Cambridge
Book MATH Google Scholar
Sentz K, Ferson S et al (2002) Combination of evidence in Dempster–Shafer theory, vol 4015. Citeseer, Princeton
Book Google Scholar
Shafer G (1976) A mathematical theory of evidence, vol 42. Princeton University Press, Princeton
Book MATH Google Scholar
Silva-Ramírez EL, Pino-Mejías R, López-Coello M (2015) Single imputation with multilayer perceptron and multiple imputation combining multilayer perceptron and k-nearest neighbours for monotone patterns. Appl Soft Comput 29:65–74
Article Google Scholar
Silva-Ramírez EL, Pino-Mejías R, López-Coello M, Cubiles-de-la Vega MD (2011) Missing value imputation on missing completely at random data using multilayer perceptrons. Neural Netw 24(1):121–129
Article Google Scholar
Smarandache F, Dezert J (2009) Advances and Applications of DSmT for Information Fusion Collected works. American Research Press, vol 3, p 760
Smets P (1990) The combination of evidence in the transferable belief model. IEEE Trans Pattern Anal Mach Intell 12(5):447–458
Article Google Scholar
Smets P (2007) Analyzing the combination of conflicting belief functions. Inf Fusion 8(4):387–412
Article Google Scholar
Thompson BB, Marks R, El-Sharkawi MA (2003) On the contractive nature of autoencoders: application to missing sensor restoration. In: Proceedings of the international joint conference on neural networks, 2003, vol 4. IEEE, pp 3011–3016
Tsang I, Kwok J, Cheung P, Cristianini N (2005) Core vector machines: fast SVM training on very large data sets. J Mach Learn Res 6:363–392
MathSciNet MATH Google Scholar
Westin LK (2004) Missing data and the preprocessing perception, page 3, Umea University, ISSN-0348-0542
Yager RR (1987) Quasi-associative operations in the combination of evidence. Kybernetes 16(1):37–41
Article MathSciNet MATH Google Scholar
Yang JB, Xu DL (2013) Evidential reasoning rule for evidence combination. Artif Intell 205:1–29
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Electronics and Communication Sciences Unit, Indian Statistical Institute, Calcutta, 700108, India
Suvra Jyoti Choudhury & Nikhil R. Pal

Authors

Suvra Jyoti Choudhury
View author publications
You can also search for this author in PubMed Google Scholar
Nikhil R. Pal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Suvra Jyoti Choudhury.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest regarding the publication of this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Choudhury, S.J., Pal, N.R. Classification of incomplete data integrating neural networks and evidential reasoning. Neural Comput & Applic 35, 7267–7281 (2023). https://doi.org/10.1007/s00521-021-06267-1

Download citation

Received: 13 October 2020
Accepted: 26 June 2021
Published: 25 October 2021
Issue Date: April 2023
DOI: https://doi.org/10.1007/s00521-021-06267-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Classification of incomplete data integrating neural networks and evidential reasoning

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Imbalanced data preprocessing techniques for machine learning: a systematic mapping study

A survey on ensemble learning

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Classification of incomplete data integrating neural networks and evidential reasoning

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Imbalanced data preprocessing techniques for machine learning: a systematic mapping study

A survey on ensemble learning

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation