Abstract
When missing data are imputed by any method, there is some uncertainty associated with the imputed value. Consequently, when such imputed data are classified, some uncertainty will be propagated to the classifier output. This leads to two issues to address. First, reducing the uncertainty in the imputed value. Second, modeling and processing of the uncertainty associated with the classifier output to arrive at a better decision. To deal with the first issue, we use a latent space representation, while for the second issue we use Dempster-Shafer evidence theory. First, we train a neural network using the data without any missing value to generate a latent space representation of the input. The complete data set is now extended by deleting every feature once. These missing values are estimated using a nearest neighbor-based scheme. The network is then refined using this extended dataset to obtain a better latent space. This mechanism is expected to reduce the effect of the missing data on the latent space representation. Using the latent space representation of the complete data, we train two classifiers, support vector machines and evidential t-nearest neighbors. To classify an input with a missing value, we make a rough estimate of the missing value using the nearest neighbor rule and generate its latent space representation for classification by the classifiers. Using each classifier output, we generate a basic probability assignment (BPA) and all BPAs are combined to get an overall BPA. Final classification is done using Pignistic probabilities computed on the overall BPA. We use three different ways to defining BPAs. To avoid some problems of Dempster’s rule of aggregation, we also use several alternative aggregations including some T-norm-based methods. Note that, T-norm has been used for combination of belief function in Pichon and Denœux (in: NAFIPS 2008: 2008 annual meeting of the North American fuzzy information processing society, pp 1–6, 2008). To demonstrate the superiority of the proposed method, we compare its performance with four state-of-the-art techniques using both artificial and real datasets.
Similar content being viewed by others
References
Allison PD (2001) Missing data: Sage university papers series on quantitative applications in the social sciences (07–136), Thousand Oaks, CA
Choudhury SJ, Pal NR (2019) Classification of incomplete data using autoencoder and evidential reasoning. In: IFIP international conference on artificial intelligence applications and innovations. Springer, pp 167–177
Choudhury SJ, Pal NR (2021) Deep and structure-preserving autoencoders for clustering data with missing information. IEEE Trans Emerg Top Comput Intell 5(4):639–650. https://doi.org/10.1109/TETCI.2019.2949264
Choudhury SJ, Pal NR (2019) Imputation of missing data with neural networks for classification. Knowl Based Syst. https://doi.org/10.1016/j.knosys.2019.07.009
Chung D, Merat FL (1996) Neural network based sensor array signal processing. In: IEEE/SICE/RSJ international conference on multisensor fusion and integration for intelligent systems, 1996. IEEE, pp 757–764
Cobb BR, Shenoy PP (2003) A comparison of methods for transforming belief function models to probability models. In: European conference on symbolic and quantitative approaches to reasoning and uncertainty. Springer, pp 255–266
DENOEUX T (1995) A k-nearest neighbor classification rule based on Dempster–Shafer theory. IEEE Trans Syst Man Cybern 25(5):804–813
Dixon JK (1979) Pattern recognition with partly missing data. IEEE Trans Syst Man Cybern 9(10):617–621
Dubois D, Prade H (1988) Representation and combination of uncertainty with belief functions and possibility measures. Comput Intell 4(3):244–264
Fessant F, Midenet S (2002) Self-organising map for data imputation and correction in surveys. Neural Comput Appl 10(4):300–310
García-Laencina PJ, Sancho-Gómez JL, Figueiras-Vidal AR (2010) Pattern classification with missing data: a review. Neural Comput Appl 19(2):263–282
Gautam C, Ravi V (2015) Counter propagation auto-associative neural network based data imputation. Inf Sci 325:288–299
Gautam C, Ravi V (2015) Data imputation via evolutionary computation, clustering and a neural network. Neurocomputing 156:134–142
Kalton G (1983) Compensating for missing survey data. Inst for Social Research the Univ
Kofman P, Sharpe IG (2003) Using multiple imputation in the analysis of incomplete observations in finance. J Financ Econom 1(2):216–249
Krstulovic J, Miranda V, Costa AJS, Pereira J (2013) Towards an auto-associative topology state estimator. IEEE Trans Power Syst 28(3):3311–3318
Kumar S (2004) Neural networks: a classroom approach. Tata McGraw-Hill Education, New York
Lefevre E, Colot O, Vannoorenberghe P (2002) Belief function combination and conflict management. Inf fusion 3(2):149–162
Little RJ, Rubin DB (2014) Statistical analysis with missing data. Wiley, Hoboken
Liu Z.g, Dezert J, Pan Q, Mercier G (2011) Combination of sources of evidence with different discounting factors based on a new dissimilarity measure. Decis Support Syst 52(1):133–141
Liu ZG, Pan Q, Mercier G, Dezert J (2015) A new incomplete pattern classification method based on evidential reasoning. IEEE Trans Cybern 45(4):635–646
Marseguerra M, Zoia A (2005) The autoassociative neural network in signal analysis: II. Application to on-line monitoring of a simulated BWR component. Ann Nucl Energy 32(11):1207–1223
Marwala T, Chakraverty S (2006) Fault classification in structures with incomplete measured data using autoassociative neural networks and genetic algorithm. Curr Sci 90:542–548
Miranda V, Krstulovic J, Keko H, Moreira C, Pereira J (2012) Reconstructing missing data in state estimation with autoencoders. IEEE Trans Power Syst 27(2):604–611
Morin R, Raeside B (1981) A reappraisal of distance-weighted \( k \)-nearest neighbor classification for pattern recognition with missing data. IEEE Trans Syst Man Cybern 3:241–243
Narayanan S, Marks R, Vian JL, Choi J, El-Sharkawi M, Thompson BB (2002) Set constraint discovery: missing sensor data restoration using autoassociative regression machines. In: Proceedings of the 2002 international joint conference on neural networks, 2002. IJCNN’02, vol 3. IEEE, pp 2872–2877
Narayanan S, Vian JL, Choi J, Marks R, El-Sharkawi M, Thompson BB (2003) Missing sensor data restoration for vibration sensors on a jet aircraft engine. In: Proceedings of the international joint conference on neural networks, 2003, vol 4. IEEE, pp 3007–3010
Nowicki R (2009) Rough neuro-fuzzy structures for classification with missing data. IEEE Trans Syst Man Cybern Part B (Cybern) 39(6):1334–1347
Pichon F, Denœux T (2008) T-norm and uninorm-based combination of belief functions. In: NAFIPS 2008: 2008 annual meeting of the North American fuzzy information processing society, pp 1–6
Platt J et al (1999) Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Adv Large Margin Classif 10(3):61–74
Qiao W, Gao Z, Harley RG, Venayagamoorthy GK (2008) Robust neuro-identification of nonlinear plants in electric power systems with missing sensor measurements. Eng Appl Artif Intell 21(4):604–618
Samad T, Harp S.A (1992) Self-organization with partial data. Netw Comput Neural Syst 3(2):205–212
Schafer JL (1997) Analysis of incomplete multivariate data. CRC Press, Cambridge
Sentz K, Ferson S et al (2002) Combination of evidence in Dempster–Shafer theory, vol 4015. Citeseer, Princeton
Shafer G (1976) A mathematical theory of evidence, vol 42. Princeton University Press, Princeton
Silva-Ramírez EL, Pino-Mejías R, López-Coello M (2015) Single imputation with multilayer perceptron and multiple imputation combining multilayer perceptron and k-nearest neighbours for monotone patterns. Appl Soft Comput 29:65–74
Silva-Ramírez EL, Pino-Mejías R, López-Coello M, Cubiles-de-la Vega MD (2011) Missing value imputation on missing completely at random data using multilayer perceptrons. Neural Netw 24(1):121–129
Smarandache F, Dezert J (2009) Advances and Applications of DSmT for Information Fusion Collected works. American Research Press, vol 3, p 760
Smets P (1990) The combination of evidence in the transferable belief model. IEEE Trans Pattern Anal Mach Intell 12(5):447–458
Smets P (2007) Analyzing the combination of conflicting belief functions. Inf Fusion 8(4):387–412
Thompson BB, Marks R, El-Sharkawi MA (2003) On the contractive nature of autoencoders: application to missing sensor restoration. In: Proceedings of the international joint conference on neural networks, 2003, vol 4. IEEE, pp 3011–3016
Tsang I, Kwok J, Cheung P, Cristianini N (2005) Core vector machines: fast SVM training on very large data sets. J Mach Learn Res 6:363–392
Westin LK (2004) Missing data and the preprocessing perception, page 3, Umea University, ISSN-0348-0542
Yager RR (1987) Quasi-associative operations in the combination of evidence. Kybernetes 16(1):37–41
Yang JB, Xu DL (2013) Evidential reasoning rule for evidence combination. Artif Intell 205:1–29
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest regarding the publication of this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Choudhury, S.J., Pal, N.R. Classification of incomplete data integrating neural networks and evidential reasoning. Neural Comput & Applic 35, 7267–7281 (2023). https://doi.org/10.1007/s00521-021-06267-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-021-06267-1