Abstract
One-class classification is a special multi-class approach where data from only a single class are available for classifier training. It is an approach with several applications in real-world scenarios, for instance, for outlier or novelty detection. This paper presents a new one-class classifier based on principal curves. The method exploits the good capacity of data representation of the principal curves to build a compact data representation of the known class. The use of principal curves gives the proposed method a good capacity for dealing with different shapes of the feature space, leading to better performance rates. The results showed high performances of the proposed method for synthetic and real data sets, outperforming other known one-class learning algorithms. Moreover, it builds decision boundaries more uniform around the known class than other models and is a fast method during the operating stage since classification is performed by simply mapping the Euclidean distances from data to the principal curve.
Similar content being viewed by others
Data availability
The real data sets used in this work are available in the Outlier Detection Datasets (ODDS) Library [30], which may be accessed by the link http://odds.cs.stonybrook.edu/. All synthetic data used in this work are available in the GitHub link https://github.com/fmborges2/PC_Classifier.
References
Mishra PK, Gautam C, Aruna T (2021) Minimum variance embedded auto-associative kernel extreme learning machine for one-class classification. Neural Comput Appl 33:12973–12987
Irigoien I, Sierra B, Arenas C (2014) Towards application of one-class classification methods to medical data. Sci World J 730712
Gözüaçık Ömer, Can F (2021) Concept learning using one-class classifiers for implicit drift detection in evolving data streams. Artif Intell Rev 54:3725–3747
Dzhoha A, Rozora I (2023) Multi-armed bandit problem with online clustering as side information. J Comput Appl Math 427:115132. https://doi.org/10.1016/j.cam.2023.115132
Slivkins A (2014) Contextual bandits with similarity information. J Mach Learn Res 15:2533–2568
Liu FT, Ting KM, Zhou Z-H (2008) Isolation forest. In: 2008 Eighth IEEE international conference on data mining. IEEE, pp 413–422
Schölkopf B, Platt JC, Shawe-Taylor J, Smola AJ, Williamson RC (2001) Estimating the support of a high-dimensional distribution. Neural Comput 13(7):1443–1471
Deng F, Chen Y, Dou J, Liu C, Chen Z, Blaabjerg F (2022) Isolation forest based submodule open-circuit fault localization method for modular multilevel converters. IEEE Trans Ind Electron 1–12
Zhang K, Kang X, Li S (2019) Isolation forest for anomaly detection in hyperspectral images. In: IGARSS 2019-2019 IEEE international geoscience and remote sensing symposium. IEEE, pp 437–440
Saari J, Strömbergsson D, Lundberg J, Thomson A (2019) Detection and identification of windmill bearing faults using a one-class support vector machine (svm). Measurement 137:287–301
Parras J, Zazo S (2019) Using one class svm to counter intelligent attacks against an sprt defense mechanism. Ad Hoc Netw 94:101946–101954
Borges FEM, Pinto AWM, Ribeiro DA, Barbosa TS, Pereira DA, Barbosa BHG, Magalhães RR, Ferreira DD (2020) An unsupervised method based on support vector machines and higher-order statistics for mechanical faults detection. IEEE Latin Am Trans 18(6):1093–1101
Alam S, Sonbhadra SK, Agarwal S, Nagabhushan P (2020) One-class support vector classifiers: a survey. Knowl Based Syst 196:105754–105772
Breunig MM, Kriegel H-P, Ng RT, Sander J (2000) Lof: identifying density-based local outliers. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pp 93–104
Li A, Xu W, Liu Z, Shi Y (2021) Improved incremental local outlier detection for data streams based on the landmark window model. Knowl Inf Syst 63(8):2129–2155
Qiu Y, Dong T, Lin D, Zhao B, Cao W, Jiang F (2022) Fault diagnosis for lithium-ion battery energy storage systems based on local outlier factor. J Energy Storage 55:105470
Hastie T, Stuetzle W (1989) Principal curves. J Am Stat Assoc 84(406):502–516
Chang K-y, Ghosh J (1998) Principal curve classifier-a nonlinear approach to pattern classification. In: 1998 IEEE international joint conference on neural networks proceedings. IEEE world congress on computational intelligence (Cat. No. 98CH36227), vol 1. IEEE, pp 695–700
Chang K-Y, Ghosh J (1998) Principal curves for nonlinear feature extraction and classification. In: Applications of artificial neural networks in image processing III, vol 3307. SPIE, pp 120–129
Ferreira DD, de Seixas JM, Cerqueira AS, Duque CA, Bollen MHJ, Ribeiro PF (2015) A new power quality deviation index based on principal curves. Electr Power Syst Res 125:8–14
Sousa LPO, Fukushima KL, Scagion VP, Facure MHM, Correa DS, Oliveira JE, Ferreira DD (2021) A principal curves-based method for electronic tongue data analysis. IEEE Sens J 21(4):4957–4965. https://doi.org/10.1109/JSEN.2020.3031737
Moraes ECC, Ferreira DD, Vitor GB, Barbosa BHG (2020) Data clustering based on principal curves. Adv Data Anal Classif 14(1):77–96
Verbeek JJ, Vlassis N, Kröse B (2002) A k-segments algorithm for finding principal curves. Pattern Recognit Lett 23(8):1009–1017
Campos GO, Zimek A, Sander J, Campello RJ, Micenková B, Schubert E, Assent I, Houle ME (2016) On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study. Data Min Knowl Discov. 30:891–927
Quinlan JR, Compton PJ, Horn K, Lazarus L (1987) Inductive knowledge acquisition: a case study. In: Proceedings of the Second Australian conference on applications of expert systems, pp 137–156
Wolberg WH, Mangasarian OL (1990) Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc Natl Acad Sci 87(23):9193–9196
Woods KS, Solka JL, Priebe CE, W Philip Kegelmeyer J, Doss CC, Bowyer KW (1994) Comparative evaluation of pattern recognition techniques for detection of microcalcifications in mammography, pp 213–231
Bandaragoda TR, Ting KM, Albrecht D, Liu FT, Wells JR (2014) Efficient anomaly detection by isolation using nearest neighbour ensemble. In: 2014 IEEE international conference on data mining workshop. IEEE, pp 698–705
Koko RRZ, Yassine IA, Wahed MA, Madete JK, Rushdi MA (2023) Dynamic construction of outlier detector ensembles with bisecting k-means clustering. IEEE Access 11:24431–24447
Rayana S (2016) ODDS Library. http://odds.cs.stonybrook.edu
Acknowledgements
This work has been supported by the Brazilian agencies CNPq and FAPEMIG.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
de Melo Borges, F.E., Mota, O.F., Ferreira, D.D. et al. One-class classifier based on principal curves. Neural Comput & Applic 35, 19015–19024 (2023). https://doi.org/10.1007/s00521-023-08721-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-023-08721-8