One-Class Classification by Combining Density and Class Probability Estimation

Hempstalk, Kathryn; Frank, Eibe; Witten, Ian H.

doi:10.1007/978-3-540-87479-9_51

Kathryn Hempstalk¹,
Eibe Frank¹ &
Ian H. Witten¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5211))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

6779 Accesses
71 Citations
1 Altmetric

Abstract

One-class classification has important applications such as outlier and novelty detection. It is commonly tackled using density estimation techniques or by adapting a standard classification algorithm to the problem of carving out a decision boundary that describes the location of the target data. In this paper we investigate a simple method for one-class classification that combines the application of a density estimator, used to form a reference distribution, with the induction of a standard model for class probability estimation. In this method, the reference distribution is used to generate artificial data that is employed to form a second, artificial class. In conjunction with the target class, this artificial class is the basis for a standard two-class learning problem. We explain how the density function of the reference distribution can be combined with the class probability estimates obtained in this way to form an adjusted estimate of the density function of the target class. Using UCI datasets, and data from a typist recognition problem, we show that the combined model, consisting of both a density estimator and a class probability estimator, can improve on using either component technique alone when used for one-class classification. We also compare the method to one-class classification using support vector machines.

Download to read the full chapter text

Chapter PDF

One-class classifier based on principal curves

Article 16 June 2023

Assessing the Reliability of a Multi-Class Classifier

A Fast k-Nearest Neighbor Classifier Using Unsupervised Clustering

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Abe, N., Zadrozny, B., Langford, J.: Outlier detection by active learning. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 767–772. ACM Press, New York (2006)
Chapter Google Scholar
Barnett, V., Lewis, T.: Outliers in Statistical Data. John Wiley & Sons, West Sussex (1994)
MATH Google Scholar
Chang, C., Lin, C.: LIBSVM: A Library for Support Vector Machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm
Dowland, P., Furnell, S., Papadaki, M.: Keystroke analysis as a method of advanced user authentication and response. In: Proceedings of the IFIP TC11 17th International Conference on Information Security, Deventer, The Netherlands, pp. 215–226. Kluwer, Dordrecht (2002)
Google Scholar
Gunetti, D., Picardi, C.: Keystroke analysis of free text. ACM Transactions on Information and System Security 8(3), 312–347 (2005)
Article Google Scholar
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer, New York (2001)
MATH Google Scholar
Monrose, F., Rubin, A.: Keystroke dynamics as a biometric for authentication. In: Future Generation Computer Systems, vol. 16, pp. 351–359. Elsevier Science, Amsterdam (2000)
Google Scholar
Nisenson, M., Yariv, I., El-Yaniv, R., Meir, R.: Towards behaviometric security systems: Learning to identify a typist. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds.) PKDD 2003. LNCS (LNAI), vol. 2838, pp. 363–374. Springer, Heidelberg (2003)
Google Scholar
Pearson, R.: Mining Imperfect Data. Society for Industrial and Applied Mechanics, USA (2005)
MATH Google Scholar
Provost, F., Domingos, P.: Tree induction for probability-based ranking. Machine Learning 52(3), 199–215 (2003)
Article MATH Google Scholar
Roth, V.: Kernel fisher discriminants for outlier detection. Neural Computing 18(4), 942–960 (2006)
Article MATH Google Scholar
Schölkopf, B., Williamson, R., Smola, A., Shawe-Taylor, J., Platt, J.: Support vector method for novelty detection. In: Advances in Neural Information Processing Systems, vol. 12, pp. 582–588. MIT Press, Cambridge (2000)
Google Scholar
Tarassenko, L., Hayton, P., Cerneaz, N., Brady, M.: Novelty detection for the identification of masses in mammograms. In: Proceedings of the Fourth International IEEE Conference on Artificial Neural Networks, London, pp. 442–447. IEEE, Los Alamitos (1995)
Chapter Google Scholar
Tax, D.: One-class Classification, Concept-learning in the Absence of Counter-examples. PhD thesis, Delft University of Technology, Netherlands (2001)
Google Scholar
Tax, D., Duin, R.: Combining one-class classifiers. In: Kittler, J., Roli, F. (eds.) MCS 2001. LNCS, vol. 2096, pp. 299–308. Springer, Heidelberg (2001)
Chapter Google Scholar
Ypma, A., Duin, R.: Support objects for domain approximation. In: Proceedings of the 8th International Conference on Artificial Neural Networks, pp. 719–724. Springer, Berlin (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Waikato, Hamilton, NZ,
Kathryn Hempstalk, Eibe Frank & Ian H. Witten

Authors

Kathryn Hempstalk
View author publications
You can also search for this author in PubMed Google Scholar
Eibe Frank
View author publications
You can also search for this author in PubMed Google Scholar
Ian H. Witten
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Walter Daelemans Bart Goethals Katharina Morik

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hempstalk, K., Frank, E., Witten, I.H. (2008). One-Class Classification by Combining Density and Class Probability Estimation. In: Daelemans, W., Goethals, B., Morik, K. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2008. Lecture Notes in Computer Science(), vol 5211. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87479-9_51

Download citation

DOI: https://doi.org/10.1007/978-3-540-87479-9_51
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87478-2
Online ISBN: 978-3-540-87479-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

One-Class Classification by Combining Density and Class Probability Estimation

Abstract

Chapter PDF

Similar content being viewed by others

One-class classifier based on principal curves

Assessing the Reliability of a Multi-Class Classifier

A Fast k-Nearest Neighbor Classifier Using Unsupervised Clustering

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

One-Class Classification by Combining Density and Class Probability Estimation

Abstract

Chapter PDF

Similar content being viewed by others

One-class classifier based on principal curves

Assessing the Reliability of a Multi-Class Classifier

A Fast k-Nearest Neighbor Classifier Using Unsupervised Clustering

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation