Abstract
We propose a novel method for detecting outliers based on the leave-one-out density. The leave-one-out density of a datum is defined as a ratio of the number of data inside a region to the volume of the region after the datum is removed from an original data set. We propose an efficient algorithm that evaluates the leave-one-out density of each datum on a set of regions around the datum by using binary decision diagrams. The time complexity of the proposed method is near linear with respect to the size of a data set, while the outlier detection accuracy is still comparable to other methods. Experimental results show the usefulness of the proposed method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bache, K., Lichman, M.: UCI machine learning repository (2013), http://archive.ics.uci.edu/ml
Beckmann, N., Kriegel, H., Schneider, R., Seeger, B.: The R*-tree: an efficient and robust access method for points and rectangles. SIGMOD Rec. 19(2), 322–331 (1990)
Brace, K., Rudell, R., Bryant, R.: Efficient implementation of a BDD package. In: The 27th ACM/IEEE Design Automation Conference, pp. 40–45 (1990)
Breunig, M., Kriegel, H., Ng, R., Sander, J.: LOF: Identifying density-based local outliers. In: SIGMOD Conference, pp. 93–104 (2000)
Bryant, R.: Graph-based algorithms for boolean function manipulation. IEEE Trans. Computers 35(8), 677–691 (1986)
Caputo, B., Sim, K., Furesjo, F., Smola, A.: Appearance-based object recognition using svms: Which kernel should I use? In: NIPS Workshop on Statistical Methods for Computational Experiments in Visual Processing and Computer Vision (2002)
Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: A survey. ACM Computing Surveys (CSUR)Â 41(3), 15 (2009)
Fawcett, T.: An introduction to ROC analysis. Pattern Recognition Letters 27(8), 861–874 (2006)
Karatzoglou, A., Smola, A., Hornik, K., Zeileis, A.: kernlab – an S4 package for kernel methods in R. Journal of Statistical Software 11(9), 1–20 (2004)
Kutsuna, T.: A binary decision diagram-based one-class classifier. In: The 10th IEEE International Conference on Data Mining, pp. 284–293 (December 2010)
Kutsuna, T., Yamamoto, A.: A parameter-free approach for one-class classification using binary decision diagrams. Intelligent Data Analysis 18(5) (to appear, 2014)
Lazarevic, A., Ertoz, L., Kumar, V., Ozgur, A., Srivastava, J.: A comparative study of anomaly detection schemes in network intrusion detection. In: Proceedings of SIAM Conference on Data Mining (2003)
Schölkopf, B., Platt, J., Shawe-Taylor, J., Smola, A., Williamson, R.: Estimating the support of a high-dimensional distribution. Neural Computation 13(7), 1443–1471 (2001)
Somenzi, F.: CUDD: CU decision diagram package, http://vlsi.colorado.edu/~fabio/CUDD/
Somenzi, F.: Binary decision diagrams. In: Calculational System Design, vol. 173, pp. 303–366. IOS Press (1999)
Torgo, L.: Data Mining with R, learning with case studies (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Kutsuna, T., Yamamoto, A. (2014). Outlier Detection Based on Leave-One-Out Density Using Binary Decision Diagrams. In: Tseng, V.S., Ho, T.B., Zhou, ZH., Chen, A.L.P., Kao, HY. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2014. Lecture Notes in Computer Science(), vol 8444. Springer, Cham. https://doi.org/10.1007/978-3-319-06605-9_40
Download citation
DOI: https://doi.org/10.1007/978-3-319-06605-9_40
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-06604-2
Online ISBN: 978-3-319-06605-9
eBook Packages: Computer ScienceComputer Science (R0)