Abstract
This paper proposes a new approach for handwritten digit image classification using a nonparametric Bayesian probabilistic model, called multinomialized subset infinite relational model (MSIRM). MSIRM realizes a three-way clustering, i.e., a simultaneous clustering of digit images, pixel columns, and pixel rows, where the numbers of clusters are adjusted automatically with Chinese restaurant process (CRP). We obtain MSIRM as a modification of subset infinite relational model (SIRM) by Ishiguro et al [4] While this modification is straightforward, our application of MSIRM to handwritten digit image classification leads to an impressive result. To represent a large number of training digit images in a compact form, we cluster the training images and then classify a test image to the class of the cluster most similar to the test image. By extending this line of thought, MSIRM clusters not only digit images but also pixel columns and pixel rows to obtain a more compact representation. With this three-way clustering, we achieved 2.95% and 5.38% test error rates for MNIST and USPS datasets, respectively.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Ciresan, D.C., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: Proc. of CVPR 2012, 3642–3649 (2012)
Dhillon, I.S.: Co-clustering documents and words using bipartite spectral graph partitioning. In: Proc. of KDD 2001, pp. 269–274 (2001)
Halkias, X., Paris, S., Glotin, H.: Sparse penalty in deep belief networks: using the mixed norm constraint. In: Proc. of ICLR 2013 (2013)
Ishiguro, K., Ueda, N., Sawada, H.: Subset infinite relational models. JMLR W&CP 22, 547–555 (2012); AISTATS 2012
Keglevic, M., Sablatnig, R.: Digit recognition in handwritten weather records. In: Proc. of OAGM/AAPR Workshop (2013)
Keysers, D., Dahmen, J., Theiner, T., Ney, H.: Experiments with an extended tangent distance. In: Proc. of ICPR 2000, pp. 38–42 (2000)
Maji, S., Malik, J.: Fast and Accurate Digit Classification. Tech. Rep. No. UCB/EECS-2009-159 (2009)
Masada, T., Takasu, A.: Trimming prototypes of handwritten digit images with subset infinite relational model. In: Proc. of MUE 2013, pp. 129–134 (2013)
Minka, T.P.: Estimating a Dirichlet distribution (2000), http://research.microsoft.com/en-us/um/people/minka/papers/dirichlet/
Neal, R.M.: Markov chain sampling methods for Dirichlet process mixture models. J. Comput. Graph. Statist. 9(2), 249–265 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Masada, T., Takasu, A. (2013). Three-Way Nonparametric Bayesian Clustering for Handwritten Digit Image Classification. In: Lee, M., Hirose, A., Hou, ZG., Kil, R.M. (eds) Neural Information Processing. ICONIP 2013. Lecture Notes in Computer Science, vol 8228. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-42051-1_20
Download citation
DOI: https://doi.org/10.1007/978-3-642-42051-1_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-42050-4
Online ISBN: 978-3-642-42051-1
eBook Packages: Computer ScienceComputer Science (R0)