Skip to main content

Three-Way Nonparametric Bayesian Clustering for Handwritten Digit Image Classification

  • Conference paper
  • 4313 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8228))

Abstract

This paper proposes a new approach for handwritten digit image classification using a nonparametric Bayesian probabilistic model, called multinomialized subset infinite relational model (MSIRM). MSIRM realizes a three-way clustering, i.e., a simultaneous clustering of digit images, pixel columns, and pixel rows, where the numbers of clusters are adjusted automatically with Chinese restaurant process (CRP). We obtain MSIRM as a modification of subset infinite relational model (SIRM) by Ishiguro et al [4] While this modification is straightforward, our application of MSIRM to handwritten digit image classification leads to an impressive result. To represent a large number of training digit images in a compact form, we cluster the training images and then classify a test image to the class of the cluster most similar to the test image. By extending this line of thought, MSIRM clusters not only digit images but also pixel columns and pixel rows to obtain a more compact representation. With this three-way clustering, we achieved 2.95% and 5.38% test error rates for MNIST and USPS datasets, respectively.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ciresan, D.C., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: Proc. of CVPR 2012, 3642–3649 (2012)

    Google Scholar 

  2. Dhillon, I.S.: Co-clustering documents and words using bipartite spectral graph partitioning. In: Proc. of KDD 2001, pp. 269–274 (2001)

    Google Scholar 

  3. Halkias, X., Paris, S., Glotin, H.: Sparse penalty in deep belief networks: using the mixed norm constraint. In: Proc. of ICLR 2013 (2013)

    Google Scholar 

  4. Ishiguro, K., Ueda, N., Sawada, H.: Subset infinite relational models. JMLR W&CP 22, 547–555 (2012); AISTATS 2012

    Google Scholar 

  5. Keglevic, M., Sablatnig, R.: Digit recognition in handwritten weather records. In: Proc. of OAGM/AAPR Workshop (2013)

    Google Scholar 

  6. Keysers, D., Dahmen, J., Theiner, T., Ney, H.: Experiments with an extended tangent distance. In: Proc. of ICPR 2000, pp. 38–42 (2000)

    Google Scholar 

  7. Maji, S., Malik, J.: Fast and Accurate Digit Classification. Tech. Rep. No. UCB/EECS-2009-159 (2009)

    Google Scholar 

  8. Masada, T., Takasu, A.: Trimming prototypes of handwritten digit images with subset infinite relational model. In: Proc. of MUE 2013, pp. 129–134 (2013)

    Google Scholar 

  9. Minka, T.P.: Estimating a Dirichlet distribution (2000), http://research.microsoft.com/en-us/um/people/minka/papers/dirichlet/

  10. Neal, R.M.: Markov chain sampling methods for Dirichlet process mixture models. J. Comput. Graph. Statist. 9(2), 249–265 (2000)

    MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Masada, T., Takasu, A. (2013). Three-Way Nonparametric Bayesian Clustering for Handwritten Digit Image Classification. In: Lee, M., Hirose, A., Hou, ZG., Kil, R.M. (eds) Neural Information Processing. ICONIP 2013. Lecture Notes in Computer Science, vol 8228. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-42051-1_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-42051-1_20

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-42050-4

  • Online ISBN: 978-3-642-42051-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics