Skip to main content

Supervised Multiclass Classifier for Autocoding Based on Partition Coefficient

  • Conference paper
  • First Online:
Intelligent Decision Technologies 2018 (KES-IDT 2018 2018)

Abstract

The classification of objects based on classification codes is an important task for data processing in the field of official statistics. In our previous study, the supervised multiclass classifier was developed for autocoding, which has the advantages of simplicity and practical calculation time. However, the previous algorithm classified a few objects incorrectly. To address this problem, a new supervised multiclass classifier is proposed that extends the previously proposed classifier algorithm by applying the idea of partition coefficient or partition entropy. Numerical evaluation shows that the proposed algorithm has a better performance as compared to the previously proposed algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)

    Book  Google Scholar 

  • Bezdek, J.C., Keller J., Krisnapuram, R., Pal, N.R.: Fuzzy Models and Algorithms for Pattern Recognition and Image Processing. Kluwer Academic Publishers, Dordrecht (1999)

    Chapter  Google Scholar 

  • Gweon, H., Schonlau, M., Kaczmirek, L., Blohm, M., Steiner, S.: Three methods for occupation coding based on statistical learning. J. Off. Stat. 33(1), 101–122 (2017)

    Google Scholar 

  • Hacking, W., Willenborg, L.: Method series theme: coding; interpreting short descriptions using a classification. In: Statistics Methods. Statistics Netherlands (2012). https://www.cbs.nl/en-gb/our-services/methods/statistical-methods/throughput/throughput/coding. Accessed 16 Jan 2018

  • Kudo, T., Yamamoto, K., Matsumoto, Y.: Applying conditional random fields to Japanese morphological analysis. In: The 2004 Conference on Empirical Methods in Natural Language Processing on Proceedings, Barcelona, Spain, pp. 230–237 (2004)

    Google Scholar 

  • Shimono, T., Wada, K., Toko, Y.: A supervised multiclass classifier using machine learning algorithm for autocoding. Res. Mem. Off. Stat. 75, 41–60 (2018). (in Japanese)

    Google Scholar 

  • Toko, Y., Wada, K., Kawano, M.: A supervised multiclass classifier for an autocoding system. J. Rom. Stat. Rev. 4, 29–39 (2017)

    Google Scholar 

  • Tsubaki, H., Wada, K., Toko, T.: An extension of Taguchi’s T method and standardized misclassification rate for supervised classification with only binary inputs. In: Proceedings of the ANQ Congress, Kathmandu, Nepal (2017)

    Google Scholar 

  • Xu, J., Wang, P., Tian, G., Xu, B., Zhao, J., Wang, F., Hao, H.: Short text clustering via convolutional neural networks. In: NAACL-HLT on Proceedings, Denver, Colorado, USA, pp. 62–69 (2015)

    Google Scholar 

Download references

Acknowledgements

We would like to thank Kaggle for making the Stack Overflow dataset available.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yukako Toko .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Toko, Y., Wada, K., Iijima, S., Sato-Ilic, M. (2019). Supervised Multiclass Classifier for Autocoding Based on Partition Coefficient. In: Czarnowski, I., Howlett, R., Jain, L., Vlacic, L. (eds) Intelligent Decision Technologies 2018. KES-IDT 2018 2018. Smart Innovation, Systems and Technologies, vol 97. Springer, Cham. https://doi.org/10.1007/978-3-319-92028-3_6

Download citation

Publish with us

Policies and ethics