Abstract
The classification of objects based on classification codes is an important task for data processing in the field of official statistics. In our previous study, the supervised multiclass classifier was developed for autocoding, which has the advantages of simplicity and practical calculation time. However, the previous algorithm classified a few objects incorrectly. To address this problem, a new supervised multiclass classifier is proposed that extends the previously proposed classifier algorithm by applying the idea of partition coefficient or partition entropy. Numerical evaluation shows that the proposed algorithm has a better performance as compared to the previously proposed algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)
Bezdek, J.C., Keller J., Krisnapuram, R., Pal, N.R.: Fuzzy Models and Algorithms for Pattern Recognition and Image Processing. Kluwer Academic Publishers, Dordrecht (1999)
Gweon, H., Schonlau, M., Kaczmirek, L., Blohm, M., Steiner, S.: Three methods for occupation coding based on statistical learning. J. Off. Stat. 33(1), 101–122 (2017)
Hacking, W., Willenborg, L.: Method series theme: coding; interpreting short descriptions using a classification. In: Statistics Methods. Statistics Netherlands (2012). https://www.cbs.nl/en-gb/our-services/methods/statistical-methods/throughput/throughput/coding. Accessed 16 Jan 2018
Kudo, T., Yamamoto, K., Matsumoto, Y.: Applying conditional random fields to Japanese morphological analysis. In: The 2004 Conference on Empirical Methods in Natural Language Processing on Proceedings, Barcelona, Spain, pp. 230–237 (2004)
Shimono, T., Wada, K., Toko, Y.: A supervised multiclass classifier using machine learning algorithm for autocoding. Res. Mem. Off. Stat. 75, 41–60 (2018). (in Japanese)
Toko, Y., Wada, K., Kawano, M.: A supervised multiclass classifier for an autocoding system. J. Rom. Stat. Rev. 4, 29–39 (2017)
Tsubaki, H., Wada, K., Toko, T.: An extension of Taguchi’s T method and standardized misclassification rate for supervised classification with only binary inputs. In: Proceedings of the ANQ Congress, Kathmandu, Nepal (2017)
Xu, J., Wang, P., Tian, G., Xu, B., Zhao, J., Wang, F., Hao, H.: Short text clustering via convolutional neural networks. In: NAACL-HLT on Proceedings, Denver, Colorado, USA, pp. 62–69 (2015)
Acknowledgements
We would like to thank Kaggle for making the Stack Overflow dataset available.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Toko, Y., Wada, K., Iijima, S., Sato-Ilic, M. (2019). Supervised Multiclass Classifier for Autocoding Based on Partition Coefficient. In: Czarnowski, I., Howlett, R., Jain, L., Vlacic, L. (eds) Intelligent Decision Technologies 2018. KES-IDT 2018 2018. Smart Innovation, Systems and Technologies, vol 97. Springer, Cham. https://doi.org/10.1007/978-3-319-92028-3_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-92028-3_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-92027-6
Online ISBN: 978-3-319-92028-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)