New Loss Function for Multiclass, Single-Label Classification

Halawa, Krzysztof

doi:10.1007/978-3-030-76773-0_15

Krzysztof Halawa ORCID: orcid.org/0000-0001-6508-0468¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1389))

Included in the following conference series:

International Conference on Dependability and Complex Systems

404 Accesses

Abstract

Deep neural networks can perform complex transformations for classification and automatic feature extraction. Their training can be time consuming and require a large number of numerical calculations. Therefore, it is important to choose the good initial learning settings. Results depend, inter alia, on a loss function. The paper proposes a new loss function for multiclass, single-label classification. Experiments were conducted with convolutional neural networks trained on several popular data sets. Tests with multilayer perceptron were also carried out. The obtained results indicate that the proposed loss may be a good alternative to the categorical cross-entropy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Khan, A., Sohail, A., Zahoora, U., Qureshi, A.S.: A survey of the recent architectures of deep convolutional neural networks. Artif. Intell. Rev. 53(8), 5455–5516 (2020)
Article Google Scholar
Francois, C.: Deep learning with Python. Manning Publications, Shelter Island NY (2017)
Google Scholar
Géron, A.: Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems. O'Reilly Media, Sebastopol (2019)
Google Scholar
LeCun, Y., Cortes, C.: MNIST database. https://yann.lecun.com/exdb/mnist/. Accessed 07 Jan 2021
Xiao, H., Rasul, K., Vollgraf, R.: Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms. arXiv preprint arXiv:1708.077472017
Krizhevsky, A.: CIFAR Dataset, https://www.cs.toronto.edu/~kriz/cifar.html. Accessed 07 Jan 2021
Krizhevsky, A.: Learning Multiple Layers of Features from Tiny Images (2009). https://www.cs.toronto.edu/~kriz/learning-features-2009-TR.pdf. Accessed 07 Jan 2021
Bengio, Y.: Practical recommendations for gradient-based training of deep architectures. In: Neural Networks: Tricks of the Trade, pp. 437–478. Springer, Heidelberg (2012)
Google Scholar
Bengio, Y., Goodfellow, I., Courville, A.: Deep Learning, vol. 1, MIT Press (2017)
Google Scholar
Yazan, E., Talu, M.F.: Comparison of the stochastic gradient descent based optimization techniques. International Artificial Intelligence and Data Processing Symposium (IDAP), pp. 1–5. IEEE (2017)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectifiers: surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1026–1034 (2015)
Google Scholar
UCI Machine learning repository, Covertype Dataset. https://archive.ics.uci.edu/ml/datasets/covertype. Accessed 09 Mar 2021
Blackard, J.A., Dean, D.J.: Comparative accuracies of artificial neural networks and discriminant analysis in predicting forest cover types from cartographic variables. Comput. Electron. Agriculture 24(3), 131–151 (1999)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Wroclaw University of Science and Technology, 27 Wyb. Wyspiańskiego Street, 50-370, Wrocław, Poland
Krzysztof Halawa

Authors

Krzysztof Halawa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Krzysztof Halawa .

Editor information

Editors and Affiliations

Wrocław University of Science and Technology, Wrocław, Poland
Wojciech Zamojski
Wrocław University of Science and Technology, Wrocław, Poland
Jacek Mazurkiewicz
Wrocław University of Science and Technology, Wrocław, Poland
Jarosław Sugier
Wrocław University of Science and Technology, Wrocław, Poland
Tomasz Walkowiak
Polish Academy of Sciences, Systems Research Institute, Warsaw, Poland
Janusz Kacprzyk

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Halawa, K. (2021). New Loss Function for Multiclass, Single-Label Classification. In: Zamojski, W., Mazurkiewicz, J., Sugier, J., Walkowiak, T., Kacprzyk, J. (eds) Theory and Engineering of Dependable Computer Systems and Networks. DepCoS-RELCOMEX 2021. Advances in Intelligent Systems and Computing, vol 1389. Springer, Cham. https://doi.org/10.1007/978-3-030-76773-0_15

Download citation

DOI: https://doi.org/10.1007/978-3-030-76773-0_15
Published: 27 May 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-76772-3
Online ISBN: 978-3-030-76773-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics