Introduction

Yin, Xu-Cheng; Yang, Chun; Liu, Chang

doi:10.1007/978-981-97-0361-6_1

Xu-Cheng Yin⁴,
Chun Yang⁵ &
Chang Liu⁶

Part of the book series: SpringerBriefs in Computer Science ((BRIEFSCOMPUTER))

80 Accesses

Abstract

In real-world applications, new data, patterns, and categories that were not covered by the training data can frequently emerge, necessitating the capability to detect and adapt to novel characters incrementally. Researchers refer to these challenges as the Open-Set Text Recognition (OSTR) task, which has, in recent years, emerged as one of the prominent issues in the field of text recognition. In this chapter, we first introduce the evolution and several main trends of preliminary works on novel (unseen) character identification and recognition. Then, we briefly discuss three main challenges in OSTR. Finally, we introduce the overall structure and main content of our book.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 49.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Borisyuk, F., Gordo, A., Sivakumar, V.: Rosetta: Large scale system for text detection and recognition in images. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery Data Mining, KDD 2018, August 19–23, pp. 71–79. ACM, London, UK (2018)
Google Scholar
Shi, B., Yang, M., Wang, X., Lyu, P., Yao, C., Bai, X.: ASTER: an attentional scene text recognizer with flexible rectification. IEEE Trans. Pattern Anal. Mach. Intell. 41(9), 2035–2048 (2019)
Article Google Scholar
Wang, T., Zhu, Y., Jin, L., Luo, C., Chen, X., Wu, Y., Wang, Q., Cai, M.: Decoupled attention network for text recognition. In: The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, February 7–12, pp. 12 216–12 224. AAAI Press, New York, NY, USA (2020)
Google Scholar
Chen, J., Li, B., Xue, X.: Zero-shot Chinese character recognition with stroke-level decomposition. In: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI 2021, Virtual Event/Montreal, Canada, August 19–27, pp. 615–621. (2021). www.ijcai.org
Wang, T., Xie, Z., Li, Z., Jin, L., Chen, X.: Radical aggregation network for few-shot offline handwritten Chinese character recognition. Pattern Recognit. Lett. 125, 821–827 (2019)
Article Google Scholar
Zhang, J., Du, J., Dai, L.: Radical analysis network for learning hierarchies of Chinese characters. Pattern Recognit. 103, 107305 (2020)
Article Google Scholar
Qi, H., Brown, M., Lowe, D.G.: Low-shot learning with imprinted weights. In: 2018 IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2018, June 18–22, pp. 5822–5830. IEEE Computer Society, Salt Lake City, UT, USA (2018)
Google Scholar
Ao, X., Zhang, X., Yang, H., Yin, F., Liu, C.: Cross-modal prototype learning for zero-shot handwriting recognition. In: 2019 International Conference on Document Analysis and Recognition. ICDAR 2019, September 20–25, pp. 589–594. IEEE, Sydney, Australia (2019)
Google Scholar
Zhang, C., Gupta, A., Zisserman, A.: Adaptive text recognition through visual matching. In: Computer Vision-ECCV 2020–16th European Conference, August 23–28,: Proceedings, Part XVI, ser. Lecture Notes in Computer Science, vol. 12361, pp. 51–67. Springer, Glasgow, UK (2020)
Google Scholar
Souibgui, M.A., Fornés, A., Kessentini, Y., Megyesi, B.: Few shots is all you need: A progressive few shot learning approach for low resource handwriting recognition (2021). [Online] Available: https://arxiv.org/abs/2107.10064
Huang, Y., Jin, L., Peng, D.: Zero-shot Chinese text recognition via matching class embedding. In: 16th International Conference on Document Analysis and Recognition, ICDAR 2021, September 5–10, 2021, Proceedings, Part III, ser. Lecture Notes in Computer Science, vol. 12823, pp. 127–141. Springer, Lausanne, Switzerland (2021)
Google Scholar
Baek, J., Kim, G., Lee, J., Park, S., Han, D., Yun, S., Oh, S.J., Lee, H.: What is wrong with scene text recognition model comparisons? dataset and model analysis. In: 2019 IEEE/CVF International Conference on Computer Vision. ICCV 2019, October 27-November 2, pp. 4714–4722. IEEE, Seoul, Korea (South) (2019)
Google Scholar
Geng, C., Huang, S., Chen, S.: Recent advances in open set recognition: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 43(10), 3614–3631 (2021)
Article Google Scholar
Fei, G., Liu, B.: Breaking the closed world assumption in text classification. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, pp. 506–514 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
Xu-Cheng Yin
School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
Chun Yang
School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
Chang Liu

Authors

Xu-Cheng Yin
View author publications
You can also search for this author in PubMed Google Scholar
Chun Yang
View author publications
You can also search for this author in PubMed Google Scholar
Chang Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xu-Cheng Yin .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Yin, XC., Yang, C., Liu, C. (2024). Introduction. In: Open-Set Text Recognition. SpringerBriefs in Computer Science. Springer, Singapore. https://doi.org/10.1007/978-981-97-0361-6_1

Download citation

DOI: https://doi.org/10.1007/978-981-97-0361-6_1
Published: 02 April 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0360-9
Online ISBN: 978-981-97-0361-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics