Abstract
In real-world applications, new data, patterns, and categories that were not covered by the training data can frequently emerge, necessitating the capability to detect and adapt to novel characters incrementally. Researchers refer to these challenges as the Open-Set Text Recognition (OSTR) task, which has, in recent years, emerged as one of the prominent issues in the field of text recognition. In this chapter, we first introduce the evolution and several main trends of preliminary works on novel (unseen) character identification and recognition. Then, we briefly discuss three main challenges in OSTR. Finally, we introduce the overall structure and main content of our book.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Borisyuk, F., Gordo, A., Sivakumar, V.: Rosetta: Large scale system for text detection and recognition in images. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery Data Mining, KDD 2018, August 19–23, pp. 71–79. ACM, London, UK (2018)
Shi, B., Yang, M., Wang, X., Lyu, P., Yao, C., Bai, X.: ASTER: an attentional scene text recognizer with flexible rectification. IEEE Trans. Pattern Anal. Mach. Intell. 41(9), 2035–2048 (2019)
Wang, T., Zhu, Y., Jin, L., Luo, C., Chen, X., Wu, Y., Wang, Q., Cai, M.: Decoupled attention network for text recognition. In: The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, February 7–12, pp. 12 216–12 224. AAAI Press, New York, NY, USA (2020)
Chen, J., Li, B., Xue, X.: Zero-shot Chinese character recognition with stroke-level decomposition. In: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI 2021, Virtual Event/Montreal, Canada, August 19–27, pp. 615–621. (2021). www.ijcai.org
Wang, T., Xie, Z., Li, Z., Jin, L., Chen, X.: Radical aggregation network for few-shot offline handwritten Chinese character recognition. Pattern Recognit. Lett. 125, 821–827 (2019)
Zhang, J., Du, J., Dai, L.: Radical analysis network for learning hierarchies of Chinese characters. Pattern Recognit. 103, 107305 (2020)
Qi, H., Brown, M., Lowe, D.G.: Low-shot learning with imprinted weights. In: 2018 IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2018, June 18–22, pp. 5822–5830. IEEE Computer Society, Salt Lake City, UT, USA (2018)
Ao, X., Zhang, X., Yang, H., Yin, F., Liu, C.: Cross-modal prototype learning for zero-shot handwriting recognition. In: 2019 International Conference on Document Analysis and Recognition. ICDAR 2019, September 20–25, pp. 589–594. IEEE, Sydney, Australia (2019)
Zhang, C., Gupta, A., Zisserman, A.: Adaptive text recognition through visual matching. In: Computer Vision-ECCV 2020–16th European Conference, August 23–28,: Proceedings, Part XVI, ser. Lecture Notes in Computer Science, vol. 12361, pp. 51–67. Springer, Glasgow, UK (2020)
Souibgui, M.A., Fornés, A., Kessentini, Y., Megyesi, B.: Few shots is all you need: A progressive few shot learning approach for low resource handwriting recognition (2021). [Online] Available: https://arxiv.org/abs/2107.10064
Huang, Y., Jin, L., Peng, D.: Zero-shot Chinese text recognition via matching class embedding. In: 16th International Conference on Document Analysis and Recognition, ICDAR 2021, September 5–10, 2021, Proceedings, Part III, ser. Lecture Notes in Computer Science, vol. 12823, pp. 127–141. Springer, Lausanne, Switzerland (2021)
Baek, J., Kim, G., Lee, J., Park, S., Han, D., Yun, S., Oh, S.J., Lee, H.: What is wrong with scene text recognition model comparisons? dataset and model analysis. In: 2019 IEEE/CVF International Conference on Computer Vision. ICCV 2019, October 27-November 2, pp. 4714–4722. IEEE, Seoul, Korea (South) (2019)
Geng, C., Huang, S., Chen, S.: Recent advances in open set recognition: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 43(10), 3614–3631 (2021)
Fei, G., Liu, B.: Breaking the closed world assumption in text classification. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, pp. 506–514 (2016)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Yin, XC., Yang, C., Liu, C. (2024). Introduction. In: Open-Set Text Recognition. SpringerBriefs in Computer Science. Springer, Singapore. https://doi.org/10.1007/978-981-97-0361-6_1
Download citation
DOI: https://doi.org/10.1007/978-981-97-0361-6_1
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0360-9
Online ISBN: 978-981-97-0361-6
eBook Packages: Computer ScienceComputer Science (R0)