Text-aware balloon extraction from manga

Liu, Xueting; Li, Chengze; Zhu, Haichao; Wong, Tien-Tsin; Xu, Xuemiao

doi:10.1007/s00371-015-1084-0

Text-aware balloon extraction from manga

Original Article
Published: 11 April 2015

Volume 32, pages 501–511, (2016)
Cite this article

The Visual Computer Aims and scope Submit manuscript

Xueting Liu^1,2,
Chengze Li²,
Haichao Zhu^1,2,
Tien-Tsin Wong^1,2 &
…
Xuemiao Xu³

939 Accesses
14 Citations
1 Altmetric
Explore all metrics

Abstract

Manga, a Japanese word for comics, is a worldwide popular visual entertainment. Nowadays, electronic devices boost the fast development of motion manga for the purpose of visual richness and manga promotion. To convert static manga images into motion mangas, text balloons are usually animated individually for better story telling. This needs the artists to cut out each text balloon meticulously, and therefore it is quite labor-intensive and time-consuming. In this paper, we propose an automatic approach that can extract text balloons from manga images both accurately and effectively. Our approach starts by extracting white areas that contain texts as text blobs. Different from existing text blob extraction methods that only rely on shape properties, we incorporate text properties in order to differentiate text blobs from texture blobs. Instead of heuristic parameter thresholding, we achieve text blob classification via learning-based classifiers. Along with the novel text blob classification method, we also make the first attempt in trying to tackle the boundary issue in balloon extraction. We apply our method on various styles of mangas and comics with texts in different languages, and convincing results are obtained in all cases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Revolutionizing animation: unleashing the power of artificial intelligence for cutting-edge visual effects in films

Article 16 December 2023

HP_DocPres: a method for classifying printed and handwritten texts in doctor’s prescription

Article 13 November 2020

Scene Text Detection and Recognition: The Deep Learning Era

Article 27 August 2020

References

Arai, K., Tolle, H.: Automatic e-comic content adaptation. Int. J. Ubiquitous Comput. 1(1), 1–11 (2010)
Article Google Scholar
Arai, K., Tolle, H.: Method for real time text extraction of digital manga comic. Int. J. Image Process. 4(6), 669–676 (2011)
Google Scholar
Chen, X., Yuille, A.L.: Detecting and reading text in natural scenes. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 366–373 (2004)
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2963–2970 (2010)
Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting (with discussion and a rejoinder by the authors). Ann. Stat. 28(2), 337–407 (2000)
Article MATH MathSciNet Google Scholar
Gllavata, J., Ewerth, R., Freisleben, B.: Text detection in images based on unsupervised classification of high-frequency wavelet coefficients. In: International Conference on Pattern Recognition (ICPR), pp. 425–428 (2004)
Ho, A.K.N., Burie, J., Ogier, J.: Panel and speech balloon extraction from comic books. In: IAPR International Workshop on Document Analysis Systems (DAS). IEEE Computer Society, Washington, DC, USA, pp. 424–428 (2012)
Jain, A.K., Yu, B.: Automatic text location in images and video frames. Pattern Recogn. 31(12), 2055–2076 (1998)
Article Google Scholar
Kim, H.K.: Efficient automatic text location method and content-based indexing and structuring of video database. J. Vis. Commun. Image Represent. 7(4), 336–344 (1996)
Article Google Scholar
Li, H., Doermann, D.S., Kia, O.E.: Automatic text detection and tracking in digital video. IEEE Trans. Image Process. 9(1), 147–156 (2000)
Article Google Scholar
Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Trans. Circuits Syst. Video Technol. 12(4), 256–268 (2002)
Article Google Scholar
Liu, Y., Goto, S., Ikenaga, T.: A contour-based robust algorithm for text detection in color images. IEICE Trans. 89–D(3), 1221–1230 (2006)
Google Scholar
Sundaresan, M., Ranjini, S.: Text extraction from digital english comic image using two blobs extraction method. In: International Conference on Pattern Recognition, Informatics and Medical Engineering (PRIME), pp. 449–452 (2012)
Suzuki, K., Horiba, I., Sugie, N.: Linear-time connected-component labeling based on sequential local operations. Comput. Vis. Image Underst. 89(1), 1–23 (2003)
Article MATH Google Scholar
Tolle, H., Arai, K.: Manga content extraction method for automatic mobile comic content creation. In: International Conference on Advanced Computer Science and Information Systems (ICACSIS), pp. 321–328 (2013)
Vapnik, V.: The nature of statistical learning theory. Springer, Berlin (2000)
Book MATH Google Scholar
Yamada, M., Budiarto, R., Endo, M., Miyazaki, S.: Comic image decomposition for reading comics on cellular phones. IEICE Trans. 87–D(6), 1370–1376 (2004)
Google Scholar
Ye, Q., Huang, Q., Gao, W., Zhao, D.: Fast and robust text detection in images and video frames. Image Vis. Comput. 23(6), 565–576 (2005)
Article Google Scholar
Zhang, X., Lin, Z., Sun, F., Ma, Y.: Transform invariant text extraction. Vis. Comput. 30(4), 401–415 (2014)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, Hong Kong
Xueting Liu, Haichao Zhu & Tien-Tsin Wong
Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, China
Xueting Liu, Chengze Li, Haichao Zhu & Tien-Tsin Wong
School of Computer Science and Engineering, South China University of Technology, Guangzhou, China
Xuemiao Xu

Authors

Xueting Liu
View author publications
You can also search for this author in PubMed Google Scholar
Chengze Li
View author publications
You can also search for this author in PubMed Google Scholar
Haichao Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Tien-Tsin Wong
View author publications
You can also search for this author in PubMed Google Scholar
Xuemiao Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Xueting Liu or Tien-Tsin Wong.

Additional information

This project is supported by NSFC (Project No. 61272293), Shenzhen Basic Research Project (Project No. JCYJ20120619152326448), Shenzhen Nanshan Innovative Institution Establishment Fund (Project No. KC2013ZDZJ0007A), the Research Grants Council of the Hong Kong Special Administrative Region, under RGC General Research Fund (Project No. CUHK 417913), NSFC (Project No. 61103120), and Guangzhou Novo Program of Science and Technology (Project No. 0501-330).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, X., Li, C., Zhu, H. et al. Text-aware balloon extraction from manga. Vis Comput 32, 501–511 (2016). https://doi.org/10.1007/s00371-015-1084-0

Download citation

Published: 11 April 2015
Issue Date: April 2016
DOI: https://doi.org/10.1007/s00371-015-1084-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Text-aware balloon extraction from manga

Abstract

Access this article

Similar content being viewed by others

Revolutionizing animation: unleashing the power of artificial intelligence for cutting-edge visual effects in films

HP_DocPres: a method for classifying printed and handwritten texts in doctor’s prescription

Scene Text Detection and Recognition: The Deep Learning Era

References

Author information

Authors and Affiliations

Corresponding authors

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Text-aware balloon extraction from manga

Abstract

Access this article

Similar content being viewed by others

Revolutionizing animation: unleashing the power of artificial intelligence for cutting-edge visual effects in films

HP_DocPres: a method for classifying printed and handwritten texts in doctor’s prescription

Scene Text Detection and Recognition: The Deep Learning Era

References

Author information

Authors and Affiliations

Corresponding authors

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation