Skip to main content
Log in

Text-aware balloon extraction from manga

  • Original Article
  • Published:
The Visual Computer Aims and scope Submit manuscript

Abstract

Manga, a Japanese word for comics, is a worldwide popular visual entertainment. Nowadays, electronic devices boost the fast development of motion manga for the purpose of visual richness and manga promotion. To convert static manga images into motion mangas, text balloons are usually animated individually for better story telling. This needs the artists to cut out each text balloon meticulously, and therefore it is quite labor-intensive and time-consuming. In this paper, we propose an automatic approach that can extract text balloons from manga images both accurately and effectively. Our approach starts by extracting white areas that contain texts as text blobs. Different from existing text blob extraction methods that only rely on shape properties, we incorporate text properties in order to differentiate text blobs from texture blobs. Instead of heuristic parameter thresholding, we achieve text blob classification via learning-based classifiers. Along with the novel text blob classification method, we also make the first attempt in trying to tackle the boundary issue in balloon extraction. We apply our method on various styles of mangas and comics with texts in different languages, and convincing results are obtained in all cases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

References

  1. Arai, K., Tolle, H.: Automatic e-comic content adaptation. Int. J. Ubiquitous Comput. 1(1), 1–11 (2010)

    Article  Google Scholar 

  2. Arai, K., Tolle, H.: Method for real time text extraction of digital manga comic. Int. J. Image Process. 4(6), 669–676 (2011)

    Google Scholar 

  3. Chen, X., Yuille, A.L.: Detecting and reading text in natural scenes. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 366–373 (2004)

  4. Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2963–2970 (2010)

  5. Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting (with discussion and a rejoinder by the authors). Ann. Stat. 28(2), 337–407 (2000)

    Article  MATH  MathSciNet  Google Scholar 

  6. Gllavata, J., Ewerth, R., Freisleben, B.: Text detection in images based on unsupervised classification of high-frequency wavelet coefficients. In: International Conference on Pattern Recognition (ICPR), pp. 425–428 (2004)

  7. Ho, A.K.N., Burie, J., Ogier, J.: Panel and speech balloon extraction from comic books. In: IAPR International Workshop on Document Analysis Systems (DAS). IEEE Computer Society, Washington, DC, USA, pp. 424–428 (2012)

  8. Jain, A.K., Yu, B.: Automatic text location in images and video frames. Pattern Recogn. 31(12), 2055–2076 (1998)

    Article  Google Scholar 

  9. Kim, H.K.: Efficient automatic text location method and content-based indexing and structuring of video database. J. Vis. Commun. Image Represent. 7(4), 336–344 (1996)

    Article  Google Scholar 

  10. Li, H., Doermann, D.S., Kia, O.E.: Automatic text detection and tracking in digital video. IEEE Trans. Image Process. 9(1), 147–156 (2000)

    Article  Google Scholar 

  11. Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Trans. Circuits Syst. Video Technol. 12(4), 256–268 (2002)

    Article  Google Scholar 

  12. Liu, Y., Goto, S., Ikenaga, T.: A contour-based robust algorithm for text detection in color images. IEICE Trans. 89–D(3), 1221–1230 (2006)

    Google Scholar 

  13. Sundaresan, M., Ranjini, S.: Text extraction from digital english comic image using two blobs extraction method. In: International Conference on Pattern Recognition, Informatics and Medical Engineering (PRIME), pp. 449–452 (2012)

  14. Suzuki, K., Horiba, I., Sugie, N.: Linear-time connected-component labeling based on sequential local operations. Comput. Vis. Image Underst. 89(1), 1–23 (2003)

    Article  MATH  Google Scholar 

  15. Tolle, H., Arai, K.: Manga content extraction method for automatic mobile comic content creation. In: International Conference on Advanced Computer Science and Information Systems (ICACSIS), pp. 321–328 (2013)

  16. Vapnik, V.: The nature of statistical learning theory. Springer, Berlin (2000)

    Book  MATH  Google Scholar 

  17. Yamada, M., Budiarto, R., Endo, M., Miyazaki, S.: Comic image decomposition for reading comics on cellular phones. IEICE Trans. 87–D(6), 1370–1376 (2004)

    Google Scholar 

  18. Ye, Q., Huang, Q., Gao, W., Zhao, D.: Fast and robust text detection in images and video frames. Image Vis. Comput. 23(6), 565–576 (2005)

    Article  Google Scholar 

  19. Zhang, X., Lin, Z., Sun, F., Ma, Y.: Transform invariant text extraction. Vis. Comput. 30(4), 401–415 (2014)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Xueting Liu or Tien-Tsin Wong.

Additional information

This project is supported by NSFC (Project No. 61272293), Shenzhen Basic Research Project (Project No. JCYJ20120619152326448), Shenzhen Nanshan Innovative Institution Establishment Fund (Project No. KC2013ZDZJ0007A), the Research Grants Council of the Hong Kong Special Administrative Region, under RGC General Research Fund (Project No. CUHK 417913), NSFC (Project No. 61103120), and Guangzhou Novo Program of Science and Technology (Project No. 0501-330).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, X., Li, C., Zhu, H. et al. Text-aware balloon extraction from manga. Vis Comput 32, 501–511 (2016). https://doi.org/10.1007/s00371-015-1084-0

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00371-015-1084-0

Keywords

Navigation