Skip to main content

Adaptive Contour Classification of Comics Speech Balloons

  • Conference paper
  • First Online:
Graphics Recognition. Current Trends and Challenges (GREC 2013)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8746))

Included in the following conference series:

Abstract

Comic books digitization combined with subsequent comic book understanding give rise to a variety of new applications, including content reflowing, mobile reading and multi-modal search. Document understanding in this domain is challenging as comics are semi-structured documents, with semantic information shared between the graphical and textual parts. Speech balloon contour analysis reveals the speech tone which is an essential step towards a fully automatic comics understanding. In this paper we present the first approach for classifying speech balloon in scanned comic books where we separate and analyze their contour variations to classify them as “smooth” (normal speech), “wavy” (thought) or “zigzag” (exclamation). The experiments show a global accuracy classification of 85.2 % on a wide variety of balloons from the eBDtheque dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Abbasi, S., Mokhtarian, F., Kittler, J.: Curvature scale space image in shape similarity retrieval. Multimedia Syst. 7(6), 467–476 (1999)

    Article  Google Scholar 

  2. Arai, K., Tolle, H.: Method for real time text extraction of digital manga comic. Int. J. Image Process. (IJIP) 4(6), 669–676 (2011)

    Google Scholar 

  3. Bader, T., Räpple, R., Beyerer, J.: Fast invariant contour-based classification of hand symbols for HCI. In: Jiang, X., Petkov, N. (eds.) CAIP 2009. LNCS, vol. 5702, pp. 689–696. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  4. Bober, M.: Mpeg-7 visual shape descriptors. IEEE Trans. Circ. Syst. 11(6), 716–719 (2001)

    Article  Google Scholar 

  5. Cenkery, C.: Wavelet contour classification. In: Proceedings of the 20th Workshop of the Austrian Association for Pattern Recognition (OAGM/AAPR) on Pattern Recognition, 1996, Leibnitz, Austria, pp. 263–271. R. Oldenbourg Verlag GmbH, Munich, Germany (1996)

    Google Scholar 

  6. Grigoriu, A., Vonwiller, J., King, R.: An automatic intonation tone contour labelling and classification algorithm. In: 1994 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP-94, vol. 2, pp. II-181. IEEE (1994)

    Google Scholar 

  7. Guérin, C., Rigaud, C., Mercier, A., et al.: eBDtheque: a representative database of comics. In: Proceedings of International Conference on Document Analysis and Recognition (ICDAR), Washington DC (2013)

    Google Scholar 

  8. Ho, A.K.N., Burie, J.C., Ogier, J.M.: Panel and Speech Balloon Extraction from Comic Books. In: 2012 10th IAPR International Workshop on Document Analysis Systems, pp. 424–428, Mar 2012

    Google Scholar 

  9. Hu, M.K.: Visual pattern recognition by moment invariants. IRE Trans. Inf. Theory 8(2), 179–187 (1962)

    Article  MATH  Google Scholar 

  10. Keogh, E., Wei, L., Xi, X., Hee Lee, S., Vlachos, M.: Lb keogh supports exact indexing of shapes under rotation invariance with arbitrary representations and distance measures. In: VLDB, pp. 882–893 (2006)

    Google Scholar 

  11. Kühne, G., Richter, S., Beier, M.: Motion-based segmentation and contour-based classification of video objects. In: Proceedings of the Ninth ACM International Conference on Multimedia, pp. 41–50. ACM (2001)

    Google Scholar 

  12. Leung, W.H., Chen, T.: Trademark retrieval using contour-skeleton stroke classification. In: Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, ICME’02, vol. 2, pp. 517–520. IEEE (2002)

    Google Scholar 

  13. Liu, H.C., Srinath, M.D.: Partial shape classification using contour matching in distance transformation. IEEE Trans. Pattern Anal. Mach. Intell. 12(11), 1072–1079 (1990)

    Article  Google Scholar 

  14. Lopatka, M., Houten, W.V.: Science and justice automated shape annotation for illicit tablet preparations: a contour angle based classification from digital images. Sci. Justice 53(1), 60–66 (2013)

    Article  Google Scholar 

  15. Mitchell, T.M.: Mach. Learn., 1st edn. McGraw-Hill Inc., New York (1997)

    Google Scholar 

  16. Mokhtarian, F., Abbasi, S.: Shape similarity retrieval under affine transforms. Pattern Recogn. 35(1), 31–41 (2002). doi:10.1016/S0031-3203(01)00040-1

    Article  MATH  Google Scholar 

  17. Mukundan, R., Ramakrishnan, K.: Moment Functions in Image Analysis: Theory and Applications, vol. 100. World Scientific, Singapore (1998)

    Book  MATH  Google Scholar 

  18. Richter, S., Kühne, G., Schuster, O.: Contour-based classification of video objects. In: Proceedings of SPIE, vol. 4315, p. 608 (2001)

    Google Scholar 

  19. Rigaud, C., Karatzas, D., Van de Weijer, J., Burie, J.C., Ogier, J.M.: An active contour model for speech balloon detection in comics. In: Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR). IEEE (2013)

    Google Scholar 

  20. Sun, K.B., Super, B.J.: Classification of contour shapes using class segment sets. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, vol. 2, pp. 727–733. IEEE (2005)

    Google Scholar 

  21. Veltkamp, R.C., Tanase, M.: Content-based image retrieval systems: A survey. Technical report (2000)

    Google Scholar 

  22. Wang, Z., Chi, Z., Feng, D.: Shape based leaf image retrieval. IEEE Proc. Vis. Image Signal Process. 150(1), 34–43 (2003)

    Article  Google Scholar 

  23. Zahn, C.T., Roskies, R.Z.: Fourier descriptors for plane closed curves. IEEE Trans. Comput. c–21(3), 269–281 (1972)

    Article  MathSciNet  Google Scholar 

  24. Zhang, D., Lu, G.: Review of shape representation and description techniques. PR 37(1), 1–19 (2004)

    Article  MATH  Google Scholar 

Download references

Acknowledgment

This work was supported by the European Doctorate founds of the University of La Rochelle, the European Regional Development Funds, the region Poitou-Charentes (France), the General Council of Charente Maritime (France), the town of La Rochelle (France) and the Spanish research projects TIN2011-24631, RYC-2009-05031. The authors would like to thanks Audrey Adam for her tedious work on the construction of the pixel level ground truth.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Christophe Rigaud .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Rigaud, C., Karatzas, D., Burie, JC., Ogier, JM. (2014). Adaptive Contour Classification of Comics Speech Balloons. In: Lamiroy, B., Ogier, JM. (eds) Graphics Recognition. Current Trends and Challenges. GREC 2013. Lecture Notes in Computer Science(), vol 8746. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-44854-0_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-44854-0_5

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-44853-3

  • Online ISBN: 978-3-662-44854-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics