Abstract
Comic books digitization combined with subsequent comic book understanding give rise to a variety of new applications, including content reflowing, mobile reading and multi-modal search. Document understanding in this domain is challenging as comics are semi-structured documents, with semantic information shared between the graphical and textual parts. Speech balloon contour analysis reveals the speech tone which is an essential step towards a fully automatic comics understanding. In this paper we present the first approach for classifying speech balloon in scanned comic books where we separate and analyze their contour variations to classify them as “smooth” (normal speech), “wavy” (thought) or “zigzag” (exclamation). The experiments show a global accuracy classification of 85.2 % on a wide variety of balloons from the eBDtheque dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Abbasi, S., Mokhtarian, F., Kittler, J.: Curvature scale space image in shape similarity retrieval. Multimedia Syst. 7(6), 467–476 (1999)
Arai, K., Tolle, H.: Method for real time text extraction of digital manga comic. Int. J. Image Process. (IJIP) 4(6), 669–676 (2011)
Bader, T., Räpple, R., Beyerer, J.: Fast invariant contour-based classification of hand symbols for HCI. In: Jiang, X., Petkov, N. (eds.) CAIP 2009. LNCS, vol. 5702, pp. 689–696. Springer, Heidelberg (2009)
Bober, M.: Mpeg-7 visual shape descriptors. IEEE Trans. Circ. Syst. 11(6), 716–719 (2001)
Cenkery, C.: Wavelet contour classification. In: Proceedings of the 20th Workshop of the Austrian Association for Pattern Recognition (OAGM/AAPR) on Pattern Recognition, 1996, Leibnitz, Austria, pp. 263–271. R. Oldenbourg Verlag GmbH, Munich, Germany (1996)
Grigoriu, A., Vonwiller, J., King, R.: An automatic intonation tone contour labelling and classification algorithm. In: 1994 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP-94, vol. 2, pp. II-181. IEEE (1994)
Guérin, C., Rigaud, C., Mercier, A., et al.: eBDtheque: a representative database of comics. In: Proceedings of International Conference on Document Analysis and Recognition (ICDAR), Washington DC (2013)
Ho, A.K.N., Burie, J.C., Ogier, J.M.: Panel and Speech Balloon Extraction from Comic Books. In: 2012 10th IAPR International Workshop on Document Analysis Systems, pp. 424–428, Mar 2012
Hu, M.K.: Visual pattern recognition by moment invariants. IRE Trans. Inf. Theory 8(2), 179–187 (1962)
Keogh, E., Wei, L., Xi, X., Hee Lee, S., Vlachos, M.: Lb keogh supports exact indexing of shapes under rotation invariance with arbitrary representations and distance measures. In: VLDB, pp. 882–893 (2006)
Kühne, G., Richter, S., Beier, M.: Motion-based segmentation and contour-based classification of video objects. In: Proceedings of the Ninth ACM International Conference on Multimedia, pp. 41–50. ACM (2001)
Leung, W.H., Chen, T.: Trademark retrieval using contour-skeleton stroke classification. In: Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, ICME’02, vol. 2, pp. 517–520. IEEE (2002)
Liu, H.C., Srinath, M.D.: Partial shape classification using contour matching in distance transformation. IEEE Trans. Pattern Anal. Mach. Intell. 12(11), 1072–1079 (1990)
Lopatka, M., Houten, W.V.: Science and justice automated shape annotation for illicit tablet preparations: a contour angle based classification from digital images. Sci. Justice 53(1), 60–66 (2013)
Mitchell, T.M.: Mach. Learn., 1st edn. McGraw-Hill Inc., New York (1997)
Mokhtarian, F., Abbasi, S.: Shape similarity retrieval under affine transforms. Pattern Recogn. 35(1), 31–41 (2002). doi:10.1016/S0031-3203(01)00040-1
Mukundan, R., Ramakrishnan, K.: Moment Functions in Image Analysis: Theory and Applications, vol. 100. World Scientific, Singapore (1998)
Richter, S., Kühne, G., Schuster, O.: Contour-based classification of video objects. In: Proceedings of SPIE, vol. 4315, p. 608 (2001)
Rigaud, C., Karatzas, D., Van de Weijer, J., Burie, J.C., Ogier, J.M.: An active contour model for speech balloon detection in comics. In: Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR). IEEE (2013)
Sun, K.B., Super, B.J.: Classification of contour shapes using class segment sets. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, vol. 2, pp. 727–733. IEEE (2005)
Veltkamp, R.C., Tanase, M.: Content-based image retrieval systems: A survey. Technical report (2000)
Wang, Z., Chi, Z., Feng, D.: Shape based leaf image retrieval. IEEE Proc. Vis. Image Signal Process. 150(1), 34–43 (2003)
Zahn, C.T., Roskies, R.Z.: Fourier descriptors for plane closed curves. IEEE Trans. Comput. c–21(3), 269–281 (1972)
Zhang, D., Lu, G.: Review of shape representation and description techniques. PR 37(1), 1–19 (2004)
Acknowledgment
This work was supported by the European Doctorate founds of the University of La Rochelle, the European Regional Development Funds, the region Poitou-Charentes (France), the General Council of Charente Maritime (France), the town of La Rochelle (France) and the Spanish research projects TIN2011-24631, RYC-2009-05031. The authors would like to thanks Audrey Adam for her tedious work on the construction of the pixel level ground truth.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rigaud, C., Karatzas, D., Burie, JC., Ogier, JM. (2014). Adaptive Contour Classification of Comics Speech Balloons. In: Lamiroy, B., Ogier, JM. (eds) Graphics Recognition. Current Trends and Challenges. GREC 2013. Lecture Notes in Computer Science(), vol 8746. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-44854-0_5
Download citation
DOI: https://doi.org/10.1007/978-3-662-44854-0_5
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-44853-3
Online ISBN: 978-3-662-44854-0
eBook Packages: Computer ScienceComputer Science (R0)