Estimating Comic Content from the Book Cover Information Using Fine-Tuned VGG Model for Comic Search

Park, Byeongseon; Matsushita, Mitsunori

doi:10.1007/978-3-030-05716-9_58

Byeongseon Park¹⁹ &
Mitsunori Matsushita¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11296))

Included in the following conference series:

International Conference on Multimedia Modeling

2155 Accesses
1 Citations

Abstract

The purpose of this research is to realize retrieval of comic based on content information. Resources of the contents information of existing comics were only the comics itself and review. However, these pieces of information have drawbacks that they can not sufficiently extract information necessary for searching, and that they contain a lot of unnecessary information. In order to solve this problem, we proposed to use the book cover of comics as a resource to grasp the contents of comics. In the proposed method, we estimate the age and cultural background of comics expressed by clothes and belongings written on the cover of comics from the reasoning model which performed fine-tuning from the VGG-16 model. Also, we associated comics with each other based on the obtained semantic vectors and tags. As a result of the experiment, the accuracy of the model was 0.693, and the reproducibility of the tag to the correct data was 0.918. Furthermore, we observed unity in the comics related by the obtained information.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
\(\copyright \) Yuzuru Shimazaki, Kodansha Ltd.
2.
\(\copyright \) Ken Yagami, Kadokawa Publishing Ltd.
3.
\(\copyright \) Yuka Kuniki, Takeshobo Ltd.

References

Park, B., Okamoto, K., Yamashita, R., Matsushita, M.: Designing a comic exploration system using a hierarchical topic classification of reviews. Inf. Eng. Express Int. Instit. Appl. Inform. 3(2), 45–57 (2017)
Google Scholar
Rigaud, C., Gurin, C., Karatzas, D., Burie, J.C., Ogier, J.M.: Knowledge-driven understanding of images in comic books. IJDAR 18(3), 199–221 (2015)
Article Google Scholar
Tanaka, T., Shoji, K., Toyama, F., Miyamichi, J.: Layout analysis of tree-structured scene frames in comic images. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence, IJCAI, Hyderabad, pp. 2885–2890 (2007)
Google Scholar
Arai, K., Tolle H.: Method for automatic E-comic scene frame extraction for reading comic on mobile devices. In: Seventh International Conference on Information Technology, pp. 370–375. IEEE, Las Vegas (2010)
Google Scholar
Chu, W-T., Li, W-W.: Manga FaceNet: face detection in manga based on deep neural network. In: 17th Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, ICMR, Bucharest, pp. 412–415 (2017)
Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. In: Proceedings of the IEEE, pp. 2278–2324. IEEE (1998)
Google Scholar
Blei, D.M., Andrew, Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
MATH Google Scholar
Blei, D.M.: Probabilistic topic models. Comun. ACM 55(4), 77–84 (2012)
Article Google Scholar
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE, Miami (2009)
Google Scholar
Lin, M., Chen, Q., Yan, S.: Network in network. In: International Conference on Learning Representations, San Diego, (2014)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations, San Diego (2015)
Google Scholar
Saito, M., Matsui, Y.: Illustration2Vec: a semantic vector representation of illustrations. In: SIGGRAPH ASIA 2015 Technical Briefs, No. 5 (2015)
Google Scholar
Islam, A., Inkpen, D.: Semantic text similarity using corpus-based word similarity and string similarity. ACM Trans. Knowl. Discov. Data 2(2), 10 (2008)
Article Google Scholar
Cha, S.H.: Comprehensive survey on distance/similarity measures between probability density functions. Int. J. Math. Models Methods Appl. Sci. 1(4), 300–307 (2007)
Google Scholar

Download references

Acknowledgments

The authors would like to thank S. Inoue, Y. Baba and Y. Higuchi for assistance with the data collection.

Author information

Authors and Affiliations

Graduate School of Informatics, Kansai University, 2-1-1, Reizanji-cho, Takatsuki-shi, Osaka, 569-1052, Japan
Byeongseon Park & Mitsunori Matsushita

Authors

Byeongseon Park
View author publications
You can also search for this author in PubMed Google Scholar
Mitsunori Matsushita
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Byeongseon Park .

Editor information

Editors and Affiliations

Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece
Ioannis Kompatsiaris
EURECOM, Sophia Antipolis, France
Benoit Huet
Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece
Vasileios Mezaris
Dublin City University, Dublin, Ireland
Cathal Gurrin
National Chiao Tung University, Hsinchu, Taiwan
Wen-Huang Cheng
Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece
Stefanos Vrochidis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Park, B., Matsushita, M. (2019). Estimating Comic Content from the Book Cover Information Using Fine-Tuned VGG Model for Comic Search. In: Kompatsiaris, I., Huet, B., Mezaris, V., Gurrin, C., Cheng, WH., Vrochidis, S. (eds) MultiMedia Modeling. MMM 2019. Lecture Notes in Computer Science(), vol 11296. Springer, Cham. https://doi.org/10.1007/978-3-030-05716-9_58

Download citation

DOI: https://doi.org/10.1007/978-3-030-05716-9_58
Published: 11 December 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05715-2
Online ISBN: 978-3-030-05716-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics