Abstract
The purpose of this research is to realize retrieval of comic based on content information. Resources of the contents information of existing comics were only the comics itself and review. However, these pieces of information have drawbacks that they can not sufficiently extract information necessary for searching, and that they contain a lot of unnecessary information. In order to solve this problem, we proposed to use the book cover of comics as a resource to grasp the contents of comics. In the proposed method, we estimate the age and cultural background of comics expressed by clothes and belongings written on the cover of comics from the reasoning model which performed fine-tuning from the VGG-16 model. Also, we associated comics with each other based on the obtained semantic vectors and tags. As a result of the experiment, the accuracy of the model was 0.693, and the reproducibility of the tag to the correct data was 0.918. Furthermore, we observed unity in the comics related by the obtained information.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
\(\copyright \) Yuzuru Shimazaki, Kodansha Ltd.
- 2.
\(\copyright \) Ken Yagami, Kadokawa Publishing Ltd.
- 3.
\(\copyright \) Yuka Kuniki, Takeshobo Ltd.
References
Park, B., Okamoto, K., Yamashita, R., Matsushita, M.: Designing a comic exploration system using a hierarchical topic classification of reviews. Inf. Eng. Express Int. Instit. Appl. Inform. 3(2), 45–57 (2017)
Rigaud, C., Gurin, C., Karatzas, D., Burie, J.C., Ogier, J.M.: Knowledge-driven understanding of images in comic books. IJDAR 18(3), 199–221 (2015)
Tanaka, T., Shoji, K., Toyama, F., Miyamichi, J.: Layout analysis of tree-structured scene frames in comic images. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence, IJCAI, Hyderabad, pp. 2885–2890 (2007)
Arai, K., Tolle H.: Method for automatic E-comic scene frame extraction for reading comic on mobile devices. In: Seventh International Conference on Information Technology, pp. 370–375. IEEE, Las Vegas (2010)
Chu, W-T., Li, W-W.: Manga FaceNet: face detection in manga based on deep neural network. In: 17th Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, ICMR, Bucharest, pp. 412–415 (2017)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. In: Proceedings of the IEEE, pp. 2278–2324. IEEE (1998)
Blei, D.M., Andrew, Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Blei, D.M.: Probabilistic topic models. Comun. ACM 55(4), 77–84 (2012)
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE, Miami (2009)
Lin, M., Chen, Q., Yan, S.: Network in network. In: International Conference on Learning Representations, San Diego, (2014)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations, San Diego (2015)
Saito, M., Matsui, Y.: Illustration2Vec: a semantic vector representation of illustrations. In: SIGGRAPH ASIA 2015 Technical Briefs, No. 5 (2015)
Islam, A., Inkpen, D.: Semantic text similarity using corpus-based word similarity and string similarity. ACM Trans. Knowl. Discov. Data 2(2), 10 (2008)
Cha, S.H.: Comprehensive survey on distance/similarity measures between probability density functions. Int. J. Math. Models Methods Appl. Sci. 1(4), 300–307 (2007)
Acknowledgments
The authors would like to thank S. Inoue, Y. Baba and Y. Higuchi for assistance with the data collection.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Park, B., Matsushita, M. (2019). Estimating Comic Content from the Book Cover Information Using Fine-Tuned VGG Model for Comic Search. In: Kompatsiaris, I., Huet, B., Mezaris, V., Gurrin, C., Cheng, WH., Vrochidis, S. (eds) MultiMedia Modeling. MMM 2019. Lecture Notes in Computer Science(), vol 11296. Springer, Cham. https://doi.org/10.1007/978-3-030-05716-9_58
Download citation
DOI: https://doi.org/10.1007/978-3-030-05716-9_58
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05715-2
Online ISBN: 978-3-030-05716-9
eBook Packages: Computer ScienceComputer Science (R0)