Abstract
One main goal of paleographers is to identify the different writers who wrote a given manuscript. Recently, paleographers are starting to use digital tools which provide new and more objective ways to analyze ancient documents. On the other hand, in the last few years, deep learning techniques have been applied to many domains and to overcome its requirement of a large amount of labeled data, transfer learning has been used. This latter approach uses previously trained large deep networks as starting points to solve specific classification problems. In this paper, we present a novel approach based on deep transfer learning to implement a reject option for the recognition of the writers in medieval documents. The implemented option is page-based and considers the row labels provided by the trained deep network to estimate the class probabilities. The proposed approach has been tested on a set of digital images from a Bible of the XII century. The achieved results confirmed the effectiveness of the proposed approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Afzal, M.Z., et al.: Deepdocclassifier: document classification with deep convolutional neural network. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 1111–1115, August 2015
Antonacopoulos, A., Downton, A.C.: Special issue on the analysis of historical documents. IJDAR 9(2–4), 75–77 (2007)
Bria, A., et al.: Deep transfer learning for writer identification in medieval books. In: 2018 IEEE International Conference on Metrology for Archaeology and Cultural Heritage (2018). (in press)
Bria, A., Marrocco, C., Molinara, M., Tortorella, F.: A ranking-based cascade approach for unbalanced data. In: 2012 21st International Conference on Pattern Recognition (ICPR), pp. 3439–3442. IEEE (2012)
Ciresan, D.C., Meier, U., Schmidhuber, J.: Transfer learning for latin and chinese characters with deep neural networks. In: The 2012 International Joint Conference on Neural Networks (IJCNN), pp. 1–6 (2012)
De Stefano, C., Maniaci, M., Fontanella, F., di Scotto Freca, A.: Layout measures for writer identification in mediaeval documents. Measurement 127, 443–452 (2018)
De Stefano, C., Maniaci, M., Fontanella, F., di Freca, A.S.: Reliable writer identification in medieval manuscripts through page layout features: the avila bible case. Eng. Appl. Artif. Intell. 72, 99–110 (2018)
De Stefano, C., Fontanella, F., Maniaci, M., Scotto di Freca, A.: A method for scribe distinction in medieval manuscripts using page layout features. In: Maino, G., Foresti, G.L. (eds.) ICIAP 2011. LNCS, vol. 6978, pp. 393–402. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-24085-0_41
Granet, A., Morin, E., Mouchère, H., Quiniou, S., Viard-Gaudin, C.: Transfer learning for handwriting recognition on historical documents. In: International Conference on Pattern Recognition Applications and Methods, Madeira, Portugal, January 2018. https://hal.archives-ouvertes.fr/hal-01681126
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. CoRR arXiv:1512.03385 (2015). http://dblp.uni-trier.de/db/journals/corr/corr1512.html#HeZRS15
Kölsch, A., Mishra, A., Varshneya, S., Liwicki, M.: Recognizing challenging handwritten annotations with fully convolutional networks. CoRR arXiv:1804.00236 (2018)
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Ni, K., Callier, P., Hatch, B.: Writer identification in noisy handwritten documents. In: 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1177–1186, March 2017. https://doi.org/10.1109/WACV.2017.136
Oliveira, S.A., Seguin, B., Kaplan, F.: dhsegment: A generic deep-learning approach for document segmentation. CoRR arXiv:1804.10371 (2018)
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., Chen, L.C.: Mobilenetv 2: inverted residuals and linear bottlenecks. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2018
Simistira, F., Seuret, M., Eichenberger, N., Garz, A., Liwicki, M., Ingold, R.: Diva-hisdb: a precisely annotated large dataset of challenging medieval manuscripts. In: 2016 15th International Conference on Frontiers in Handwriting Recognition(ICFHR), pp. 471–476, October 2016
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR arXiv:1409.1556 (2014). http://dblp.uni-trier.de/db/journals/corr/corr1409.html#SimonyanZ14a
Szegedy, C., Ioffe, S., Vanhoucke, V.: Inception-v4, inception-resnet and the impact of residual connections on learning. CoRR arXiv:1602.07261 (2016). http://dblp.uni-trier.de/db/journals/corr/corr1602.html#SzegedyIV16
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. CoRR arXiv:1512.00567 (2015). http://dblp.uni-trier.de/db/journals/corr/corr1512.html#SzegedyVISW15
Trovini, G., et al.: A deep learning framework for micro-calcification detection in 2d mammography and c-view. In: Progress in Biomedical Optics and Imaging - Proceedings of SPIE.,vol. 10718 (2018)
Tushar, A.K., Ashiquzzaman, A., Afrin, A., Islam, M.R.: A novel transfer learning approach upon hindi, arabic, and bangla numerals using convolutional neural networks. CoRR arXiv:1707.08385 (2017). http://arxiv.org/abs/1707.08385
Zoph, B., Vasudevan, V., Shlens, J., Le, Q.V.: Learning transferable architectures for scalable image recognition. CoRR arXiv:1707.07012 (2017). http://dblp.uni-trier.de/db/journals/corr/corr1707.html#ZophVSL17
Acknowledgment
The authors gratefully acknowledge the support of NVIDIA Corporation for the donation of the Titan Xp GPUs.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Cilia, N.D., De Stefano, C., Fontanella, F., Marrocco, C., Molinara, M., Scotto di Freca, A. (2019). A Page-Based Reject Option for Writer Identification in Medieval Books. In: Cristani, M., Prati, A., Lanz, O., Messelodi, S., Sebe, N. (eds) New Trends in Image Analysis and Processing – ICIAP 2019. ICIAP 2019. Lecture Notes in Computer Science(), vol 11808. Springer, Cham. https://doi.org/10.1007/978-3-030-30754-7_19
Download citation
DOI: https://doi.org/10.1007/978-3-030-30754-7_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30753-0
Online ISBN: 978-3-030-30754-7
eBook Packages: Computer ScienceComputer Science (R0)