Introduction to Document Analysis and Recognition

Marinai, Simone

doi:10.1007/978-3-540-76280-5_1

Simone Marinai⁴

Part of the book series: Studies in Computational Intelligence ((SCI,volume 90))

2841 Accesses
24 Citations

Document Analysis and Recognition (DAR) aims at the automatic extraction of information presented on paper and initially addressed to human comprehension. The desired output of DAR systems is usually in a suitable symbolic representation that can subsequently be processed by computers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Lyman, P., Varian, H.R.: How much information. Technical Report Retrieved from http://www.sims.berkeley.edu/how-much-info-2003 on 6-1-2007 (2003)
Sellen, A.J., Harper, R.: The myth of the paperless office. MIT press (2001)
Google Scholar
Mori, S., Suen, C., Yamamoto, K.: Historical review of OCR research and de-velopment. Proc. IEEE 80 (1992) 1029-1058
Article Google Scholar
O’Gorman, L., Kasturi, R.: Document Image Analysis. IEEE Computer Society Press, Los Alamitos, California (1995)
Google Scholar
Casey, R.G., Lecolinet, E.: A survey of methods and strategies in character segmentation. IEEE Transaction on PAMI 18(7) (1996) 690-706
Google Scholar
Nagy, G.: Twenty years of document image analysis in PAMI. IEEE Transaction on PAMI 22(1) (2000) 38-62
Google Scholar
Plamondon, R., Srihari, S.N.: On-line and off-line handwriting recognition: a comprehensive survey. IEEE Transaction on PAMI 22(1) (2000) 63-84
Google Scholar
Marinai, S., Gori, M., Soda, G.: Artificial neural networks for document analysis and recognition. IEEE Transactions on PAMI 27(1) (2005) 23-35
Google Scholar
Ha, T., Bunke, H.: Image processing methods for document image analysis. In: Handbook of character recognition and document image analysis. World Scientific (1997) 1-47
Google Scholar
Hornik, K., Stinchcombe, M., White, H.: Multilayer feedforward networks are universal approximators. Neural Networks 2 (1989) 359-366
Article Google Scholar
Taxt, T., Flynn, P.J., Jain, A.K.: Segmentation of document images. IEEE Transaction on PAMI 11(12) (1989) 1322-1329
Google Scholar
Nagy, G., Seth, S.: Hierarchical representation of optically scanned documents. In: Int’l Conference on Pattern Recognition. (1984) 347-349
Google Scholar
Watanabe, T., Luo, Q., Sugie, N.: Structure recognition methods for various types of documents. MVA 6(6) (1993) 163-176
Article Google Scholar
Nagy, G., Seth, S., Viswanathan, M.: A prototype document image analysis system for technical journals. IEEE Computer 25(7) (1992) 10-22
Google Scholar
Kim, J.H., Kim, K.K., Suen, C.Y.: An HMM-MLP hybrid model for cursive script recognition. PAA 3(4) (2000) 314-324
MathSciNet Google Scholar
Gilloux, M., Lemarié, B., Leroux, M.: A hybrid radial basis function network/ hidden Markov model handwritten word recognition system. In: Int’l Conference on Document Analysis and Recognition. (1995) 394-397
Google Scholar
Fu, L.M.: Neural networks in computer intelligence. McGraw-Hill, New York, NY (1994)
Google Scholar
Jain, A.K., Zhong, Y.: Page segmentation using texture analysis. Pattern Recog- nition 29(5) (1996) 743-770
Article Google Scholar
Shih, F.Y., Chen, S.S.: Adaptive document block segmentation and classifica-tion. IEEE Trans. SMC 26(5) (1996) 797-802
Google Scholar
Strouthopoulos, C., Papamarkos, N.: Text identification for document image analysis using a neural network. Image and Vision Computing 16(12/13) (1998) 879-896
Article Google Scholar
Wang, J., Jean, J.: Segmentation of merged characters by neural networks and shortest path. Pattern Recognition 27(5) (1994) 649-658
Article Google Scholar
Lu, Z.K., Chi, Z., Siu, W.C.: Length estimation of digits strings using a neu-ral network with structure based features. SPIE/IS&T Journal of Electronic Imaging 7(1) (1998) 79-85
Article Google Scholar
You, D., Kim, G.: An approach for locating segmentation points of handwritten digit strings using a neural network. In: Int’l Conference on Document Analysis and Recognition. (2003) 142-146
Google Scholar
Pal, U., Sinha, S., Chaudhuri, B.: Multi-script line identification from indian documents. In: Int’l Conference on Document Analysis and Recognition. (2003) 880-884
Google Scholar
Ishitani, Y.: Flexible and robust model matching based on association graph for form image understanding. Pattern Analysis and Applications 3(2) (2000) 104-119
Article Google Scholar
Dengel, A., Dubiel, F.: Clustering and classification of document structure -a machine learning approach-. In: Int’l Conference on Document Analysis and Recognition. (1995) 587-591
Google Scholar
Cesarini, F., Lastri, M., Marinai, S., Soda, G.: Encoding of modified X-Y trees for document classification. In: Int’l Conference on Document Analysis and Recognition. (2001) 1131-1136
Google Scholar
van Beusekom, J., Keysers, D., Shafait, F., Breuel, T.M.: Distance measures for layout-based document image retrieval. In: Proc. Second Int’l Workshop on Document Image Analysis for Libraries. (2006) 232-242
Google Scholar
Kaski, S., Honkela, T., Lagus, K., Kohonen, T.: Websom self-organizing maps of document collections. Neurocomputing 21(1-3) (1998) 101-118
Article MATH Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proceedings of the IEEE 86(11) (1998) 2278-2324
Article Google Scholar
Simard, P.Y., Steinkraus, D., Platt, J.C.: Best practices for convolutional neural networks applied to visual document analysis. In: Int’l Conference on Document Analysis and Recognition. (2003) 958-963
Google Scholar
Suzuki, M., Uchida, S., Nomura, A.: A ground-truthed mathematical character and symbol image database. In: Int’l Conference on Document Analysis and Recognition. (2005) 675-679
Google Scholar
Pechwitz, M., Maddouri, S.S., Magner, V., Ellouze, N., Amiri, H.: IFN/ENIT- database of handwritten arabic words. In: 7th Colloque International Francophone sur l’Ecrit et le Document. (2002)
Google Scholar
Ford, G., Thoma, G.: Ground truth data for document image analysis. In: Sym-posium on Document Image Understanding and Technology. (2003) 199-205
Google Scholar
Marti, U., Bunke, H.: A full english sentence database for off-line handwriting recognition. In: Int’l Conference on Document Analysis and Recognition. (1999) 705-708
Google Scholar
Antonacopoulos, A., Gatos, B., Bridson, D.: ICDAR 2005 page segmentation competition. In: Int’l Conference on Document Analysis and Recognition. (2005) 75-79
Google Scholar
Margner, V., Pechwitz, M., Abed, H.E.: ICDAR 2005 arabic handwriting recog-nition competition. In: Int’l Conference on Document Analysis and Recognition. (2005) 70-74
Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Sistemi e Informatica (DSI), University of Florence, Via S. Marta, 3, I-50139, Firenze, Italy
Simone Marinai

Authors

Simone Marinai
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Sistemi e Informatica, University of Florence, Via S. Marta, 3, 50139, Firenze, Italy
Simone Marinai
Hitachi Central Research Laboratory, 1-280, Higashi-Koigakubo, Kokubunji-shi, Tokyo, 185-8601, Japan
Hiromichi Fujisawa

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Marinai, S. (2008). Introduction to Document Analysis and Recognition. In: Marinai, S., Fujisawa, H. (eds) Machine Learning in Document Analysis and Recognition. Studies in Computational Intelligence, vol 90. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76280-5_1

Download citation

DOI: https://doi.org/10.1007/978-3-540-76280-5_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76279-9
Online ISBN: 978-3-540-76280-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics