Document Analysis and Recognition (DAR) aims at the automatic extraction of information presented on paper and initially addressed to human comprehension. The desired output of DAR systems is usually in a suitable symbolic representation that can subsequently be processed by computers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Lyman, P., Varian, H.R.: How much information. Technical Report Retrieved from http://www.sims.berkeley.edu/how-much-info-2003 on 6-1-2007 (2003)
Sellen, A.J., Harper, R.: The myth of the paperless office. MIT press (2001)
Mori, S., Suen, C., Yamamoto, K.: Historical review of OCR research and de-velopment. Proc. IEEE 80 (1992) 1029-1058
O’Gorman, L., Kasturi, R.: Document Image Analysis. IEEE Computer Society Press, Los Alamitos, California (1995)
Casey, R.G., Lecolinet, E.: A survey of methods and strategies in character segmentation. IEEE Transaction on PAMI 18(7) (1996) 690-706
Nagy, G.: Twenty years of document image analysis in PAMI. IEEE Transaction on PAMI 22(1) (2000) 38-62
Plamondon, R., Srihari, S.N.: On-line and off-line handwriting recognition: a comprehensive survey. IEEE Transaction on PAMI 22(1) (2000) 63-84
Marinai, S., Gori, M., Soda, G.: Artificial neural networks for document analysis and recognition. IEEE Transactions on PAMI 27(1) (2005) 23-35
Ha, T., Bunke, H.: Image processing methods for document image analysis. In: Handbook of character recognition and document image analysis. World Scientific (1997) 1-47
Hornik, K., Stinchcombe, M., White, H.: Multilayer feedforward networks are universal approximators. Neural Networks 2 (1989) 359-366
Taxt, T., Flynn, P.J., Jain, A.K.: Segmentation of document images. IEEE Transaction on PAMI 11(12) (1989) 1322-1329
Nagy, G., Seth, S.: Hierarchical representation of optically scanned documents. In: Int’l Conference on Pattern Recognition. (1984) 347-349
Watanabe, T., Luo, Q., Sugie, N.: Structure recognition methods for various types of documents. MVA 6(6) (1993) 163-176
Nagy, G., Seth, S., Viswanathan, M.: A prototype document image analysis system for technical journals. IEEE Computer 25(7) (1992) 10-22
Kim, J.H., Kim, K.K., Suen, C.Y.: An HMM-MLP hybrid model for cursive script recognition. PAA 3(4) (2000) 314-324
Gilloux, M., Lemarié, B., Leroux, M.: A hybrid radial basis function network/ hidden Markov model handwritten word recognition system. In: Int’l Conference on Document Analysis and Recognition. (1995) 394-397
Fu, L.M.: Neural networks in computer intelligence. McGraw-Hill, New York, NY (1994)
Jain, A.K., Zhong, Y.: Page segmentation using texture analysis. Pattern Recog- nition 29(5) (1996) 743-770
Shih, F.Y., Chen, S.S.: Adaptive document block segmentation and classifica-tion. IEEE Trans. SMC 26(5) (1996) 797-802
Strouthopoulos, C., Papamarkos, N.: Text identification for document image analysis using a neural network. Image and Vision Computing 16(12/13) (1998) 879-896
Wang, J., Jean, J.: Segmentation of merged characters by neural networks and shortest path. Pattern Recognition 27(5) (1994) 649-658
Lu, Z.K., Chi, Z., Siu, W.C.: Length estimation of digits strings using a neu-ral network with structure based features. SPIE/IS&T Journal of Electronic Imaging 7(1) (1998) 79-85
You, D., Kim, G.: An approach for locating segmentation points of handwritten digit strings using a neural network. In: Int’l Conference on Document Analysis and Recognition. (2003) 142-146
Pal, U., Sinha, S., Chaudhuri, B.: Multi-script line identification from indian documents. In: Int’l Conference on Document Analysis and Recognition. (2003) 880-884
Ishitani, Y.: Flexible and robust model matching based on association graph for form image understanding. Pattern Analysis and Applications 3(2) (2000) 104-119
Dengel, A., Dubiel, F.: Clustering and classification of document structure -a machine learning approach-. In: Int’l Conference on Document Analysis and Recognition. (1995) 587-591
Cesarini, F., Lastri, M., Marinai, S., Soda, G.: Encoding of modified X-Y trees for document classification. In: Int’l Conference on Document Analysis and Recognition. (2001) 1131-1136
van Beusekom, J., Keysers, D., Shafait, F., Breuel, T.M.: Distance measures for layout-based document image retrieval. In: Proc. Second Int’l Workshop on Document Image Analysis for Libraries. (2006) 232-242
Kaski, S., Honkela, T., Lagus, K., Kohonen, T.: Websom self-organizing maps of document collections. Neurocomputing 21(1-3) (1998) 101-118
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proceedings of the IEEE 86(11) (1998) 2278-2324
Simard, P.Y., Steinkraus, D., Platt, J.C.: Best practices for convolutional neural networks applied to visual document analysis. In: Int’l Conference on Document Analysis and Recognition. (2003) 958-963
Suzuki, M., Uchida, S., Nomura, A.: A ground-truthed mathematical character and symbol image database. In: Int’l Conference on Document Analysis and Recognition. (2005) 675-679
Pechwitz, M., Maddouri, S.S., Magner, V., Ellouze, N., Amiri, H.: IFN/ENIT- database of handwritten arabic words. In: 7th Colloque International Francophone sur l’Ecrit et le Document. (2002)
Ford, G., Thoma, G.: Ground truth data for document image analysis. In: Sym-posium on Document Image Understanding and Technology. (2003) 199-205
Marti, U., Bunke, H.: A full english sentence database for off-line handwriting recognition. In: Int’l Conference on Document Analysis and Recognition. (1999) 705-708
Antonacopoulos, A., Gatos, B., Bridson, D.: ICDAR 2005 page segmentation competition. In: Int’l Conference on Document Analysis and Recognition. (2005) 75-79
Margner, V., Pechwitz, M., Abed, H.E.: ICDAR 2005 arabic handwriting recog-nition competition. In: Int’l Conference on Document Analysis and Recognition. (2005) 70-74
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Marinai, S. (2008). Introduction to Document Analysis and Recognition. In: Marinai, S., Fujisawa, H. (eds) Machine Learning in Document Analysis and Recognition. Studies in Computational Intelligence, vol 90. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76280-5_1
Download citation
DOI: https://doi.org/10.1007/978-3-540-76280-5_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76279-9
Online ISBN: 978-3-540-76280-5
eBook Packages: EngineeringEngineering (R0)