Skip to main content

Part of the book series: Studies in Computational Intelligence ((SCI,volume 90))

Document Analysis and Recognition (DAR) aims at the automatic extraction of information presented on paper and initially addressed to human comprehension. The desired output of DAR systems is usually in a suitable symbolic representation that can subsequently be processed by computers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lyman, P., Varian, H.R.: How much information. Technical Report Retrieved from http://www.sims.berkeley.edu/how-much-info-2003 on 6-1-2007 (2003)

  2. Sellen, A.J., Harper, R.: The myth of the paperless office. MIT press (2001)

    Google Scholar 

  3. Mori, S., Suen, C., Yamamoto, K.: Historical review of OCR research and de-velopment. Proc. IEEE 80 (1992) 1029-1058

    Article  Google Scholar 

  4. O’Gorman, L., Kasturi, R.: Document Image Analysis. IEEE Computer Society Press, Los Alamitos, California (1995)

    Google Scholar 

  5. Casey, R.G., Lecolinet, E.: A survey of methods and strategies in character segmentation. IEEE Transaction on PAMI 18(7) (1996) 690-706

    Google Scholar 

  6. Nagy, G.: Twenty years of document image analysis in PAMI. IEEE Transaction on PAMI 22(1) (2000) 38-62

    Google Scholar 

  7. Plamondon, R., Srihari, S.N.: On-line and off-line handwriting recognition: a comprehensive survey. IEEE Transaction on PAMI 22(1) (2000) 63-84

    Google Scholar 

  8. Marinai, S., Gori, M., Soda, G.: Artificial neural networks for document analysis and recognition. IEEE Transactions on PAMI 27(1) (2005) 23-35

    Google Scholar 

  9. Ha, T., Bunke, H.: Image processing methods for document image analysis. In: Handbook of character recognition and document image analysis. World Scientific (1997) 1-47

    Google Scholar 

  10. Hornik, K., Stinchcombe, M., White, H.: Multilayer feedforward networks are universal approximators. Neural Networks 2 (1989) 359-366

    Article  Google Scholar 

  11. Taxt, T., Flynn, P.J., Jain, A.K.: Segmentation of document images. IEEE Transaction on PAMI 11(12) (1989) 1322-1329

    Google Scholar 

  12. Nagy, G., Seth, S.: Hierarchical representation of optically scanned documents. In: Int’l Conference on Pattern Recognition. (1984) 347-349

    Google Scholar 

  13. Watanabe, T., Luo, Q., Sugie, N.: Structure recognition methods for various types of documents. MVA 6(6) (1993) 163-176

    Article  Google Scholar 

  14. Nagy, G., Seth, S., Viswanathan, M.: A prototype document image analysis system for technical journals. IEEE Computer 25(7) (1992) 10-22

    Google Scholar 

  15. Kim, J.H., Kim, K.K., Suen, C.Y.: An HMM-MLP hybrid model for cursive script recognition. PAA 3(4) (2000) 314-324

    MathSciNet  Google Scholar 

  16. Gilloux, M., Lemarié, B., Leroux, M.: A hybrid radial basis function network/ hidden Markov model handwritten word recognition system. In: Int’l Conference on Document Analysis and Recognition. (1995) 394-397

    Google Scholar 

  17. Fu, L.M.: Neural networks in computer intelligence. McGraw-Hill, New York, NY (1994)

    Google Scholar 

  18. Jain, A.K., Zhong, Y.: Page segmentation using texture analysis. Pattern Recog- nition 29(5) (1996) 743-770

    Article  Google Scholar 

  19. Shih, F.Y., Chen, S.S.: Adaptive document block segmentation and classifica-tion. IEEE Trans. SMC 26(5) (1996) 797-802

    Google Scholar 

  20. Strouthopoulos, C., Papamarkos, N.: Text identification for document image analysis using a neural network. Image and Vision Computing 16(12/13) (1998) 879-896

    Article  Google Scholar 

  21. Wang, J., Jean, J.: Segmentation of merged characters by neural networks and shortest path. Pattern Recognition 27(5) (1994) 649-658

    Article  Google Scholar 

  22. Lu, Z.K., Chi, Z., Siu, W.C.: Length estimation of digits strings using a neu-ral network with structure based features. SPIE/IS&T Journal of Electronic Imaging 7(1) (1998) 79-85

    Article  Google Scholar 

  23. You, D., Kim, G.: An approach for locating segmentation points of handwritten digit strings using a neural network. In: Int’l Conference on Document Analysis and Recognition. (2003) 142-146

    Google Scholar 

  24. Pal, U., Sinha, S., Chaudhuri, B.: Multi-script line identification from indian documents. In: Int’l Conference on Document Analysis and Recognition. (2003) 880-884

    Google Scholar 

  25. Ishitani, Y.: Flexible and robust model matching based on association graph for form image understanding. Pattern Analysis and Applications 3(2) (2000) 104-119

    Article  Google Scholar 

  26. Dengel, A., Dubiel, F.: Clustering and classification of document structure -a machine learning approach-. In: Int’l Conference on Document Analysis and Recognition. (1995) 587-591

    Google Scholar 

  27. Cesarini, F., Lastri, M., Marinai, S., Soda, G.: Encoding of modified X-Y trees for document classification. In: Int’l Conference on Document Analysis and Recognition. (2001) 1131-1136

    Google Scholar 

  28. van Beusekom, J., Keysers, D., Shafait, F., Breuel, T.M.: Distance measures for layout-based document image retrieval. In: Proc. Second Int’l Workshop on Document Image Analysis for Libraries. (2006) 232-242

    Google Scholar 

  29. Kaski, S., Honkela, T., Lagus, K., Kohonen, T.: Websom self-organizing maps of document collections. Neurocomputing 21(1-3) (1998) 101-118

    Article  MATH  Google Scholar 

  30. LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proceedings of the IEEE 86(11) (1998) 2278-2324

    Article  Google Scholar 

  31. Simard, P.Y., Steinkraus, D., Platt, J.C.: Best practices for convolutional neural networks applied to visual document analysis. In: Int’l Conference on Document Analysis and Recognition. (2003) 958-963

    Google Scholar 

  32. Suzuki, M., Uchida, S., Nomura, A.: A ground-truthed mathematical character and symbol image database. In: Int’l Conference on Document Analysis and Recognition. (2005) 675-679

    Google Scholar 

  33. Pechwitz, M., Maddouri, S.S., Magner, V., Ellouze, N., Amiri, H.: IFN/ENIT- database of handwritten arabic words. In: 7th Colloque International Francophone sur l’Ecrit et le Document. (2002)

    Google Scholar 

  34. Ford, G., Thoma, G.: Ground truth data for document image analysis. In: Sym-posium on Document Image Understanding and Technology. (2003) 199-205

    Google Scholar 

  35. Marti, U., Bunke, H.: A full english sentence database for off-line handwriting recognition. In: Int’l Conference on Document Analysis and Recognition. (1999) 705-708

    Google Scholar 

  36. Antonacopoulos, A., Gatos, B., Bridson, D.: ICDAR 2005 page segmentation competition. In: Int’l Conference on Document Analysis and Recognition. (2005) 75-79

    Google Scholar 

  37. Margner, V., Pechwitz, M., Abed, H.E.: ICDAR 2005 arabic handwriting recog-nition competition. In: Int’l Conference on Document Analysis and Recognition. (2005) 70-74

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Marinai, S. (2008). Introduction to Document Analysis and Recognition. In: Marinai, S., Fujisawa, H. (eds) Machine Learning in Document Analysis and Recognition. Studies in Computational Intelligence, vol 90. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76280-5_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-76280-5_1

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-76279-9

  • Online ISBN: 978-3-540-76280-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics