Advertisement

Table of contents

  1. Front Matter
    Pages i-xxi
  2. Introduction, Background, Fundamentals

    1. Front Matter
      Pages 1-2
    2. Professor Henry S. Baird
      Pages 3-10
    3. PhDElisa H. Barney Smith
      Pages 11-61
    4. Henry S. Baird, Karl Tombre
      Pages 63-71
  3. Page Analysis

    1. Front Matter
      Pages 133-134
    2. Andreas Dengel, Faisal Shafait
      Pages 177-222
    3. Simone Marinai
      Pages 223-253
  4. Text Recognition

    1. Front Matter
      Pages 255-256
    2. Nicola Nobile, Ching Y. Suen
      Pages 257-290
    3. Umapada Pal, Niladri Sekhar Dash
      Pages 291-330
    4. Huaigu Cao, Prem Natarajan
      Pages 331-358
    5. Sergey Tulyakov, Venu Govindaraju
      Pages 359-389
    6. Volkmar Frinken, Horst Bunke
      Pages 391-425
    7. Abdel Belaı̈d, Mohamed Imran Razzak
      Pages 427-457
    8. Srirangaraj Setlur, Zhixin Shi
      Pages 459-486
  5. Processing of Non-textual Information

    1. Front Matter
      Pages 487-488
    2. Josep Lladós, Marçal Rusiñol
      Pages 489-521

About this book

Introduction

The Handbook of Document Image Processing and Recognition provides a consistent, comprehensive resource on the available methods and techniques in document image processing and recognition. It includes unified comparison and contrast analysis of algorithms in standard table formats. Thus, it educates the reader in order to help them to make informed decisions on their particular problems.

The handbook is divided into several parts. Each part starts with an introduction written by the two editors. These introductions set the general framework for the main topic of each part and introduces the contribution of each chapter within the framework. The introductions are followed by several chapters written by established experts of the field.

Each chapter provides the reader with a clear overview of the topic and of the state of the art in techniques used (including elements of comparison between them). Each chapter is structured in the same way: It starts with an introductory text, concludes with a summary of the main points addressed in the chapter and ends with a comprehensive list of references. Whenever appropriate, the authors include specific sections describing and pointing to consolidated software and/or reference datasets. Numerous cross-references between the chapters ensure this is a truly integrated work, without unnecessary duplications and overlaps between chapters.

This reference work is intended for the use by a wide audience of readers from around the world such as graduate students, researchers, librarians, lecturers, professionals, and many other people.

Keywords

Computer Vision Document Image Processing Human Computer Interaction Pattern Recognition

Editors and affiliations

  • David Doermann
    • 1
  • Karl Tombre
    • 2
  1. 1.University of MarylandCollege ParkMDUSA
  2. 2.Université de LorraineNancyFrance

About the editors

Dr. David Doermann is Senior Research Scientist and Director of the Laboratory for Language and Media Processing at the University of Maryland Institute for Advanced Computer Studies, College Park, MD, USA. He is also President and co-founder of Applied Media Analysis, Inc., and Editor-in-Chief of the International Journal on Document Analysis and Recognition.

Dr. Karl Tombre is Professor at Université de Lorraine, France, one of the lagest French universities, where he currently is vice-president in charge of partnerships and international affairs. He was one of the founders, and for many years an editor-in-chief of the International Journal on Document Analysis and Recognition. From 2007 to 2012 he was director of the Inria Nancy - Grand Est research center, a large public research center in computer science and applied mathematics. From 2006 to 2008 he was President of the International Association for Pattern Recognition (IAPR).

Bibliographic information

Reviews

From the book reviews:

“This edited compendium of chapters represents the largest effort to date to bring together the breadth and depth of image processing research for document text extraction, segmentation of document image into picture and text zones, and general optical character recognition (OCR) of the international family of foreign languages. … will appeal to the widest audience possible, including academicians, practitioners, library science and legal professionals, and all who are interested in the efficient storage and retrieval of vast numbers of documents.” (R. Goldberg, Computing Reviews, October, 2014)