Skip to main content
  • Conference proceedings
  • © 2021

Document Analysis and Recognition – ICDAR 2021

16th International Conference, Lausanne, Switzerland, September 5–10, 2021, Proceedings, Part I

Part of the book series: Lecture Notes in Computer Science (LNCS, volume 12821)

Part of the book sub series: Image Processing, Computer Vision, Pattern Recognition, and Graphics (LNIP)

Conference series link(s): ICDAR: International Conference on Document Analysis and Recognition

Conference proceedings info: ICDAR 2021.

Buying options

eBook USD 109.00
Price excludes VAT (USA)
  • ISBN: 978-3-030-86549-8
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book USD 139.99
Price excludes VAT (USA)

This is a preview of subscription content, access via your institution.

Table of contents (40 papers)

  1. Front Matter

    Pages i-xix
  2. Historical Document Analsyis 1

    1. Front Matter

      Pages 1-1
    2. Pho(SC)Net: An Approach Towards Zero-Shot Word Image Recognition in Historical Documents

      • Anuj Rai, Narayanan C. Krishnan, Sukalpa Chanda
      Pages 19-33
    3. Versailles-FP Dataset: Wall Detection in Ancient Floor Plans

      • Wassim Swaileh, Dimitrios Kotzinos, Suman Ghosh, Michel Jordan, Ngoc-Son Vu, Yaguan Qian
      Pages 34-49
    4. Context Aware Generation of Cuneiform Signs

      • Kai Brandenbusch, Eugen Rusakov, Gernot A. Fink
      Pages 65-79
    5. Adaptive Scaling for Archival Table Structure Recognition

      • Xiao-Hui Li, Fei Yin, Xu-Yao Zhang, Cheng-Lin Liu
      Pages 80-95
  3. Document Analysis Systems

    1. Front Matter

      Pages 97-97
    2. LGPMA: Complicated Table Structure Recognition with Local and Global Pyramid Mask Alignment

      • Liang Qiao, Zaisheng Li, Zhanzhan Cheng, Peng Zhang, Shiliang Pu, Yi Niu et al.
      Pages 99-114
    3. VSR: A Unified Framework for Document Layout Analysis Combining Vision, Semantics and Relations

      • Peng Zhang, Can Li, Liang Qiao, Zhanzhan Cheng, Shiliang Pu, Yi Niu et al.
      Pages 115-130
    4. LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis

      • Zejiang Shen, Ruochen Zhang, Melissa Dell, Benjamin Charles Germain Lee, Jacob Carlson, Weining Li
      Pages 131-146
    5. Understanding and Mitigating the Impact of Model Compression for Document Image Classification

      • Shoaib Ahmed Siddiqui, Andreas Dengel, Sheraz Ahmed
      Pages 147-159
    6. Hierarchical and Multimodal Classification of Images from Soil Remediation Reports

      • Korlan Rysbayeva, Romain Giot, Nicholas Journet
      Pages 160-175
    7. Competition and Collaboration in Document Analysis and Recognition

      • Daniel Lopresti, George Nagy
      Pages 176-187
  4. Handwriting Recognition

    1. Front Matter

      Pages 189-189
    2. 2D Self-attention Convolutional Recurrent Network for Offline Handwritten Text Recognition

      • Nam Tuan Ly, Hung Tuan Nguyen, Masaki Nakagawa
      Pages 191-204
    3. Mix-Up Augmentation for Oracle Character Recognition with Imbalanced Data Distribution

      • Jing Li, Qiu-Feng Wang, Rui Zhang, Kaizhu Huang
      Pages 237-251

About this book

This four-volume set of LNCS 12821, LNCS 12822, LNCS 12823 and LNCS 12824, constitutes the refereed proceedings of the 16th International Conference on Document Analysis and Recognition, ICDAR 2021, held in Lausanne, Switzerland in September 2021. The 182 full papers were carefully reviewed and selected from 340 submissions, and are presented with 13 competition reports.

The papers are organized into the following topical sections: historical document analysis, document analysis systems, handwriting recognition, scene text detection and recognition, document image processing, natural language processing (NLP) for document understanding, and graphics, diagram and math recognition.


Keywords

  • artificial intelligence
  • character recognition
  • computational linguistics
  • computer science
  • computer systems
  • computer vision
  • data mining
  • databases
  • image analysis
  • image processing
  • image segmentation
  • information retrieval
  • linguistics
  • machine learning
  • Natural Language Processing (NLP)
  • natural languages
  • optical character recognition
  • pattern recognition
  • semantics
  • text processing

Editors and Affiliations

  • Universitat Autònoma de Barcelona, Barcelona, Spain

    Josep Lladós

  • Lehigh University, Bethlehem, USA

    Daniel Lopresti

  • Kyushu University, Fukuoka-shi, Japan

    Seiichi Uchida

Bibliographic Information

Buying options

eBook USD 109.00
Price excludes VAT (USA)
  • ISBN: 978-3-030-86549-8
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book USD 139.99
Price excludes VAT (USA)