Table Structure Recognition in Scanned Images Using a Clustering Method

Van Nguyen, Nam; Vu, Hanh; Zucker, Arthur; Belkada, Younes; Van Do, Hai; Nguyen, Doanh Ngoc-; Nguyen Le, Thanh Tuan; Van Hoang, Dong

doi:10.1007/978-3-030-63083-6_12

Nam Van Nguyen¹⁷,
Hanh Vu¹⁸,
Arthur Zucker¹⁹,
Younes Belkada¹⁹,
Hai Van Do¹⁷,
Doanh Ngoc- Nguyen¹⁷,
Thanh Tuan Nguyen Le¹⁷ &
…
Dong Van Hoang¹⁷

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 334))

Included in the following conference series:

International Conference on Industrial Networks and Intelligent Systems

527 Accesses
1 Citations

Abstract

Optical Character Recognition (OCR) for scanned paper invoices is very challenging due to the variability of 19 invoice layouts, different information fields, large data tables, and low scanning quality. In this case, table structure recognition is a critical task in which all rows, columns, and cells must be accurately positioned and extracted. Existing methods such as DeepDeSRT, TableNet only dealt with high-quality born-digital images (e.g., PDF) with low noise and apparent table structure. This paper proposes an efficient method called CluSTi (Clustering method for recognition of the Structure of Tables in invoice scanned Images). The contributions of CluSTi are three-fold. Firstly, it removes heavy noises in the table images using a clustering algorithm. Secondly, it extracts all text boxes using state-of-the-art text recognition. Thirdly, based on the horizontal and vertical clustering algorithm with optimized parameters, CluSTi groups the text boxes into their correct rows and columns, respectively. The method was evaluated on three datasets: i) 397 public scanned images; ii) 193 PDF document images from ICDAR 2013 competition dataset; and iii) 281 PDF document images from ICDAR 2019’s numeric tables. The evaluation results showed that CluSTi achieved an \(\textit{F}_1\textit{-score}\) of 87.5%, 98.5%, and 94.5%, respectively. Our method also outperformed DeepDeSRT with an \(\textit{F}_1\textit{-score}\) of 91.44% on only 34 images from the ICDAR 2013 competition dataset. To the best of our knowledge, CluSTi is the first method to tackle the table structure recognition problem on scanned images.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

ClusTi: Clustering Method for Table Structure Recognition in Scanned Images

Article 30 April 2021

Table Structure Recognition Using Top-Down and Bottom-Up Cues

A Clustering Approach Combining Lines and Text Detection for Table Extraction

References

Table-detection-dataset. https://github.com/sgrpanchal31/table-detection-dataset
Baek, Y., Lee, B., Han, D., Yun, S., Lee, H.: Character region awareness for text detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 9365–9374 (2019)
Google Scholar
Clinchant, S., Déjean, H., Meunier, J.L., Lang, E.M., Kleber, F.: Comparing machine learning approaches for table recognition in historical register books. In: 2018 13th IAPR International Workshop on Document Analysis Systems (DAS), pp. 133–138. IEEE (2018)
Google Scholar
Deng, D., Liu, H., Li, X., Cai, D.: Pixellink: detecting scene text via instance segmentation. In: 32nd AAAI Conference on Artificial Intelligence (2018)
Google Scholar
Ester, M., Kriegel, H.P., Sander, J., Xu, X., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD, pp. 226–231 (1996)
Google Scholar
Farahmand, A., Sarrafzadeh, H., Shanbehzadeh, J.: Document image noises and removal methods. In: Proceedings of the International MultiConference of Engineers and Computer Scientists, pp. 436–440. Newswood Ltd. (2013)
Google Scholar
Hartigan, J.A., Wong, M.A.: Algorithm as 136: a k-means clustering algorithm. J. Roy. Stat. Soc.: Ser. C (Appl. Stat.) 28(1), 100–108 (1979)
MATH Google Scholar
He, T., Tian, Z., Huang, W., Shen, C., Qiao, Y., Sun, C.: An end-to-end textspotter with explicit alignment and attention. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5020–5029 (2018)
Google Scholar
He, W., Zhang, X.Y., Yin, F., Liu, C.L.: Deep direct regression for multi-oriented scene text detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 745–753 (2017)
Google Scholar
Hu, J., Kashi, R.S., Lopresti, D.P., Wilfong, G.: Table structure recognition and its evaluation. In: Document Recognition and Retrieval VIII, vol. 4307, pp. 44–55. International Society for Optics and Photonics (2000)
Google Scholar
Kboubi, F., Chabi, A.H., Ahmed, M.B.: Table recognition evaluation and combination methods. In: 8th International Conference on Document Analysis and Recognition (ICDAR 2005), pp. 1237–1241. IEEE (2005)
Google Scholar
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Liu, X., Liang, D., Yan, S., Chen, D., Qiao, Y., Yan, J.: FOTS: fast oriented text spotting with a unified network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5676–5685 (2018)
Google Scholar
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
Google Scholar
Paliwal, S.S., Vishwanath, D., Rahul, R., Sharma, M., Vig, L.: Tablenet: deep learning model for end-to-end table detection and tabular data extraction from scanned document images. In: 2019 International Conference on Document Analysis and Recognition (ICDAR), pp. 128–133. IEEE (2019)
Google Scholar
Qasim, S.R., Mahmood, H., Shafait, F.: Rethinking table recognition using graph neural networks. In: 2019 International Conference on Document Analysis and Recognition (ICDAR), pp. 142–147. IEEE (2019)
Google Scholar
Rashid, S.F., Akmal, A., Adnan, M., Aslam, A.A., Dengel, A.: Table recognition in heterogeneous documents using machine learning. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 1, pp. 777–782. IEEE (2017)
Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)
Google Scholar
Sasaki, Y., et al.: The truth of the f-measure. Teach Tutor mater 1(5), 1–5 (2007)
Google Scholar
Satopaa, V., Albrecht, J., Irwin, D., Raghavan, B.: Finding a “kneedle” in a haystack: detecting knee points in system behavior. In: 2011 31st International Conference on Distributed Computing Systems Workshops, pp. 166–171. IEEE (2011)
Google Scholar
Scholkmann, F., Boss, J., Wolf, M.: An efficient algorithm for automatic peak detection in noisy periodic and quasi-periodic signals. Algorithms 5(4), 588–603 (2012)
Article Google Scholar
Schreiber, S., Agne, S., Wolf, I., Dengel, A., Ahmed, S.: Deepdesrt: deep learning for detection and structure recognition of tables in document images. In: 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 1, pp. 1162–1167. IEEE (2017)
Google Scholar
Soille, P.: Morphological Image Analysis: Principles and Applications. Springer Science & Business Media, Heidelberg (2013)
MATH Google Scholar
Sudana, O., Putra, D., Sudarma, M., Hartati, R.S., Wirdiani, A.: Image clustering of complex balinese character with dbscan algorithm. J. Eng. Technol. 6(1), 548–558 (2018)
Google Scholar
Xu, R., Wunsch, D.: Survey of clustering algorithms. IEEE Trans. Neural Netw. 16(3), 645–678 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Thuyloi University, 175 Tay Son, Dong Da, Hanoi, Vietnam
Nam Van Nguyen, Hai Van Do, Doanh Ngoc- Nguyen, Thanh Tuan Nguyen Le & Dong Van Hoang
Viettel CyberSpace Center, 41st floor, Keangnam Landmark 72, Hanoi, Vietnam
Hanh Vu
Sorbonne University, Polytech Sorbonne, 75005, Paris, France
Arthur Zucker & Younes Belkada

Authors

Nam Van Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Hanh Vu
View author publications
You can also search for this author in PubMed Google Scholar
Arthur Zucker
View author publications
You can also search for this author in PubMed Google Scholar
Younes Belkada
View author publications
You can also search for this author in PubMed Google Scholar
Hai Van Do
View author publications
You can also search for this author in PubMed Google Scholar
Doanh Ngoc- Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Thanh Tuan Nguyen Le
View author publications
You can also search for this author in PubMed Google Scholar
Dong Van Hoang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nam Van Nguyen .

Editor information

Editors and Affiliations

Faculty of Electrical and Electronics Engineering, Duy Tan University, Da Nang, Vietnam
Nguyen-Son Vo
Le Quy Don Technical University, Hanoi, Vietnam
Van-Phuc Hoang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Van Nguyen, N. et al. (2020). Table Structure Recognition in Scanned Images Using a Clustering Method. In: Vo, NS., Hoang, VP. (eds) Industrial Networks and Intelligent Systems. INISCOM 2020. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 334. Springer, Cham. https://doi.org/10.1007/978-3-030-63083-6_12

Download citation

DOI: https://doi.org/10.1007/978-3-030-63083-6_12
Published: 21 November 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63082-9
Online ISBN: 978-3-030-63083-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Table Structure Recognition in Scanned Images Using a Clustering Method

Abstract

Access this chapter

Similar content being viewed by others

ClusTi: Clustering Method for Table Structure Recognition in Scanned Images

Table Structure Recognition Using Top-Down and Bottom-Up Cues

A Clustering Approach Combining Lines and Text Detection for Table Extraction

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Table Structure Recognition in Scanned Images Using a Clustering Method

Abstract

Access this chapter

Similar content being viewed by others

ClusTi: Clustering Method for Table Structure Recognition in Scanned Images

Table Structure Recognition Using Top-Down and Bottom-Up Cues

A Clustering Approach Combining Lines and Text Detection for Table Extraction

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation