Handwritten Artefact Identification Method for Table Interpretation with Little Use of Previous Knowledge

  • Luiz Antônio Pereira Neves
  • João Marques de Carvalho
  • Jacques Facon
  • Flávio Bortolozzi
  • Sérgio Aparecido Ignácio
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3872)

Abstract

An artefact identification method for handwritten filled table-forms is presented. Artefacts in table-forms are smudges and overlaps between handwritten data and line segments which increase the complexity of table-form interpretation. After reviewing some knowledge-based methods, a novel artefact identification method to improve table-form interpretation is presented. The proposed method aims to detect, identify and remove table-form artefacts with little use of previous knowledge. Experiments show the significance of using the proposed artefact identification method to improve table-form interpretation rates.

References

  1. 1.
    Arias, J.F., Chhabra, A., Misra, V.: Finding Straight Lines in Drawings. In: ICDAR 1997. IEEE, Proceedings of the Fourth International Conference on Document Analysis and Recognition, pp. 788–791 (1997)Google Scholar
  2. 2.
    Arias, J.F., Kasturi, R., Chhabra, A.: Efficient Techniques for Telephone Company Line Drawing Interpretation. In: Proceedings of the Third International Conference on Document Analysis and Recognition. ICDAR 1995, pp. 795–798. IEEE, Los Alamitos (1995)CrossRefGoogle Scholar
  3. 3.
    Couasnon, B.: Dmos: A generic document recognition method, application to an automatic generator of musical scores, mathematical formulae and table structures recognition systems. In: Proceedings of the Sixth International Conference on Document Analysis and Recognition. ICDAR 2001, pp. 215–220 (2001)Google Scholar
  4. 4.
    Fan, K.C., Lu, J.M., Wang, L.S., Liao, H.Y.: Extraction of characters from form documents by feature point clustering. Pattern Recognition Letters (1995)Google Scholar
  5. 5.
    Hu, J., Kashi, R.S., Lopresti, D., Wilfong, G.T.: Evaluating the performance of table processing algorithms. International Journal on Document Analysis and Recognition 4, 140–153 (2002)CrossRefGoogle Scholar
  6. 6.
    Thom, R.T.V.: Modélisation de Tableaux pour le traitement Automatique des Formulaires. Laboratoire PSI, Université de Rouen (1997)Google Scholar
  7. 7.
    Watanabe, T., Luo, Q., Sugie, N.: Structure recognition methods for various types of documents. Machine Vision and Applications (1993)Google Scholar
  8. 8.
    Watanabe, T., Luo, Q., Sugie, N.: Layout recognition of multi-kinds of table-form documents. IEEE Transactions on Pattern Analysis and Machine Intelligence (1995)Google Scholar
  9. 9.
    Kieninger, T., Dengel, A.: The t-recs table recognition and analysis system. In: Lee, S.-W., Nakano, Y. (eds.) DAS 1998. LNCS, vol. 1655, pp. 255–269. Springer, Heidelberg (1999)CrossRefGoogle Scholar
  10. 10.
    Liang, J., Ha, J., Haralick, R.M., Phillips, I.T.: Document layout structure extraction using bounding boxes of different entities. In: Proceedings of the Third IEEE Workshop on Applications of Computer Vision. WACV 1996, pp. 278–283 (1996)Google Scholar
  11. 11.
    Hori, O., Doermann, D.S.: Robust table-form structure analysis based on box-driven reasoning. In: Proceedings of the Third International Conference on Document Analysis and Recognition. ICDAR 1995, pp. 218–221 (1995)Google Scholar
  12. 12.
    Hirano, T., Okada, Y., Yoda, F.: Field extraction method from existing forms transmitted by facsimile. In: Proceedings of the Sixth International Conference on Document Analysis and Recognition. ICDAR 2001, pp. 738–742 (2001)Google Scholar
  13. 13.
    Shinjo, H., Hadano, E., Marukawa, K., Shima, Y., Sako, H.: A recursive analysis for form cell recognition. In: Proceedings of the Sixth International Conference on Document Analysis and Recognition. ICDAR 2001 (2001)Google Scholar
  14. 14.
    Shimotsuji, S., Asano, M.: Form Identification based on Cell Structure. In: Proceedings of the 12th IAPR International Conference on Pattern Recognition. ICPR 1996, pp. 793–797. IEEE, Los Alamitos (1996)CrossRefGoogle Scholar
  15. 15.
    Pizano, A.: Extracting line features from images of business forms and tables. In: Proceedings of the 11th International Conference on Pattern Recognition. IAPR, vol. 3, pp. 399–403 (1992)Google Scholar
  16. 16.
    Tukey, J.W.: Exploratory Data Analysis. Addison-Wesley, Reading (1977)MATHGoogle Scholar
  17. 17.
    Kazmier, L.J.: Estatística Aplicada a Economia e Administração. Editora McGraw-Hill do Brasil, São Paulo - SP (1982)Google Scholar
  18. 18.
    Neves, L.A.P.: Extração de células de dados manuscritos em tabelas. Master’s thesis, Pontifícia Universidade Católica do Paraná - PUCPR (1999)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Luiz Antônio Pereira Neves
    • 1
  • João Marques de Carvalho
    • 1
  • Jacques Facon
    • 2
  • Flávio Bortolozzi
    • 2
  • Sérgio Aparecido Ignácio
    • 2
  1. 1.UFCG-Universidade Federal de Campina GrandeCampina GrandeBrasil
  2. 2.PUCPR-Pontifícia Universidade Católica do ParanáCuritibaBrazil

Personalised recommendations