A method of table detection in metafiles
- 59 Downloads
A method is proposed for the detection of statistical tables that use metafiles as input data; the latter fact allows one to apply this method to documents of different formats. In this method, the table detection process is viewed as a bottom-up segmentation of a document page, i.e., segmentation from simple elements of a page to more complicated ones. The experimental evaluation of the method shows that it is efficient as applied to a wide class of statistical tables.
Key wordsDocument analysis and recognition table extraction from documents table detection
- 3.D. Lopresti and G. Nagy, “A Tabular Survey of Automated Table Processing,” Lect. Notes Comput. Sci. 1941 (Springer, 2000), pp. 93–120.Google Scholar
- 4.R. Zanibbi, D. Blostein, and J. R. Cordy, “A Survey of Table Recognition: Models, Observations, Transformations, and Inferences,” Int. J. Doc. Anal. Recognit. 7(1), 1–16 (2004).Google Scholar
- 5.PostScript Language Reference, 3rd ed. (Addison-Wesley, 1999).Google Scholar
- 6.PDF reference. Adobe, 5th ed.Google Scholar
- 7.Microsoft Developer Network, Available from http://msdn.microsoft.com.
- 8.T. Hassan and R. Baumgartner, “Table Recognition and Understanding from PDF Files,” in Proc. of the 9th Int. Conf. on Document Analysis and Recognition (ICDAR 2007), Morretes, Sept. 23–26 (IEEE Computer Society, 2007), pp. 1143–1147.Google Scholar
- 10.J. Hu, R. Kashi, D. Lopresti, and G. Wilfong, “Medium-Independent Table Detection,” in Document Recognition Retrieval VII (IS&T/SPIE Electronic Imaging, San Jose, 2000), pp. 291–302.Google Scholar