White-Box Evaluation of Computer Vision Algorithms through Explicit Decision-Making

Zanibbi, Richard; Blostein, Dorothea; Cordy, James R.

doi:10.1007/978-3-642-04667-4_30

Richard Zanibbi¹⁹,
Dorothea Blostein²⁰ &
James R. Cordy²⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5815))

Included in the following conference series:

International Conference on Computer Vision Systems

1901 Accesses
2 Citations

Abstract

Traditionally computer vision and pattern recognition algorithms are evaluated by measuring differences between final interpretations and ground truth. These black-box evaluations ignore intermediate results, making it difficult to use intermediate results in diagnosing errors and optimization. We propose “opening the box,” representing vision algorithms as sequences of decision points where recognition results are selected from a set of alternatives. For this purpose, we present a domain-specific language for pattern recognition tasks, the Recognition Strategy Language (RSL). At run-time, an RSL interpreter records a complete history of decisions made during recognition, as it applies them to a set of interpretations maintained for the algorithm. Decision histories provide a rich new source of information: recognition errors may be traced back to the specific decisions that caused them, and intermediate interpretations may be recovered and displayed. This additional information also permits new evaluation metrics that include false negatives (correct hypotheses that the algorithm generates and later rejects), such as the percentage of ground truth hypotheses generated (historical recall), and the percentage of generated hypotheses that are correct(historical precision). We illustrate the approach through an analysis of cell detection in two published table recognition algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cesarini, F., Marinai, S., Sarti, L., Soga, G.: Trainable Table Location in Document Images. In: Proc. Sixteenth Int’l Conf. Pattern Recognition, vol. 3, pp. 236–240 (2002)
Google Scholar
Competitions in Document Image Analysis Organized by ICDAR and GREC (2008), http://www.icdar2007.org/competition.html
Cordy, J.: The TXL Source Transformation Language. Science of Computer Programming 61(3), 190–210 (2006)
Article MATH MathSciNet Google Scholar
Embley, D., Hurst, M., Lopresti, D., Nagy, G.: Table-processing Paradigms: A Research Survey. Int’l J. Document Analysis and Recognition 8, 66–86 (2006)
Article Google Scholar
Handley, J.: Table Analysis for Multi-line Cell Identification. In: Proc. Doc. Rec. and Retrieval VIII (IS&T/SPIE Electronic Imaging), vol. 4307, pp. 34–43 (2001)
Google Scholar
Hu, J., Kashi, R., Lopresti, D., Nagy, G., Wilfong, G.: Why Table Ground-truthing is Hard. In: Proc. Sixth Int’l Conf. Document Analysis and Recognition, pp. 129–133 (2001)
Google Scholar
Hu, J., Kashi, R., Lopresti, D., Wilfong, G.: Table Structure Recognition and its Evaluation. In: Proc. Document Rec. and Retrieval VIII, SPIE, vol. 4307, pp. 44–55 (2001)
Google Scholar
Hu, J., Kashi, R., Lopresti, D., Wilfong, G.: Evaluating the Performance of Table Processing Algorithms. Int’l J. Document Analysis and Recognition 4(3), 140–153 (2002)
Article Google Scholar
Hurst, M.: Towards a Theory of Tables. Int’l J. Document Analysis and Recognition 8, 123–131 (2006)
Article Google Scholar
Kieninger, T., Dengel, A.: Applying the T-RECS Table Recognition System to the Business Letter Domain. In: Proc. Sixth ICDAR, pp. 518–522 (2001)
Google Scholar
Liang, J.: Document Structure Analysis and Performance Evaluation. PhD dissertation, Univ. Washington (1999)
Google Scholar
Lopresti, D.: Exploiting WWW Resources in Experimental Document Analysis Research. In: Lopresti, D.P., Hu, J., Kashi, R.S. (eds.) DAS 2002. LNCS, vol. 2423, p. 532. Springer, Heidelberg (2002)
Chapter Google Scholar
Lopresti, D.P., Nagy, G.: A tabular survey of automated table processing. In: Chhabra, A.K., Dori, D. (eds.) GREC 1999. LNCS, vol. 1941, pp. 93–120. Springer, Heidelberg (2000)
Chapter Google Scholar
Lopresti, D., Wilfong, G.: Evaluating Document Analysis Results via Graph Probing. In: Proc. Sixth Int’l Conf. Document Analysis and Recognition, pp. 116–120 (2001)
Google Scholar
Chen, P.S., Haralick, R.: CD-ROM Document Database Standard. In: Proc. Second Int’l Conf. Document Analysis and Recognition, pp. 478–483 (1993)
Google Scholar
Phillips, Chhabra, A.: Empirical Performance Evaluation of Graphics Recognition Systems. IEEE Trans. Pattern Analysis and Machine Intelligence 21(9), 849–870 (1999)
Article Google Scholar
Silva, A.E., Jorge, A., Torgo, L.: Design of an End-to-end Method to Extract Information from Tables. Int’l J. Document Analysis and Recognition 8, 144–171 (2006)
Article Google Scholar
Wang, X.: Tabular Abstraction, Editing and Formatting. PhD dissertation, Univ. Waterloo, Canada (1996)
Google Scholar
Wenyin, L., Dori, D.: A Protocol for Performance Evaluation of Line Detection Algorithms. Machine Vision and Applications 9(5/6), 240–250 (1997)
Article Google Scholar
Zanibbi, R.: A Language for Specifying and Comparing Table Recognition Strategies. PhD dissertation, School of Computing, Queen’s Univ., Canada (2004)
Google Scholar
Zanibbi, R., Blostein, D., Cordy, J.R.: A Survey of Table Recognition: Models, Observations, Transformations, and Inferences. IJDAR 7(1), 1–16 (2004)
Article Google Scholar
Zanibbi, R., Blostein, D., Cordy, J.R.: Historical Recall and Precision: Summarizing Generated Hypotheses. In: Proc. Eighth ICDAR, vol. 2, pp. 202–206 (2005)
Google Scholar
Zanibbi, R., Blostein, D., Cordy, J.R.: Decision-Based Specification and Comparison of Table Recognition Algorithms. In: Machine Learning in Document Analysis and Recognition, pp. 71–103. Springer, Heidelberg (2008)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Dept. Computer Science, Rochester Institute of Technology, Rochester, NY, 14623-5608
Richard Zanibbi
School of Computing, Queen’s University, Kingston, Ontario, Canada, K7L 3N6
Dorothea Blostein & James R. Cordy

Authors

Richard Zanibbi
View author publications
You can also search for this author in PubMed Google Scholar
Dorothea Blostein
View author publications
You can also search for this author in PubMed Google Scholar
James R. Cordy
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

ICSI & UC Berkeley, 1947 Center Street, Suite 600, 94704, Berkeley, CA, USA
Mario Fritz
Department of Computer Science, TU Darmstadt, Hochschulstrasse 10, 64289, Darmstadt, Germany
Bernt Schiele
Department of Electrical Engineering and Computer Science, Montefiore Institute, University of Liège, B28, Sart Tilman Campus, Grand Traverse 10, 4000, Liège, Belgium
Justus H. Piater

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zanibbi, R., Blostein, D., Cordy, J.R. (2009). White-Box Evaluation of Computer Vision Algorithms through Explicit Decision-Making. In: Fritz, M., Schiele, B., Piater, J.H. (eds) Computer Vision Systems. ICVS 2009. Lecture Notes in Computer Science, vol 5815. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04667-4_30

Download citation

DOI: https://doi.org/10.1007/978-3-642-04667-4_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04666-7
Online ISBN: 978-3-642-04667-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics