Advertisement

General diagram-recognition methodologies

  • Dorothea Blostein
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1072)

Abstract

The field of diagram recognition faces many challenges, including the great diversity in diagrammatic notations, and the presence of noise and ambiguity during the recognition process. To help address these problems, research is needed into methods for acquiring, representing, and exploiting notational conventions. We review several frameworks for diagram recognition: blackboard systems, schema-based systems, syntactic methods, and graph rewriting. Next we discuss the need for a computationally-relevant characterization of diagrammatic notations, the need to exploit soft constraints during diagram recognition, and the possibility that diagram generators may provide a useful source of information about notational conventions.

Keywords

diagram recognition notational conventions diagrammatic notations contextual information diagram classification blackboard systems schema-based systems graph rewriting 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [Ande77]
    R. Anderson, “Two Dimensional Mathematical Notation,” in Syntactic Pattern Recognition, Applications, K. S. Fu editor, Springer 1977, pp. 147–177.Google Scholar
  2. [BaIt94]
    H. Baird and D. Ittner, “Data Structures for Page Readers” Proc. IAPR Workshop on Document Analysis Systems, Kaiserslautern, Germany, Oct. 1994, pp. 323–334.Google Scholar
  3. [Belk94]
    A. Belkin, “Macintosh Notation Software: Present and Future,” Computer Music Journal, Vol. 18, No. 1, pp. 53–69, Spring 1994.Google Scholar
  4. [Bert83]
    J. Bertin, Semiology of Graphics: Diagrams, Networks, and Maps, University of Wisconsin Press, 1983.Google Scholar
  5. [BlBa92]
    D. Blostein and H. Baird, “A Critical Survey of Music Image Analysis,” in Structured Document Image Analysis, Eds. H. Baird, H. Bunke, and K. Yamamoto, Springer Verlag, 1992, pp. 405–434.Google Scholar
  6. [BlFG95]
    D. Blostein, H. Fahmy, and A. Grbavec, “Practical Use of Graph Rewriting,” Technical Report No. 95-373, Computing and Information Science, Queen's University, January, 1995.Google Scholar
  7. [BlGr95]
    D. Blostein, A. Grbavec, “Recognition of Mathematical Notation,” in Handbook of Character Recognition and Document Image Analysis, Eds. H. Bunke and P. Wang, World Scientific, to appear.Google Scholar
  8. [Bunk82]
    H. Bunke, “Attributed Programmed Graph Grammars and Their Application to Schematic Diagram Interpretation,” IEEE Trans. Pattern Analysis and Machine Intelligence 4(6), pp. 574–582, Nov. 1982.Google Scholar
  9. [Chou89]
    P. Chou, “Recognition of Equations Using a Two-Dimensional Stochastic Context-Free Grammar,” Proc. SPIE Visual Communications and Image Processing IV, Philadelphia PA, pp. 852–863, Nov. 1989.Google Scholar
  10. [CoTV93]
    S. Collin, K. Tombre, and P. Vaxiviere, “Don't Tell Mom I'm Doing Document Analysis; She Believes I'm in the Computer Vision Field,” Proc. Second Intl. Conf. on Document Analysis and Recognition, Tsukuba, Japan, Oct. 1993, pp. 619–622.Google Scholar
  11. [DiCM91]
    Y. Dimitriadis, J. Coronado, and C. de la Maza, “A New Interactive Mathematical Editor, Using On-line Handwritten Symbol Recognition, and Error Detection-Correction with an Attribute Grammar,” in Proc. First Intl. Conf. on Document Analysis and Recognition, Saint Malo, France, September 1991, pp. 242–250.Google Scholar
  12. [FaBl93]
    H. Fahmy and D. Blostein, “A Graph Grammar Programming Style for Recognition of Music Notation,” Machine Vision and Applications, Vol. 6, No. 2, pp. 83–99, 1993.Google Scholar
  13. [Fahm95]
    H. Fahmy, “Reasoning in the Presence of Uncertainty via Graph Rewriting,” PhD Thesis, Computing and Information Science, Queen's University, March 1995. (TR 95-382)Google Scholar
  14. [Fu82]
    K. S. Fu, Syntactic Pattern Recognition and Applications, Prentice Hall 1982.Google Scholar
  15. [GrBl95]
    A. Grbavec and D. Blostein, “Mathematics Recognition Using Graph Rewriting,” Third International Conference on Document Analysis and Recognition, Montreal, Canada, August 1995.Google Scholar
  16. [HaMa83]
    W. Havens and A. Mackworth, “Representing Knowledge of the Visual World,” IEEE Computer, October 1983, pp. 90–96.Google Scholar
  17. [JaDB89]
    V. Jagannathan, R. Dodhiawala, L. Baum, Editors, Blackboard Architectures and Applications, Academic Press, 1989.Google Scholar
  18. [JoPr92]
    S. Joseph and T. Pridmore, “Knowledge-Directed Interpretation of Mechanical Engineering Drawings,” IEEE PAMI, Vol. 14, No. 9, Sept. 1992, pp. 928–940.Google Scholar
  19. [KaIn90a]
    H. Kato and S. Inokuchi, “The Recognition System of Printed Piano Music using Musical Knowledge and Constraints,” Proc. IAPR Workshop on Syntactic and Structural Pattern Recognition., Murray Hill NJ, June 1990, pp. 231–248.Google Scholar
  20. [KaIn90b]
    H. Kato and S. Inokuchi, “The Recognition Method for Roughly Hand-Drawn Logical Diagrams Based on Utilization of Multi-Layered Knowledge,” Proc. 10th Intl. Conf. on Pattern Recognition, Atlantic City NJ, June 1990, pp. 443–473.Google Scholar
  21. [Knut79]
    D. Knuth, “Mathematical Typography,” Bulletin of the American Mathematical Society, Vol. 1, No. 2, March 1979.Google Scholar
  22. [KoCh94]
    G. Kopec and P. Chou, “Document Image Decoding Using Markov Source Models,” IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 16, No. 6, June 1994, pp. 602–617.Google Scholar
  23. [LSMS85]
    X. Lin, S. Shimotsuji, M. Minoh, T. Saki, “Efficient Diagram Understanding with Chraracteristic Pattern Detection,” Computer Vision, Graphics, and Image Processing, Vol. 30, 1985, pp. 84–106.Google Scholar
  24. [LiFu89]
    W. Lin and K.S. Fu, “A Syntactic Approach to Three-Dimensional Object Recognition,” IEEE Trans. Systems Man and Cybernetics, Vol. 16, No. 3, May 1986, pp. 405–422.Google Scholar
  25. [LBWR94]
    G. Lohse, K. Biolsi, N. Walker, H. Ruetter, “A Classification of Visual Representations,” Communications of the ACM, Vol. 37, No. 4, December 1994, pp. 36–49.Google Scholar
  26. [MuMH88]
    I. Mulder, A. Mackworth, W. Havens, “Knowledge Structuring and Constraint Satisfaction: The Mapsee Approach,” IEEE Pattern Analysis and Machine Intelligence, Vol. 10, No. 6, November 1988, pp. 866–879.Google Scholar
  27. [NoBu93]
    G. Novak and W. Bulko, “Diagrams and Text as Computer Input,” J. Visual Languages and Computing, Vol. 4, 1993, pp. 161–175.Google Scholar
  28. [OkMi91]
    M. Okamoto and B. Miao, “Recognition of Mathematical Expressions by Using the Layout Structure of Symbols,” in Proc. First Intl. Conference on Document Analysis and Recognition, Saint Malo, France, September 1991, pp. 242–250.Google Scholar
  29. [Past94]
    B. Pasternak, “Processing Imprecise and Structural Distorted Line Drawings by and Adaptable Drawing Interpretation Kernel,” Proc. IAPR Workshop on Document Analysis Systems, Kaiserslautern, Germany, Oct. 1994, pp. 349–363.Google Scholar
  30. [Read79]
    G. Read, Music Notation: A Manual of Modern Practice (Second Edition), Taplinger Publishing, New York, NY, 1979.Google Scholar
  31. [Rous88]
    D. Roush, “Music Formatting Guidelines,” Technical Report OSU-3/88-TR10, Department of Computer and Information Science, The Ohio State University, 1988.Google Scholar
  32. [Senn94]
    R. Sennhauser, “Integration of Contextual Knowledge Sources Into a Blackboard-based Text Recognition System,” IAPR Workshop on Document Analysis Systems, Kaiserslautern, Germany, Oct. 1994, pp. 211–228.Google Scholar
  33. [Slob81]
    J. Sloboda, “The Uses of Space in Music Notation,” Visual Language, Vol. XV, No. 1, pp. 86–112, 1981.Google Scholar
  34. [Srih93]
    S. Srihari, “From Pixels to Paragraphs: the Use of Contextual Models in Text Recognition,” Proc. Second Intl. Conf. Document Analysis and Recognition, Tsukuba, Japan, Oct. 1993, pp. 416–423.Google Scholar
  35. [Truv90]
    S. Truvé, “Image Interpretation Using Multi-Relational Grammars,” Proc. Third International Conference on Computer Vision, pp. 146–155, December 1990.Google Scholar
  36. [Tuft83]
    E. Tufte, The Visual Display of Quantitative Information, Graphics Press, 1983.Google Scholar
  37. [VaTo94]
    P. Vaxivière and K. Tombre, “Knowledge Organization and Interpretation Process in Engineering Drawing Interpretation,” Proc. IAPR Workshop on Document Analysis Systems, Kaiserslautern, Germany, Oct. 1994, pp. 313–321.Google Scholar
  38. [WaSr89]
    C. Wang and S. Srihari, “A Framework for Object Recognition in a Visually Complex Environment and its Application to Locating Address Blocks on Mail Pieces,” International Journal of Computer Vision, Vol. 2, 1989, pp. 125–151.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1996

Authors and Affiliations

  • Dorothea Blostein
    • 1
  1. 1.Computing and Information ScienceQueen's UniversityKingstonCanada

Personalised recommendations