Searching in the “Real World”

(Abstract of Invited Plenary Talk)
  • Ophir Frieder
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5831)

Abstract

For many, "searching" is considered a mostly solved problem.  In fact, for text processing, this belief is factually based.  The problem is that most "real world" search applications involve "complex documents", and such applications are far from solved.  Complex documents, or less formally, "real world documents", comprise of a mixture of images, text, signatures, tables, logos, water-marks, stamps, etc, and are often available only in scanned hardcopy formats. Search systems for such document collections are currently unavailable.

We describe our efforts at building a complex document information processing (CDIP) prototype. This prototype integrates "point solution" (mature) technologies, such as OCR capability, signature matching and handwritten word spotting techniques, search and mining approaches, among others, to yield a system capable of searching "real world documents". The described prototype demonstrates the adage that "the whole is greater than the sum of its parts".

References

  1. 1.
    Frieder, O.: On Searching in the ‘Real World’. In: Argamon, S., Howard, N. (eds.) Computational Methods for Counterterrorism, ch.  1. Springer, Heidelberg (2009) ISBN: 978-3-642-01140-5Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Ophir Frieder
    • 1
  1. 1.Information Retrieval Laboratory, Department of Computer ScienceIllinois Institute of TechnologyUSA

Personalised recommendations