Skip to main content

Human-Computer Interaction for Complex Pattern Recognition Problems

  • Chapter
Data Complexity in Pattern Recognition

Part of the book series: Advanced Information and Knowledge Processing ((AI&KP))

Summary

We review some applications of human-computer interaction that alleviate the complexity of visual recognition by partitioning it into human and machine tasks to exploit the differences between human and machine capabilities. Human involvement offers advantages, both in the design of automated pattern classification systems, and at the operational level of some image retrieval and classification tasks. Recent development of interactive systems has benefited from the convergence of computer vision and psychophysics in formulating visual tasks as computational processes. Computer-aided classifier design and exploratory data analysis are already well established in pattern recognition and machine learning, but interfaces and functionality are improving. On the operational side, earlier recognition systems made use of human talent only in preprocessing and in coping with rejects. Most current content-based image retrieval systems make use of relevance feedback without direct image interaction. In contrast, some visual object classification systems can exploit such interaction. They require, however, a domain-specific visible model that makes sense to both human and computer.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. L. von, M. Blum, J. Langford. Telling humans and computers apart automatically. Communications of ACM, 47(2), 57–60, 2004.

    Article  Google Scholar 

  2. P. Aigrain, H. Zhang, D. Petkovic. Content-based representation and retrieval of visual media: a state of the art review. Multimedia Tools and Applications, 3, 179–202, 1996.

    Article  Google Scholar 

  3. F.G. Ashby, N.A. Perrin. Toward a unified theory of similarity and recognition. Psychological Review, 95(1), 124–150, 1988.

    Article  Google Scholar 

  4. F. Attneave. Some informational aspects of visual perception. Psychological Review, 61, 183–193, 1954.

    Article  Google Scholar 

  5. G.H. Ball, D.J. Hall. Some implications of interactive graphic computer systems for data analysis and statistics. Technometrics, 12, 17–31, 1970.

    Article  Google Scholar 

  6. J.C. Baird, E. Noma. Fundamentals of scaling and psychophysics. New York: John Wiley & Sons, 1978.

    Google Scholar 

  7. J. Bach, C. Fuller, A. Gupta et al. The Virage image search engine: an open framework for image management. Proc. SPIE Storage and Retrieval for Image and Video Databases IV, San Jose, CA, pages 76–87, 1996

    Google Scholar 

  8. I. Biederman. Recognition-by-components: a theory of human image understanding. Psychological Review, 94, 115–147, 1987.

    Article  Google Scholar 

  9. I. Biederman, P.C. Gerhardstein. Recognizing depth-rotated objects: evidence and conditions for three-dimensional viewpoint invariance. Journal of Experimental Psychology: Human Perception and Performance, 19, 1162–1182, 1993.

    Article  Google Scholar 

  10. I, Biederman, P.C. Gerhardstein. Viewpoint-dependent mechanisms in visual object recognition: reply to Tarr and Bülthoff (1995). Journal of Experimental Psychology: Human Perception and Performance, 21, 1506–1514, 1995.

    Article  Google Scholar 

  11. C. Carson, S. Belongie, H. Greenspan, J. Malik. Blobworld: image segmentation using expection-maximization and its application to image querying. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(8), 1026–1038, 2002.

    Article  Google Scholar 

  12. Y.T. Chien. Interactive Pattern Recognition. New York: Marcel Dekker, 1970.

    Google Scholar 

  13. C.K. Chow. On optimum recognition error and reject tradeoff. IEEE Transactions Information Theory 16, 41–46, 1970.

    Article  Google Scholar 

  14. G. Ciocca, R. Schettini. Using a relevance feedback mechanism to improve content-based image retrieval. Proc. Visual’ 99: Information and Information Systems, pages 107–114, 1999.

    Google Scholar 

  15. A.L. Coates, H.S. Baird, R.J. Fateman. Pessimal print: a reverse Turing test. Proc. Int. Conf. on Document Analysis and Recognition, pages 1154–1159, 2001.

    Google Scholar 

  16. I.J. Cox, M.L. Miller, T.P. Minka, T.V. Papathomas, P.N. Yianilos. The Bayesian image retrieval system, PicHunter: theory, implementation, and psychophysical experiments. IEEE Transactions on Image Processing, 9(1), 20–37, 2000.

    Article  Google Scholar 

  17. R.O. Duda, P.E. Hart, D.G. Stork. Pattern classification. New York: John Wiley & Sons, 2001.

    MATH  Google Scholar 

  18. M. Flickner, H. Sawhney, W. Niblack. Query by image and video content: the QBIC system. IEEE Computer, 28(9), 23–32, 1995.

    Google Scholar 

  19. J.H. Friedman. On bias, variance, 0/1-loss, and the curse-of-dimensionality. Data Mining and Knowledge Discorvery, 1(1), 55–77, 1997.

    Article  Google Scholar 

  20. S. Geman, E. Bienenstock, R. Doursat. Neural networks and the bias/variance dilemma. Neural Networks, 4(1), 1–58, 1992.

    Google Scholar 

  21. T. Gevers, A.W.M. Smeulders. PicToSeek, combining color and shape invariant features for image retrieval. IEEE Transactions on Image Processing, 9(1), 102–118, 2000.

    Article  Google Scholar 

  22. I. Haritaoglu. Scene text extraction and translation for handheld devices. Proc. IEEE conf. on Computer Vision and Pattern Recognition, 2, 408–413, 2001

    Google Scholar 

  23. T.K. Ho. Exploratory analysis of point proximity in subspaces. Proc. 16th Int. Conf. on Pattern Recognition, Quebec City, Canada, 2, pages 196–199, August 11–15, 2002.

    Google Scholar 

  24. T.K. Ho. Mirage: a tool for interactive pattern recognition from multimedia data. Proc. of Astronomical Data Analysis Software and Systems XII, Baltimore, MD, pages 339–342, October 13–16, 2002.

    Google Scholar 

  25. A. Hopgood. Artificial intelligence: hype or reality? IEEE Computer, 36(5), 24–28, 2003.

    Google Scholar 

  26. T.S. Huang, S. Mehrotra, K. Ramachandran. Multimedia analysis and retrieval system (MARS) project. Proc. 33rd Annual Clinic on Library Application of Data Processing Digital Image Access and Retrieval, 1996.

    Google Scholar 

  27. R. Jain. US NSF workshop on visual information management systems, 1992.

    Google Scholar 

  28. D. Marr. Vision: a computational investigation into the human presentation and processing of visual information. New York: W.H. Freeman, 1982.

    Google Scholar 

  29. R.J. Mericsko. Introduction of 27th AIPR workshop-advances in computer-assisted recognition. Proc. of SPIE 3584, Washington, DC, October 14, 1988.

    Google Scholar 

  30. G. Miller. The magical number seven plus or minus two; some limits on our capacity for processing information. Psychological Review, 63, 81–97, 1956.

    Article  Google Scholar 

  31. F. Mosteller, J. Tukey. Data analysis and regression. Reading, MA: Addison-Wesley, 1977.

    Google Scholar 

  32. G. Nagy. Image database. Image and Vision Computing, 3, 111–117, 1986.

    Article  Google Scholar 

  33. S.E. Palmer. Vision science, Photons to Phenomenology. Cambridge, MA: MIT Press, 1999.

    Google Scholar 

  34. A. Pentland, R.W. Picard, S. Sclaroff. Photobook: content-based manipulation of image database. International Journal of Computer Vision, 18(3), 233–254, 1996.

    Article  Google Scholar 

  35. Y. Rui, T.S. Huang, M. Ortega, S. Mehrotra. Relevance feedback: a power tool for interactive content-based image retrieval. IEEE Transactions on Circuits and Systems for Video Technology, 8(5), 644–655, 1998.

    Article  Google Scholar 

  36. Y. Rui, T.S. Huang, S.F. Chang. Image retrieval: current techniques, promising directions, and open issues. Journal of Visual Communication and Image Representation, 10, 39–62, 1999.

    Article  Google Scholar 

  37. J.W. Sammon. Interactive pattern analysis and classification. IEEE Transactions on Computers, 19, 594–616, 1970.

    Google Scholar 

  38. W. Siedlecki, K. Siedlecka, J. Sklansky. An overview of mapping techniques for exploratory pattern analysis. Pattern Recognition, 21, 411–429, 1988.

    Article  MathSciNet  Google Scholar 

  39. A.W.M. Smeulders, M. Worring, S. Santini, A. Gupta, R. Jain. Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(12), 1349–1380, 2000.

    Article  Google Scholar 

  40. M.J. Tarr, H.H. Bulthoff. Is human object recognition better described by geonstructural-descriptions or by multiple-views? comment on Biederman and Gerhardstein (1993). Journal of Experimental Psychology: Human Perception and Performance, 21, 1494–1505, 1995.

    Article  Google Scholar 

  41. J. Tukey. Exploratory data analysis. Reading, MA: Addison-Wesley, 1977.

    MATH  Google Scholar 

  42. A.M. Turing. On computable numbers with an application to the entscheidungs problem. Proc. of the London Mathematical Society, 42, 230–265, 1936.

    Google Scholar 

  43. R. Willing. Airport anti-terror systems flub tests face-recognition technology fails to flag “ suspects”. USA Today, 3A, September 2, 2003. http://www.usatoday.com/usatonline/20030902/5460651s.htm.

    Google Scholar 

  44. D.H. Wolpert. The relationship between PAC, the statistical physics framework, the Bayesian framework, and the VC framework. In: D.H. Wolpert, eds. The Mathematics of Generalization. 117–214, Reading, MA: Addison-Wesley, 1995.

    Google Scholar 

  45. J. Yang, X. Chen, W. Kunz. A PDA-based face recognition system. Proc. the 6th IEEE Workshop on Applications of Computer Vision, 19–23, 2002.

    Google Scholar 

  46. J. Vesanto. SOM-based data visualization methods. J. Intelligent Data Analysis, 3, 111–126, 1999.

    Article  Google Scholar 

  47. J. Zhang, X. Chen, J. Yang, A. Waibel. A PDA-based sign translator. Proc. the 4th IEEE Int. Conf. on Multimodal Interfaces, pages 217–222, 2002.

    Google Scholar 

  48. J. Zou. Computer assisted visual interactive Recognition. Ph.D. thesis, Rensselaer Polytechnic Institute, Troy, NY, 2004.

    Google Scholar 

  49. J. Zou, G. Nagy. Evaluation of model-based interactive flower recognition. Proc. of 17th Int. Conf. Pattern Recognition, Cambridge, UK, volume 2, pages 311–314, 2004.

    Google Scholar 

  50. J. Zou. A model-based interactive object segmentation procedure. Proc. of IEEE 2005 Workshop on Applications of Computer Vision, Breckenridge, CO, pages 522–527, 2005.

    Google Scholar 

  51. J. Zou, A. Gattani. Computer-assisted visual interactive recognition and its prospects of implementation over the Internet. IS&T/SPIE 17th Annual Symposium, Internet Imaging VI, Proceedings of the SPIE, San Jose, CA, volume 5670, pages 76–87, 2005.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer Verlag London Limited

About this chapter

Cite this chapter

Zou, J., Nagy, G. (2006). Human-Computer Interaction for Complex Pattern Recognition Problems. In: Basu, M., Ho, T.K. (eds) Data Complexity in Pattern Recognition. Advanced Information and Knowledge Processing. Springer, London. https://doi.org/10.1007/978-1-84628-172-3_14

Download citation

  • DOI: https://doi.org/10.1007/978-1-84628-172-3_14

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-84628-171-6

  • Online ISBN: 978-1-84628-172-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics