Summary
We review some applications of human-computer interaction that alleviate the complexity of visual recognition by partitioning it into human and machine tasks to exploit the differences between human and machine capabilities. Human involvement offers advantages, both in the design of automated pattern classification systems, and at the operational level of some image retrieval and classification tasks. Recent development of interactive systems has benefited from the convergence of computer vision and psychophysics in formulating visual tasks as computational processes. Computer-aided classifier design and exploratory data analysis are already well established in pattern recognition and machine learning, but interfaces and functionality are improving. On the operational side, earlier recognition systems made use of human talent only in preprocessing and in coping with rejects. Most current content-based image retrieval systems make use of relevance feedback without direct image interaction. In contrast, some visual object classification systems can exploit such interaction. They require, however, a domain-specific visible model that makes sense to both human and computer.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
L. von, M. Blum, J. Langford. Telling humans and computers apart automatically. Communications of ACM, 47(2), 57–60, 2004.
P. Aigrain, H. Zhang, D. Petkovic. Content-based representation and retrieval of visual media: a state of the art review. Multimedia Tools and Applications, 3, 179–202, 1996.
F.G. Ashby, N.A. Perrin. Toward a unified theory of similarity and recognition. Psychological Review, 95(1), 124–150, 1988.
F. Attneave. Some informational aspects of visual perception. Psychological Review, 61, 183–193, 1954.
G.H. Ball, D.J. Hall. Some implications of interactive graphic computer systems for data analysis and statistics. Technometrics, 12, 17–31, 1970.
J.C. Baird, E. Noma. Fundamentals of scaling and psychophysics. New York: John Wiley & Sons, 1978.
J. Bach, C. Fuller, A. Gupta et al. The Virage image search engine: an open framework for image management. Proc. SPIE Storage and Retrieval for Image and Video Databases IV, San Jose, CA, pages 76–87, 1996
I. Biederman. Recognition-by-components: a theory of human image understanding. Psychological Review, 94, 115–147, 1987.
I. Biederman, P.C. Gerhardstein. Recognizing depth-rotated objects: evidence and conditions for three-dimensional viewpoint invariance. Journal of Experimental Psychology: Human Perception and Performance, 19, 1162–1182, 1993.
I, Biederman, P.C. Gerhardstein. Viewpoint-dependent mechanisms in visual object recognition: reply to Tarr and Bülthoff (1995). Journal of Experimental Psychology: Human Perception and Performance, 21, 1506–1514, 1995.
C. Carson, S. Belongie, H. Greenspan, J. Malik. Blobworld: image segmentation using expection-maximization and its application to image querying. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(8), 1026–1038, 2002.
Y.T. Chien. Interactive Pattern Recognition. New York: Marcel Dekker, 1970.
C.K. Chow. On optimum recognition error and reject tradeoff. IEEE Transactions Information Theory 16, 41–46, 1970.
G. Ciocca, R. Schettini. Using a relevance feedback mechanism to improve content-based image retrieval. Proc. Visual’ 99: Information and Information Systems, pages 107–114, 1999.
A.L. Coates, H.S. Baird, R.J. Fateman. Pessimal print: a reverse Turing test. Proc. Int. Conf. on Document Analysis and Recognition, pages 1154–1159, 2001.
I.J. Cox, M.L. Miller, T.P. Minka, T.V. Papathomas, P.N. Yianilos. The Bayesian image retrieval system, PicHunter: theory, implementation, and psychophysical experiments. IEEE Transactions on Image Processing, 9(1), 20–37, 2000.
R.O. Duda, P.E. Hart, D.G. Stork. Pattern classification. New York: John Wiley & Sons, 2001.
M. Flickner, H. Sawhney, W. Niblack. Query by image and video content: the QBIC system. IEEE Computer, 28(9), 23–32, 1995.
J.H. Friedman. On bias, variance, 0/1-loss, and the curse-of-dimensionality. Data Mining and Knowledge Discorvery, 1(1), 55–77, 1997.
S. Geman, E. Bienenstock, R. Doursat. Neural networks and the bias/variance dilemma. Neural Networks, 4(1), 1–58, 1992.
T. Gevers, A.W.M. Smeulders. PicToSeek, combining color and shape invariant features for image retrieval. IEEE Transactions on Image Processing, 9(1), 102–118, 2000.
I. Haritaoglu. Scene text extraction and translation for handheld devices. Proc. IEEE conf. on Computer Vision and Pattern Recognition, 2, 408–413, 2001
T.K. Ho. Exploratory analysis of point proximity in subspaces. Proc. 16th Int. Conf. on Pattern Recognition, Quebec City, Canada, 2, pages 196–199, August 11–15, 2002.
T.K. Ho. Mirage: a tool for interactive pattern recognition from multimedia data. Proc. of Astronomical Data Analysis Software and Systems XII, Baltimore, MD, pages 339–342, October 13–16, 2002.
A. Hopgood. Artificial intelligence: hype or reality? IEEE Computer, 36(5), 24–28, 2003.
T.S. Huang, S. Mehrotra, K. Ramachandran. Multimedia analysis and retrieval system (MARS) project. Proc. 33rd Annual Clinic on Library Application of Data Processing Digital Image Access and Retrieval, 1996.
R. Jain. US NSF workshop on visual information management systems, 1992.
D. Marr. Vision: a computational investigation into the human presentation and processing of visual information. New York: W.H. Freeman, 1982.
R.J. Mericsko. Introduction of 27th AIPR workshop-advances in computer-assisted recognition. Proc. of SPIE 3584, Washington, DC, October 14, 1988.
G. Miller. The magical number seven plus or minus two; some limits on our capacity for processing information. Psychological Review, 63, 81–97, 1956.
F. Mosteller, J. Tukey. Data analysis and regression. Reading, MA: Addison-Wesley, 1977.
G. Nagy. Image database. Image and Vision Computing, 3, 111–117, 1986.
S.E. Palmer. Vision science, Photons to Phenomenology. Cambridge, MA: MIT Press, 1999.
A. Pentland, R.W. Picard, S. Sclaroff. Photobook: content-based manipulation of image database. International Journal of Computer Vision, 18(3), 233–254, 1996.
Y. Rui, T.S. Huang, M. Ortega, S. Mehrotra. Relevance feedback: a power tool for interactive content-based image retrieval. IEEE Transactions on Circuits and Systems for Video Technology, 8(5), 644–655, 1998.
Y. Rui, T.S. Huang, S.F. Chang. Image retrieval: current techniques, promising directions, and open issues. Journal of Visual Communication and Image Representation, 10, 39–62, 1999.
J.W. Sammon. Interactive pattern analysis and classification. IEEE Transactions on Computers, 19, 594–616, 1970.
W. Siedlecki, K. Siedlecka, J. Sklansky. An overview of mapping techniques for exploratory pattern analysis. Pattern Recognition, 21, 411–429, 1988.
A.W.M. Smeulders, M. Worring, S. Santini, A. Gupta, R. Jain. Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(12), 1349–1380, 2000.
M.J. Tarr, H.H. Bulthoff. Is human object recognition better described by geonstructural-descriptions or by multiple-views? comment on Biederman and Gerhardstein (1993). Journal of Experimental Psychology: Human Perception and Performance, 21, 1494–1505, 1995.
J. Tukey. Exploratory data analysis. Reading, MA: Addison-Wesley, 1977.
A.M. Turing. On computable numbers with an application to the entscheidungs problem. Proc. of the London Mathematical Society, 42, 230–265, 1936.
R. Willing. Airport anti-terror systems flub tests face-recognition technology fails to flag “ suspects”. USA Today, 3A, September 2, 2003. http://www.usatoday.com/usatonline/20030902/5460651s.htm.
D.H. Wolpert. The relationship between PAC, the statistical physics framework, the Bayesian framework, and the VC framework. In: D.H. Wolpert, eds. The Mathematics of Generalization. 117–214, Reading, MA: Addison-Wesley, 1995.
J. Yang, X. Chen, W. Kunz. A PDA-based face recognition system. Proc. the 6th IEEE Workshop on Applications of Computer Vision, 19–23, 2002.
J. Vesanto. SOM-based data visualization methods. J. Intelligent Data Analysis, 3, 111–126, 1999.
J. Zhang, X. Chen, J. Yang, A. Waibel. A PDA-based sign translator. Proc. the 4th IEEE Int. Conf. on Multimodal Interfaces, pages 217–222, 2002.
J. Zou. Computer assisted visual interactive Recognition. Ph.D. thesis, Rensselaer Polytechnic Institute, Troy, NY, 2004.
J. Zou, G. Nagy. Evaluation of model-based interactive flower recognition. Proc. of 17th Int. Conf. Pattern Recognition, Cambridge, UK, volume 2, pages 311–314, 2004.
J. Zou. A model-based interactive object segmentation procedure. Proc. of IEEE 2005 Workshop on Applications of Computer Vision, Breckenridge, CO, pages 522–527, 2005.
J. Zou, A. Gattani. Computer-assisted visual interactive recognition and its prospects of implementation over the Internet. IS&T/SPIE 17th Annual Symposium, Internet Imaging VI, Proceedings of the SPIE, San Jose, CA, volume 5670, pages 76–87, 2005.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer Verlag London Limited
About this chapter
Cite this chapter
Zou, J., Nagy, G. (2006). Human-Computer Interaction for Complex Pattern Recognition Problems. In: Basu, M., Ho, T.K. (eds) Data Complexity in Pattern Recognition. Advanced Information and Knowledge Processing. Springer, London. https://doi.org/10.1007/978-1-84628-172-3_14
Download citation
DOI: https://doi.org/10.1007/978-1-84628-172-3_14
Publisher Name: Springer, London
Print ISBN: 978-1-84628-171-6
Online ISBN: 978-1-84628-172-3
eBook Packages: Computer ScienceComputer Science (R0)