Abstract
Recent developments in image classification have focused on efficient preprocessing of visual data to improve the performances of neural networks and other learning algorithms when dealing with content-based classification tasks. Given the high dimensionality and redundancy of visual data, the primary goal of preprocessing is to transfer the original data to a low-dimensional representation that preserves the information relevant for the classification. This contribution reviews modern preprocessing (dimension-reduction) techniques and discusses their advantages and disadvantages. The performance of the techniques is assessed on a difficult painting-classification task that requires painter-specific features to be retained in the low-dimensional representation. Evaluation of the results shows that domain-specific knowledge provides a rough albeit indispensable guideline for determining the appropriate type of preprocessing. Furthermore, the evaluation shows that neural-network techniques are most suitable for executing and fine-tuning the preprocessing and subsequent classification. It is argued that further improvements can be gained by the use of a content-based attentional selection procedure. Our conclusion is that preprocessing should be tailored to the task at hand by combining domain knowledge with neural-network techniques, and that within fifty years the visual signature of painters is as recognizable as is any handwritten signature.
Keywords
- Image recognition
- neural networks
- visual art recognition
This is a preview of subscription content, access via your institution.
Buying options
Preview
Unable to display preview. Download preview PDF.
References
Barnsley, M.F. (1992). Fractals Everywhere. San Diego: Academic Press.
Beck, J. (1982). Textural segmentation. In J. Beck (Ed.), Organization and Representation in Perception (pp. 285–317). Hillsdale, NJ: Erlbaum.
Biederman, I., Rabinowitz, J.C., Glass, A.L., and Stacey, E.W., Jr. (1974). On the information extracted from a glance at a scene. Journal of Experimental Psychology, 103, 597–600.
Fourier, J. (1888). Theorie Analytique de la Chaleur. Gauthiers-Villars.
Freeman, W.T. and Adelson, E.H. (1991). The design and use of steerable filters. IEEE Transactions in Pattern Analysis and Machine Intelligence, 13, 891–906.
Funt, B.V. and Finlayson, G.D. (1995). Color constant color indexing. IEEE Transactions on Pattern Analysis and Machine Intelligence, 17, 522–528.
Gage, J. (1999). Colour and Meaning. Art, Science and Symbolism. London: Thames and Hudson.
Gonzalez, R.C. and Woods, R.E. (1993). Digital Image Processing. Reading, MA: Addison-Wesley Publishing Company.
Herik, H.J. van den (1995). How to model thoughts and actions. Nieuw Archiefvoor Wiskunde, Part IV, 13 (3), 363–380.
Hyvarinen, A. (1999). Fast and robust fixed-point algorithms for independent component analysis. IEEE Transactions on Neural Networks, 10, 626–634.
Hyvarinen, A. and Oja, E. (1999). Independent Component Analysis: A Tutorial. http://www.cis.hut.fi/projects/ica.
Jain, A.K. (1989). Fundamentals of Digital Image Processing. Prentice Hall.
Julesz, B. and Bergen, J.R. (1983). Textons, the fundamental elements in preattentive vision and perception of textures. The Bell Systems Technical Journal, 62, 1619–1645.
Koenderink, J.J. and Doom, A.J. Van (1988).The basic geometry of a vision system. In R. Trappl (Ed.), Cybernetics and Systems’88 (pp. 481–485). Dordrecht: Kluwer Academic Publishers.
Lu, N. (1997). Fractal Imaging. San Diego: Academic Press.
Mandelbrot, B.B. (1977). The Fractal Geometry of Nature. W.H. Freeman and Company.
Mel, B. (1997). SEEMORE: Combining color, shape, and texture histogramming in a neurally-inspired approach to visual object recognition. Neural Computation, 9, 111–804.
Parker, J.R. (1997). Algorithms for Image Processing and Computer Vision. New York: John Wiley & Sons, Inc.
Phillips, D. (1995). How do forgers deceive art critics? In R. Gregory, J. Harris, P. Heard, and D. Rose (Eds.), The Artful Eye (pp. 372–388). Oxford: Oxford University Press.
Pioch, N. (1996). The Webmuseum, Paris, http://sunsite.doc.ic.ac.uk/wm/
Postma, E.O., Herik, H.J. van den, and Hudson, P.T.W. (1997a). Image Recognition by Brains and Machines. In S. Amari and N. Kasabov (Eds), Brain-like Computing and Intelligent Information Systems (pp. 25–47). Singapore: Springer-Verlag.
Postma, E.O., Herik, HJ. van den, and Hudson, P.T.W. (1997b). SCAN: A scalable model of covert attention. Neural Networks, 10, 993–1015.
Postma, E.O, Herik, H.J. van den, and Hudson, P.T.W. (1998). Spatio-chromatic Features for Image Recognition. In H. Prade (Ed.), Proceedings of the European Conference on Artificial Intelligence, ECAIV8 (pp. 637–641). John Wiley & Sons, Chichester.
Rao, R.P.N, and Ballard, D.H. (1995). An active vision architecture based on iconic representations. Artificial Intelligence, 78, 461–505.
Reed, R.D. and Marks II, R.J. (1999). Neural Smithing. Supervised Learning in Feedforward Artificial Neural Networks. Cambridge, MA: MIT Press.
Rumelhart, D.E., Hinton, G.E., and Williams, R.J. (1986). Learning internal representations by error propagation. In D.E. Rumelhart and J.L. McClelland (Eds.), Parallel Distributed Processing: Explorations in the microstructure of cognition, vol. I: Foundations, (pp.318–362). Cambridge, MA: MIT Press.
Russ, J.C. (1990). Surface characterisation: Fractal Dimensions, Hurst Coefficients, and Frequency Transforms. Journal of Computer Assisted Microscopy, 2, 249–257.
Schiele, B. and Crowley, J.L. (1996). Object recognition using multidimensional receptive field histograms. In B. Buxton and R. Cipolla (Eds.) Proceedings of the ECCVV6, 610–619. Berlin: Springer-Verlag.
Schmid, C. and Mohr, R. (1997). Local gray value invariants for image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19, 530–534.
Swain, M. and Ballard, D.H. (1991). Color indexing. International Journal of Computer Vision, 7, 11–32.
Taylor, R.P., Micolich, A.P., and Jonas, D. (1999). Fractal analysis of Pollock’s drip paintings. Nature, 399,422.
Treisman, A.M. (1982). Perceptual grouping and attention in visual search for features and objects. Journal of Experimental Psychology: Human Perception and Performance, 8 (2), 194–214.
Weiss, S. M. and Kulikowski, C. A. (1991). Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning and expert systems. San Mateo, CA: Morgan Kaufmann.
Wetering, E. van de (1997). Rembrandt: The painter at work. Amsterdam: Amsterdam University Press.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
van den Herik, H.J., Postma, E.O. (2000). Discovering the Visual Signature of Painters. In: Kasabov, N. (eds) Future Directions for Intelligent Systems and Information Sciences. Studies in Fuzziness and Soft Computing, vol 45. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1856-7_7
Download citation
DOI: https://doi.org/10.1007/978-3-7908-1856-7_7
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-2470-4
Online ISBN: 978-3-7908-1856-7
eBook Packages: Springer Book Archive