A computational feature binding model of human texture perception
Abstract
We present a computational model for human texture perception which assigns functional principles to the Gestalt laws of similarity and proximity. Motivated by early vision mechanisms, in the first stage, local texture features are extracted by utilizing multi-scale filtering and nonlinear spatial pooling. In the second stage, features are grouped according to the spatial feature binding model of the competitive layer model (CLM; Wersing et al. 2001). The CLM uses cooperative and competitive interactions in a recurrent network, where binding is expressed by the layer-wise coactivation of feature-representing neurons. The Gestalt law of similarity is expressed by a non-Euclidean distance measure in the abstract feature space with proximity being taken into account by a spatial component. To choose the stimulus dimensions which allow the most salient similarity-based texture segmentation, the feature similarity metrics is reduced to the directions of maximum variance. We show that our combined texture feature extraction and binding model performs segmentation in strong conformity with human perception. The examples range from classical microtextures and Brodatz textures to other classical Gestalt stimuli, which offer a new perspective on the role of texture for more abstract similarity grouping.
Keywords
Perceptual grouping Texture perception Segmentation PsychophysicsReferences
- Albrecht DG, Hamilton DB (1982) Striate cortex of monkey and cat: contrast response function. J Neurophysiol 48(1):217–237PubMedGoogle Scholar
- Bergen JR, Adelson EH (1988) Early vision and texture perception. Nature 333:363–364CrossRefPubMedGoogle Scholar
- Besag J, Green P, Higdon D, Mengersen K (1995) Bayesian computation and stochastic systems. Stat Sci 1:3–66Google Scholar
- Bovik AC, Clark M, Geisler WS (1990) Multichannel texture analysis using localized spatial filters. IEEE Trans Pattern Anal Mach Intel 12(1):55–73CrossRefGoogle Scholar
- Brodatz P (1966) Texture: a photographic album for artists and designers. Dover, New YorkGoogle Scholar
- Cesmeli E Wang DL (2001) Texture segmentation using Gaussian–Markov random fields and neural oscillator networks. IEEE Trans Neural Netw 12:394–404CrossRefGoogle Scholar
- Chubb C Landy MS (1991) Orthogonal distribution analysis: a new approach to the study of texture perception. In: Landy MS, Movshon JA (eds) Computational models of visual processing. MIT Press, Boston, pp 291–301Google Scholar
- Daugman JG (1985) Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. J Optic Soc Am A 2(7):1160–1169Google Scholar
- Daugman JG (1988) Complete discrete 2D Gabor transforms by neural networks for image analysis and compression. IEEE Trans Acoust Speech Signal Process 36(7):1169–1179CrossRefGoogle Scholar
- Dunn D, Higgins WE, Wakeley J (1994) Texture segmentation using 2D Gabor elementary functions. IEEE Trans Pattern Anal Mach Intel 16(2):130–149CrossRefGoogle Scholar
- Feng J (1997) Lyapunov functions for neural nets with nondifferentiable input–output characteristics. Neural Comput 9:43–49PubMedGoogle Scholar
- Fogel I, Sagi D (1989) Gabor filters as texture discriminator. Biol Cyber 61:103–113Google Scholar
- Giese MA (1999) Dynamic neural field theory for motion perception. Kluwer, DordrechtGoogle Scholar
- Graham N, Beck J, Sutter A (1992) Nonlinear processes in spatial-frequency channel models of perceived texture segregation: effects of sign and amount of contrast. Vision Res 32(4):719–743CrossRefPubMedGoogle Scholar
- Grossberg S, Williamson JR (1999) A self organizing neural system for learning to recognize textured scenes. Vision Res 39(7):1385–1406CrossRefPubMedGoogle Scholar
- Hahnloser RHR, Sarpeshkar R, Mahowald MA, Douglas RJ, Seung HS (2000) Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit. Nature 405:947–951CrossRefPubMedGoogle Scholar
- Hancock PJB, Baddeley RJ, Smith LS (1992) The principal components of natural images. Network 3:61–70CrossRefGoogle Scholar
- Hofmann T, Puzicha J, Buhmann J (1996) A deterministic annealing framework for unsupervised texture segmentation. Tech Rep IAI-TR-96. University of BonnGoogle Scholar
- Hofmann T, Puzicha J, Buhmann JM (1998) Unsupervised texture segmentation in a deterministic annealing framework. IEEE Trans Pattern Anal Mach Intel 20(8):803–818CrossRefGoogle Scholar
- Hupe JM, James AC, Girard P, Bullier J (2001) Response modulations by static textures surround in area V1 of the macaque monkey do not depend on feedback connections from V2. J Neurophysiol 85(1):146–163PubMedGoogle Scholar
- Jähne B (1993) Digitale Bildverarbeitung. Springer, Berlin Heidelberg New YorkGoogle Scholar
- Jain AK, Farrokhina F (1991) Unsupervised texture segmentation using Gabor filters. Pattern Recog 24(12):1167–1186CrossRefGoogle Scholar
- Jones J, Palmer L (1987) An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. J Neurophysiol 58:1233–1258PubMedGoogle Scholar
- Julesz B (1981) Textons, the elements of texture perception and their interaction. Nature 290:91–97PubMedGoogle Scholar
- Kreiter AK, Singer W (1996) Stimulus-dependent synchronization of neural responses in the visual cortex of the awake macaque monkey. J Neurosci 16:2381–2396PubMedGoogle Scholar
- Kröse BJ (1987) Local structure analyzers as determinants of preattentive pattern discrimination. Biol Cybernetics 55:289–298Google Scholar
- Lamme VAF, Spekreijse H (1998) Neuronal synchrony does not represent texture segregation. Nature 396:362–366PubMedGoogle Scholar
- Landy MS, Bergen JR (1991) Texture segregation and orientation gradient. Vision Res 31(4):679–691CrossRefPubMedGoogle Scholar
- Lee TS (1996) Image representation using 2D Gabor wavelets. IEEE Trans Pattern Anal Mach Intel 18(10):959–971CrossRefGoogle Scholar
- Ma WY, Manjunath BS (1996) Texture features and learning similarity. In: Proc IEEE Int Conf on Computer Vision and Pattern Recognition (CVPR), San Francisco, CaliforniaGoogle Scholar
- Malik J, Perona P (1990) Preattentive texture discrimination with early vision mechanisms. J Optic Soc Am A 7(5)Google Scholar
- Malsburg von der C, Buhmann J (1992) Sensory segmentation with coupled oscillators. Biol Cybernetics 54:29–40Google Scholar
- Manjunath BS, Chellappa R (1993) A unified approach to boundary perception: edges, textures, and illusory contours. IEEE Trans Neural Netw 4(1):96–107CrossRefGoogle Scholar
- Manjunath BS, Ma WY (1996) Texture features for browsing and retrieval of image data. IEEE Trans Pattern Anal Mach Intel 18(8):837–842CrossRefGoogle Scholar
- Nattkemper TW, Wersing H, Schubert W, Ritter H (2000) Fluorescence micrograph segmentation by gestalt-based feature binding. In: Proc IEEE Int Joint Conf on Neural Networks (IJCNN), Como, Italy, pp 248–254Google Scholar
- Nothdurft HC (1985) Sensitivity for structure gradient in texture discrimination tasks. Vision Res 25(12):1957–1968CrossRefPubMedGoogle Scholar
- Nothdurft HC (1991) Different effects from spatial frequency masking in texture segregation and texton detection tasks. Vision Res 31:299–320CrossRefPubMedGoogle Scholar
- Nothdurft HC, Gallant JL, Van Essen DC (2000) Response profiles to texture border patterns in area V1. J Neurophysiol 16(3):421–436Google Scholar
- Pichler O, Teuner A, Hosticka BJ (1996) A comparison of texture feature extraction using adaptive Gabor filtering, pyramidal and tree structured transforms. Pattern Recognition 29(5):733–742CrossRefGoogle Scholar
- Randen T, Husøy JH (1999) Filtering for texture classification: a comparative study. IEEE Trans Pattern Anal Mach Intel 21(4):291–310CrossRefGoogle Scholar
- Rentschler I, Hubner M, Caelli T (1988) On the discrimination of compound Gabor signals and textures. Vision Res 28:279–291CrossRefPubMedGoogle Scholar
- Ritter H (1990) A spatial approach to feature linking. In: Proc Int Neural Network Conf, Paris, vol 2, pp 898–901Google Scholar
- Robert A (1997) From contour completion to image schemas: a modern perspective on Gestalt psychology. Tech Rep. Department of Cognitive Science, University of California, San DiegoGoogle Scholar
- Roelfsema PR, Lamme VAF, Spekreisje H, Bosch H (2002) Figure-ground segregation in a recurrent network architecture. J Cognitive Neurosci 14:525–537CrossRefGoogle Scholar
- Rosenblatt F (1962) Principles of neurodynamics: perceptions and the theory of brain mechanics. Spartan Books, Washington, DCGoogle Scholar
- Sakai K, Tanaka S (2000) Spatial pooling in the second order spatial structure of cortical complex cells. Vision Res 40:855–871CrossRefPubMedGoogle Scholar
- Schillen TB, Knig P (1994) Binding by temporal structure in multiple feature domains of an oscillatory network. Biol Cybernetics 70:397–405CrossRefGoogle Scholar
- Singer W (1999) Neuronal synchrony: a versatile code for the definition of relations? Neuron 24:49–65PubMedGoogle Scholar
- Super H, van der Togt C, Spekreijse H, Lamme VAF (2003) Internal state of monkey primary visual cortex(V1) predicts figure-ground perception. J Neurosci 23(8):3407–3414PubMedGoogle Scholar
- Sutter A, Sperling G, Chubb C (1995) Measuring the spatial frequency selectivity of second order texture mechanisms. Vision Res 35(7):915–924CrossRefPubMedGoogle Scholar
- Terman D, Wang DL (1995) Global competition and local cooperation in a network of neural oscillators. Physica D 81:148–176CrossRefGoogle Scholar
- Treisman A, Schmidt H (1982) Illusory conjunctions in the perception of objects. Cognitive Psychol 14:107–141Google Scholar
- Turner MR (1986) Texture discrimination by Gabor functions. Biol Cybernetics 55:71–82Google Scholar
- Usrey WM, Reid RC (1999) Synchronous activity in the visual system. Annu Rev Neurosci 61:194–214Google Scholar
- De Valois RL, De Valois KK (1988) Spatial vision. Oxford University Press, New YorkGoogle Scholar
- De Valois RL, Yund EW, Hepler N (1982a) The orientation and direction selectivity of cells in macaque visual cortex. Vision Res 22:531–544PubMedGoogle Scholar
- De Valois RL, Albrecht DG, Thorell LG (1982b) Spatial frequency selectivity of cells in macaque visual cortex. Vision Res 22:545–559PubMedGoogle Scholar
- Vernon D (1991) Machine vision. Prentice Hall, New YorkGoogle Scholar
- Watson AB (1987) Efficiency of a model human image code. J Optic Soc Am A 4(12):2401–2417Google Scholar
- Wersing H (2000) Spatial feature binding and learning in competitive neural layer architectures. Cuvillier, GöttingenGoogle Scholar
- Wersing H, Steil JJ, Ritter H (2001) A competitive layer model for feature binding and sensory segmentation. Neural Comput 13(2):357–387CrossRefPubMedGoogle Scholar
- Wolfe JM, Cave KR (1999) The psychophysical evidence for a binding problem in human vision. Neuron 24:11–17PubMedGoogle Scholar
- Wolfson SS, Landy MS (1998) Examining edge- and region-based texture analysis mechanisms. Vision Res 38(3):439–446CrossRefPubMedGoogle Scholar