Abstract
This paper reviews our recent fMRI and psychophysical finding on: 1) perceived size representation in V1; 2) border ownership representation in V2; and 3) neural processing of partially occluded face. These findings demonstrate that the human early visual cortex not only performs local feature analyses, but also contributes significantly to high-level visual computation with assistance of attention-enabled cortical feedback. Moreover, by taking advantage of recent findings on early visual cortex from neuroscience and cognitive science, we build a biologically plausible attention model that can well predict human scanpaths on natural images.
This is a preview of subscription content, access via your institution.
References
Wandell B A, Dumoulin S O, Brewer A A. Visual field maps in human cortex. Neuron, 2007, 56(2): 366–383
Fang F, Boyaci H, Kersten D, Murray S O. Attention-dependent representation of a size illusion in human V1. Current Biology, 2008, 18(21): 1707–1712
Wojciulik E, Kanwisher N, Driver J. Covert visual attention modulates face-specific activity in the human fusiform gyrus: fMRI study. Journal of Neurophysiology, 1998, 79(3): 1574–1578
Murray S O, He S. Contrast invariance in the human lateral occipital complex depends on attention. Current Biology, 2006, 16(6): 606–611
Nakayama K, Shimojo S, Silverman G H. Stereoscopic depth: Its relation to image segmentation, grouping, and the recognition of occluded objects. Perception, 1989, 18(1): 55–68
Driver J, Baylis G C. Edge-assignment and figure-ground segmentation in short-term visual matching. Cognitive Psychology, 1996, 31(3): 248–306
Lee T S, Mumford D, Romero R, Lamme V A F. The role of the primary visual cortex in higher level vision. Vision Research, 1998, 38(15–16): 2429–2454
Zhou H, Friedman H S, von der Heydt R. Coding of border ownership in monkey visual cortex. Journal of Neuroscience, 2000, 20(17): 6594–6611
Qiu F T, von der Heydt R. Figure and ground in the visual cortex: V2 combines stereoscopic cues with Gestalt rules. Neuron, 2005, 47(1): 155–166
Kourtzi Z, Kanwisher N. Representation of perceived object shape by the human lateral occipital complex. Science, 2001, 293(5534): 1506–1509
Andrews T J, Schluppeck D, Homfray D, Matthews P, Blakemore C. Activity in the fusiform gyrus predicts conscious perception of Rubin’s vase-face illusion. NeuroImage, 2002, 17(2): 890–901
Fang F, Murray S O, Kersten D J, He S. Orientation-tuned fMRI adaptation in human visual cortex. Journal of Neurophysiology, 2005, 94(6): 4188–4195
Fang F, Boyaci H, Kersten D. Border ownership selectivity in human early visual cortex and its modulation by attention. Journal of Neuroscience, 2009, 29(2): 460–465
Albright T D, Stoner G R. Contextual influences on visual processing. Annual Review of Neuroscience, 2002, 25(1): 339–379
Grill-Spector K, Malach R. The human visual cortex. Annual Review of Neuroscience, 2004, 27(1): 649–677
Kanwisher N. Functional specificity in the human brain: A window into the functional architecture of the mind. Proceedings of the National Academy of Sciences of the United States of America, 2010, 107(25): 11163–11170
Stanley D A, Rubin N. fMRI activation in response to illusory contours and salient regions in the human lateral occipital complex. Neuron, 2003, 37(2): 323–331
Hegde J, Fang F, Murray S O, Kersten D. Preferential responses to occluded objects in the human visual cortex. Journal of Vision, 2008, 8(4): 16-1–16-16
Grutzner C, Uhlhaas P J, Genc E, Kohler A, Singer W, Wibral M. Neuroelectromagnetic correlates of perceptual closure processes. Journal of Neuroscience, 2010, 30(24): 8342–8352
Sugita Y. Grouping of image fragments in primary visual cortex. Nature, 1999, 401(6750): 269–272
Pillow J, Rubin N. Perceptual completion across the vertical meridian and the role of early visual cortex. Neuron, 2002, 33(5): 805–813
Bakin J S, Nakayama K, Gilbert C D. Visual responses in monkey areas V1 and V2 to three-dimensional surface configurations. Journal of Neuroscience, 2000, 20(21): 8188–8198
Lerner Y, Hendler T, Malach R. Object-completion effects in the human lateral occipital complex. Cerebral Cortex, 2002, 12(2): 163–177
Murray R F, Sekuler A B, Bennett P J. Time course of amodal completion revealed by a shape discrimination task. Psychonomic Bulletin & Review, 2001, 8(4): 713–720
Chen J, Liu B, Chen B, Fang F. Time course of amodal completion in face perception. Vision Research, 2009, 49(7): 752–758
Bar M, Tootell R B, Schacter D, Greve D, Fischl B, Mendola J, Rosen B, Dale A. Cortical mechanisms specific to explicit visual object recognition. Neuron, 2001, 29(2): 529–535
Lamme V A F, Zipser K, Spekreijse H. Masking interrupts figure-ground signals in V1. Journal of Cognitive Neuroscience, 2002, 14(7): 1044–1053
Chen J, Zhou T, Yang H, Fang F. Cortical dynamics underlying face completion in human visual system. Journal of Neuroscience, 2010, 30(49): 16692–16698
Fang F, He S. Viewer-centered object representation in the human visual system revealed by viewpoint aftereffect. Neuron, 2005, 45(5): 793–800
Wang W, Chen C, Wang Y, Jiang T, Fang F, Yao Y. Simulating human saccadic scanpath on natural images. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2011, 441–448
Olshausen B, Field D. Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature, 1996, 381(6583): 607–609
Wang W, Wang Y, Huang Q, Gao W. Measuring visual saliency by site entropy rate. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2010, 2368–2375
Itti L, Koch C, Niebur E. A model of saliency based visual attention for rapid scene analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1998, 20(11): 1254–1259
Yao J G, Gao X, Yan H M, Li C Y. Field of attention for instantaneous object recognition. PLoS ONE, 2011, 6(1): e16343
Hou X, Zhang L. Dynamic visual attention: Searching for coding length increments. Advances in Neural Information Processing Systems, 2008, 21: 681–688
Sauer T, Yorke J, Casdagli M. Embedology. Journal of Statistical Physics, 1991, 65(3–4): 579–616
Bruce N, Tsotsos J. Saliency based on information maximization. Advances in Neural Information Processing Systems, 2006, 18: 155–162
Author information
Authors and Affiliations
Corresponding authors
Additional information
Fang FANG is a professor in the Department of Psychology at Peking University, Beijing, China. He earned his B.S. degree in psychology and M.E. degree in signal and information processing at Peking University. From 2001 to 2007, he did his Ph.D. and postdoctoral training in cognitive and biological psychology at the University of Minnesota at Twin Cities. He then moved back to Peking University and started his own lab as principle investigator. His research has been dedicated to human visual perception, attention and awareness, using functional brain imaging, psychophysics and computational modeling. He has authored or co-authored about 40 research papers published in prestigious journals, including Nature Neuroscience, Neuron, Current Biology, Proceedings of the National Academy of Sciences of the United States of America (PNAS), Journal of Neuroscience. He received the National Distinguished Young Scientist Award from the National Natural Science Foundation of China in 2009 and the National Award for Youth in Science and Technology from the China Association for Science and Technology in 2011. He serves as editorial board member of Experimental Brain Research and Frontiers in Perception Science.
Yizhou WANG is a professor of Department of Computer Science at Peking University, Beijing, China. He is a vice director of Institute of Digital Media at Peking University, and the director of New Media Lab of National Engineering Lab for Video Technology. He received his Bachelor’s degree in electrical engineering from Tsinghua University, Beijing, China, in 1996, and his Ph.D. degree in computer science from University of California at Los Angeles (UCLA) in 2005. He worked at Xerox Palo Alto Research Center (Xerox PARC) as a research scientist from 2005 to 2007. Dr. Wang’s research interests include computer vision, statistical modeling and learning, and digital visual arts.
About this article
Cite this article
Fang, F., Wang, Y. Image understanding, attention and human early visual cortex. Front. Electr. Electron. Eng. 7, 85–93 (2012). https://doi.org/10.1007/s11460-012-0184-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11460-012-0184-0