Abstract
Many have argued that early visual processing is encapsulated from the influence of higher-level goals, expectations, and knowledge of the world. (Early vision is thought to result in perception of three-dimensional shapes and surfaces, prior to object recognition and categorization.) Here we confront the main arguments offered in support of such a view, showing that they are unpersuasive. We also present evidence of top–down influences on early vision, emphasizing data from cognitive neuroscience. Our conclusion is that encapsulation is not a defining feature of visual processing. But we take this conclusion to be quite modest in scope, readily incorporated into mainstream vision science.
Similar content being viewed by others
Notes
See Masrour et al. (2015) on the methodological limitations of behavioral evidence for resolving the encapsulation debate.
Firestone and Scholl (forthcoming) identify six pitfalls that befall studies purporting to show top-down effects. We think that each of the six confounds is a special case of the three discussed by Pylyshyn (1999).
We should note that there are many others besides Fodor and Pylyshyn who defend the encapsulation of vision, at least in some form. Deroy (2013), for example, argues that the kinds of color-discrimination effects discussed by Macpherson (2012) (that is, cases where objects with characteristic colors are perceived differently from color-neutral objects) can be explained with a more enriched model of perceptual processing. But Deroy’s focus is on just this one strand of evidence, which we ourselves don’t rely on here. More ambitiously, Firestone and Scholl (forthcoming) provide a wide-ranging critique of claims of top–down penetration of vision. But as we have already noted, most of the claims they discuss rely on purely behavioral studies, and they pay scant attention to findings from cognitive neuroscience. Finally, Raftopoulos (2009) draws on an extensive body of empirical research to defend and elaborate the Fodor–Pylyshyn view. We do not have space in this paper for an adequate critique of his account, which would require a paper on its own.
Strikingly, visual illusions like the Müller-Lyer illusion are present in children functionally-blind from birth (who were previously only capable of perceiving gross motion effects such as a hand waved close in front of the face) as soon as they are enabled to see for the first time following cataract surgery and intraocular lens implant (Ghandi et al. 2015).
Recall that Pylyshyn (1999) explicitly allows that intra-modular top–down effects fail to qualify as forms of cognitive penetration. So even staunch advocates of encapsulation allow that perceptual processing involves an interaction between incoming signals and stored information.
A tuning-curve represents a neuron’s response profile to changes in a stimulus along a particular dimension. (Roughly, it tells us what, along that dimension, the neuron represents.) A flat tuning-curve in response to some change in the stimulus indicates that the neuron is not displaying sensitivity to the manipulated stimulus property.
Note that it is quite possible that the macaques acquired some perceptual expertise over the course of the training periods. Perceptual learning is thought to be a long-term change to how visual information is processed. But recall that the effect here is context sensitive. Any expertise gained from, for example, performing the side-flank task must be flexibly deployed in accordance with the current goal. So even if perceptual learning occurs in this experiment, there must be some sort of contentful relation between the high-level goal and visual processing.
How is it possible for goals to modulate visual processing in this way? Li et al. (2004) explain that in addition to multiple feed-forward and feed-back connections between V1 neurons and higher areas, there are also strong lateral connections among neurons in V1 itself. These can have either excitatory or inhibitory roles, making the activity of one neuron partly dependent upon the activities of many of its neighbors. So a plausible explanation is that a change in task modulates how a given V1 neuron is influenced by some of these neighbors rather than others. That is, the effect of the side-flank task on a V1 neuron amounts to telling it something like, “be more influenced in your response by these nearby neurons [which code for the distances between the center line and the side-flanks] than by those ones [which code for the alignment of the center line with the end-flanks].”
Post-experiment interviews revealed that 80 % of participants suspected no relationship between the auditory cue and orientation of motion. Of the remaining 20 %, one participant was aware of the true significance of the cues, one was aware of a relationship, but had their predictive character reversed; and the remaining three participants suspected a relationship between just one of the auditory cues and presented orientation.
This kind of implicit biasing of visual processing doesn’t involve conscious beliefs or desires penetrating vision, of course. As a result, many supporters of visual modularity may not regard such cases as particularly interesting counterexamples. Given that much of cognition operates at an implicit level, however, we fail to see why this should make the effects any less important.
Continuous flash suppression is a form of binocular rivalry, in which different stimuli are presented to each eye simultaneously. But in continuous flash suppression the stimuli presented to one eye consist of high-contrast dynamically changing Mondrian-like colored patterns. These tend to dominate conscious experience, with the stimulus presented to the other eye taking considerable time to become visible.
References
Albers, A., P. Kok, I. Toni, C. Dijkerman, and F. de Lange. 2013. Shared representations for working memory and mental imagery in early visual cortex. Current Biology 23: 1427–1431.
Anderson, M.L. 2010. Neural reuse: A fundamental organizational principle of the brain. Behavioral and Brain Sciences 33: 245–266.
Balcetis, E., and R. Dale. 2007. Conceptual set as a top-down constraint on visual object identification. Perception 36: 581–595.
Bar, M., K. Kassam, A. Ghuman, J. Boshyan, A. Schmid, A. Dale, M. Hämäläinen, K. Marinkovic, D. Schacter, B. Rosen, and E. Halgren. 2006. Top-down facilitation of visual recognition. Proceedings of the National Academy of Sciences 103: 449–454.
Barrett, H., and R. Kurzban. 2006. Modularity in cognition. Psychological Review 113: 628–647.
Boutonnet, B., and Lupyan, G. 2015. Words jump-start vision: A label advantage in object recognition. Journal of Neuroscience.
Brewer, W.F., and L. Loschky. 2005. Top-down and bottom-up influences on observation: Evidence from cognitive psychology and the history of science. In Cognitive penetrability of perception, ed. A. Raftopoulos, 31–47. New York: Nova Science.
Bridge, H., S. Harrold, E. Holmes, M. Stokes, and C. Kennard. 2012. Vivid visual mental imagery in the absence of the primary visual cortex. Journal of Neurology 259: 1062–1070.
Brogaard, B., K. Marlow, and K. Rice. 2014. The long-term potentiation model for grapheme-color binding in synesthesia. In Sensory integration and the unity of consciousness, ed. D. Bennett and C. Hill, 37–72. Cambridge, MA: MIT Press.
Carruthers, P. 2006. The architecture of the mind: Massive modularity and the flexibility of thought. Oxford: Oxford University Press.
Chaumon, M., K. Kveraga, L. Feldman Barrett, and M. Bar. 2014. Visual predictions in the orbitofrontal cortex rely on associative content. Cerebral Cortex 24: 2899–2907.
Churchland, P.M. 1988. Perceptual plasticity and theoretical neutrality: A reply to Jerry Fodor. Philosophy of Science 55: 167–187.
Churchland, P.S., V. Ramachandran, and T. Sejnowski. 1994. A critique of pure vision. In Large-scale neuronal theories of the brain, ed. T. Sejnowski, C. Koch, and J. Davis, 23–60. Cambridge: Bradford.
Clark, A. 2013. Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behavioral and Brain Sciences 36: 181–204.
Costello, P., Y. Jiang, B. Baartman, K. McGlennen, and S. He. 2009. Semantic and subword priming during binocular suppression. Consciousness and Cognition 18: 375–382.
Craver-Lemley, C., and A. Reeves. 1992. How visual imagery interferes with vision. Psychological Review 99: 633–649.
Davis, T., and R. Poldrack. 2013. Measuring neural representations with fMRI: Practices and pitfalls. Annals of the New York Academy of Sciences 1296(1): 108–134.
Dehaene, S. 2014. Consciousness and the brain. New York: Viking.
Deroy, O. 2013. Object-sensitivity versus cognitive penetrability of perception. Philosophical Studies 162: 87–107.
Felleman, D., and D. Van Essen. 1991. Distributed hierarchical processing in the primate cerebral cortex. Cerebral Cortex 1: 1–47.
Firestone, C., and B. Scholl. 2014. “Top-down” effects where none should be found: The El Greco fallacy in perception research. Psychological Science 25: 38–46.
Firestone, C., and B.J. Scholl. 2015. Can you experience “top-down” effects on perception?: The case of race categories and perceived lightness. Psychonomic Bulletin & Review 22: 694–700.
Firestone, C. and Scholl, B. forthcoming. Cognition does not affect perception: Evaluating the evidence for “top–down” effects. Behavioral and Brain Sciences.
Fodor, J. 1983. The modularity of mind. Cambridge: MIT Press.
Ghandi, T., A. Kalia, S. Ganesh, and P. Sinha. 2015. Immediate susceptibility to visual illusions after sight onset. Current Biology 25: R359.
Ghose, G., and D. Bearl. 2010. Attention directed by expectations enhances receptive fields in area MT. Vision Research 50: 441–451.
Gilbert, D., and W. Li. 2013. Top–down influences on visual processing. Nature Reviews Neuroscience 14: 350–363.
Goldenberg, G., W. Müllbacher, and A. Nowak. 1995. Imagery without perception: A case study of anosognosia for cortical blindness. Neuropsychologia 33: 1373–1382.
Grill-Spector, K., and N. Kanwisher. 2005. As soon as you know it is there, you know what it is. Psychological Science 16: 152–160.
Gutteling, T., N. Petridou, S. Dumoulin, B. Harvey, E. Aarnoutse, J. Kenemans, and S. Neggers. 2015. Action preparation shapes processing in early visual cortex. Journal of Neuroscience 35: 6472–6480.
Hanson, N. 1965. Patterns of discovery. Cambridge: Cambridge University Press.
Haynes, J.D. 2015. A primer on pattern-based approaches to fMRI: Principles, pitfalls, and perspectives. Neuron 87: 257–270.
Hegde, J., and D. Kersten. 2010. A link between visual disambiguation and visual memory. Journal of Neuroscience 30: 15124–15133.
Hohwy, J. 2014. The predictive mind. Oxford: Oxford University Press.
Hsieh, P., E. Vul, and N. Kanwisher. 2010. Recognition alters the spatial pattern of fMRI activation in early retinotopic cortex. Journal of Neurophysiology 103: 1501–1507.
Jeannerod, M. 2006. Motor cognition. Oxford: Oxford University Press.
Kok, P., J. Jehee, and F. de Lange. 2012. Less is more: Expectation sharpens representations in the primary visual cortex. Neuron 75: 265–270.
Kok, P., G. Brouwer, M. van Gerven, and F. de Lange. 2013. Prior expectations bias sensory representations in visual cortex. Journal of Neuroscience 33: 16275–16284.
Kok, P., M. Failing, and F. de Lange. 2014. Prior expectations evoke stimulus templates in the primary visual cortex. Journal of Cognitive Neuroscience 26: 1546–1554.
Kosslyn, S. 1994. Image and brain. Cambridge: MIT Press.
Kosslyn, S., W. Thompson, and G. Ganis. 2006. The case for mental imagery. Oxford: Oxford University Press.
Krieman, G., I. Fried, and C. Koch. 2002. Single-neuron correlates of subjective vision in the human medial temporal lobe. Proceedings of the National Academy of Sciences 99: 8378–8383.
Kveraga, K., J. Boshyan, and M. Bar. 2007. Magnocellular projections as the trigger of top-down facilitation in recognition. Journal of Neuroscience 27: 13232–13240.
Li, W., V. Piëch, and C. Gilbert. 2004. Perceptual learning and top-down influences in primary visual cortex. Nature Neuroscience 7: 651–657.
Lin, Z., and S. He. 2009. Seeing the invisible: The scope and limits of unconscious processing in binocular rivalry. Progress in Neurobiology 87: 195–211.
Lupyan, G., and E. Ward. 2013. Language can boost otherwise unseen objects into visual awareness. Proceedings of the National Academy of Sciences 110: 14196–14201.
Lyons, J. 2011. Circularity, reliability, and the cognitive penetrability of perception. Philosophical Issues 21: 289–311.
Mack, M., and T. Palmeri. 2010. Decoupling object detection and categorization. Journal of Experimental Psychology: Human Perception and Performance 36: 1067–1079.
Macpherson, F. 2012. Cognitive penetration of color experience: Rethinking the issue in light of an indirect mechanism. Philosophy and Phenomenological Research 84: 24–62.
Masrour, F., G. Nirshberg, M. Schon, J. Leardi, and E. Barrett. 2015. Revisiting the empirical case against perceptual modularity. Frontiers in Psychology 6, doi: 10.3389/fpsyg.2015.01676.
McManus, J., W. Li, and C. Gilbert. 2011. Adaptive shape processing in primary visual cortex. Proceedings of the National Academy of Sciences 108: 9739–9746.
Mechelli, A., C. Price, K. Friston, and A. Ishai. 2004. Where bottom–up meets top–down: Neuronal interactions during perception and imagery. Cerebral Cortex 14: 1256–1265.
Mo, L., G. Xu, P. Kay, and L.H. Tan. 2011. Electrophysiological evidence for the left-lateralized effect of language on preattentive categorical perception of color. Proceedings of the National Academy of Sciences 108: 14026–14030.
Moore, C., and P. Cavanagh. 1998. Recovery of 3D volume from 2-tone images of novel objects. Cognition 67: 45–71.
Moro, V., G. Berlucchi, J. Lerch, F. Tomaiuolo, and S. Aglioti. 2008. Selective deficit in mental visual imagery with intact primary visual cortex and visual perception. Cortex 44: 109–118.
Murray, E., D. Gaffan, and M. Mishkin. 1993. Neural substrates of visual stimulus–stimulus association in Rhesus monkeys. Journal of Neuroscience 13: 4549–4561.
Neri, P. 2014. Semantic control of feature extraction from natural scenes. Journal of Neuroscience 34: 2374–2388.
Pearson, J., C. Clifford, and F. Tong. 2008. The functional impact of mental imagery on conscious perception. Current Biology 18: 982–986.
Pinto, Y., S. van Gaal, F. de Lange, V. Lamme, and A. Seth. 2015. Expectations accelerate entry of visual stimuli into awareness. Journal of Vision 15, doi: 10.1167/15.8.13.
Prinz, J. 2006. Is the mind really modular? In Contemporary debates in cognitive science, ed. J. Stainton, 22–36. Malden: Blackwell.
Pylyshyn, Z. 1999. Is vision continuous with cognition?: The case for cognitive impenetrability of visual perception. Behavioral and Brain Sciences 22: 341–365.
Pylyshyn, Z. 2003. Seeing and visualizing. Cambridge: MIT Press.
Raftopoulos, A. 2009. Cognition and perception: How do psychology and neural science inform philosophy? Cambridge MA: Bradford Books.
Ramalingam, N., J. Mcmanus, W. Li, and C. Gilbert. 2013. Top-down modulation of lateral interactions in visual cortex. Journal of Neuroscience 33: 1773–1789.
Reddy, L., N. Tsuchiya, and T. Serre. 2010. Reading the mind’s eye: Decoding category information during mental imagery. NeuroImage 50: 818–825.
Rockland, K., and D. Pandya. 1981. Cortical connections of the occipital lobe in the rhesus monkey: interconnections between areas 17, 18, 19 and the superior temporal sulcus. Brain Research 212: 249–270.
Schapiro, A., E. Gregory, B. Landau, M. McCloskey, and N. Turk-Browne. 2014. The necessity of medial temporal lobe for statistical learning. Journal of Cognitive Neuroscience 26: 1736–1747.
Sklar, A., N. Levy, A. Goldstein, R. Mandel, A. Maril, and R. Hassin. 2012. Reading and doing arithmetic nonconsciously. Proceedings of the National Academy of Sciences 109: 19614–19619.
Stokes, D. 2011. Perceiving and desiring: a new look at the cognitive penetrability of experience. Philosophical Studies 158: 477–492.
Summerfield, C., and T. Egner. 2009. Expectation (and attention) in visual cognition. Trends in Cognitive Sciences 13: 403–409.
Teufel, C., et al. forthcoming. Talk delivered at the cognitive penetration workshop, Bergen, June 2015.
Thierry, G., P. Athanasopoulos, A. Wiggett, B. Dering, and J.R. Kuipers. 2009. Unconscious effects of language-specific terminology on preattentive color perception. Proceedings of the National Academy of Sciences 106: 4567–4570.
Tsuchiya, N., and C. Koch. 2005. Continuous flash suppression reduces negative afterimages. Nature Neuroscience 8: 1096–1101.
Vetter, P., F. Smith, and L. Muckli. 2014. Decoding sound and imagery content in early visual cortex. Current Biology 24: 1256–1262.
Wu, W. 2013. Visual spatial constancy and modularity: Does intention penetrate vision? Philosophical Studies 165: 647–669.
Wyart, V., A. Nobre, and C. Summerfield. 2012. Dissociable prior influences of signal probability and relevance on visual contrast sensitivity. Proceedings of the National Academy of Sciences 109: 3593–3598.
Yang, E., and R. Blake. 2012. Deconstructing continuous flash suppression. Journal of Vision 12: 1–14.
Acknowledgments
The authors are grateful to Tom Carlson, Chaz Firestone, Zoe Jenkins, Peter Kok, Eric Mandelbaum, Jake Quilty-Dunn, and an anonymous referee for their comments on earlier drafts of this paper.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ogilvie, R., Carruthers, P. Opening Up Vision: The Case Against Encapsulation. Rev.Phil.Psych. 7, 721–742 (2016). https://doi.org/10.1007/s13164-015-0294-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13164-015-0294-8