Skip to main content
Log in

Primate errors in transitive ‘inference’: a two-tier learning model

  • Original Article
  • Published:
Animal Cognition Aims and scope Submit manuscript

Abstract

Transitive performance (TP) is a learning-based behaviour exhibited by a wide range of species, where if a subject has been taught to prefer A when presented with the pair AB but to prefer B when presented with the pair BC, then the subject will also prefer A when presented with the novel pair AC. Most explanations of TP assume that subjects recognize and learn an underlying sequence from observing the training pairs. However, data from squirrel monkeys (Saimiri sciureus) and young children contradict this, showing that when three different items (a triad) are drawn from the sequence, subjects’ performance degrades systematically (McGonigle and Chalmers, Nature 267:694–696, 1977; Chalmers and McGonigle, Journal of Experimental Child Psychology 37:355–377, 1984; Harris and McGonigle, The Quarterly Journal of Experimental Psychology 47B:319–348, 1994). We present here the two-tier model, the first learning model of TP which accounts for this systematic performance degradation. Our model assumes primate TP is based on a general-purpose task learning system rather than a special-purpose sequence-learning system. It supports the hypothesis of Heckers et al. (Hippocampus 14:153–162, 2004) that TP is an expression of two separate general learning elements: one for associating actions and contexts, another for prioritising associations when more than one context is present. The two-tier model also provides explanations for why phased training is important for helping subjects learn the initial training pairs and why some subjects fail to do so. It also supports the Harris and McGonigle (The Quarterly Journal of Experimental Psychology 47B:319–348, 1994) explanation of why, once the training pairs have been acquired, subjects perform transitive choice automatically on two-item diads, but not when exposed to triads from the same sequence.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. This is a subset of all reported TI effects, see further Wynne (1998) or Shultz and Vogel (2004).

  2. The psychological literature is not consistent about whether A or E is the ‘higher’ (rewarded) end. This paper uses A as high.

  3. If X is a stimulus, Y is one of the other stimuli present chosen randomly, unless the rule was avoid in which case it is the item actually grasped. If X is an action then Y is the second-highest priority action for that stimulus.

  4. The term agent is actually intended to refer to any actor, artificial or not, but it has become associated with AI software systems.

  5. Normally in AI, agents are considered to have fully learned a task only when their weights have stabilised (stopped changing). This of course can not map directly to the animal research, where learning must be judged by expressed behaviour.

  6. We use the term strategy here to mean an innate, evolved solution, not something intentionally selected by the subjects.

  7. Shultz and Vogel refer to this layer as a Cascade Correlation (CC) network (Fahlman and Lebiere 1990). However, the single-layer TI problem is so simple (as we demonstrate in Experiment 1) that the CC algorithm never adds any hidden units.

  8. In animals, the contiguity effect can only be expressed in terms of favouring the rewarded end of a series.

  9. A personal edition of LispWorks (which runs on all platforms) is available for free download from its manufacturers, and all software used in this paper is available from the authors.

References

  • Alvarado MC, Bachevalier J (2000) Revisiting the maturation of medial temporal lobe memory functions in primates. Learn Mem 7:244–256

    Google Scholar 

  • Anderson JR (1993) Rules of the mind. Lawrence Erlbaum Associates, Hillsdale, NJ

  • Baxter MG, Murray EA (2001) Opposite relationship of hippocampal and rhinal cortex damage to delayed nonmatching-to-sample deficits in monkeys. Hippocampus 11:61–71

    Google Scholar 

  • Bermúdez-Rattoni F, Rusiniak KW, Garcia J (1983) Flavor-illness aversions: potentiation of odor by taste is disrupted by application of novocaine into amygdala. Behav Neural Biol 37:61–75

    Google Scholar 

  • Bryant PE, Trabasso T (1971) Transitive inferences and memory in young children. Nature 232:456–458

    Google Scholar 

  • Bryson JJ (2001) Intelligence by design: principles of modularity and coordination for engineering complex adaptive agents. PhD thesis, MIT, Department of EECS Cambridge, MA. AI Technical Report 2001–003

  • Bryson JJ, Lowe W (1997) Cognition without representational redescription. Behav Brain Sci 20:743–744. Commentary on Ballard et al., Deictic codes for the embodiment of cognition

    Google Scholar 

  • Bryson JJ, Stein LA (2001) Architectures and idioms: making progress in agent design. In: Castelfranchi C, Lespérance Y (eds) The seventh international workshop on agent theories, architectures, and languages (ATAL2000). Springer, Berlin Heidelberg New York

  • Buckmaster CA, Eichenbaum H, Amaral DG, Suzuki WA, Rapp PR (2004) Entorhinal cortex lesions disrupt the relational organization of memory in monkeys. J Neurosci 24:9811–9825

    Google Scholar 

  • Chalmers M, McGonigle BO (1984) Are children any more logical than monkeys on the five term series problem? J Exp Child Psychol 37:355–377

    Google Scholar 

  • Cowan N (2001) The magical number 4 in short-term memory: a reconsideration of mental storage capacity. Brain Behav Sci 24:87–114

    Google Scholar 

  • de Boysson-Bardies B, O'Regan K (1973) What children do in spite of adults’ hypotheses. Nature 246:531–534

    Google Scholar 

  • De Lillo C, Floreano D, Antinucci F (2001) Transitive choices by a simple, fully connected, backpropagation neural network: implications for the comparative study of transitive inference. Anim Cogn 4:61–68

    Google Scholar 

  • Delius JD, Siemann M (1998) Transitive responding in animals and humans: exaptation rather than adaptation? Behav Processes 42:107–137

    Google Scholar 

  • Dusek JA, Eichenbaum H (1997) The hippocampus and memory for orderly stimuls relations. Proc Nat Acad Sci USA 94:7109–7114

    Google Scholar 

  • Fahlman SE, Lebiere C (1990) The cascade-correlation learning architecture. In: Touretzky DS (ed) Advances in neural information processing systems 2 (NIPS'90). Morgan-Kaufmann, San Mateo, CA, pp 524–532

  • Fersen L, Wynne CDL, Delius J, Staddon JER (1991) Transitive inference formation in pigeons. J Exp Psychol Anim Behav Processes 17:334–341

    Google Scholar 

  • Frank MJ, Rudy JW, O'Reilly RC (2003) Transitivity, flexibility, conjunctive representations, and the hippocampus: II. A computational analysis. Hippocampus 13:341–354

    Google Scholar 

  • Gallistel C, Brown AL, Carey S, Gelman R, Keil FC (1991) Lessons from animal learning for the study of cognitive development. In: Carey S, Gelman R (eds) The epigenesis of mind. Lawrence Erlbaum Hillsdale, NJ, pp 3–36

  • Gilovich T, Vallone R, Tversky A (1985) The hot hand in basketball: on the misperception of random sequences. Cognit Psychol 17:295–314

    Google Scholar 

  • Glasspool DW (1995) Competitive queuing and the articulatory loop. In: Levy J, Bairaktaris D, Bullinaria J, Cairns P (eds) Connectionist models of memory and language. UCL Press, London

  • Harris MR (1988) Computational modelling of transitive inference: a micro analysis of a simple form of reasoning. PhD thesis, University of Edinburgh

  • Harris MR, McGonigle BO (1994) A model of transitive choice. Q J Exp Psychol 47B:319–348

    Google Scholar 

  • Heckers S, Zalesak M, Weiss AP, Ditman T, Titone D (2004) Hippocampal activation during transitive inference in humans. Hippocampus 14:153–162

    Google Scholar 

  • Hertz J, Krogh A, Palmer RG (1991) Introduction to the theory of neural computation. Addison-Wesley, Redwood City, CA

  • Hurliman E, Nagode JC, Pardo JV (2005) Double dissociation of exteroceptive and interoceptive feedback systems in the orbital and ventromedial prefrontal cortex of humans. J Neurosci 25:4641–4648

    Google Scholar 

  • McClelland JL, McNaughton BL, O'Reilly RC (1995) Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory. Psychol Rev 102:419–457

    Google Scholar 

  • McDonald S, Lowe W (1998) Modelling functional priming and the associative boost. In: Gernsbacher MA, Derry SD (eds) Proceedings of the 20th annual meeting of the cognitive science society. Lawrence Erlbaum Associates, New Jersey, pp 675–680

  • McGonigle BO Chalmers M (1977) Are monkeys logical? Nature 267:694–696

    Google Scholar 

  • McGonigle BO, Chalmers M (1992) Monkeys are rational! Q J Exp Psychol 45B:189–228

    Google Scholar 

  • McGonigle BO, Chalmers M (1996) The ontology of order. In: Smith L (ed) Critical readings on piaget. Routledge London, chapter 14

  • Newell A (1990) Unified theories of cognition. Harvard University Press, Cambridge, MA

  • O'Reilly RC, Rudy JW (2001) Conjunctive representations in learning and memory: principles of cortical and hippocampal function. Psychol Rev 108:311–345

    Google Scholar 

  • Piaget J (1928) Judgment and reasoning in the child. Routledge and Kegan Paul, London

  • Piaget J (1954) The construction of reality in the child. Basic Books, New York

  • Rapp PR, Kansky MT, Eichenbaum H (1996) Learning and memory for hierarchical relationships in the monkey: effects of aging. Behav Neurosci 110:887–897

    Google Scholar 

  • Rescorla RA, Wagner AR (1972) A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. In: Black AH, Prokasy WF (eds) Classical conditioning, vol II. Appleton, New York, chapter 3, pp 64–99

  • Shultz TR, Vogel A (2004) A connectionist model of the development of transitivity. In: The 26th annual meeting of the cognitive science society (CogSci 2004). Lawrence Erlbaum Associates, Chicago, pp 1243–1248

  • Siemann M, Delius JD (1993) Implicit deductive reasoning in humans. Naturwissenshaften 80:364–366

    Google Scholar 

  • Siemann M, Delius JD, Dombrowski D, Daniel S (1996) Value transfer in discriminative conditioning with pigeons. Psychol Record 46:707–728

    Google Scholar 

  • Steele, GL, Jr (1990) Common Lisp: the language, 2nd edn. Digital Press, Bedford, MA

  • Terrace HS, McGonigle BO (1994) Memory and representation of serial order by children, monkeys and pigeons. Curr Dir Psychol Sci 3:180–185

    Google Scholar 

  • Tyrrell T (1993) Computational mechanisms for action selection. PhD thesis, University of Edinburgh. Centre for Cognitive Science

  • von Fersen L, Wynne CDL, Delius JD, Staddon JER (1991) Transitive inference formation in pigeons. J Exp Psychol: Anim Behav Processes 17:334–341

    Google Scholar 

  • Van Elzakker M, O'Reilly RC, Rudy JW (2003) Transitivity, flexibility, conjunctive representations, and the hippocampus: an empirical analysis. Hippocampus 13:334–340

    Google Scholar 

  • Waelti P, Dickinson A, Schultz W (2001) Dopamine reponses comply with basic assumptions of formal learning theory. Nature 412:43–48

    Google Scholar 

  • Wolpert DH (1996) The lack of a priori distinctions between learning algorithms. Neural Comput 8:1341–1390

    Google Scholar 

  • Wood MA, Leong JCS, Bryson JJ (2004) ACT-R is almost a model of primate task learning: experiments in modelling transitive inference. In: The 26th annual meeting of the cognitive science society (CogSci 2004). Lawrence Erlbaum Associates, Chicago, pp 1470–1475

  • Wright BC (2001) Reconceptualizing the transitive inference ability: a framework for existing and future research. Dev Rev 21:375–422

    Google Scholar 

  • Wynne CDL (1998) A minimal model of transitive inference. In: Wynne CDL, Staddon JER (eds) Models of action. Lawrence Erlbaum Associates Mahwah, NJ, pp 269–307

  • Xanalys (2001) Lispworks Professional Edition 4.1.20. Waltham, MA (formerly Harlequin)

  • Zentall TR, Sherburne LM (1994) Transfer of value from S\( +\) to S\( -\) in a simultaneous discrimination. J Exp Psychol: Anim Behav Processes 20:176–183

    Google Scholar 

Download references

Acknowledgements

We would like to thank Will Lowe, Mark Baxter, Juan Delius, Brendan McGonigle, Lynn Andrea Stein, Olin Shivers, John Mann, Emily Korvin, Mark Wood, Marc Hauser and the denizens of the Harvard Primate Cognitive Neuroscience Lab, particularly Roian Egnor. We would also like to thank our anonymous reviewers for many helpful comments and criticisms. All of the subjects used for the novel results in this article were computer programs and as such are not subject to any experimental ethics regulation either in the UK or elsewhere. However, the validity of AI models is entirely dependent on data from live subjects, and we fully support our colleagues involved in responsible animal research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Joanna J. Bryson.

Appendices

Appendix A: the binary sampling model

In one of the earliest responses to Bryant and Trabasso (1971), McGonigle and Chalmers (1977) not only demonstrated non-human animal learning of TP, but also proposed a model to account for the errors the animals made. Their subjects were squirrel monkeys (Saimiri sciureus). Like human children, these monkeys tend to score only around 90% on the pair \( BD\). To explain this, McGonigle and Chalmers proposed the binary-sampling theory. This theory assumes that:

  • Subjects consider not only the items visible, but also items that might be expected to be visible. That is, they take into account elements associated with the current stimulus, especially intervening stimuli associated with both.

  • Subjects consider only two of these possible elements, choosing the pair at random. If they were trained on that pair, they perform as trained; otherwise they perform at chance, unless one of the items is an end item, A or E, in which case they perform by selecting or avoiding the item, respectively.

  • If the subject chooses an item that is only expected, not actually present, it obviously cannot act on that selection (e.g. grasp the item). However, selection reinforces consideration of that item, which makes it likely the next pair the animal considers includes one of the higher-valued of the items displayed.

Thus for the pair \( BD\), this model assumes an equal chance the monkey will focus on \( BC\), \( CD\), or \( BD\). Either established training pair results in the monkey selecting B, while the pair \( BD\) results in an even (therefore 17%) chance of either element being chosen. This predicts that the subjects would select B about 83% of the time, which is near to the average actual performance of 85%.

The binary-sampling theory can be viewed as a naive probabilistic model—it incorporates the concept of expectation, but not in a full-fledged probabilistic framework. It proved controversial both because of lack of parsimony (or explanation) for the ‘imagining’ the extra items, and for apparently contradicting the SDE, since further-apart pairs may require more operations (though see McGonigle and Chalmers 1992). What is significant to the present model is that it motivated McGonigle and Chalmers to generate a data set showing the results of testing monkeys (and later children Chalmers and McGonigle 1984) on triads of three items. The binary-sampling theory predicts that for the triad \( BCD\) there is a 17% chance D will be chosen (half of the times \( BD\) is attended to), a 33% chance C will be chosen (all of the times \( CD\) is attended to) and a 50% chance of B (all of \( BC\) plus half of \( BD\)). Any model using a fully sequential representation, or indeed true TP, would of course predict 0, 0 and 100%. In fact, the monkeys showed 3, 36 and 61%, respectively. Six-year-old human children showed a similar pattern on triad results (Chalmers and McGonigle 1984). While this is a fairly good match, the Harris (1988) production-rule model provides a significantly better one.

Appendix B: details of the simulation

The ALife agents were written in the Common Lisp Object System (CLOS) (Steele 1990), a standard programming language frequently used for artificial intelligence. The code was developed under the Behavior Oriented Design methodology, developed by one of the authors (Bryson 2001), and implemented on top of a graphical software development environment for CLOS called LispWorks (Xanalys 2001).Footnote 9

The artificial intelligence (AI) program that runs the simulations controls not only the learning agents but also the operation of the test apparatus including the recording of results. The program is modular, and the knowledge in the modules representing different real-world agents (the subjects, apparatus and operator) is kept isolated from the other agents’ knowledge. That is, although the testing-apparatus modules contain knowledge about the correct solution of the experiment, the test subject has no direct access to this knowledge except as evidenced by the reward.

On each trial, the testing agent (the apparatus) generates an n-gram (either diad or triad) as appropriate to the current phase of the experiment and places its element in the test-board. The learning agent (the subject) then selects one of the options by transferring it from the test-board to its hand. The apparatus determines whether the subject has chosen correctly and provides reinforcement with either a ‘peanut or a ‘buzzer token. The apparatus also records the trial results, and ends the trial by clearing the testing area.

When the subject is presented with the test-board it selects an object as described (see Fig. 1). When the subject receives reinforcement, it applies the learning rule in Eq. (1) based on its current variable state (its expectations), its current context (the contents of the test board and its hand, and the rules to which it is attending) and the presence or absence of the ‘peanut reward.’

In Experiments 1 and 2, an indefinite number of trials were run until the simulation was terminated by the human experimenter. In Experiment 3, the apparatus was enhanced to run the training and testing procedure shown in Table 1.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bryson, J.J., Leong, J.C.S. Primate errors in transitive ‘inference’: a two-tier learning model. Anim Cogn 10, 1–15 (2007). https://doi.org/10.1007/s10071-006-0024-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10071-006-0024-9

Keywords

Navigation