Maintenance of responding when reinforcement becomes delayed

Costa, Daniel S. J.; Boakes, Robert A.

doi:10.3758/BF03193044

Maintenance of responding when reinforcement becomes delayed

Published: May 2007

Volume 35, pages 95–105, (2007)
Cite this article

Download PDF

Learning & Behavior Aims and scope Submit manuscript

Maintenance of responding when reinforcement becomes delayed

Download PDF

Daniel S. J. Costa¹ &
Robert A. Boakes¹

649 Accesses
11 Citations
1 Altmetric
Explore all metrics

Abstract

In four experiments with rats, we examined the persistence of behavior when reinforcement was switched from immediate to delayed. In Experiment 1, lever pressing elicited by instrumental training with immediate reinforcement continued when a 20-sec delay of reinforcement was introduced (easy-to-hard condition), whereas when the delay condition was introduced from the start (hard-to-hard condition), responding remained low throughout. A similar result was obtained in Experiment 2, in which lever pressing was elicited by a classical conditioning (autoshaping) procedure. In Experiment 3, rats initially trained with delayed reinforcement continued to respond at a low rate when switched to immediate reinforcement (hard-to-easy condition). By measuring magazine entry (goal tracking) as well as lever pressing (sign tracking) in Experiment 4, we confirmed that such transfer effects at least partly involve the persistence of whatever type of behavior was initially dominant.

Article PDF

A comparison of renewal, spontaneous recovery, and reacquisition after punishment and extinction

Article 07 November 2022

Training reinforcement rates, resistance to extinction, and the role of context in reinstatement

Article 03 July 2015

Retention period differentially attenuates win–shift/lose–stay relative to win–stay/lose–shift performance in the rat

Article Open access 22 September 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Boakes, R. A. (1977). Performance on learning to associate a stimulus with positive reinforcement. In H. Davis & H. M. B. Hurwitz (Eds.),Operant—Pavlovian interactions (pp 67–101). Hillsdale, NJ: Erlbaum.
Google Scholar
Bonardi, C. &Ong, S. Y. (2003). Learned irrelevance: A contemporary overview.Quarterly Journal of Experimental Psychology,56B, 80–89.
Google Scholar
Brown, B. L, Hemmes, N. S, Cabeza de Vaca, S. &Pagano, C. (1993). Sign and goal tracking during delay and trace autoshaping in pigeons.Animal Learning & Behavior,21, 360–368.
Google Scholar
Costa, D. S. J. (2004).The momentum effect: Maintenance of responding when reinforcement is delayed. Unpublished honors thesis, School of Psychology, University of Sydney.
Dickinson, A. (1980).Contemporary animal learning theory. Cambridge: Cambridge University Press.
Google Scholar
Dickinson, A, Watt, A. &Griffiths, W. J. (1992). Free-operant acquisition with delayed reinforcement.Quarterly Journal of Experimental Psychology,45B, 241–258.
Google Scholar
Harker, G. S. (1956). Delay of reward and performance of an instrumental response.Journal of Experimental Psychology,51, 303–310.
Article PubMed Google Scholar
Holland, P. C. (1977). Conditioned stimulus as a determinant of the form of the Pavlovian conditioned response.Journal of Experimental Psychology: Animal Behavior Processes,3, 77–104.
Article PubMed Google Scholar
Kehoe, E. J. &Holt, P. E. (1984). Transfer across CS#x2014;US intervals and sensory modalities in classical conditioning of the rabbit.Animal Learning & Behavior,12, 122–128.
Google Scholar
Lawrence, D. H. (1952). The transfer of discrimination along a continuum.Journal of Comparative & Physiological Psychology,45, 511–516.
Article Google Scholar
Lieberman, D. A, McIntosh, D. C. &Thomas, G. V. (1979). Learning when reward is delayed: A marking hypothesis.Journal of Experimental Psychology: Animal Behavior Processes,5, 224–242.
Article PubMed Google Scholar
Lieberman, D. A. &Thomas, G. V. (1986). Marking, memory, and superstition in the pigeon.Quarterly Journal of Experimental Psychology,38B, 449–459.
Google Scholar
Logan, F. A. (1952). The role of delay of reinforcement in determining reaction potential.Journal of Experimental Psychology,43, 393–399.
Article PubMed Google Scholar
Mackintosh, N. J. (1974).The psychology of animal learning. London: Academic Press.
Google Scholar
Mackintosh, N. J. (1975). A theory of attention: Variations in the associability of stimuli with reinforcement.Psychological Review,82, 276–298.
Article Google Scholar
Mackintosh, N. J. &Little, L. (1970). An analysis of transfer along a continuum.Canadian Journal of Psychology,24, 362–369.
PubMed Google Scholar
McGreevy, P. &Boakes, R. A. (2007).Carrots and sticks: Principles of animal training. Cambridge: Cambridge University Press.
Google Scholar
Messing, R. B, Kleven, M. S. &Sparber, S. B. (1986). Delaying reinforcement in an autoshaping task generates adjunctive and superstitious behaviors.Behavioural Processes,13, 327–338.
Article Google Scholar
Midgley, M, Lea, S. E. G. &Kirby, R. M. (1989). Algorithmic shaping and misbehavior in the acquisition of token deposit by rats.Journal of the Experimental Analysis of Behavior,52, 27–40.
Article PubMed Google Scholar
Myer, J. S. &Hull, J. H. (1974). Autoshaping and instrumental learning in the rat.Journal of Comparative & Physiological Psychology,86, 724–729.
Article Google Scholar
Nevin, J. A. &Grace, R. C. (2000). Behavioral momentum and the Law of Effect.Behavioral & Brain Sciences,23, 73–130.
Article Google Scholar
Pavlov, I. P. (1927).Conditioned reflexes: An investigation of the physiological activity of the cerebral cortex (V. Anrep, Trans.). London: Oxford University Press.
Google Scholar
Pear, J. J. &Legris, J. A. (1987). Shaping by automated tracking of an arbitrary operant response.Journal of the Experimental Analysis of Behavior,47, 241–247.
Article PubMed Google Scholar
Perin, C. T. (1943). A quantitative investigation of the delay of reinforcement gradient.Journal of Experimental Psychology,32, 37–52.
Article Google Scholar
Prokasy, W. F, Ebel, H. C. &Thompson, D. D. (1963). Response shaping at long interstimulus intervals in classical eyelid conditioning.Journal of Experimental Psychology,66, 138–141.
Article PubMed Google Scholar
Prokasy, W. F. &Papsdorf, J. D. (1965). Effects of increasing the interstimulus interval during classical conditioning of the albino rabbit.Journal of Comparative & Physiological Psychology,60, 249–252.
Article Google Scholar
Reilly, S, Schachtman, T. R. &Reid, P. (1996). Signaled delay of reinforcement: Effects of postconditioning manipulation of context associative strength on instrumental performance.Learning & Motivation,27, 451–463.
Article Google Scholar
Rescorla, R. A. (1989). Redundant treatments of neutral and excitatory stimuli in autoshaping.Journal of Experimental Psychology: Animal Behavior Processes,15, 212–223.
Article Google Scholar
Rescorla, R. A, Durlach, P. J. &Grau, J. W. (1985). Contextual learning in Pavlovian conditioning. In P. D. Balsam & A. Tomie (Eds.),Context and learning (pp 23–56). Hillsdale, NJ: Erlbaum.
Google Scholar
Scahill, V. L. &Mackintosh, N. J. (2004). The easy to hard effect and perceptual learning in flavor aversion conditioning.Journal of Experimental Psychology: Animal Behavior Processes,30, 93–103.
Article PubMed Google Scholar
Schmidt, R. A. &Bjork, R. A. (1992). New conceptualizations of practice: Common principles in three paradigms suggest new concepts for training.Psychological Science,3, 207–217.
Article Google Scholar
Stiers, M. &Silberberg, A. (1974). Lever-contact responses in rats: Automaintenance with and without negative response-reinforcer dependency.Journal of the Experimental Analysis of Behavior,22, 497–506.
Article PubMed Google Scholar
Thorndike, E. L. (1911).Animal intelligence: Experimental studies. New York: Macmillan.
Google Scholar
Westbrook, R. F. &Homewood, J. (1982). The effects of a flavour— toxicosis pairing upon long-delay, flavour aversion learning.Quarterly Journal of Experimental Psychology,34B, 59–75.
Google Scholar
Williams, B. A. (1999). Associative competition in operant conditioning: Blocking the response—reinforcer association.Psychonomic Bulletin & Review,6, 618–623.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Psychology (A18), University of Sydney, 2006, Sydney, NSW, Australia
Daniel S. J. Costa & Robert A. Boakes

Authors

Daniel S. J. Costa
View author publications
You can also search for this author in PubMed Google Scholar
Robert A. Boakes
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daniel S. J. Costa.

Additional information

Experiments 1 and 2A were reported in D.S.J.C.’s unpublished honors thesis at the University of Sydney (Costa, 2004). Experiments 2B, 3, and 4 were partially supported by an Australian Research Council grant to R.A.B. and by an Australian Postgraduate Award to D.S.J.C.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Costa, D.S.J., Boakes, R.A. Maintenance of responding when reinforcement becomes delayed. Learning & Behavior 35, 95–105 (2007). https://doi.org/10.3758/BF03193044

Download citation

Received: 06 September 2006
Accepted: 12 December 2006
Issue Date: May 2007
DOI: https://doi.org/10.3758/BF03193044

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Maintenance of responding when reinforcement becomes delayed

Abstract

Article PDF

Similar content being viewed by others

A comparison of renewal, spontaneous recovery, and reacquisition after punishment and extinction

Training reinforcement rates, resistance to extinction, and the role of context in reinstatement

Retention period differentially attenuates win–shift/lose–stay relative to win–stay/lose–shift performance in the rat

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Maintenance of responding when reinforcement becomes delayed

Abstract

Article PDF

Similar content being viewed by others

A comparison of renewal, spontaneous recovery, and reacquisition after punishment and extinction

Training reinforcement rates, resistance to extinction, and the role of context in reinstatement

Retention period differentially attenuates win–shift/lose–stay relative to win–stay/lose–shift performance in the rat

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation