Behavioral momentum and resurgence: Effects of time in extinction and repeated resurgence tests

Sweeney, Mary M.; Shahan, Timothy A.

doi:10.3758/s13420-013-0116-8

Behavioral momentum and resurgence: Effects of time in extinction and repeated resurgence tests

Published: 28 August 2013

Volume 41, pages 414–424, (2013)
Cite this article

Download PDF

Learning & Behavior Aims and scope Submit manuscript

Behavioral momentum and resurgence: Effects of time in extinction and repeated resurgence tests

Download PDF

Mary M. Sweeney¹ &
Timothy A. Shahan¹

2263 Accesses
48 Citations
1 Altmetric
Explore all metrics

Abstract

Resurgence is an increase in a previously extinguished operant response that occurs if an alternative reinforcement introduced during extinction is removed. Shahan and Sweeney (2011) developed a quantitative model of resurgence based on behavioral momentum theory that captures existing data well and predicts that resurgence should decrease as time in extinction and exposure to the alternative reinforcement increases. Two experiments tested this prediction. The data from Experiment 1 suggested that without a return to baseline, resurgence decreases with increased exposure to alternative reinforcement and to extinction of the target response. Experiment 2 tested the predictions of the model across two conditions, one with constant alternative reinforcement for five sessions, and the other with alternative reinforcement removed three times. In both conditions, the alternative reinforcement was removed for the final test session. Experiment 2 again demonstrated a decrease in relapse across repeated resurgence tests. Furthermore, comparably little resurgence was observed at the same time point in extinction in the final test, despite dissimilar previous exposures to alternative reinforcement removal. The quantitative model provided a good description of the observed data in both experiments. More broadly, these data suggest that increased exposure to extinction may be a successful strategy to reduce resurgence. The relationship between these data and existing tests of the effect of time in extinction on resurgence is discussed.

Classical and Operant Conditioning—Ivan Pavlov; Burrhus Skinner

Fear of missing out: prevalence, dynamics, and consequences of experiencing FOMO

Article 17 March 2018

Motivation: Introduction to the Theory, Concepts, and Research

Resurgence is relapse that occurs following the removal of alternative reinforcement introduced during the extinction of an operant response. Resurgence has practical implications for treatments using alternative reinforcement to reduce problem behaviors, because it suggests that the removal or reduction of alternative reinforcement following treatment can result in an increase in the problem behavior. Many popular behavioral treatments involve alternative reinforcement, such as contingency management for substance abuse (e.g., Higgins et al., 2010; Silverman et al., 2007) and differential reinforcement of alternative behavior (DRA) in individuals with intellectual or developmental disabilities (Petscher, Rey, & Bailey, 2009). Although treatments that use alternative reinforcement are often effective at reducing problem behavior during treatment, the risk of relapse when alternative reinforcement is reduced or removed has led to a recent revival in basic (Lieving & Lattal, 2003; Quick, Pyszczynski, Colston, & Shahan, 2011; Winterbauer & Bouton, 2010) and applied (Volkert, Lerman, Call, & Trosclaire-Lasserre, 2009) research on resurgence (see Lattal & St. Peter Pipkin, 2009, for a review).

Behavioral momentum theory has been useful for understanding the persistence (e.g., Nevin, Mandell, & Atak, 1983) and relapse (Podlesnik & Shahan, 2010) of operant behavior—as such, the application of behavioral momentum theory to resurgence could shed light on important determinants of resurgence magnitude. In an effort to integrate resurgence into behavioral momentum theory, Shahan and Sweeney (2011) proposed a quantitative model of resurgence based on the augmented-extinction model (Nevin & Grace, 2000). The augmented-extinction model suggests that experience with higher rates of reinforcement within a discriminative-stimulus context prior to extinction renders an operant response more resistant to the disruptive effects of extinction. The model suggests:

$$ \log \left(\frac{B_t}{B_0}\right)=\frac{-t\left(c+ dr\right)}{r^b} $$

(1)

where B _t is the response rate at time t in extinction and B ₀ is the baseline response rate before extinction, c is the suppressive effect of breaking the response–reinforcer contingency, d scales the suppression associated with the elimination of reinforcers from the situation (i.e., generalization decrement), r is the rate of reinforcement within the context at baseline, and b is sensitivity to the reinforcement rate. As time in extinction increases, the disruptive impact increases (in the numerator) but is counteracted by previous experience with higher reinforcement rates in the discriminative context (in the denominator). The reinforcement experienced in the context includes all sources of reinforcement, regardless of whether they are contingent upon the target response, independent of the target response, or even contingent on an alternative response. This prediction stems from behavioral momentum theory’s suggestion that resistance to disruption is governed by the Pavlovian discriminative-stimulus–reinforcer relation, which has been supported by research with species ranging from fish to humans (e.g., Ahearn, Clark, Gardenier, Chung, & Dube, 2003; Cohen, 1996; Igaki & Sakagami, 2004; Nevin, Tota, Torquato, & Shull, 1990; Shahan & Burke, 2004). Nevin, McLean, and Grace (2001) have shown that the c and d parameters are independent, vary as expected with experimental manipulations, and combine additively, as suggested by the model. Equation 1 also accounts for the partial-reinforcement extinction effect, because at very high rates of reinforcement, the stimulus change associated with removal of the reinforcers from the situation (i.e., generalization decrement—dr) serves as a larger disruptor than does removal of reinforcers arranged on a schedule of partial reinforcement (Nevin & Grace, 2005). Equation 1 has provided a successful account of extinction of operant behavior in basic research and in applied settings (Nevin & Shahan, 2011, for a review).

Shahan and Sweeney (2011) extended Eq. 1 to resurgence by suggesting that alternative reinforcement during extinction of a target behavior has two effects. First, alternative reinforcement further disrupts the target behavior. Second, such reinforcement contributes to the strength of the target behavior by serving as an additional source of reinforcement in the context. Thus, the model suggests:

$$ \log \left(\frac{{ B}_t}{B_0}\right)=\frac{-t\left(k{R}_a+c+ dr\right)}{{\left(r+{R}_a\right)}^b} $$

(2)

where all terms are as in Eq. 1. The added variable R _a is the rate of alternative reinforcement during extinction, and the added parameter k scales the disruptive impact of the alternative reinforcement during extinction. The inclusion of kR _a increases the suppressive impact in the numerator, with higher rates of alternative reinforcement producing more suppression of the target behavior. When alternative reinforcement is removed, kR _a is zero, and the target behavior increases as a result of the decrease in disruption. In addition, because R _a is included in the denominator, alternative reinforcement experienced in the context during extinction also contributes to the future strength of the target behavior.

Equation 2 describes several known findings in the resurgence literature and fits existing data well (Shahan & Sweeney, 2011). One such finding is that less resurgence occurs following longer exposure to extinction plus alternative reinforcement (Leitenberg, Rawson, & Mulick, 1975, Exp. 4). Equation 2 captures the effect of extended exposure to extinction plus alternative reinforcement through its use of time in extinction as a factor that increases the impact of disruption over time. As time in extinction increases, t becomes larger, and consequently the larger numerator predicts that the removal of alternative reinforcement after extended periods of extinction will result in less resurgence.

A related prediction of Eq. 2 is that resurgence should decrease across repeated tests. In other words, when subjects are not returned to baseline contingencies of reinforcement for the target response, t continues to grow as exposure to extinction plus alternative reinforcement increases, and thus the model predicts that resurgence should decrease across each removal of alternative reinforcement. Figure 1 shows a simulation of this prediction using the exponentiated version of Eq. 2, which avoids logarithmic transformation of response rates and permits the inclusion zero values. The exponentiated version is

$$ \begin{array}{l}\frac{B_t}{B_0}={10}^{\frac{-t\left({ kR}_a+c+ dr\right)}{{\left(r+{ R}_a\right)}^b}}\end{array} $$

(3)

where all terms are as in Eq. 2. This simulation in Fig. 1 is supported by evidence from two studies, Quick et al. (2011) and Wacker et al. (2011). Quick et al. investigated resurgence of cocaine seeking in rats following the removal of alternative food reinforcement for nose pokes during extinction. The researchers introduced and removed alternative food reinforcement twice while keeping the extinction of cocaine seeking in place. Relapse during the second resurgence test was significantly less than in the first resurgence test—consistent with the predictions of Eq. 3. In an applied study with children with developmental disabilities, Wacker et al. alternated extinction of the problem behavior with extinction plus alternative reinforcement, in the form of functional communication training (FCT). The resurgence of the problem behavior that occurred following FCT generally decreased with each removal of alternative reinforcement. In fits of Eq. 3 to the data, Wacker et al. found that the model accurately described the decreased resurgence seen following repeated FCT. Although the percentage of variance accounted for was relatively low as compared with fits of the model to data from basic laboratories, the fits were compelling given the variability inherent in the data set collected in children’s homes with their mothers serving as therapists.

The purpose of the present experiments was to examine resurgence across repeated tests under conditions explicitly designed to test Eq. 3. Experiment 1 was designed to establish that resurgence decreases across repeated tests with simple food-maintained behavior, in a manner consistent with the predictions of Eq. 3 and with existing data from more complex situations (Quick et al., 2011; Wacker et al., 2011). Though the data from Quick et al. and Wacker et al. are consistent with the predictions of Eq. 3 displayed in Fig. 1, there are no data comparing a condition with repeated resurgence tests to a condition with constant alternative reinforcement. Equation 3 predicts that not only should resurgence decrease across repeated resurgence tests as time in extinction increases, but resurgence should be comparably low at a given time t in extinction in a condition in which the first removal of alternative reinforcement occurs at time t and a condition that includes alternative-reinforcement lapses prior to time t. Experiment 2 tested these predictions.

Experiment 1

In Experiment 1, we assessed the effects of repeated implementations of extinction plus alternative reinforcement on subsequent resurgence under conditions designed to test the predictions of Eq. 3. The experimental parameters used in Experiment 1 provided the basis for the simulation in Fig. 1.

Method

Subjects

Twelve unsexed homing pigeons (Double T Farm, Glenwood, IA) with varied previous experimental histories served as the subjects. The pigeons were maintained at approximately 80 % of their free-feeding weight (±15 g) via postsession feedings in the home cage and adjustments of the hopper duration across subjects ranging from 1.3 to 2 s. The colony room was on a 12:12-h light:dark cycle with lights on at 0700 h. Experimental sessions occurred in three squads of four pigeons each, with each squad running at approximately the same time each day.

Apparatus

The experimental sessions took place in four Lehigh Valley Electronics pigeon operant chambers that measure 350 mm long, 350 mm high, and 300 mm wide. Three response keys, 83 mm apart, each 25 mm in diameter, were centered on the front panel of the chamber. The keys were transilluminated via back-mounted in-line projectors and could display yellow, blue, and red homogeneous hues, as well as three separate white shapes (circle, horizontal line, and vertical line) on a black background. About 0.1 N of force was required to operate the keys. A house light located 76 mm above the center key provided general illumination directed toward the chamber ceiling. When the hopper was elevated, a miniature bulb illuminated the available Purina Pigeon Chow in a 50-mm wide × 55-mm tall aperture located 130 mm below the center key. A fan mounted to the outside of each chamber provided ventilation. The fan and white noise helped to mask extraneous sounds. Med Associates (St. Albans, VT) programming and interfacing were used to control the execution and recording of experimental events.

Procedure

Experiment 1 involved three phases: baseline, extinction, and test. Because the subjects had previous experimental histories, no shaping or pretraining was necessary before the baseline phase. Baseline consisted of ten sessions, during which only the center key was illuminated and displayed a white vertical line on a black background. Pecks to the center key (the target response) produced food on a variable-interval (VI) 60-s schedule of reinforcement. When a food delivery was arranged, the next target response turned off the house light and response key and produced access to the illuminated hopper aperture. Following the hopper presentation, the key and the house light were relit, and the VI timer restarted. Sessions were 45 min long, excluding hopper time.

During extinction (EXT), pecks to the center key (vertical line) no longer produced food, but pecks to the right key (blue hue) produced food on a VI 30-s schedule. As during baseline, hopper time was excluded from the session time, and the only stimulus illuminated during hopper delivery was the food aperture light. EXT lasted for three days.

Next, in the test phase, both the center key and the right key remained illuminated with their respective stimuli, but neither produced food. The test phase lasted for three days. Next, EXT and test were repeated (EXT 2, Test 2) for three days each, without returning to baseline.

Results

The means and standard deviations of the target key response rate, alternative key response rate, inactive key response rate, and obtained food delivery rate during each phase of Experiment 1 are presented in Table 1. Acquisition of the target response (i.e., pecks to the vertical line) proceeded normally during baseline. Pecks to the inactive (unlit) response keys were negligible for all pigeons.

Table 1 Experiment 1 response rates and food rates

Full size table

The target response rate decreased during the three days of EXT. Acquisition of the alternative response was rapid; pigeons earned close to the maximum food delivery rate on the first day of EXT (M = 1.88 foods/min, SD = 0.08). Responding on the inactive key continued to be negligible in all subjects. The average target response rate on the last day of EXT was 0.08 pecks/min, SD = 0.16.

Figure 2 displays target response rates during the final session of each exposure to the repeated EXT and the first session of each test phase. During the first session of the test phase, target response rates increased relative to the last session of EXT. The increase from the last session of EXT 2 to the first session of Test 2 was smaller than the increase from the last session of EXT to the first session of Test. We conducted a 2 × 2 within-subjects repeated measures analysis of variance (ANOVA) with the factors Transition and R _a. Target response rates for the sessions making up the first transition (i.e., EXT to Test) were coded as part of the first level of transition, whereas target response rates for sessions EXT 2 to Test 2 were coded as part of the second transition. The level of the factor R _a was determined by whether alternative reinforcement was present during the session (EXT and EXT 2) or absent (Test and Test 2). We found significant main effects of transition [F(1, 11) = 11.98, p < .01] and of R _a [F(1, 11) = 15.15, p < .01], as well as a Transition × R _a interaction [F(1, 11) = 11.85, p < .01.], capturing that the effect of removing alternative reinforcement on target response rates was different from the first to the second resurgence test. In simple-effects analyses, we found a statistically significant increase in target response rate on the first day of Test relative to the last day of EXT [F(1, 11) = 16.24, p < .01], and also a significant decrease in target response rate on the first day of Test 2 relative to the first day of Test [F(1, 11) = 11.92, p < .01]. No significant difference emerged between target response rates on the last day of EXT and the last day of EXT 2 [F(1, 11) = 2.55, p = .14], nor a significant difference between the last day of EXT 2 and the first day of Test 2 [F(1, 11) = 2.85, p = .12].

Given the visual increase from the last day of EXT 2 to the first day of Test 2, the data were examined for consistent patterns at the individual-subject level. Figure 3 displays the transitions for Test and Test 2. It is clear that for the first test, all but one subject showed an increased target response rate when alternative reinforcement was removed. For Test 2, only two subjects showed notable increases in target response rate and were driving the visual difference between mean target response rate on the last day of EXT 2 and the first day of Test 2.

Equation 3 was fitted to the mean subject data across all sessions, which is displayed in Fig. 4. As in Shahan and Sweeney (2011), the d parameter was fixed to a value of 0.001, b was fixed to 0.5, and the values of the variables t, R _a, and r were determined from the experimental parameters of time in extinction, alternative-reinforcement rate, and baseline reinforcement rate, respectively. Because the design included no return to baseline conditions, t increased daily by a value of 1. During EXT and EXT 2, the value of R _a in the numerator was set to 120 (i.e., foods per hour), and during Test and Test 2, R _a in the numerator was set to 0 because alternative reinforcement as a disruptor was not present. Consistent with the usual treatment of the previously experienced response-strengthening effects of reinforcement in the denominator of the augmented model during extinction (i.e., r in Eq. 1; see Nevin et al., 2001), the value of R _a in the denominator was 120 throughout all EXT and test phases. Only the parameters k and c were free to vary. The least squares regression fit of Eq. 3 to the mean subject data accounted for 99 % of the variance, with c = 2.56 and k = 0.09. Table 2 shows the parameter estimates obtained in the fits of Eq. 3 to the individual pigeon data. The median of the individual R ² values was .99 (M = .98, SD = .02). The median value of parameter c was 2.86 (M = 2.90, SD = 0.99), and the median value of k was 0.10 (M = 0.11, SD = 0.04).

Table 2 Equation 3 individual subject parameter values for the Experiment 1 fits

Full size table

Discussion

As in previous experiments in more complex situations (Quick et al., 2011; Wacker et al., 2011), the present experiment showed that resurgence appears to decrease across repeated tests. One could argue that for all but two subjects, resurgence did not occur upon the second removal of alternative reinforcement. Furthermore, the quality of the least squares regression fit to the data from Experiment 1 suggests that, on average, Eq. 3 adequately describes the repeated-resurgence phenomenon, although considerable variability occurred in the individual parameter estimates. It is important to note that decreased resurgence across tests is not contradictory to previous findings that repeated examinations of resurgence within subjects result in similar relapses (da Silva, Maxwell, & Lattal, 2008; Lieving & Lattal, 2003), because in these previous experiments baseline responding was reestablished before the second examination of extinction and resurgence. In these cases, Eq. 3 requires that the value of t be reset to zero following each baseline, and as such, the model would predict similar resurgence across repeated tests rather than reduced resurgence.

The results of Experiment 1 are also consistent with data from Leitenberg et al. (1975, Exp. 4) in which groups that experienced lengthier extinction plus alternative reinforcement showed less resurgence than did a group that experienced only three sessions of extinction plus alternative reinforcement. On the other hand, a recent failure to replicate the findings of Leitenberg et al. was reported by Winterbauer, Lucke, and Bouton (2013). This discrepancy will be addressed in the General Discussion.

Experiment 2

The data from Experiment 1 supported the model prediction that resurgence should decrease with repeated resurgence tests. Equation 3 also predicted that resurgence should be similar at time t in extinction in a condition with no previous lapses in alternative reinforcement and at t in a condition with previous removals of alternative reinforcement. Because no data exist that speak to this prediction, Experiment 2 was designed to assess it. As such, in Experiment 2 we compared target responding on the sixth session of extinction across two conditions. In one condition, alternative reinforcement was removed at Sessions 2, 4, and 6 of extinction. In the second condition, the alternative reinforcement was removed only during Session 6 of extinction. A model simulation of Experiment 2, using the values of c and k obtained in the fit to the data from Experiment 1, is displayed in Fig. 5.