Impulsive choice has been defined as choosing a smaller immediate reinforcer over a larger but delayed reinforcer (Ainslie, 1974; Rachlin & Green, 1972), and delay discounting is the process by which the efficacy (value) of the larger reinforcer decreases with increasing delay to its delivery (Myerson & Green, 1995). In choice situations in which the smaller immediate reinforcer (the smaller-sooner reinforcer, or SSR) and the larger delayed reinforcer (the larger-later reinforcer, or LLR) are concurrently available, choosing the LLR is an instance of self-controlled choice (Logue, 1988), and choosing the SSR is a case of impulsive choice (Mazur, 2000). The hyperbolic-decay model describes the degree to which the value of the LLR decays with increasing delay (Mazur, 1987):

$$ V=\frac{A}{1+kD}, $$
(1)

where V is the reinforcer value, A is reinforcer amount, D is the reinforcer delay, and k is a free parameter estimating how fast the value of the LLR decays with increasing D.

The accuracy of the hyperbolic-decay model in describing the data from numerous studies with human (e.g., Myerson & Green, 1995; Rachlin, Raineri, & Cross, 1991) and nonhuman (e.g., Aparicio, Hughes, & Pitts, 2013; Farrar, Kieres, Hausknecht, de Wit, & Richards, 2003; Green, Myerson, Shah, Estle, & Holt, 2007; Mazur, 2012; Stein, Pinkston, Brewer, Francisco, & Madden, 2012) animals is remarkably general, and this model does so with a single free parameter (k). Studies of impulsivity using changes in k to assess the effects of drugs or other neurobiological variables on impulsive choice have often used the adjusting-delay (i.e., Mazur 1987) and the adjusting-amount titration procedures (i.e., Richards, Mitchell, de Wit, & Seiden, 1997; Green et al., 2007), or the method developed by Evenden and Ryan (1996). One important distinction between these procedures is that the former generate graded discounting functions between sessions (e.g., Stein et al., 2012), and the latter generates an entire delay-of-reinforcement function within each session (e.g., Evenden & Ryan, 1999); these methods serve different purposes when examining variables affecting impulsive choice. All of the procedures, however, share the following characteristics: (1) Discrete trials offer a choice between the SSR and the LLR; (2) forced-choice trials precede free-choice trials, exposing subjects to the contingencies associated with the SSR and the LLR; (3) forced- and free-choice trials require a single response to produce either the SSR or the LLR; and (4) an intertrial interval follows each SSR and LLR, keeping constant the time between choices. Some advantages and disadvantages of each of these procedures in generating graded discount functions have been reviewed (Madden & Johnson, 2010), warranting other procedures to examine impulsive choice.

In the present study, we used a concurrent-chains procedure, recently developed to study impulsive choice in rats (Aparicio et al., 2013). In this technique, the delay to deliver the LLR is manipulated dynamically within sessions, capitalizing on Evenden and Ryan’s (1996) procedure to obtain an entire delay-of-reinforcement function within each session. Choice is estimated by analyzing the distribution of responses on two levers concurrently available in the initial link (e.g., Grace, 1994), preventing the development of an exclusive preference for one or the other alternative, which is observed when only one response is required to choose and produce the SSR and the LLR (Mazur, 1987, 2010). Another characteristic of the present procedure is that the presentation order of delays to deliver the LLR was manipulated across conditions (i.e., ascending, descending, or random presentation orders), avoiding the possibility of creating a single history of increasing delay to deliver the LLR; this characteristic is important because, in studies in which discrete trials have been used to create a history of increasing delay to LLR delivery, choice for the LLR decreased across blocks of trials during probe sessions in which the delay to LLR delivery remained at 0 s throughout the session (e.g., Evenden & Ryan, 1996; Pitts & McKinney, 2005; Slezak & Anderson, 2009). In addition, the present procedure required locomotion; rats traveled from the front to the back wall of the chamber to press a lever to restart each cycle of the concurrent-chains procedure, which implied effort, representing some cost to reach the choice point. This is important because effort plays a crucial role in preference (Salamone & Correa, 2009), determining sensitivity to relative changes in the frequency, amount, and immediacy of reinforcement (e.g., Aparicio, 2001; Aparicio & Cabrera, 2001).

The primary goal of the present study was to examine the impulsive choices of Lewis (LEW) and Fischer 344 (F344) rats, analyzing and representing the entire data set instead of using the data from the final sessions of each condition. The adjustments in the concurrent-chains procedure developed to study impulsive choice (i.e., Aparicio et al., 2013) and the use of contemporary measures to analyze impulsivity are the contributions of the present study. One objective was to investigate the acquisition and maintenance of impulsive choice, studying changes in discounting functions and sensitivity to the immediacy of reinforcement as a function of training (Aparicio et al., 2013). Another objective was to analyze the role of presentation orders of the delays to LLR in determining the slope of the discounting function; this objective is important to ratify the findings of studies with human (e.g., Robles & Vargas, 2007, 2008; Robles, Vargas, & Bejarano, 2009) and nonhuman (e.g., Fox, Hand, & Reilly, 2008; Slezak & Anderson, 2009) animals showing that the slope of the discounting function is affected by the order in which delays to LLR are presented in the choice situation. The last objective was to compare the impulsive choices of LEWs and F344s, given the growing interest of the scientific community in using these rat strains to assess the effects of drugs and neurobiological variables in determining the slope of the delay-discounting function; this objective is significant because researchers have suggested that genetic and neurochemical differences between LEWs and F344s account for the dissimilarities in impulsive choice between these inbred rats (e.g., Anderson & Diller, 2010; Garcia-Lecumberri et al., 2011).

Evidence suggesting genetic differences between LEWs and F344s has come from models of dependence on alcohol (Suzuki, George, & Meisch, 1988), nicotine (Brower, Fu, Matta, & Sharp, 2002), cocaine (Kosten et al., 1997), etonitazene (Suzuki, George, & Meisch, 1992), and morphine (Martin et al., 2003), showing that self-administration of these substances occurred more readily in the former than in the latter strain of rats. This cumulative body of data indicates that LEWs possess a phenotype highly susceptible to drug addictions (Garcia-Lecumberri et al., 2011), providing a genetic model of human drug abuse (Kosten & Ambrosio, 2002). Neurochemical variations between LEWs and F344s were documented in parametric analyses of neurotransmitter systems such as the serotonergic (Chaouloff, Kulikov, Sarrieau, Castanon, & Mormede, 1995), dopaminergic (Flores, Wood, Barbeau, Quiron, & Srivastava, 1998; Lindley, Bengoechea, Wong, & Schatzberg, 1999), and noradrenergic (Sziraki et al., 2001) systems. Studies comparing and contrasting the dopamine (DA) and serotonin (5-HT) systems of LEWs with those of F344s have reported that the former strain has lower levels of DA and 5-HT in many brain areas (Burnet, Mefford, Smith, Gold, & Sternberg, 1996), fewer D2 receptors in the striatum and nucleus accumbens core, fewer D3 receptors in the nucleus accumbens shell and olfactory tubercule (Flores et al., 1998), and fewer 5-HT binding sites in the hippocampus and frontal cortex (Selim & Bradberry, 1996) than F344s. The LEWs also display higher levels of basal N-methyl-D-aspartate receptors in many regions of the brain and reduced proenkephalin mRNA levels in the dorsal striatum and nucleus accumbens than do F344s (Martin et al., 1999, Martin et al., 2003).

Behavioral and physiological differences between LEWs and F344s have been reported in various laboratories (see Kosten & Ambrosio, 2002). For instance, studies using an autoshaping technique (Brown & Jenkins, 1968) to establish lever pressing for food found that the LEWs acquired that behavior more rapidly and performed it at higher rates than the F344s (Kearns, Gomez-Serrano, Weiss, & Riley, 2006); similar results were documented by Anderson and Elcoro (2007) using a tandem fixed-ratio one, fixed-time 20 s schedule of reinforcement to establish lever pressing for food. Moreover, studies of impulsive choice in LEWs and F344s have shown steeper discounting functions for the former than for the latter strain of rats (e.g., Anderson & Diller, 2010; Anderson & Woolverton, 2005; Huskinson, Krebs, & Anderson, 2012; Madden, Smith, Brewer, Pinkston, & Johnson, 2008). Nonetheless, the generality of this finding has been compromised by research that failed to show differences in discounting functions between LEWs and F344s (e.g., Stein et al., 2012; Wilhelm & Mitchell, 2009).

The discrepancies in the discounting functions of the LEWs and F344s are probably due to methodological differences in the ways to obtain graded discounting functions. Studies using steady-state procedures, arranging a delay/amount combination for at least ten sessions before replacing it for a different combination, found steeper discount functions in LEWs than in F344s (e.g., Anderson & Diller, 2010; Anderson & Woolverton, 2005; Huskinson et al., 2012; Madden et al., 2008). In contrast, studies that have used a rapid-determination adjusting-amount procedure to control each delay/amount combination in series of five sessions have yielded no differences in discounting functions between LEWs and F344s (Stein et al., 2012; Wilhelm & Mitchell, 2009), suggesting that more than five sessions are required for the rapid-determination adjusting-amount procedure to detect a strain difference that is mainly driven by the LEWs. Perhaps the F344s need several sessions (ten or more) to detect rapid changes in the contingencies of reinforcement and to show steady-state performance. If so, initial differences in discounting rates and higher values of k for the LEWs than for the F344s (estimated by Eq. 1) should appear after ten or more sessions, showing that measures of delay discounting can change systematically as a function of training. A recent study by Aparicio et al. (2013) provided a positive answer to this question by using a concurrent-chains procedure. In that study, the performance of LEWs and F344s was compared at different points of training (i.e., blocks of 15 days each) under conditions of extended exposure (i.e., 225 sessions) that varied the delay to the LLR in the terminal links of a concurrent-chains procedure. Early in training (Blocks 1–2), there were no differences between the LEWs and the F344s, with both strains showing flat discounting functions. In Blocks 4 and 5, the LEWs showed more sensitivity to the effects of delay to the LLR (i.e., steeper discounting functions) than did the F344s; this difference in discounting functions between strains remained for ten blocks of sessions. However, it faded by Blocks 14 and 15, with results showing no differences between the LEWs and F344s in proportions of LL choices, discounting rates (k in Eq. 1), and other dependent variables derived from the general form of the matching law (i.e., Baum, 1974; Baum & Rachlin, 1969). It was concluded that (1) measures of delay discounting change systematically as a function of training, (2) the F344s require more training than the LEWs to detect dynamic changes in the contingencies of reinforcement, and (3) with extended training, the F344s develop patterns of impulsive choice that are similar (i.e., comparable ks) to those observed in the LEWs (Aparicio et al., 2013).

Although in the Aparicio et al.’s (2013) study the proportion of LL choices was reasonably high (i.e., .8) when 5 s was the minimal delay to obtain either the SSR or the LLR, a 0-s delay to deliver these reinforcers was not included in their choice procedure. Another potential drawback of Aparicio et al.’s study is that fixed-interval (FI) schedules were used in the terminal links to delay the LLR, requiring one response at the end of the interval to obtain the reinforcer (i.e., compromising the effect of delay to LLR on choice); this might explain the low estimates of k found in Aparicio et al.’s study.

In the present study, we corrected the above drawbacks by (1) including a delay close to 0 s (0.1 s) to deliver both the SSR and the LLR, and (2) using fixed-time (FT) schedules in the terminal links to manipulate delays to the LLR, eliminating the response requirement at the end of the interval. The general goal was to use a range of measures (described in detail below) analyzing impulsive choice, and the specific objectives were to (1) examine the acquisition and maintenance of impulsive behavior, (2) analyze whether the presentation orders of the delays to the LLR is important in determining the slope of the discounting function, and (3) contribute to the study of the impulsive choices of LEWs and F344s.

Method

Subjects

Experimentally naïve inbred LEW (n = 8) and F344 (n = 8) male rats (Harlan Sprague-Dawley, Indianapolis, IN), between 108 and 122 days old at the start of training, served as the subjects. An ad-libitum feeding regimen of Purina Lab Chow was used at first to allow habituation to the laboratory. On the day before training, the feeders of all cages were emptied and the rats were placed on a regimen of food restriction; postsession feedings of approximately 10 g of Purina Lab Chow were provided, such that the weights of the LEWs and F344s at the beginning of the study ranged from 253 to 282 g and from 209 to 234 g, respectively, and at the end of it, their weights ranged from 409 to 459 g and from 324 to 380 g, respectively. Between sessions, the animals were individually housed in plastic cages with water permanently available in a temperature-controlled colony room providing a 12:12-h light/dark cycle (lights on at 0600).

Apparatus

Eight modular chambers (Coulbourn E10-11R TC) for rats measuring 295 mm long, 250 mm wide, and 285 mm high (inside) were enclosed in isolation cubicles (E10-23) that from the outside measured 794 mm long, 533 mm wide, and 514 mm high. The front and back walls of each chamber were made of stainless steel, and the sidewalls of Plexiglas. The floor of each chamber was a square metal grid (E10-18NS). A food cup (E14-01R), 30 mm long by 40 mm wide, was centered between the left-side and right-side walls 20 mm from the floor. Two retractable levers (E23-17RA), requiring a force of 0.2 N to operate, were mounted on the front wall of each chamber 70 mm above the floor; the levers were 30 mm wide, and the edge of each lever was 25 mm from its respective left and right sidewalls. Two 24-V DC stimulus lights (H11-03R) were installed 35 mm above the levers. A food dispenser (H14-23R) located behind the front wall delivered 45-mg grain pellets (BioServ) into the food cup. A third, nonretractable lever (H21-03R), requiring a force of 0.2 N to operate, was centered on the rear wall of each chamber, 60 mm above the floor. A 24-V DC houselight (H11-01R) centered on the rear wall 20 mm below the ceiling provided the illumination of the chamber. A 26 × 40 mm speaker (H12-01R) mounted on the rear wall, 10 mm from the left sidewall and 65 mm from the house light, was connected to a white noise generator (E12-08) providing a constant white noise at 20 kHz (±3 dB). All experimental events were programmed and data recorded in a separate room by two Windows-controlled computers using Coulbourn Instruments software (Graphic State Notation, version 3.03) and interfacing equipment operating at a 0.01-s resolution.

Procedure

Training

Four 45-mg pellets were placed into the food cup, and training began with the left lever being extended into the chamber. A fixed-ratio schedule of one response (FR 1) was associated with the extended lever, but there were no attempts to shape manually the response of pressing on it. The FR 1 was operative until the rats had pressed on the extended lever to produce 60 food pellets, or until 30 min had elapsed, whichever happened first. When the rats pressed consistently on that lever to produce 60 food pellets in two consecutive sessions, it was retracted from the chamber and the right lever was extended into the chamber; responding on the right lever was trained in a similar fashion over the following days. Finally, the rear lever was reinstalled in the chamber with an FR 1 schedule associated with it, and the other two levers were retracted from the chamber. Once the rats had pressed on the rear lever to produce 60 food pellets in two consecutive sessions, the front levers were extended into the chamber, and training ended in a 30-min session with the rear and the two front levers all providing food pellets according to a concurrent FR 1 FR 1 FR 1 schedule of reinforcement.

General procedure

A concurrent-chains procedure similar to that described in detail elsewhere (Aparicio et al., 2013) was used, the only difference being that the choice link used two random-interval (RI) schedules instead of one RI schedule, and the terminal links used fixed-time (FT) response-independent schedules to deliver the SSR and the LLR. Each session consisted of a series of 60 choice cycles, organized in six delay components of ten cycles each that initiated with the houselight turning on. A single response on the rear lever (a) turned off the houselight, (b) extended the front levers into the chamber, and (c) turned on the stimulus lights above them, advancing the procedure to the choice link; two nonindependent RI 12-s schedules, one associated with the left lever and the other with the right lever, arranged equal numbers of left and right terminal-link entries (Alsop & Davison, 1986; Stubbs & Pliskoff, 1969). Once an entry was set up for the left or the right lever, the first response on that lever advanced the procedure to its terminal link and retracted the other, inoperative lever from the chamber, turning off the light above it. The second press on the available lever started the FT; that lever was not retracted from the chamber, in order to avoid signaling the delay to the LLR (i.e., its function as a conditioned reinforcement). Further presses on the same lever during the delay or at the end of it had no programmed consequences. For half of the rats within each strain, the terminal link delivering one pellet (the SSR) was associated with the left lever, and the terminal link delivering four pellets (the LLR) with the right lever; these relations were reversed for the other half of the rats within each strain (i.e., the LLR was associated with the left lever and the SSR with the right lever). The terminal link delivering the SSR remained constant at a FT 0.1 s (i.e., the minimal unit of time for the program to move from one state to another), and the terminal link delivering the LLR was a FT that every ten cycles took on a different value (0.1, 5, 10, 20, 40, or 80 s). Upon delivery of either the SSR or the LLR, each cycle ended by (a) retracting the operative lever, (b) turning off the stimulus light above it, and (c) turning on the houselight to start a new cycle. Each delay component of ten cycles was followed by a 60-s blackout during which responses on the rear lever were not effective, all lights were extinguished, and the front levers were retracted from the chamber. Sessions were programmed to end after 60 cycles (i.e., six delay components) or 60 min, whichever occurred first. Most sessions, however, ended after 60 cycles, not requiring the 60-min maximum time to end the session. Delay components were presented within sessions in ascending, descending, or random order, with each presentation order of delay conditions lasting for 105 sessions. All rats were exposed to the same sequence of ascending, descending, and random conditions. Then, a replication of the ascending presentation order of delay conditions (reascending) was scheduled for another 105 sessions. Sessions were conducted seven days per week at about the same time each day.

Data analysis

For each data set of 105 sessions, corresponding to the ascending, descending, random, and redetermination-of-ascending-order conditions, data were analyzed across blocks of sessions and also across reinforcers within a component. All computations used choice-link responses emitted on the LL lever and the SS lever. The analysis across blocks of sessions organized each data set into seven blocks of 15 days each, which in our experience is a large enough number of days to observe consistent changes in choice. Responses on the LL and SS levers were counted separately for each delay and aggregated across sessions of the same block. Then, the corresponding proportions of LL choice [LL responses / (LL responses + SS responses)] and ratios of responses (LL/SS) were calculated using computations for individuals within each strain and the group’s mean. For the analysis across reinforcers within a component, we pooled the data across all 105 sessions of the same condition, counting responses on the LL and SS levers separately for each reinforcer within the same delay component, regardless of whether it was a SSR or LLR; accordingly, for each delay component ten proportions of LL choices and ten ratios of responses could be computed, or 60 proportions of LL choices and 60 response ratios for the six delay components. These computations were obtained for each rat within a strain and used to calculate the means and standard deviations of the groups.

The distribution of responses across the two levers in the initial or choice link was analyzed with the general form of the matching law (Baum, 1974; Baum & Rachlin, 1969):

$$ \log \left(\frac{B_1}{B_2}\right)=s* \log \left(\frac{r_1}{r_2}\right)+ \log b, $$
(2)

where B 1 and B 2 are the behavior allocations, measured in time or responses (i.e., LL responses and SS responses, respectively), to Alternatives 1 and 2; r 1 and r 2 are the rates of reinforcement, amounts of reinforcement, or immediacies of reinforcement (reciprocal of delay of reinforcement) produced by responding at Alternatives 1 and 2; b is a measure of bias toward one alternative or the other arising from factors other than r 1 and r 2; and s is the sensitivity of the behavior ratio to the ratio of the values (i.e., rates, amounts, or immediacies of reinforcement) of the activities (Baum & Rachlin, 1969). The ratios of responses (LL/SS) and delays to food delivery (SS/LL) were transformed into base-2 logarithms. Linear regression analysis (the least squares method) was conducted with the log2 of the response ratio plotted against the log2 of the delay ratio (SS/LL). The resulting slope was used to estimate the sensitivity of choices to changes in the immediacy of the reinforcement (i.e., the reciprocal of the delay to reinforcement), and the y-intercept to estimate bias (Baum, 1974; Baum & Rachlin, 1969).

Equation 1 was entered into the Origin software (version 8.5) as a user-defined equation, providing nonlinear curve fitting to the proportions of LL choices; A was free to vary (i.e., it was not assumed to be 1.0 LL choice at the y-intercept), and the value of the parameter k was used to estimate the rate of discounting.

The area under the empirical discounting curve (AUC) was computed using Myerson, Green, and Warusawitharana’s (2001) method, providing a theory-free estimate of delay discounting; it was expressed as a proportion of 1.0, with values close to 1.0 indicating minimal or no discounting, and values close to zero maximal discounting. Nonparametric statistical Mann–Whitney U tests and Wilcoxon signed ranks tests were used to examine the differences between strains in estimates of k (Eq. 1), bias, slopes from regression lines (Eq. 2), and AUC computed for the data of the individual LEWs and F344s. Linear curve fitting and nonparametric statistical tests, at the alpha level of .05, were implemented using Origin.

Results

Hyperbolic-decay functions

The group means of proportions of LL choices are plotted in Figs. 1 and 2 as a function of delay to the LLR. From top to bottom, the graphs show the data for each block of sessions (Fig. 1) and each food delivery (Fig. 2); from left to right, the figures show the data obtained in the ascending, descending, random, and reascending conditions. Continuous and dashed lines are the best fits using Eq. 1 to the data points of the LEWs (circles) and F344s (squares), respectively. Generally, the groups’ means of the proportions of LL choices decreased with increasing delays to LLR. All delay conditions show that delay discounting increased across blocks of sessions and food deliveries, with a tendency of the LEWs to choose the SS lever more that the F344s, but this difference was unreliable.

Fig. 1
figure 1

Proportions of LL choices [LL / (LL + SS)] as a function of delay (in seconds) to the LLR. Panels show the group means among the Lewis (LEW, circles) and Fischer 344 (F344, squares) rats for the ascending, descending, random, and reascending delay-order conditions. Lines that extend from the squares and circles correspond to standard deviations. Bk stands for blocks of sessions

Fig. 2
figure 2

Proportions of LL choices [LL / (LL + SS)] as a function of delay (in seconds) to the LLR. F stands for food delivery. Other details are as in Fig. 1

The resulting parameters for the y-intercept (A), k, and R 2, computed for blocks of sessions, are presented in Table 1, and those same parameters computed for food deliveries are in Table 2. The delay-discounting functions of the ascending condition show that Eq. 1 provided excellent fits to the data of the LEWs and F344s across blocks of sessions, means R 2s = .978 and .950, respectively, and food deliveries, mean R 2s = .985 and .912, respectively; y-intercepts (A) for the data of the LEWs across blocks and food deliveries (means of .703 and .711, respectively) were higher than those for the data of the F344s (means of .587 and .657, respectively).

Table 1 Hyperbolic discounting, parameters (Eq. 1) computed for blocks of sessions
Table 2 Hyperbolic discounting, parameters (Eq. 1) computed for food deliveries

As is shown in Table 1, Blocks 1–2 of the ascending condition show steeper discounting functions for the data of the LEWs (k = .020 and .049) than for the F344s (k = .013 and .021), indicating that the LEWs chose more impulsively than the F344s. Both strains, however, produced similar discounting functions in Blocks 3–7. Estimates of k, ranging from .020 to .065 (M = .048) for the data of the LEWs, were not significantly different (W = 22, p > .05) from those for the data of the F344s, ranging from .013 to .082 (M = .041). Figure 2 shows that the discounting functions generated by the LEWs for each food delivery of the ascending condition were similar to those of the F344s (i.e., the continuous and dotted lines are close to one another). Yet, the proportions of LL choices generated by the F344s with delays to LLR higher than 20 s were slightly higher than those produced by the LEWs with the same delays, indicating that food-by-food, the F344s chose less impulsively across delays to LLR. Estimates of k for the group data of the LEWs, ranging from .038 to .062 (M = .048) were significantly higher (W = 55, p < .05) than the estimates for the F344s, ranging from .021 to .032 (M = .028).

As is shown in the top graph of Fig. 1 for the descending condition, the pattern of choices that both strains established in ascending order was disrupted by delays occurring within sessions in descending order. For both strains, Block 1 of the descending condition shows flat slopes (k = .0) and aberrant fits (R 2 = –.250). Block 2 shows a steeper discounting function for the LEWs (k = .017) than for the F344s (.008); the former strain recovered the pattern of choice shown in the ascending condition faster (R 2 = .853) than the latter (R 2 = .373). In Blocks 3–5, the discounting functions of the F344s are steeper (k = .062, .066, and .098, respectively) than those of the LEWs (k = .048, .058, and .071, respectively), indicating that the former chose more impulsively than the latter strain. Differences in the discounting functions between both strains vanished in Block 6 (k = .080 vs. .083, respectively). Still, in Block 7 the LEWs generated the steepest discounting function (k = .121). Estimates of k ranging from .000 to .121 (M = .057) for the data of the LEWs were not significantly different (W = 13, p > .05) from those for the F344s, ranging from .000 to .098 (M = .052), as is shown in Table 1. Even though Fig. 2 shows similar discounting functions for each food delivery of the descending condition (i.e., the continuous and dotted lines are close to one another), the discounting functions of the LEWs were steeper than those of the F344s. Following Table 2, estimates of k for the data of the LEWs, ranging from .044 to .064 (M = .057) were significantly higher (W = 50, p < .05) than those for the F344s, ranging from .016 to .060 (M = .044), showing that with delays being presented within sessions in descending order, food by food, the LEWs chose more impulsively than the F344s.

As is shown in Tables 1 and 2, with delay components presented within sessions in random order, Eq. 1 also provided good fits to the group data of both strains across blocks of sessions (mean R 2s = .956 and .772, respectively) and food deliveries (mean R 2s = .851 and .679, respectively). The y-intercept (A) was higher for the data of the F344s than for the LEWs across blocks of sessions (mean As = .610 and .567, respectively) and food deliveries (mean As = .612 and .575, respectively).

Block 1 shows that both strains generated similar discounting functions (k = .008 and .005, respectively) responding to delays to the LLR presented in a random order (see Table 1). Blocks 2–7, however, show steeper discounting functions for the group data of the LEWs; estimates of k for the LEWs, ranging from .008 to .025 (M = .018) were significantly higher (W = 28, p < .05) than those for the F344s, ranging from .005 to .009 (M = .008). For the first food delivery of the random condition, Fig. 2 shows flat discounting functions (k = 0), resulting in aberrant fits (R 2 = .251) to data points of the group data of both strains. Consistent with the results for blocks of sessions, Food Deliveries 2–10 show steeper discounting functions for the group data of the LEWs (see Table 2); estimates of k for the data of the LEWs, ranging from .000 to .041 (M = .022), were significantly higher (W = 45, p < .05) than those corresponding to the F344s, ranging from .000 to .017 (M = .009).

The results of the reascending condition were close to those obtained during the first ascending determination. As is shown in Tables 1 and 2, Eq. 1 provided excellent fits to the group data of both strains across blocks of sessions (mean R 2s = .955 and .957, respectively) and food deliveries (mean R 2s = .946 and .968, respectively); the y-intercepts for the data of the LEWs across blocks of sessions (mean A = .771) and food deliveries (mean A = .776) were similar to those corresponding to the data of the F344s (mean As = .768 and .773, respectively). The reascending condition shows steeper discounting functions for both strains, but the LEWs made more impulsive choices than the F344s. Estimates of k for the data of the LEWs, ranging from .036 to .093 (M = .071), were significantly higher (W = 28, p > .05) than those for the F344s, ranging from .017 to .049 (M = .033). The food-by-food analysis revealed similar results, with Food Deliveries 1–9 showing steeper discounting functions for the LEWs than for the F344s (see Fig. 2). Nevertheless, the discounting function (see Table 2) that the F344s showed in Food Delivery 10 was steeper (k = .106) than that of the LEWs (k = .096); estimates of k for the LEWs, ranging from .050 to .096 (M = .072), were significantly higher (W = 54, p < .05) than those for the F344s, ranging from .020 to .106 (M = .043).

In summary, the LEWs in the ascending condition responded more impulsively than did F344s for the first two blocks, and with further training this difference in impulsive choices disappeared. From all of the orders of delay to LLR presentation, the descending order generated the most disruption of response patterns, as evidenced by the lowest sensitivity and goodness of fit across delay presentation orders; the LEWs adapted faster to the descending condition. In the random condition, strain differences were not as evident in the first two blocks, but with training the LEWs made more impulsive choices than did the F344s. The same trend was observed during the reascending condition, replicating what had been found in the ascending condition; such differences between strains were maintained across blocks and food deliveries.

General form of the matching law

The log2 of response ratios (LL/SS) is plotted in Figs. 3 and 4 against the log2 of delay ratios (SS/LL). From top to bottom, the graphs show the data for each block of sessions (Fig. 3) and each food delivery (Fig. 4); from left to right, the data obtained in the ascending, descending, random, and reascending presentation-order-of-delays conditions. The dotted lines intercepting the y-axis at zero are the indifference lines. The continuous and dashed lines are the best fits using Eq. 2 to the group data points of the LEW (circles) and F344 (squares) rats, respectively. The resulting parameters for bias (b), slope (s), and R 2 computed for each block of sessions are presented in Table 3, and the same parameters calculated for each food delivery are in Table 4.

Fig. 3
figure 3

Log2 of response ratios (LL/SS) as a function of the log2 of delay ratios (SS/LL), presented for each block of sessions. Other details are as in Fig. 1

Fig. 4
figure 4

Log2 of response ratios (LL/SS) as a function of the log2 of delay ratios (SS/LL). F stands for food delivery. Other details are as in Fig. 1

Table 3 General form of the matching law (Eq. 2), parameters for each block of sessions
Table 4 General form of the matching law (Eq. 2), parameters for each food delivery

Figure 3 shows a positive relation between the log of response ratios and the log of delay ratios, indicating sensitivity of preference to within-session changes in the ratio of immediacy to reinforcement (i.e., the reciprocal of the delay to reinforcement). Equation 2 provided good fits to the response ratios that both strains of rats produced with delays to LLR being presented in ascending order, accounting for changes in preference that occurred as a function of rapid changes in the delay ratio (mean R 2s = .679 and .625, respectively). Estimates of bias for the data of the LEWs, ranging from 0.873 to 2.059 (M = 1.480), were not significantly different (W = 22, p > .05) from those for the F344s, ranging from 0.808 to 2.033 (M = 1.400). In Blocks 1 and 2, the data points of the F344s are close to or above the indifference line, indicating indifference or a slight preference for the LL; in contrast, most data points of the LEWs fall below the indifference line, showing preference for the SS lever. Blocks 3–7 show that both strains established similar patterns of choice responding to delays to LLR presented in ascending order. Indifference occurred when the LLR was delayed 10 s (delay ratio –6.64), preference for the SS lever when it was delayed 20 s or longer (delay ratios –7.64, –8.64, and –9.64), and preference for the LL lever when the LLR was delayed 5 or 0.1 s (i.e., delay ratios –5.64 and 0). Estimates of sensitivity to the immediacy of reinforcement (s) for the data of the LEWs, ranging from .225 to .415 (M = .306), were not significantly different (W = 22, p > .05) from those for the F344s, ranging from .131 to .406 (M = .268).

With delays to LLR presented in descending order (see Table 3), estimates of bias ranging from –1.650 to 1.732 (M = 0.746) for the data of the LEWs were not significantly different (W = 25, p > .05) from those for the F344s, ranging from –2.097 to 1.365 (M = 0.041). Equation 2 provided good fits to group data points of both strains, accounting for changes in preference as a function of changes in the delay ratio (mean R 2 = .588 and .606, respectively). For both rat strains, Block 1 of the descending condition shows preference for the SS lever across delay ratios, resulting in aberrant fits (R 2 = –.129 and –.221) and poor estimates of sensitivity (s = –.069 and .027, respectively). Block 2 shows a positive relation between response ratios and delay ratios, with the LEWs showing higher sensitivity to immediacy to reinforcement (s = .206) than the F344s (s = .055). The goodness of fit for the group data of the LEWs in Block 2 (R 2 = .943) suggested that this strain recovered the pattern of choice shown in the ascending condition faster than the F344s (R 2 = –.042). For the LEWs, Blocks 3–7 show results consistent with those of the ascending condition; indifference occurred when the LLR was delayed 10 s, preference for the SS lever when it was delayed 20 s or longer, and preference for the LL lever when the LLR was delayed either 0.1 or 5 s. In contrast, the F344s show a strong preference for the SS lever across delay ratios. The F344s showed a preference for the LL lever when it delivered the LLR with a 0.1-s delay (delay ratio = 0). Note that, as can be seen in Fig. 3, most data points of the LEWs are above those of the F344s, indicating than the LEWs responded more to the LL lever than the F344s did across delay ratios. Yet, estimates of sensitivity to immediacy of reinforcement, ranging from –.069 to .500 for the data of the LEWs (M = .307), were not significantly different (W = 14, p > .05) from those for the data of the F344s, ranging from .027 to .513 (M = .315).

Following Fig. 3, Blocks 1 and 2 of the random condition show similar choices in both strains; preference for responding on the SS lever is evident across delay ratios. In Blocks 3–7, the F344s emitted more responses on the LL lever than did the LEWs with delays to LLR occurring in random order; note that across delays, most of the data points of the F344s are close to the indifference line. The LEWs show a slight preference for responding on the SS lever across delay ratios, with indifference occurring with delays to LLR of 5 and 10 s (delay ratios of –5.64 and –6.64, respectively). For both rat strains, a clear preference for the LL lever occurred when the LLR was delayed 0.1 s (delay ratio = 0). With delays to LLR of 5 and 10 s, the F344s showed preference for the LL lever and the LEWs showed indifference. Again, estimates of sensitivity to the immediacy of reinforcement (see Table 3), ranging from .114 to .222 (M = .171) for the data of LEWs, were not significantly different (W = 25, p > .05) from those for the F344s, ranging from .105 to .182 (M = .156).

In the reascending condition, following the corresponding parameters presented in Table 3, the choices of the LEWs and F344s generated the steepest slopes (mean ss = .437 and .344, respectively), suggesting that sensitivity to the immediacy of reinforcement increased with extensive training in the choice situation. Equation 2 accounted for a good proportion of the variability in the choices of the LEWs and F344s (mean R 2s = .622 and .786, respectively, as is shown in Table 3) that occurred with within-session changes in the delay ratio. Both rat strains showed a bias for responding on the LL lever, with estimates of bias for the data of the LEWs, ranging from 1.428 to 2.563 (M = 2.126), not being significantly different (W = 15, p > .05) from those of the F344s, ranging from 1.891 to 2.341 (M = 2.144).

In Blocks 1 and 2, the F344s emitted more responses on the LL lever that did the LEWs across delay ratios, showing less sensitivity to within-session changes in the immediacy of reinforcement. All blocks show that both strains chose indifferently between LL and SS levers with a 10-s delay to LLR (i.e., –6.64 delay ratio); again, preference for the SS lever occurred when the LLR was delayed 20 s or longer (delay ratios –7.64, –8.64, and –9.64), and preference for the LL lever when it delivered the LLR with delays of 5 and 0.1 s (delay ratios –5.64 and 0, respectively); note that in Blocks 3–7, most data points of the LEWs overlap with those of the F344s, suggesting similar choices across delay ratios. Estimates of sensitivity for the data of the LEWs, however, ranging from .261 to .574 (M = .437), were significantly higher (W = 27, p < .05) than those for the F344s, ranging from .274 to .447 (M = .344), showing in the former strain more sensitivity to changes in the immediacy of reinforcement.

Figure 4 shows food-by-food sensitivity of choices to within-session changes in the immediacy of reinforcement. Except for the first food delivery of the random condition, showing for the data of both strains poor estimates of sensitivity (s = –.026 and –.018), all food deliveries across conditions show a positive relation between the log of the response ratio and the log of the delay ratio. Food by food, Eq. 2 provided better fits to the data points of the F344s than to those of the LEWs in the ascending (mean R 2 = .708 vs. .678), descending (mean R 2 = .951 vs. .772), random (mean R 2 = .882 vs. .654), and reascending (mean R 2 = .791 vs. .623) conditions. Following the parameters presented in Table 4, both rat strains developed a bias for responding on the LL lever; in the ascending condition and its replication, estimates of bias for the group data of the LEWs, ranging from 1.321 to 1.532 (M = 1.415) and from 1.635 to 2.417 (M = 2.221), respectively, were not significantly different (W = 14, p > .05, and W = 35, p > .05, respectively) from those for the group data of the F344s, ranging from 1.166 to 1.742 (M = 1.479) and from 1.254 to 2.754 (M = 2.182), respectively. However, the descending and random conditions generated estimates of bias for the data of the LEWs, ranging from 0.424 to 1.297 (M = 1.043) and from –0.973 to 1.304 (M = 0.798), respectively, that were significantly different (W = 53, p < .05, and W = 10, p < .05) from those corresponding to the F344s, ranging from –0.688 to 1.365 (M = 0.535) and from 0.001 to 1.454 (M = 1.100), respectively.

Sensitivity of preference to within-session changes in the delay ratio increased with each consecutive food delivery, with both strains showing similar patterns of choice. Food deliveries of the ascending condition show that the LEWs and F344s preferred the LL lever when the LLR was delayed 0.1 or 5 s in that lever, indifference when the LLR was delayed 10 s, and preference for the SS lever when the LLR was delayed 20 or more seconds (delay ratios of –7.64, –8.64, and –9.64). Estimates of sensitivity for the data of the LEWs, ranging from .249 to .328 (M = .286) were not significantly different (W = 42, p > .05) from those corresponding to the F344s, ranging from .212 to .307 (M = .273).

The descending condition shows, food by food, that the F344s preferred the SS lever when the log delay ratios were different from zero. When the SS and LL levers delivered the LLR with the same 0.1-s delay (i.e., delay ratio = 0), the F344s preferred the LL lever. Food by food, the LEWs also preferred the LL lever when both levers delivered the LLR with the same 0.1-s delay (i.e., delay ratio = 0), but with delay ratios of –5.64 and –6.64, the LEWs showed indifference, and a preference for the SS lever with delay ratios of –7.64, –8.64, and –9.64. Estimates of sensitivity for the data of the LEWs, ranging from .285 to .381 (M = .350), were not significantly different (W = 18, p > .05) from those for the data of the F344s, ranging from .214 to .480 (M = .375).

Food Deliveries 1–2 of the random condition show data points of the LEWs and F344s that fall on the indifference line across delay ratios, suggesting a lack of sensitivity of the behavior ratio to within-session changes in delay ratio (s = –.026 and .063 vs. –.018 and .059, respectively). Sensitivity to the immediacy of reinforcement gradually emerged with Food Deliveries 3–10 of the random condition, with the F344s emitting more responses on the LL lever than did the LEWs across delay ratios. Estimates of sensitivity for the data of the LEWs, ranging from –.026 to .267 (M = .174), were significantly higher (W = 51, p < .05) than those of the F344 rats, ranging from –.018 to .242 (M = .159). Both strains preferred the SS lever when the LLR was delayed 40 or 80 s; indifference occurred when the LLR was delayed 20, 10, or 5 s; and preference for the LL lever when the LLR was delivered in both levers within a 0.1-s delay.

Again, the reascending condition shows the steepest slopes, with estimates of sensitivity for the data of the LEWs ranging from .376 to .515 (M = .471) that are significantly higher (W = 52, p < .05) than those for the F344s, ranging from .205 to .517 (M = .368). The pattern of choices that both strains established in the ascending condition was recovered in the replication of this condition. All food deliveries show preference for the LL lever when it delivered the LLR with delays of 0.1 and 5 s, indifference with a 10-s delay, and preference for the SS lever when the LL lever delayed the LLR by 20 or more seconds (i.e., for delay ratios of –7.64, –8.64, and –9.64).

To summarize, the analysis using the generalized matching law confirmed what had previously been found using the hyperbolic-decay function; there were no consistent differences in impulsive choices between strains during the ascending presentation-order-of-delays condition. During the descending condition, the LEWs recovered the pattern of choice established in the ascending condition faster than the F344s, and showed higher sensitivity to the immediacy of reinforcement. In the random condition, both strains preferred the LL lever when the delay to LLR was 0.1 s; with longer delays, the F344s maintained this preference for the LL lever, whereas LEWs shifted to indifference. The reascending condition showed that sensitivity to the immediacy of reinforcement increased as a function of training, with the LEWs showing higher sensitivity to immediacy than did the F344s.

Hyperbolic-decay function and the general form of the matching law

The results in Figs. 1, 2, 3 and 4 suggest some regularity between Eqs. 1 and 2: Both models of choice provided good fits to the data of both strains, with the former generating estimates of discounting rate (k) and the latter generating estimates of sensitivity to the immediacy of reinforcement (s) that might be positively correlated to one another (e.g., Aparicio et al., 2013). To explore this possibility, estimates of k (Eq. 1) are plotted in Fig. 5 against estimates of s (Eq. 2), computed for blocks of sessions (upper graphs) and food deliveries (lower graphs) of the ascending, descending, random, and replication to ascending presentation-order-of-delays conditions. The continuous and dashed lines are the best fits using linear regression; resulting values of Pearson’s r appear in the left corner of each graph for the data corresponding to the LEWs (L r) and F344s (F r).

Fig. 5
figure 5

Values of k (Eq. 1) against values of slope (s in Eq. 2), obtained across blocks of sessions (top panels) and food deliveries (bottom panels). The panels show values obtained for the ascending, descending, random, and redetermination of the ascending delay-order condition. Pearson’s r values, computed using the means of the groups (LEW in circles and F344 in squares), appear near the data points

All graphs in Fig. 5 show a positive correlation between estimates of k and estimates of s. For blocks of sessions of the ascending condition, the regression lines show a higher correlation for the group data of the LEWs (r = .912) than for the group data of the F344s (r = .194), resulting in a better fit to estimates of k and s (R 2s = .799 and –.155, respectively). Regression lines were fitted to the data that individual LEWs and F344s generated in blocks of sessions (graphs not shown). The resulting Pearson’s rs, ranging from .539 to .988 (M = .989) for the LEWs, were not significantly different (U = 43, p > .05) from the corresponding Pearson’s rs for the F344s, ranging from .747 to .926 (M = .860). For food deliveries, the regression lines show a higher correlation (r = .960) and better fit (R 2 = .913) for the group data of the LEWs than for the group data of the F344s (r = .453, R 2 = .106). Regression lines were also fitted to the data that individual LEW and F344 rats generated for each food delivery (graphs not shown). The resulting Pearson’s rs, ranging from .106 to .798 (M = .295) for the data of the individual LEWs, were not significantly different (U = 29, p > .05) from those of the F344s, ranging from –.208 to .981 (M = .620).

For both rat strains, blocks of sessions of the descending condition show parallel correlations (rs = .926 and .964, respectively), but the fit to the data of the F344s was better (R 2 = .916) than that to the data of the LEWs (R 2 = .830). Pearson’s rs, ranging from –.101 to .562 (M = .215) for correlations with the data of the individual LEWs, were not significantly different (U = 23, p > .05) from those correlations with the data of the individual F344s, ranging from –.262 to .781 (M = .340). Consistent with the results of blocks of sessions, food deliveries showed a higher correlation (r = .925) and better fit (R 2 = .840) for the data of the F344s than for the data of the LEWs (r = .787, R 2 = .572). Pearson’s rs, ranging from –.223 to .959 (M = .340) for the data of the individual LEWs, were not significantly different (U = 23, p > .05) from those for the data of individual F344s, ranging from –.754 to .944 (M = .558).

The random condition shows data points that are clustered near the origin, due to low estimates of k and s, with similar correlations (r = .926 and .918) and goodnesses of fit (R 2 = .818 and .754) for the data of the LEWs and F344s, respectively. Pearson’s rs, ranging from .890 to .993 (M = .947) for the data of the individual LEWs, were significantly higher (U = 64, p < .05) from those corresponding to the data of the individual F344s, ranging from .065 to .825 (M = .468). Food deliveries show a remarkable result: The correlation and goodness of fit for the data of the LEWs (r = .979 and R 2 = .954) are identical to those computed for the F344s. Pearson’s rs, ranging from .928 to .976 (M = .949) for the data of the individual LEWs, were not significantly different (U = 43, p > .05) from those for the data of the individual F344s, ranging from .863 to .973 (M = .931).

For the data of the LEWs and F344s, blocks of sessions of the replication of the ascending condition show positive correlations (r = .937 and .891) and good fits (R 2 = .854 and .754, respectively). Pearson’s rs, ranging from .320 to .958 (M = .708), for the data of the individual LEWs were significantly different (U = 12, p < .05) from those for the data of the individual F344s, ranging from .855 to .957 (M = .922). Food deliveries show that the correlation for the data of the LEWs was higher (r = .846) than that for the data of the F344s (r = .364), resulting in a better fit to estimates of k and s (R 2s = .681 and .024, respectively). Pearson’s rs, ranging from –.293 to .880 (M = .653), for the data of the individual LEWs were not significantly different (U = 13, p > .05) from those for the data of the individual F344s, ranging from –.493 to .959 (M = .686).

Area under the curve

Computations of the AUC (Myerson et al., 2001) for the group data of the LEWs (circles) and F344s (squares) are plotted in Fig. 6 as a function of blocks of sessions (upper graphs) and food deliveries (lower graphs).

Fig. 6
figure 6

Values of area under the curve (AUC) as a function of block of sessions (top panels) and food deliveries (bottom panels). Other details are as in Fig. 1

Blocks 1 and 2 of the ascending condition show that at the beginning of the study, the LEWs discounted the LLR more than the F344s, but that between-strain difference in discounting the LLR decreased in Blocks 3–4 and disappeared in Blocks 5–7, showing similar discountings of the LLR for both rat strains. Estimates of the AUC, ranging from .379 to .577 (M = .423), for the data of the LEWs were not significantly different (W = 5, p > .05) from those for the data of the F344s, ranging from .381 to .734 (M = .485). Food by food, both rat strains show moderate discounting of the LLR, with the F344s discounting the LLR less than the LEWs across food deliveries; estimates of the AUC for the LEWs, ranging from .364 to .466 (M = .411), were significantly lower (W = 0, p < .05) than those for the F344s, ranging .464 to .568 (M = .520).

For both rat strains, the descending condition shows that proportions decreased with increasing blocks of sessions, with Block 1 showing no discounting, and Block 7 showing the maximum discounting reached (.300) in that condition (but see the data for the F344s). Estimates of the AUC for the data of the LEWs, ranging from .315 to 1.00 (M = .497), were not significantly different (W = 12, p > .05) from those for the data of the F344s, ranging from .281 to 1.00 (M = .522). Food deliveries of the descending condition show that, food by food, the LEWs and F344s discounted the LLR with similar proportions (about .350). Estimates of the AUC for the data of the LEWs, ranging from .365 to .421 (M = .387), were not significantly different (W = 29, p < .05) from those for the data of the F344 rats, ranging from .270 to .530 (M = .383).

In the random condition, the LEWs and F344s show the highest proportions (about .650) across blocks of sessions, indicating little discounting of the LLR with delays to its delivery occurring within sessions in random order. Blocks 5–7, however, show that the LEWs discounted the LLR more than the F344s (AUCs = .500 vs. .650, respectively). Yet, estimates of the AUC for the data of the LEWs, ranging from .541 to .742 (M = .607), were not significantly different (W = 3, p > .05) from those for the data of the F344s, ranging from .621 to .736 (M = .698). Food deliveries show that the AUC decreased with each food delivery of the random condition, with the LEWs showing a decrease in AUC values from .850 with the first food delivery to about .400 with the last food delivery, and the F344s also showing a decrease, from .850 with the first food delivery to about .600 with the last food delivery. Estimates of the AUC for the LEWs, ranging from .453 to 1.00 (M = .628) were significantly lower (W = 2, p < .05) than those for the F344 rats, ranging from .568 to 1.00 (M = .707).

The reascending condition generated estimates of the AUC that were comparable to those computed for the first determination of this condition, with the LEWs discounting the LLR more than the F344s across blocks of sessions. Estimates of the AUC for the data of the LEWs, ranging from .327 to .464 (M = .362), were significantly lower (W = 0, p < .05) than those for the data of the F344 rats, ranging from .367 to .590 (M = .458). Food deliveries of the reascending condition show similar estimates of the AUCs for the data of the LEWs and F344s across food deliveries, with the LEWs showing more discounting of the LLR than the F344s show, food by food. Estimates of the AUC for the LEWs, ranging from .294 to .470 (M = .361), were significantly lower (W = 0, p < .05) than those for the data of the F344s, ranging .363 to .568 (M = .437).

In sum, the AUC analysis confirmed that strain differences were not consistent throughout the seven blocks of sessions and food deliveries of the ascending condition; only during the first two blocks of training did the LEWs show steeper discounting. The variability of AUC measures within strains increased the most when the presentation order of delays to LLR changed from ascending to descending; this is additional evidence supporting the disruptive effect of the descending presentation order of delays to LLR. In the random condition, both strains showed the highest AUCs across blocks of sessions and food deliveries, showing no consistent differences in impulsive choices between strains. The results from the two first blocks of the ascending condition were replicated in the reascending condition, with the LEWs showing more discounting than the F344s.

Discussion

The present study has clearly showed that impulsive choice can change as a function of learning and is not a static property of behavior that is mainly determined by genetic and neurochemical mechanisms; thus, when examining impulsive choice, we deal with a dynamic system. We contend that a working definition of impulsivity is a complicated task and that the evidence from animal models to examine impulsivity is warranted by studies that have incorporated novel procedures and several ways to estimate impulsive choice. Accordingly, we developed a concurrent-chains procedure in which response allocation in two levers concurrently available in the initial link was used to assess preference (Grace, 1994). One terminal link varied the delay to the LLR within sessions, capitalizing on Evenden and Ryan’s (1996) method for generating an entire delay-of-reinforcement function within each session. Because more than one response on the same lever was allowed to choose between the SSR and the LLR, the probability to develop an exclusive preference for one or the other lever was reduced; this is important, because exclusive preference occurs when a single response serves to choose and produce either the SSR or the LLR (Mazur, 1987, 2010). Our procedure required locomotion to travel from the front to the back wall of the chamber and press on a lever there to start each cycle of the concurrent-chains schedule; these activities implied some effort representing a cost to reach the choice link; effort plays an important role in preference (Aparicio, 2001; Aparicio & Cabrera, 2001; Aparicio & Otero, 2004; Salamone & Correa, 2009), and cost influences delay discounting (Paglieri, 2013). We also implemented multiple analytical tools, providing an ideal scenario to generate other critical measures and further examine strain differences in impulsive choice.

The study of impulsivity tends to be predominantly linked to pathologies, maladaptive behavior, and personality (Odum, 2011), when in actuality it is a pattern of behavior that may also be adaptive in humans and nonhuman animals (Fawcett, McNamara, & Houston, 2012). In the present study, we applied a behavioral analytical approach, defined impulsivity functionally rather than structurally, and provided a better understanding of such a concept by keeping the definition of impulsivity open to empirical evidence (Winstanley, Eagle, & Robbins, 2006).

Historical variables are worth further exploring within the context of impulsivity. In the present study, LEWs and F344s were exposed to the same sequences of ascending, descending, random, and reascending presentation orders of delays to LLR; hence, varying the order in which this sequence of conditions was presented is a question that deserves further examination. Also, we studied the impulsive choices of the LEWs and F344s for most of their life span, and aging might have contributed to the result showing that the F344s chose the LL lever at higher proportions than did the LEWs during the random and the replication of the ascending presentation-order-of-delays conditions; this finding resembles that obtained with aged F344 rats that, regardless of their experience with the choice situation, preferred the LLR over the SSR (Simon et al., 2010). Nonetheless, we did not examine in isolation the role of aging in determining the preference for the LLR. Still, though, as in Aparicio et al. (2013), the results confirmed that the impulsive choices of LEWs and F344s changed with learning; early in training, the LEWs showed more impulsive choices (as estimated by k) than the F344s, but this between-strain difference in impulsive choices disappeared with experience in the choice situation, showing parallel discounting functions (similar estimates of k for the LEWs and F344s) in the last blocks of sessions of the ascending condition. This finding calls for a reevaluation of studies that have exposed these rat strains to procedures designed to examine impulsivity for shorter lengths of time (Stein et al., 2012; Wilhelm & Mitchell, 2009).

On the basis of the present and previous discoveries (Aparicio et al., 2013), it is fair to conclude that the behavior pattern labeled impulsivity, frequently implicated in maladaptive behavior, can actually change as a function of learning. Additional evidence of the effect of learning on preference was observed in the descending condition; presenting the delays to LLR in descending order disrupted the patterns of choice that the LEWs and F344s established responding to delays to LLR in ascending order. It could be said that in adjusting to the transition from the ascending to the descending presentation order of delays, the LEWs did so more rapidly than the F344s, choosing the LL lever at higher proportions during most delays to LLR. This result is in dispute with previous findings (Anderson & Diller, 2010; Anderson & Woolverton, 2005; Huskinson et al., 2012; Madden et al., 2008; Stein et al., 2012) that showed more impulsivity in the LEWs; the assertion that the LEWs choose more impulsively than the F344s is not supported by the present study, challenging this strain difference and confirming that it may be due to procedural issues (Stein et al., 2012) and the analytical tools used to characterize impulsive choice (Madden & Johnson, 2010).

We conclude that strain differences in impulsivity are not fundamentally a direct function of their neurochemical characteristics (Burnet, Mefford, Smith, Gold, & Sternberg, 1992; Chaouloff et al., 1995; Flores et al., 1998; Lindley et al., 1999; Sziraki et al., 2001), but as we have shown, differences in impulsive choice between LEWs and F344s change as a function of learning (Aparicio et al., 2013). Overall, the discounting functions of the LEWs across delay conditions seem to suggest that they are more adaptable to dynamic changes in the contingencies of reinforcement than are F344s. Particularly, the present results seem to indicate that the LEWs are more sensitive to rapid changes in delay to reinforcement, in the sense that their preferences changed sooner than those of the F344s. It could be concluded that impulsivity is an advantage in dynamic changing environments that might be adaptive for human and nonhuman animals (Fawcett et al., 2012).

The present study extended the generality of the hyperbolic-decay model (Mazur, 1987) and the general form of the matching law (Baum, 1974; Baum & Rachlin, 1969) to the study of impulsive choice. The former model has been successful in describing the data from several experiments on delay discounting in which human and nonhuman animals have been exposed to a variety of procedures examining impulsivity (e.g., Mazur, 2000; Richards et al., 1997; Stein et al., 2012); in agreement with these outcomes, the present results showed that delay discounting in LEWs and F344s was well described by Mazur’s (1987) hyperbolic-decay model. This result, first reported by Stein et al. (2012), using a rapid version of a steady-state adjusting-amount procedure developed by Mazur (2000) and adapted from Richards et al.’s (1997) method, was also documented in a study using a concurrent-chains procedure similar to that used here (Aparicio et al., 2013). Although the general form of the matching law (Baum, 1974; Baum & Rachlin, 1969) has not been extensively used to analyze impulsive choice (e.g., Ainslie, 1992; Aparicio et al., 2013), it accurately described the data from the present study. Consistent with estimates of k obtained with Eq. 1, estimates of s using Eq. 2 also changed across conditions, with the random presentation-order-of-delays condition showing the lowest estimates of s and the redetermination to the ascending condition the highest estimates, suggesting that both the presentation order of the delays to the LLR and the length of training are important factors in determining the sensitivity of preference to dynamic changes in delay to reinforcement (Aparicio et al., 2013).

We demonstrated that using both the hyperbolic-decay model and the general form of the matching law in conjunction to describe impulsive choice provided the possibility of observing compatibility between models of choice, which in turn might offer different ways to analyze impulsivity in nonhuman animals (Aparicio et al., 2013); we observed prevalent high and positive correlations between estimates of discounting rates (k in Eq. 1) and estimates of sensitivity (s in Eq. 2) across blocks of sessions and food deliveries, confirming that the matching law (Herrnstein, 1970) accurately describes impulsive choice in concurrent-chains procedures (Aparicio et al., 2013).

The analyses of the AUC using Myerson et al.’s (2001) method confirmed results showing that between-rat-strain differences in preference changed as a function of the presentation order of delays to LLR and experience in the choice situation. The analysis of blocks of sessions showed no clear between-strain differences in the AUC. For both rat strains, the AUC decreased with progressive training and consecutive food deliveries within delay components. In the descending condition, the AUC was higher for the LEWs across blocks of sessions and food deliveries, but the random condition showed the opposite result: The AUC was higher for the F344s than for the LEWs, and this result was confirmed in the redetermination of the ascending presentation order of delays. Interestingly, for both rat strains the analysis across food deliveries revealed values of the AUC that gradually decreased with consecutive food deliveries, showing a gradual shift from self-controlled choice to impulsive choice with extended training (Aparicio et al., 2013).

An important aspect of the present study was using FT instead of FI schedules in the terminal links of the concurrent-chains procedure to remove the response producing the LLR. Accordingly, we analyzed the distribution of responses during the FT terminal link (computations not shown) to assess the possibility of an accidental reinforcement at the end of the FT schedule (Skinner, 1948). Responses during delays of 5, 10, 20, 40, and 80 s to LLR were counted and sorted in bins of 1 s each across sessions of the same condition. These computations revealed that FT responses decreased with increasing delays to the LLR; the LEWs and F344s emitted more responses at the start of each delay, showing no evidence of the scalloped pattern of responding characterizing FI schedules and discarding the possibility of fortuitous reinforcement at the end of the FT (Skinner, 1948). This finding is consistent with the idea that nonhuman animals are indifferent to FI and FT terminal links of concurrent-chains schedules (Neuringer, 1969).

In conclusion, through the present study we investigated both the acquisition and maintenance of the impulsive behavior of LEWs and F344s, and found more dynamic changes in choice characterizing impulsivity than those previously reported in studies using a rapid-redetermination assessment of indifference points (e.g., Stein et al., 2012; Wilhelm & Mitchell, 2009) and steady-state procedures (e.g., Anderson & Diller, 2010; Anderson & Woolverton, 2005; Huskinson et al., 2012; Madden et al., 2008; Stein et al., 2012). Consistent with findings obtained in nonhuman (Fox et al., 2008) and human (Robles & Vargas, 2007, 2008; Robles et al., 2009) animals, we found that the presentation order of delays to the LLR is important in determining the discounting rate (k in Eq. 1) and sensitivity to the immediacy of reinforcement (s in Eq. 2). We found that the descending presentation order of delay to LLR disrupted the pattern of choices that LEWs and F344s established responding to delays to LLR in the ascending and random conditions. Our results supported the notion that strain differences in impulsivity change as a function of training (Aparicio et al., 2013).

Differences in impulsive choice between LEWs and F344s should be studied carefully and for longer periods of time, because they are not static properties of behavior solely determined by genetic and neurochemical brain mechanisms. We concluded that LEWs are more adaptable to dynamic changes in the contingencies of reinforcement than are F344s, and that impulsivity is an advantage in dynamic reinforcing environments that might be adaptive for both human and nonhuman animals (Fawcett et al., 2012). As has recently been noted, concurrent-chains procedures have distinct advantages over the techniques typically used to study delay discounting, and they can be used without worrying that new discoveries can be accredited to the use of innovative techniques (Oliveira, Green, & Myerson, 2014).