The Near-Miss Effect in Slot Machines: A Review and Experimental Analysis Over Half a Century Later

Pisklak, Jeffrey M.; Yong, Joshua J. H.; Spetch, Marcia L.

doi:10.1007/s10899-019-09891-8

The Near-Miss Effect in Slot Machines: A Review and Experimental Analysis Over Half a Century Later

Original Paper
Open access
Published: 14 September 2019

Volume 36, pages 611–632, (2020)
Cite this article

Download PDF

You have full access to this open access article

Journal of Gambling Studies Aims and scope Submit manuscript

The Near-Miss Effect in Slot Machines: A Review and Experimental Analysis Over Half a Century Later

Download PDF

Jeffrey M. Pisklak ORCID: orcid.org/0000-0002-5465-7978¹,
Joshua J. H. Yong¹ &
Marcia L. Spetch¹

13k Accesses
8 Citations
87 Altmetric
9 Mentions
Explore all metrics

Abstract

In games of chance, a near miss is said to occur when feedback for a loss approximates a win. For instance, obtaining “cherry–cherry–lemon” on a slot machine could be considered a near miss. Sixty-six years ago, B.F. Skinner first proposed the idea that near-miss events might reinforce continued play in slot machines, and despite some inconsistencies in the experimental literature, belief in this “near-miss effect” has remained strong. In the present manuscript, we will review this literature and present experimental assessments of the near-miss effect on the frequency of the gambling response. Experiment 1 used a tightly controlled resistance-to-extinction procedure in pigeons to evaluate the putative reinforcing effect of near misses relative to a control “far-miss” reel pattern. Experiment 2 extended Experiment 1’s procedure to human participants. The results of both experiments failed to support the near-miss effect hypothesis. Experiment 3 used a further simplified procedure to assess the validity of the resistance-to-extinction paradigm when a probable conditional reinforcer was present on the reel stimuli. Although a clear conditional response was obtained from the reel, subsequent testing in extinction revealed no conditionally reinforcing function of this stimulus on operant response frequency.

A practical guide to multi-objective reinforcement learning and planning

Article Open access 13 April 2022

Conor F. Hayes, Roxana Rădulescu, … Diederik M. Roijers

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

The Five Whys Technique

Introduction

Near misses, also called near hits or near wins, are said to occur when the elements of a game or task “suggest” to a player that they have almost achieved a favourable result. A good example is provided by Witts et al. (2015): consider a novice player making repeated free throws in basketball. With each successive throw, variations in the throwing technique that get the ball closer towards the hoop increase in probability (i.e., are selected for). This is an instance where the “near miss” has a clear reinforcing function on the player’s free-throw behaviour.

Near misses also occur in games of chance, but the crucial difference is that in a game of chance the outcome is a random event. On a standard slot machine, if a win is signalled by “cherry–cherry–cherry,” then “cherry–cherry–lemon” would be considered a near miss. Modern slot machines contain a pseudo-random number generator (RNG) that cycles through about 4.3 billion distinct values continuously at approximately 1000 values per second. For each bet, the machine selects the cycle’s current position and outputs the correlated reel positions onto the display (Schüll 2012). Unlike the free throw in basketball, no amount of practice will improve the odds of winning at the slot machine. Moreover, receiving a near miss is no more informative about an upcoming win than any other type of miss. This raises an important question: if the near miss inside a game of chance is independent of a win and cannot be used to increase the chance of a win, then why is it considered a “near” miss?

The answer seems to be that the near miss is visually “nearly” a win. For example, cherry–cherry–lemon looks more similar than cherry–lemon–lemon to a win signalled by three cherries. In the case of basketball free throws, visual aspects of the situation provide helpful feedback to the player. Near-miss feedback on slot machines, however, offers no practical use for improving performance. One possibility is that the visual aspect of the near miss is exploiting learning processes—most notably conditional reinforcement—that evolved to detect contingent (i.e., non-random) outcomes. In this manuscript, we will first review the existing literature pertaining to the reinforcing effect of near misses on gambling persistence, and then we will present the results of experiments specifically designed to test possible reinforcing effects of near-miss stimuli on gambling persistence in both humans and pigeons.

Conditional Reinforcement

Conditional reinforcement is thought to play a significant role in gambling behaviour. For example, audio-visual stimuli correlated with winning on slot machines may acquire conditionally reinforcing properties that encourage further play (Cherkasova et al. 2018). As early as 1953, B.F. Skinner discussed conditional reinforcement in what is likely the first scientific account of near misses. Skinner’s (1953) account drew upon conditional reinforcement as one of the plausible methods casinos were using to exploit their patrons and it is still cited as an explanation of the near-miss effect—the belief that near-miss events contribute to an increased frequency of gambling. More formally, we define the near-miss effect as a reinforcing function of near-miss events on total frequency of play in games of chance. Skinner’s original hypothesis rested on two critical factors: (1) that conditional reinforcement is based on pairing (i.e., contiguity), and (2) that near misses do in fact increase the frequency of betting responses. When Skinner proposed this, the available evidence largely supported a pairing account of conditional reinforcement, and the reinforcing effect of near misses was a sensible a priori prediction. Subsequent research, however, has shown that the pairing account is inadequate since it ignores nuances that influence and characterize this type of learning (Lattal 2013; Rescorla 1988), and because pairing is not sufficient to produce a conditionally reinforcing effect (e.g., Schoenfeld et al. 1950; Stubbs 1971). Contemporary accounts of conditional reinforcement now emphasize contingency, with two prominent mathematical models being delay reduction theory and the hyperbolic decay model (Fantino 1977; Mazur 1997). Despite wide acceptance of these models in behaviour analysis, the near-miss effect continues to be conceptualized through the pairing formulation (e.g., Kassinove and Schare 2001).

Experimental tests of conditional reinforcement often employ chained schedules in which two or more schedules of reinforcement, signalled by unique exteroceptive stimuli, are presented successively. Conditionally reinforcing effects of a stimulus within the chain can then be assessed by instituting extinction and comparing responding in the presence and absence of the putative conditional reinforcer. As noted by Kelleher and Gollub (1962), extinction procedures avoid the confounding effects of unconditional reinforcers in testing but can inadvertently introduce other problems. For example, extinction may alter the context such that stimuli function differently than in training (i.e., are “viewed” differently by the organism). Kelleher and Gollub also noted that extinction often produces only small effects—presumably because the conditional reinforcer is being simultaneously tested and extinguished. Finally, a problem specific to near misses is that conventional chained procedures that successfully produce conditional reinforcement have a logical predictability (i.e., contingency) between the putative conditional reinforcer and subsequent unconditional reinforcer. The classic slot machine, however, provides random outcomes with no contingency between the occurrence of near-miss stimuli and the subsequent occurrence of a winning outcome. It is not clear, therefore, why near-miss stimuli should be assumed to have conditionally reinforcing effects.

Furthermore, near misses can be conceptualized globally (i.e., cherry–cherry–lemon could be viewed as a single stimulus) or locally (i.e., each element as a separate stimulus). From a global view, the conditionally reinforcing effect of win-related stimuli may generalize better to near misses than to other, more dissimilar, misses (Belisle and Dixon 2016; Daly et al. 2014). Evidence consistent with a stimulus generalization account comes from findings that reel outcomes more visually similar to wins will generate longer latencies that are more like the latencies which occur after a win. However, it has been found that latencies are sometimes shorter—not longer—for stimuli that approximate a win (Dixon et al. 2013). Most problematic is that these studies have focussed only on latency and have not reported on overall rates of responding. Reinforcers increase response rates, leading to more cumulative responses across an interval of time despite post-reinforcement pausing (Ferster and Skinner 1957). Thus, evidence that the reinforcing function of winning stimuli generalizes to near misses also requires a demonstration that near misses increase the overall number or rate of bets. Near-miss events could also potentially function as conditional reinforcers by operating locally. For example, if cherry–cherry–cherry signals a win, then a cherry in the first reel position signals that the odds of winning have increased. However, the outcome of the spin is resolved quickly, so it is not clear that this brief change in probability signalled by the stimuli would be sufficient to conditionally reinforce gambling persistence.

Animal Research

Despite calls for increased experimental analysis of gambling behaviour employing non-human animals (Weatherly and Phelps 2006), animal research on the near-miss effect is fairly sparse. This is surprising given the historical precedence of animal research on questions of reinforcement generally. Nevertheless, a few studies warrant discussion.

Using a refined slot-machine analogue (Peters et al. 2010; Weatherly and Derenne 2007), Winstanley et al. (2011) measured rates of extinction for two groups of rats’ responses on a roll lever. Alongside various neuropharmacological treatments, one group of eight rats received trials that contained near misses and another group of eight received no near-miss trials. The rats obtained food they won by responding on a collect lever. Pressing this lever on loss trials incurred a timeout penalty, thus making it advantageous to only press when winning cues were present. Rates of extinction on the roll lever were not significantly different between the two groups. Collect lever responses increased linearly as a function of win similarity and were more frequent after a near miss than, for instance, a full loss, which the researchers suggested may reflect “a process similar to a ‘near-miss’ effect” (p. 917). While this finding is consistent with a stimulus generalization account of near-misses, it should not be ignored that the critical extinction measure on the more relevant roll lever produced no such effect. Earlier research using similar methods likewise found no increase in persistence on the roll lever (Peters et al. 2010).

Other work has explored the reinforcing effect of near misses in choice paradigms. Two notable studies examined pigeons’ persistence on concurrently available alternatives (Fortes et al. 2017; Stagner et al. 2015). The pigeons consistently preferred alternatives containing no near misses across various manipulations. The findings are consistent with both delay reduction theory and the hyperbolic decay model’s interpretation of conditional reinforcement.

Another choice study analyzed matching (Davison and McCarthy 1988) between rates of responding and reinforcer value (Rice and Kyonka 2017). Results showed a consistent bias, across three pigeons, towards a “gambling” option containing near misses relative to a certain option that contained no losses (and therefore no near misses). However, this same bias was not observed when the gambling option containing near misses was tested against a similarly probabilistic option that contained no near misses.

Human Research

Although there is considerable evidence that near-miss events can affect subjective measures (e.g., self-reports) and physiological responses (e.g., skin conductance, heart rate, or brain activity), there have been surprisingly few direct experimental tests of the presumed reinforcing function of near misses on gambling persistence. Effects on subjective motivation or physiological responses are consistent with a reinforcing effect on gambling frequency, but without a direct behavioral measure they do not actually demonstrate such an effect. For example, Dymond et al. (2014) found that increased activity in win-related brain regions was correlated with near misses (a similar effect was seen in pigeons’ neurophysiological responses to near misses; see Scarf et al. 2011) and with a trait measure of gambling propensity. They claimed to find “convincing evidence of a role for reward-related brain responses to near-miss outcomes, particularly in the insula, in maintaining PG [problem gambling]” (p. 216). Although the inference that near misses enhanced the activity in win-related brain regions is justified, the contribution of these near misses to problem gambling was based on a questionnaire with no direct assessment of the relationship between the near misses and the players’ actual gambling behaviour. Similar limitations exist in other studies (Billieux et al. 2012; Clark et al. 2009, 2013; Dixon et al. 2013; Habib and Dixon 2010).

A few experimental studies are frequently cited for providing evidence that near misses have a reinforcing function. Strickland and Grote (1967) found that significantly more participants who saw winning symbols occur more frequently on the earlier presented reels of a slot machine (i.e., more near misses than far misses), opted to continue playing than participants who saw more far misses than near misses. However, the average number of trials played by participants who opted to keep playing did not significantly differ between the two groups. Reid (1986) attempted two systematic replications of Strickland and Grote’s study. One used a card-based version of the slot machine task and the other used simulated slot machines. Only the latter showed the same pattern of results obtained by Strickland and Grote but neither replication obtained significant effects.

More recently, Kassinove and Schare (2001) manipulated the frequency of near misses on a four-reel slot machine simulation that participants played for money. Near misses had a 15%, 30%, or 45% chance of occurring for different groups. Participants were then put onto an extinction condition that removed both wins and near misses and could continue to play for as long as they desired. The 30% group showed the most persistence during extinction. A self-report administered after extinction showed no group differences on willingness to return to play in the future. The authors accounted for their findings in terms of conditional reinforcement, assuming that the 45% near miss frequency was presented too frequently (leading to respondent extinction), whereas the 15% condition was presented too infrequently (not paired enough). Across the 15%, 30%, and 45% groups, means (SD) of 5.88 (8.06), 10.26 (11.47), and 6.66 (8.22) responses in extinction were obtained respectively. This is one of the most frequently cited studies of evidence for a reinforcing function of near misses. However, it has some potential problems. First, Kassinove and Schare analyzed their extinction data using parametric statistics, which can be problematic for highly skewed data. In extinction, the probability of responding declines after each subsequent unreinforced response, making the data heavily bounded by zero. Additionally, if one assumes their data is symmetric then, based on their obtained variances, approximately 25% of their data points in the 30% group would fall below zero, which is not possible. Consequently, the obtained means and variances may not represent the population’s true value, particularly in the 30% condition which had substantially higher variance than the other groups, and thus a stronger relative pull of the mean under conditions of skewness. A similar criticism could potentially be made for other extinction- or persistence-based studies (Ghezzi et al. 2006; Reid 1986; Strickland and Grote 1967). A second concern is that extinction was tested without near misses, i.e., without the presence of the putative conditional reinforcer, which makes the observed difference puzzling. Finally, across three experiments, Ghezzi et al. (2006) attempted to replicate the findings of both Kassinove and Schare and Strickland and Grote by having participants play a simulated slot machine for points. Despite marginally larger group sample sizes than Kassinove and Schare, only one of their three experiments revealed a main effect of near-miss density on gambling persistence, and the most effective density was 66%. Overall, Ghezzi et al.’s findings were not commensurate with the original papers. Although there were notable procedural differences from the original two studies, this failure to replicate is concerning for the robustness of the near-miss effect on gambling persistence.

In another well-cited study, Cote et al. (2003) had participants play a three-reel video slot machine for money. The first phase contained 48 trials, programmed to give 12 near misses and 9 wins. In the second phase participants were, unknowingly, placed onto extinction. For an experimental group, extinction removed wins only. For a control group, extinction removed both wins and near misses. Participants could stop playing at any point and keep their earnings. Non-parametric analyses revealed that participants in the experimental group played significantly more during extinction than the control group. Although this study is cited as strong evidence of a near-miss effect, we believe there is a noteworthy concern that precludes it from providing evidence that near misses in a typical slot machine would increase gambling persistence. Specifically, in the first phase of the experiment, each win was preceded by a near miss. This contingency meant that each time a near miss was encountered, a win would be 75% certain to occur on the next trial, and thus near-misses actually predicted an increased chance of winning. This contingency would be expected to produce reinforcing effects, but it does not exist in typical slot machines.

An experiment by MacLin et al. (2007) gave recreational slot machine gamblers three concurrently available three-reel slot machines to play for points. Each machine had a specific frequency of near-miss presentations: 15%, 30%, or 45%. After 100 required trials, participants could choose to keep playing for a chance to earn a cash prize for the highest score among a pool of participants. However, at this point, the machines ceased to pay out wins (i.e., extinction). Participants persisted most with a 45% frequency and least with a 15% frequency on average, but the difference was not statistically significant.

Other studies that did not employ persistence-based methodologies have found similarly varied results ranging from support (Kurucz and Koermendi 2012; Tan et al. 2015) to lack of support (Witts et al. 2015) for a near-miss effect. Interestingly, Sundali et al. (2012) examined real casino data from electronic roulette terminals. They modelled the data of 36 players using regression analyses and found no evidence that near misses had reinforced the gamblers’ playing in terms of time spent playing or number of bets placed.

The variance across the aforementioned studies does not appear to be a function of the population sampled. Most did not specifically recruit problem gamblers and some used the South Oaks Gambling Screen (Lesieur and Blume 1987) to exclude probable pathological gamblers (Cote et al. 2003; Kassinove and Schare 2001; MacLin et al. 2007; Witts et al. 2015). Of these, only MacLin et al. (2007) sampled recreational gamblers. Strickland and Grote (1967) recruited rural high-school students and other studies recruited undergraduate students (Ghezzi et al. 2006; Kurucz and Koermendi 2012; Reid 1986; Tan et al. 2015).

The number of training trials ranged between 20 and 100 trials across studies. Of the studies showing increased persistence, Cote et al. (2003) used 48 trials, Kassinove and Schare (2001) used 50 trials, and Strickland and Grote (1967) used 100 trials. Interestingly, Ghezzi et al. (2006) varied the number of training trials (25, 50, 75, and 100) and found a significant effect on persistence only after 25 trials, with no trends as a function of training trials, and no interaction with near-miss density. This raises an important question: What is an appropriate amount of training in these studies? Generally, more training seems preferable to less training and this may be especially true for participants with less experience playing games of chance. Their expectations about gambling may conflict with the real-life gambling contingencies that experiments try to model, thus creating biases that may go unchallenged at low levels of training. Therefore, the results of novice gamblers may be different from more experienced gamblers, particularly with low levels of training.

Perhaps because of Skinner’s influence, the suggestion that near misses are functioning as conditional reinforcers has been largely taken for granted, yet our review of experimental studies has revealed inconsistent results and inconclusive evidence for an effect of near misses on gambling persistence. Many human studies have used either real lottery terminals or programs designed to mimic casino slot machines, which enhance external validity but can complicate the analyses of basic effects and comparisons between studies. Motivation levels can differ widely between individuals, and basic parameters such as the population sampled and the number of trials often vary across experiments. Arguably, the pursuit of external validity has hindered the establishment of internal validity of near-miss research. What seems required is a more controlled analysis reminiscent of the work Skinner favoured.

Pigeons have had a long and storied history in the study of conditional reinforcement and learning in general (Logue 2002). Many properties of pigeons make them suitable for a controlled analysis of near-miss effects: they have excellent visual acuity, easily regulated motivation levels, and exhibit steep rates of temporal discounting relative to other common laboratory animals (Stevens and Stephens 2009), which is pertinent for gambling research because problem gamblers have been found to discount delayed rewards more than non-gamblers (Dixon et al. 2003; Petry and Madden 2010; Reynolds 2006). Some have theorized that steep delay discounting, together with reinforcement schedules, is responsible for much of the maladaptive behaviour exhibited by problem gamblers (Rachlin et al. 2015). Enhanced experimental control can make behavioural data obtained through animal models more reliable than studying humans, although results should be verified in humans whenever feasible.

Overview of Experiments

Experiments 1 and 2 used a resistance-to-extinction paradigm to test the reinforcing function of a near-miss stimulus pattern against a far-miss control pattern on both pigeons and humans. Using logic similar to the two notable experimental demonstrations of a near-miss effects (Cote et al. 2003; Kassinove and Schare 2001), if near-miss events function as conditional reinforcers more than other types of misses, then greater resistance to extinction should occur in conditions with a higher frequency of near-miss events than far-miss events.

Experiment 3 was designed to address the lack of a near-miss effect in Experiments 1 and 2. This experiment used pigeons and did not involve near-miss events. Instead, one of two single reel stimuli was contingently paired with food and its ability to elicit a conditional response was verified. The presence of that conditional stimulus was then tested during extinction to assess the validity of an extinction-based test of conditional reinforcement on behavioural persistence.

Experiment 1a and 1b

Methods

Subjects and Apparatus

For both Experiment 1a and 1b, homing pigeons (Columba livia) were randomly selected from a University of Alberta colony room for the experiment. A sample of eight pigeons was used, which is the same as employed in Stagner et al. (2015), and larger than other near-miss studies with pigeons (Fortes et al. 2017; Rice and Kyonka 2017; Scarf et al. 2011); the higher levels of experimental control in animal research and balanced within-subject design increases statistical power. Subjects were housed in 65 × 27 × 70 in. flight cages in a colony room maintained at 20 °C and a 12-h daylight cycle from 6:00 a.m. to 6:00 p.m. MST. All birds had free access to vitamin-enriched water and crushed oyster shell grit in the colony room. Subjects were maintained at 80% of their free-feeding weight by adjusting their post-experiment feeding of Mazuri Gamebird food pellets (PMI Nutrition International).

Six custom-built operant boxes were equipped with Carrol Touch infrared touchscreens (Elo Touch Systems, Inc., Menlo Park, CA) to detect pecking responses. Stimuli were presented on a centrally-mounted 17” Viewsonic LCD monitor located at the back wall of each chamber. Speakers in the operant boxes continuously played white noise to mask external sounds. The sound pressure levels were equalized in each operant box at 65 dB via A-weighting filter with a Brüel and Kjær Type 2239 Integrating Sound Level Meter. Two 2 × 2 in. feeding ports equipped with food hoppers, flanked the monitor. Access to food was controlled by Colbourne H20-94 photocell sensors that detected entry into the ports. Stimuli were presented and responses were logged using E-Prime^® 2.0 Professional software. A ${\raise0.5ex\hbox{$\scriptstyle 1$} \kern-0.1em/\kern-0.15em \lower0.25ex\hbox{$\scriptstyle 8$}}$ in. thick white plastic barrier was mounted in front of the screen to prevent errant behaviours (e.g., subject’s wings, bodies, and feet contacting the screen) from interfering with the touch screen’s ability to record pecking responses. The barrier had four holes cut in it: three horizontal circles were cut to 1½ in. diameter near the top to allow for visual identification of the reel stimuli and a 1 in. diameter hole was cut beneath the middle of the three circles to allow for pecking responses.

Stimuli were presented on a black background and were aligned behind the holes cut in the barrier. A white pecking circle was presented at the smaller 1 in. hole. The win and miss patterns (i.e., reel patterns) were presented at the upper three circles. For Experiment 1a, each reel pattern consisted only of those elements comprising the winning reel pattern, which was displayed as three red circles. A win occurred when all three circles were red. A near miss occurred when only the left and middle circles were red. A flanked miss occurred when only the left and right circles were red. A far miss occurred when only the middle and right circles were red. Finally, a single miss occurred when only one circle was red. On single misses, each of the three locations had an equal chance of turning red. Non-illuminated circles were left black. Experiment 1b was identical but with the exception that the non-illuminated circles were coloured blue. For example, a trial displaying a far-miss would be presented as blue–red–red, with the blue signalling non-reinforcement. We denote red circles as putative S^D stimuli (i.e., stimuli that occasion reinforcement) and blue circles as putative S^Δ stimuli (i.e., stimuli that occasion nonreinforcement).

Reels were always presented sequentially from left to right with a 600 ms interval between each presentation. For instance, this was the sequence of events on a ‘winning’ trial: following a peck to the white circle, the white circle disappeared. After 600 ms, the left reel appeared. Then, the middle and right reel each appeared in sequence with a 600 ms interval between each presentation. In total, the sequence was 2.4 s long. Then, the right or left hopper (randomly chosen) rose to provide 1 s access to food pellets (i.e., unconditional reinforcement) with the reels remaining on screen during this period. Finally, the reels reset to black and the white pecking circle returned to occasion the start of the next trial. On all loss trials, each of the intervals still occurred whether or not a red circle appeared so that the total sequence was always 2.4 s long. The reels then reset back to black and the white pecking circle reappeared following the last 600 ms interval. Thus, with the exception of the initial autoshaping phase in this and the subsequent experiments, there were no intertrial intervals following either the win or the loss trials. Figure 1 diagrams this sequence for both win and far-miss outcomes.

Ethical Statement

All procedures for the care and use of animals were in accordance with the ethical standards of the University of Alberta, Canadian Council on Animal Care, and were approved by the Bioscience Animal Care and Use Committee (Protocol AUP00002018).

Procedure

The experiment was structured as a repeated-measures design with two treatments: the near-miss and far-miss treatments. The order of these treatments for each subject was randomly determined according to a Latin square design (i.e., subject × treatment). Each treatment was preceded by a pre-exposure phase comprised of three components. Only one handler conducted the experimental sessions to further reduce sources of unsystematic variation that can occur in extinction-based designs. Additionally, because the noise of hoppers in adjacent operant chambers might confound the measures taken during periods of extinction, all pigeons had their extinction sessions yoked to occur at the same time and day as other pigeons concurrently running in the experiment.

In the pre-exposure phase, the subjects began on a basic autoshaping procedure (Schwartz and Gamzu 1977), with a fixed-ratio 1 (FR1) contingency built in. Here, a white circle was presented behind the lower response hole for 10 s or until the bird pecked it. After the 10 s or after one peck on the white circle, the circle disappeared and then either the left or right feeding port light illuminated and its hopper raised. When the bird’s head entered the port, it had 1 s to eat from the hopper. Then, the port light extinguished, the hopper lowered, and a 240 s interval began. This process repeated over a 90 min session. All subjects remained on this component for three consecutive sessions.

Following autoshaping, the subjects were put on a FR1 schedule. Responses were made on the lower response hole as before. Following one peck, the white pecking circle disappeared and then the subject had to wait 2400 ms until they were given access to food from either the left or right hopper. The FR1 schedule lasted for one 90 min session. On the next session, the schedule was extended to a FR3 (i.e., three pecks were required to gain access to food). The 2400 ms interval was still implemented after every response. Following one session of FR3, the schedule was extended to a FR6 for one more session. This procedure was followed for all birds with the exception of Bird 76 in Experiment 1b, who displayed unusually low responses after being moved to the FR1 schedule. Autoshaping was reinstated until high rates of responding on the FR1 schedule occurred. Due to the necessity of yoking the extinction sessions, the FR3 and FR6 conditions were skipped in the first half of Bird 76’s sessions to allow for maximal exposure to the contingency in the last pre-exposure component.

In the final pre-exposure component, subjects were put on a random-ratio 5 (RR5) schedule. Similar to a variable-ratio (VR) schedule, a RR schedule typically contains an average response requirement. However, unlike a VR schedule, a RR schedule is determined by pseudo-random number generation as opposed to a predetermined number of reinforced and nonreinforced trials (Hurlburt et al. 1980; Zeiler 1977). In this pre-exposure component, each peck was separated by at least a 2400 ms interval but during this interval, the reel stimuli were now presented at the upper three circles. Birds only gained access to food following wins. All reel outcomes were equiprobable. Specifically, wins, near misses, and all other loss types each occurred 20% of the time and these probabilities were equated every 30 responses. Each bird received fifteen 90-min-long sessions—apart from bird 76 who only received nine sessions of the RR5 schedule in the first half of Experiment 1b.

In the treatment phase, after the pre-exposure phase, all subjects were put on either the near-miss or far-miss treatment. In either treatment, responses on the white key were put on extinction. Specifically, all win reels and unconditional reinforcement were replaced by either the near-miss or far-miss reel patterns. For instance, a subject in the near-miss treatment experienced near misses 40% of the time and no wins, and the proportions of the other loss feedback were left unchanged. The subjects were not presented with any additional cues to signal this change in the schedule. After completing the first treatment phase, all subjects were returned to the pre-exposure phase and then they completed the next treatment condition (i.e., a pigeon that began on the near-miss treatment first completed the far-miss treatment next and vice versa).

Results and Discussion

Statistical analyses were conducted using R 3.5.0 (R Core Team 2018). The mean difference in resistance to extinction between the near-miss and far-miss treatments was assessed using a paired t test on the cumulative number of responses made during extinction. An effect size was calculated using an unbiased estimate of Cohen’s d (see equation 11.13 in Cumming 2012). To best account for the reduction in variability offered by paired designs, the d estimate was standardized using the standard deviation of the difference scores. To obtain the relative odds in favour of an alternative hypothesis against the null, a JZS Bayes Factor (BF₁₀) using a medium prior was calculated (Morey and Rouder 2011, 2015).

The results of Experiment 1a are depicted in the top row of Fig. 2. Contrary to the predictions of a near-miss effect, the near-miss treatment showed less overall responding (M = 613.25, 95% CI [407.26, 819.24]) in extinction than the corresponding far-miss control treatment (M = 784.12, 95% CI [537.73, 1030.52]). However, the difference between the two treatments was not statistically significant, nor was the obtained Bayes Factor meaningfully large, t(7) = − 1.68, 95% CI [− 410.75, 69.00], p = .136, d = 0.53, BF₁₀ = 0.92.

The results of Experiment 1b are depicted in the bottom row of Fig. 2 and, like experiment 1a, are similarly inconclusive. The mean cumulative responding for the near-miss treatment (M = 677.88, 95% CI [436.94, 918.81]) exhibited marginally more responses during extinction than the corresponding far-miss control treatment (M = 573.12, 95% CI [353.40, 792.85]), t(7) = 1.44, 95% CI [− 67.31, 276.81], p = .193, d = 0.45, BF₁₀ = 0.73.

Overall, the results of both Experiment 1a and 1b failed to provide evidence that near misses have a conditionally reinforcing function on the gambling response. However, it may be the case that the supposed near-miss effect is a uniquely human phenomenon and does not apply to non-human or non-primate organisms. While the literature on operant conditioning seems at odds with such a hypothesis, given the strong reliability of reinforcement processes across numerous species, the possibility should nonetheless be considered. The animal-based literature has, in most cases, failed to straightforwardly demonstrate the reinforcing nature of near-miss stimuli whereas the human-based literature has been, at least occasionally, successful. Given this, Experiment 2 adapted and applied Experiment 1a and 1b’s procedure to human participants.