The paradox of preference for unreliable reinforcement: The role of context and conditioned reinforcement.
Signals during delays can make unreliable reinforcement feel better than sure reinforcement.
01Research in Context
What this study did
The authors looked at old pigeon data. The birds picked schedules that paid off only half the time.
They asked why any creature would choose the worse deal. Their answer: signals during the wait act like extra candy.
What they found
Pigeons leaned toward the unreliable side if two things happened. A light or sound came on during the delay. That signal itself became reinforcing.
The setup made bad odds feel good. The birds worked harder for 50% pay than for sure pay.
How this fits with other research
Geckeler et al. (2000) later tested kids with ID. Same trick worked. The children also chased the shaky schedule when a fun stimulus marked the wait.
Perez et al. (2015) pushed it further. They used signals that sometimes meant no food at all. The pigeons still picked that side.
McDevitt et al. (2016) reviewed many bird studies. They concluded that "good-news" cues are so powerful they can wreck optimal choice.
Together these papers show the effect jumps species. From pigeons to children, signals can override the real payoff rate.
Why it matters
Your client may stick with problem behavior that only pays sometimes. Check what cues sit between the response and the reinforcer. A fun song, a flashing toy, or even your own smile can act as conditioned reinforcement. Strip those signals or move them to the better response. You might shift preference toward the reliable, adaptive choice.
Want CEUs on This Topic?
The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.
Join Free →Remove or swap the fun cue that marks the wait for problem behavior; pair it only with the adaptive response.
02At a glance
03Original abstract
We discuss Belke and Spetch's (1994) work on choice between reliable and unreliable reinforcement. The studies by Belke and Spetch extend a line of basic research demonstrating that under certain experimental conditions in a concurrent chains procedure, pigeons prefer an alternative that produces unreliable reinforcement. The authors describe the variables that influence preference for unreliable reinforcement, including the signaling and the duration of the reinforcement schedules, the context in which the signaling stimuli occur, and the effects of conditioned reinforcement. Hypothetical applied examples that address these variables are provided, and their influence on preference for unreliable reinforcement in humans is discussed. We conclude by suggesting a line of applied research to examine the relationship between these variables and a preference for unreliable reinforcement.
Journal of applied behavior analysis, 1995 · doi:10.1901/jaba.1995.28-389