ABA Fundamentals

The paradox of preference for unreliable reinforcement: The role of context and conditioned reinforcement.

Lalli et al. (1995) · Journal of applied behavior analysis

★ The Verdict

Signals during delays can make unreliable reinforcement feel better than sure reinforcement.

✓ Read this if BCBAs who use token boards, timers, or praise during DRO or DRA.

✗ Skip if Clinicians working only with immediate, continuous reinforcement.

01Research in Context

What this study did

The authors looked at old pigeon data. The birds picked schedules that paid off only half the time.

They asked why any creature would choose the worse deal. Their answer: signals during the wait act like extra candy.

What they found

Pigeons leaned toward the unreliable side if two things happened. A light or sound came on during the delay. That signal itself became reinforcing.

The setup made bad odds feel good. The birds worked harder for 50% pay than for sure pay.

How this fits with other research

Geckeler et al. (2000) later tested kids with ID. Same trick worked. The children also chased the shaky schedule when a fun stimulus marked the wait.

Perez et al. (2015) pushed it further. They used signals that sometimes meant no food at all. The pigeons still picked that side.

McDevitt et al. (2016) reviewed many bird studies. They concluded that "good-news" cues are so powerful they can wreck optimal choice.

Together these papers show the effect jumps species. From pigeons to children, signals can override the real payoff rate.

Why it matters

Your client may stick with problem behavior that only pays sometimes. Check what cues sit between the response and the reinforcer. A fun song, a flashing toy, or even your own smile can act as conditioned reinforcement. Strip those signals or move them to the better response. You might shift preference toward the reliable, adaptive choice.

FREE CEUs

Get CEUs on This Topic — Free

The ABA Clubhouse has 60+ on-demand CEUs including ethics, supervision, and clinical topics like this one. Plus a new live CEU every Wednesday.

✓ 60+ on-demand CEUs (ethics, supervision, general)

✓ New live CEU every Wednesday

✓ Community of 500+ BCBAs

✓ 100% free to join

Join The ABA Clubhouse — Free →

→ Action — try this Monday

Remove or swap the fun cue that marks the wait for problem behavior; pair it only with the adaptive response.

02At a glance

Intervention

not applicable

Design

theoretical

Finding

not reported

03Original abstract

We discuss Belke and Spetch's (1994) work on choice between reliable and unreliable reinforcement. The studies by Belke and Spetch extend a line of basic research demonstrating that under certain experimental conditions in a concurrent chains procedure, pigeons prefer an alternative that produces unreliable reinforcement. The authors describe the variables that influence preference for unreliable reinforcement, including the signaling and the duration of the reinforcement schedules, the context in which the signaling stimuli occur, and the effects of conditioned reinforcement. Hypothetical applied examples that address these variables are provided, and their influence on preference for unreliable reinforcement in humans is discussed. We conclude by suggesting a line of applied research to examine the relationship between these variables and a preference for unreliable reinforcement.

Journal of applied behavior analysis, 1995 · doi:10.1901/jaba.1995.28-389