ABA Fundamentals

The power of one reinforcer: The effect of a single reinforcer in the context of shaping

Hunter et al. (2019) · Journal of the Experimental Analysis of Behavior 2019
★ The Verdict

One lone reinforcer after extinction can pull behavior back as strongly as a long payoff history.

✓ Read this if BCBAs who use shaping or extinction with humans in clinic or school.
✗ Skip if Practitioners who only run maintenance programs with dense schedules.

01Research in Context

01

What this study did

College students played a computer game with two objects. One object had a long history of paying off with points. The other object had just been put on extinction.

After the player stopped touching the dead object, the experimenter gave one lone reinforcer. Then they watched where the player spent time next.

02

What they found

That single point was enough to make players hang around the dead object as long as they did with the rich one.

One accidental payoff erased the extinction effect and pulled behavior back to the wrong target.

03

How this fits with other research

Hachiga et al. (2014) first saw this in rats. After one pellet, the rats returned to the lever that was now in extinction. The new study shows the same bounce-back happens with humans.

Cohen et al. (1993) said one reinforcer only gives a brief local spike. Their birds stayed on the rich key in the long run. Hunter’s humans, however, kept preferring the once-dead object for the rest of the session. The difference is timing: L looked at steady concurrent schedules, while Hunter used shaping and a long extinction pause before the lone payoff.

Jimenez-Gomez et al. (2025) add that reinforcers work like traffic signals. They tell the learner what is likely to pay next. The desperation-driven click is powerful because the learner reads it as "this spot is hot again."

04

Why it matters

If you are shaping a new response, one mistimed treat can lock in the wrong form. Wait for the exact topography you want before you click again. Check your data after each session; a sudden jump in old errors may be the footprint of an accidental reinforcer.

Free CEUs

Want CEUs on This Topic?

The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.

Join Free →
→ Action — try this Monday

Track the last five responses before each reinforcer; if the wrong form sneaks in, withhold and reset instead of delivering the token.

02At a glance

Intervention
shaping
Design
single case other
Population
neurotypical
Finding
positive

03Original abstract

During shaping, if the organism is engaged in behaviors other than the current approximation, the amount of time between reinforcers increases. In these situations, the shaper may resort to what is referred to as a "desperation-driven click." That is, after a period of no reinforcement, the shaper delivers one reinforcer for a nontarget approximation. Reports from professional animal trainers suggest that the animal may continue performing this new behavior, even if it is reinforced only once. This study attempted to model this phenomenon with college students. Results from the study demonstrated that a desperation-driven click situation can be reliably produced in a controlled setting. When participants received one reinforcer for interacting with a new object following a period of no reinforcement, they interacted with the new object for a longer or equal amount of time as compared to an object that had a longer history of reinforcement. The results of this study have implications for the understanding of how reinforcement controls behavior.

Journal of the Experimental Analysis of Behavior, 2019 · doi:10.1002/jeab.517