ABA Fundamentals

Immediate reinforcement in delayed reward learning in pigeons.

Winter et al. (1982) · Journal of the experimental analysis of behavior 1982
★ The Verdict

A half-second cue right after the response locks in learning even when the real payoff is still far away.

✓ Read this if BCBAs teaching new skills to kids or staff who can't get instant edible rewards.
✗ Skip if Clinicians already using micro-second clicker timing with flawless results.

01Research in Context

01

What this study did

Researchers worked with pigeons in a small lab box. The birds pecked a key for food, but the food came after a long wait.

Right after each peck, a brief light or sound flashed. The flash lasted either 0.33 seconds or 10 seconds. The team asked: does the length of this tiny cue change how fast the birds learn?

02

What they found

Birds with the 0.33-second cue learned the game in fewer sessions. Birds with the 10-second cue took longer.

The short cue acted like a tiny piece of candy. It kept the pigeon working even though the real food was still far away.

03

How this fits with other research

Griesi-Oliveira et al. (2013) ran a similar test with rats. They also saw that longer delays between response and cue slowed learning. The pigeon and rat data line up: brief cues work best.

Bacon-Prue et al. (1980) showed that pigeons forget two-step sequences after just 2-4 seconds. The 1982 study adds that even a single cue loses power if it lasts too long.

Together, these papers say the same thing: keep the bridge short. Whether it's a flash, click, or spoken "good," deliver it fast.

04

Why it matters

When you shape a new skill, the praise or click must come right away. If you must delay food, slip in a tiny immediate signal first. A half-second "good" or point card can hold the learner's behavior until the bigger reward arrives. Think of it as a behavioral bridge you build in 0.33 seconds.

Free CEUs

Want CEUs on This Topic?

The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.

Join Free →
→ Action — try this Monday

Cut your praise to under one second when you must delay the edible or token.

02At a glance

Intervention
not applicable
Design
single case other
Population
not specified
Finding
positive

03Original abstract

Pigeons were trained on simultaneous red-green discrimination procedures with delayed reward and sequences of stimuli during the delay. In Experiment 1, three stimuli appeared during the 60-second intervals between the correct responses and reward, and the incorrect responses and nonreward. The stimulus that immediately followed a correct response also preceded nonreward, and the stimulus that followed an incorrect response preceded reward. These stimuli were 10 or .33 second in duration for different groups. Stimuli during the remainder of the delay interval differed following correct and incorrect responses. Group 10 initially persisted in the nonrewarded choice, but shifted to a preponderance of rewarded responses after further training. Group .33 rapidly acquired the correct response. Similar results were obtained in Experiment 2 where delay intervals consisted of opposite sequences of two stimuli of equal duration and total delays were 6, 20, or 60 seconds. Early in training, generalization of differential conditioned-reinforcing properties from the conditions preceding reward and nonreward to postchoice conditions had a greater effect relative to backchaining than it did later. It was concluded that delayed-reward learning is best analyzed in terms of the conditioned-reinforcing value of the patterns of cues that follow immediately after rewarded and nonrewarded responses.

Journal of the experimental analysis of behavior, 1982 · doi:10.1901/jeab.1982.38-169