Conditioning of two-response patterns of key pecking in pigeons.
Reinforcement can weld two responses into one new operant unit.
01Research in Context
What this study did
The team worked with pigeons in a small lab chamber. Each bird had one round key to peck.
Food followed only when the pigeon pecked twice in a set order, like left-then-right. All other pairs paid nothing.
The study asked: can reinforcement glue two pecks into one new operant, or is each peck always separate?
What they found
Every pigeon soon produced the reinforced two-peck pattern far more than any other pair.
The exact sequence became a new unit of behavior, not just two lucky single pecks that happened to land close in time.
How this fits with other research
Bacon-Prue et al. (1980) extends the idea. They showed pigeons can remember two-item sequences, but only if the test comes within half a second. Together the papers say: teach the chain quickly, then test it quickly.
Adams (1980) used the same birds and key to show pigeons can also count their own pecks. The 1979 study adds order to that number skill.
Stolz (1977) split key pecks by duration and punished only the long ones. The 1979 paper splits the same peck by place in a sequence and rewards only the right order. Both prove the key peck is not one blob; it can be carved by time, place, or count.
Why it matters
If you want a client to lock two responses into one smooth chain, reinforce the exact order right away. Do not wait to reinforce each move on its own. The chain itself can become the operant, just like the pigeons’ two-peck unit. Use tight timing and clear cues so the whole sequence, not the parts, earns the payoff.
Get CEUs on This Topic — Free
The ABA Clubhouse has 60+ on-demand CEUs including ethics, supervision, and clinical topics like this one. Plus a new live CEU every Wednesday.
Pick a two-step chain your learner already does halfway and deliver the reinforcer only when the steps occur in the exact order within three seconds.
02At a glance
03Original abstract
On discrete trials, two response keys were made available to hungry pigeons and food reinforcement depended on the order in which the required two key pecks occurred. In different phases, only one of the four possible two-peck sequences (left-left, left-right, right-left, and right-right) produced food reinforcement. In each case, the pigeons learned to perform the correct two-peck sequence more often than the incorrect sequences. Furthermore, the course of differentiation mastery indicated that both reinforcement history and response-reinforcer contiguity influenced performance. These results reveal that response patterns comprising two instances of the same response left-left and right-right) or instances of two different responses (left-right and right-left) may function as operants, thereby extending the generality of conditioning principles from discrete responses to structured sequences of behavior. These and other results are discussed in terms of contiguity-based and memory-based models of learning.
Journal of the experimental analysis of behavior, 1979 · doi:10.1901/jeab.1979.31-23