Matching in a network.
Old reward patterns can keep steering choice after the payoffs stop.
01Research in Context
What this study did
Glover et al. (1976) worked with pigeons in a lab network. Birds first pecked on two keys that paid off at different rates.
After training, the team turned off all rewards. They watched whether the birds' response splits still followed the matching law.
What they found
During training, response ratios lined up with reward ratios — classic matching.
When rewards stopped, the birds did not drift toward equal pecking. They kept favoring the key that had paid more, even though it now paid nothing.
How this fits with other research
Shimp (1967) showed matching can appear without maximizing by reinforcing rare four-peck chains. Glover et al. (1976) go further: once matching forms, the bias can outlast the payoff.
Stubbs et al. (1970) found that local burst patterns drive matching. The 1976 study reveals those bursts leave a lasting footprint; birds still lean toward the once-rich side after extinction.
White (1979) seems to disagree. Adding an extinction key inside one alternative did not shift matching. The gap is procedural: M kept some rewards flowing; J removed them all. Remove every payoff and the old rich side keeps the edge.
Why it matters
Your client's history can lock in preference even when current rewards change. If a child always got bigger praise for math than reading, that bias may survive when you thin reinforcement. Probe unreinforced trials to see the true leftover preference, then re-balance your delivery rates to offset the historical pull.
Want CEUs on This Topic?
The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.
Join Free →Run five unreinforced probe trials; note which alternative still wins and adjust your reinforcement rates to counter the historical bias.
02At a glance
03Original abstract
Pigeons were given practice choosing between pairs of alternatives yielding different frequencies of reinforcement. Four individual alternatives were set into four pairwise choices. Averaged over subjects, the distribution of responses in each choice approximated matching. The four individual alternatives were then presented, two by two, in two pairwise choices for which there had been no initial practice. No further reinforcement was given during the tests with the new pairs. Transfer to the two test pairs deviated systematically from matching in most cases by exaggerating the preference for the alternative that had had the higher frequency of reinforcement.
Journal of the experimental analysis of behavior, 1976 · doi:10.1901/jeab.1976.26-143