FURTHER EXPERIMENTS ON PROBABILITY-MATCHING IN THE PIGEON.
A single procedural tweak—like adding an error-correction trial—can flip choice from probability matching to exclusive maximizing.
01Research in Context
What this study did
Researchers put pigeons in a box with two keys.
Each key gave grain on a different schedule.
Some birds got a correction trial after wrong pecks.
Others saw both keys light up at the same time.
The team changed these small rules across days.
What they found
Birds did not stick to one rule.
With correction trials they almost always picked the richer key.
Without correction they split pecks to match the payoff odds.
A tiny switch—do-over or no do-over—flipped the whole pattern.
How this fits with other research
Mellitz et al. (1983) later showed one hill-climbing rule can explain both looks.
The birds just took the best local option moment to moment.
Schenk et al. (2020) found the same math fits kids shooting 2- or 3-point shots in a video game.
Same equation, feathers or fingers.
Frank-Crawford et al. (2018) also saw choice swing when work demands changed.
Together the papers say: watch the task details, not just the numbers.
Why it matters
Your client may “match” or “maximize” depending on tiny setup cues.
If you give an error-correction trial after a wrong response, you may push the child toward the richer side every time.
Remove that trial and you might see a 70-30 split that mirrors the payoff rate.
Check your procedure before you label a choice pattern as a trait.
Want CEUs on This Topic?
The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.
Join Free →Turn off error-correction for one block and on for another; graph if the learner’s split shifts toward matching or maximizing.
02At a glance
03Original abstract
In a discrete-trials, two-key choice situation, probability-learning by pigeons was studied under a variety of training conditions. Matching was found in simultaneous and in successive problems, but a spatial problem produced only maximizing. In the simultaneous problem, noncorrection produced maximizing, while correction produced matching. Guidance produced maximizing when the animals were required to earn each opportunity for choice by pecking a center key on FR-5, but matching when the center key was not used. In a discrete-trials one-key situation, with latency as the measure, frequency and probability of reinforcement were varied independently. Differences in probability produced differences in latency of response, but differences in frequency did not.
Journal of the experimental analysis of behavior, 1964 · doi:10.1901/jeab.1964.7-151