Preference for qualitatively different reinforcers.
One simple choice equation predicts how any learner will split time between wildly different rewards.
01Research in Context
What this study did
Researchers let pigeons peck two keys. One key gave food pellets. The other gave mild brain stimulation.
Both rewards were set on the same VI schedule. The birds could switch keys at any time.
What they found
The birds spread their pecks the same way no matter which reward they were working for.
The matching law still fit. Different rewards behaved like different flavors of the same thing.
How this fits with other research
Smith et al. (1997) later asked children with autism to pick one top toy or a rotating box of so-so toys. About half the kids chose the variety, showing the same unified preference rule in people.
Norris et al. (2024) used the same logic to rank the social, escape, and sensory pay-offs that keep problem behavior alive. One quick choice test showed which reinforcer really drove each child.
Allan et al. (1994) swapped reward type for reward reliability. Pigeons still followed the matching law, proving the rule holds across quality, variety, and certainty.
Why it matters
You can treat any reinforcer—cookies, praise, screen time, break from work—as points on the same scale. Run a quick concurrent choice test, graph the split, and you have a single preference score you can trust. No need to build separate systems for edible, social, or sensory rewards.
Want CEUs on This Topic?
The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.
Join Free →Place two options on the table—one edible, one activity—count the first 10 spontaneous approaches, and let the split guide your next reinforcement schedule.
02At a glance
03Original abstract
Three pigeons were studied under two-key concurrent variable-interval schedules with food as the reinforcer on one key and ectostriatal brain stimulation as the reinforcer on the other. Brain-stimulation parameters were kept constant while the rate of food reinforcement availability was varied. The results showed that qualitatively different reinforcers could be handled in the same theoretical framework that applies when choice is between different rates, immediacies, and amounts of a single reinforcer.
Journal of the experimental analysis of behavior, 1971 · doi:10.1901/jeab.1971.16-375