Graded differential reinforcement: Response-dependent reinforcer amount.
Let the learner earn bigger rewards with more responses—and keep a clear "collect" action— to boost effort without hurting value.
01Research in Context
What this study did
The team worked with pigeons in a small lab chamber.
Each peck on the key added 0.25 seconds of grain.
More pecks meant longer grain time.
The birds could control how big the snack was.
A second group got the same grain times no matter how much they pecked.
What they found
Birds worked harder when grain time grew with every peck.
They pecked more than the yoked birds who got identical grain no matter what.
Linking reinforcer size to response count pushed the behavior up.
How this fits with other research
King et al. (1990) later tested adult humans with points instead of grain.
They saw the same boost when people had to press a button to collect the points.
The idea crosses species: let the learner do a quick "collect" action and the amount matters more.
Pliskoff et al. (1972) also used pigeons and differential reinforcement.
They showed that reinforcing high rates in one setting can speed up responding in other settings.
Both papers tell us to watch for spill-over when we tie bigger rewards to faster or more responses.
Fortes et al. (2015) seems to disagree at first.
They found that making birds peck a lot during a delay made the big reward less attractive.
The key difference is timing: D et al. tied amount to the main response, while Inês et al. piled extra work into the wait.
Effort during the task helps; effort during the delay hurts.
Why it matters
You can use graduated reinforcement when you want more effort from a client.
Let each correct response earn a slightly bigger slice of preferred item.
Keep the collection step visible: have the learner drop a token in a jar or press a button to hear the tally.
Watch for induction: if you reinforce fast math facts, check that other tasks don’t speed up too much.
Finally, don’t load extra responses into wait time; save the work for the target behavior itself.
Want CEUs on This Topic?
The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.
Join Free →Count correct responses out loud and add one extra Lego piece to the pile for each one, then let the client drop them into the box.
02At a glance
03Original abstract
After key pecking had been autoshaped, six pigeons were exposed to a condition in which the duration of grain availability at the end of an 8-second trial depended on the number of responses emitted during the trial (0.25-second access to grain per response). This procedure, called correlated reinforcement, alternated across conditions with the automaintenance baseline in which the 8-second trial terminated with a constant 2.5-second access to grain. Two control procedures were run; in both, the reinforcer durations were yoked to those obtained in the last correlated session. In the yoked control no responses were required, but in the single-response yoked control at least one response was required to receive the yoked duration. The correlated condition maintained response rates above those produced by the two control conditions. These results may be accounted for by differential reinforcement.
Journal of the experimental analysis of behavior, 1984 · doi:10.1901/jeab.1984.41-27