Two-component schedules of differential-reinforcement-of-low-rate.
Under two-component DRL, prior reinforced vs non-reinforced responses become discriminative stimuli that control the length of the next pause.
01Research in Context
What this study did
Brinker et al. (1975) worked with pigeons on a two-part DRL schedule. Part one required a short pause between pecks. Part two required a longer pause.
The birds switched back and forth between the two parts many times per session. The key color told them which pause rule was active.
What they found
The pigeons matched their pause times to whichever rule was currently in force. Their previous reinforced, or non-reinforced, pecks acted like traffic lights that told them how long to wait next.
In plain words, the birds used their own reinforcement history as a cue to time the next response.
How this fits with other research
Hart et al. (1974) saw the opposite: when DRL requirements got longer, lever pressing and licking dropped while wheel running shot up. The two studies seem to clash, but the difference is the task. B et al. used one long DRL value at a time and measured side behaviors. P et al. rapidly alternated two short DRL values and looked at timing control.
LATIES et al. (1965) showed rats grow extra behaviors like tail nibbling to fill long DRL gaps. P et al. add the idea that the outcome of the last response itself becomes the cue for the next pause.
Newman et al. (1991) later used the same fine-grained pause analysis to separate drug effects. They built on P et al.'s method to see whether medicines improve timing or just scatter responses.
Why it matters
If you run DRL to reduce rapid calling or hand flapping, remember the client's last response is now a signal. Reinforce the correct pause immediately so that "success" becomes the cue for the next correct pause. Switching between two DRL values within one session, signaled by color or place, can sharpen this stimulus control and speed up learning.
Want CEUs on This Topic?
The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.
Join Free →Begin a two-color DRL session: green card for 3-s pause, red card for 6-s pause, and deliver praise right after the target pause so the last success cues the next wait.
02At a glance
03Original abstract
Two-component schedules of differential-reinforcement-of-low-rate were presented, where the contingencies specified separately two minimum interresponse times, t(1) and t(2), required for reinforcement, depending on whether the interresponse time was initiated by, in one case, a reinforced response (t(1)) or, in the other, a nonreinforced response (t(2)). A distinctive pattern of responding developed on each of the two contingencies. Duration of an interresponse time approximated t(1) when the t(1) contingency was in effect, and t(2) when the t(2) contingency was in effect. This relationship persisted even when t(2) was shorter than t(1), and responding at a higher rate on the t(1) contingency would have greatly increased the rate of reinforcement. Increasing the value of t(2) resulted in both longer interresponse times on the t(1) contingency, and a higher probability of a response-burst on those occasions when the contingency switched from t(1) to t(2). The results indicated that both reinforced and nonreinforced responses functioned as discriminative events in determining the duration of following interresponse times.
Journal of the experimental analysis of behavior, 1975 · doi:10.1901/jeab.1975.24-33