The role of adventitious reinforcement during differential reinforcement of other behavior: A systematic replication
DRO works because it accidentally makes other, safer responses pay off, not just because it stops paying for problem behavior.
01Research in Context
What this study did
Rey et al. (2020) asked why DRO cuts problem behavior. They ran three conditions with neurotypical adults. One group got DRO. One got food on a fixed clock no matter what. One got no food at all.
The team counted both the target response and any other responses people made. They wanted to see if DRO accidentally strengthens other, safer moves.
What they found
DRO gave the lowest rate of the problem response. It also gave the highest rate of other, incompatible responses.
The plain clock-food schedule did little. Extinction did less. The authors say DRO works partly because it sneaks extra value onto other behavior, not just because it stops paying for the target.
How this fits with other research
Rey et al. (2020) is a direct replication of their own 2020 long-duration study. Both labs show the same pattern: DRO beats non-contingent food. The second paper adds that the boost in other behavior fades once people spot the clock.
Attwood et al. (1988) ran the first pigeon test of DRO versus yoked VT. They also saw DRO win, but they credited the longer delay between response and food. The new study keeps that delay idea and adds the extinction group, updating the story.
Fantino (1969) looks like a clash. That old DRL paper found no power in accidental links. The gap is simple: DRL rewards long pauses, DRO rewards any pause. Species and schedule differ, so both findings can stand.
Why it matters
If you run DRO, remember it secretly pays for other, safer acts. You can boost this by picking reinforcers the client already loves and by making the timing easy to notice. When fading, watch for the drop in other behavior the lab saw after long exposure.
Want CEUs on This Topic?
The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.
Join Free →Track any new, low-effort responses that pop up during DRO and give them extra praise so the accidental boost becomes intentional.
02At a glance
03Original abstract
Differential reinforcement of other behavior (DRO) is commonly used to decrease problem behavior by presenting reinforcers contingent upon the absence of a target response. Although it is well demonstrated that DROs decrease response rates, the processes producing these decreases are not well understood. The present study systematically replicated previous research assessing whether adventitious reinforcement of alternative behavior contributes to the effectiveness of DRO. We presented university students with two options on a computer and reinforced target responding on a variable-ratio schedule. Next, we compared decreases in target-response rates and any increases in alternative responding during DRO schedules versus yoked variable-time schedules or extinction probes. DRO schedules resulted in the lowest target-response rate and highest alternative-response rate. These findings generally provide some support for the adventitious reinforcement of "other" behavior.
Journal of Applied Behavior Analysis, 2020 · doi:10.1002/jaba.678