Reinforced behavioral variability: Working towards an understanding of its behavioral mechanisms.
Reinforcing novelty itself produces more variation than reinforcing mere change.
01Research in Context
What this study did
The team worked with pigeons in an operant chamber. Birds had to peck four times on each trial. The order of left and right pecks had to meet a variability rule to earn food.
Some birds got food only when their four-peck pattern was new. Other birds got food just for switching patterns. The study asked which rule made the birds vary more.
What they found
Pigeons produced far more varied four-peck sequences when the rule said 'be different' than when it only said 'change'. Direct reinforcement of variability beat simple changeover rules.
The result supports the idea that variability itself can be reinforced, not just the act of switching.
How this fits with other research
Moxley (1989) set the stage with a percentile schedule that raised or lowered variability on cue. Doughty et al. (2015) sharpen the tool by showing a simple threshold rule works just as well.
Nergaard et al. (2020) push back. They argue that 'reinforced' variability is really the product of quick extinction of old forms, not true reinforcement. The two papers seem to clash, but they test different questions. H et al. look at what the bird does next; Nergaard et al. look at why the old form stops. Both can be true.
Dugdale et al. (2000) and Hopkinson et al. (2003) extend the same logic to people. Adolescents with autism and college students with depression both varied more when the contingency told them to. The animal rule travels well to human clinics.
Why it matters
If you want a client to try new responses—new play schemes, new sentences, new problem-solving steps—reinforce the novelty itself, not just the absence of the old response. Use a lag schedule or a simple 'show me something you haven’t done' rule. Pair it with a signal as Reed (2023) shows and the effect gets even stronger. Start small, count response forms, and deliver praise or tokens only when the form is new.
Want CEUs on This Topic?
The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.
Join Free →Set a lag-2 rule during play: deliver praise only if the child stacks blocks in a new order each time.
02At a glance
03Original abstract
There is disagreement about how to characterize the environment-behavior relations involved in the reinforcement of behavioral variability. The present research examined some of these issues using food-maintained, 4-peck sequences in pigeons. Experiment 1 evaluated the claim that behavioral variability is not reinforced directly but, rather, is the byproduct of changing over within sequences. Considerably higher levels of behavioral variation occurred under a relative-frequency threshold contingency than under a contingency that required a changeover but not variability per se. These results are consistent with the argument that behavioral variability is reinforced directly. Experiment 2 assessed the effects on variation levels of manipulating inter-trial and inter-response intervals. Variability increased with longer inter-response intervals but not with longer inter-trial intervals. These results are consistent with multiple explanations, including the notion that remembering past behavior interferes with the emission of reinforced variation. Consequently, Experiment 3 examined more directly the relation between remembering and reinforced variation. Variation levels were not affected by a concurrent contingency that encouraged pigeons to remember their past behavior. The implications of this research are presented in the context of working towards an understanding of the environment-behavior relations involved in the reinforcement of behavioral variability.
Journal of the experimental analysis of behavior, 2015 · doi:10.1002/jeab.171