Reinforcement contingencies as discriminative stimuli: II. Effects of changes in stimulus probability.
The schedule that just paid off can itself become the cue that bends the next choice.
01Research in Context
What this study did
Pigeons pecked keys in a matching-to-sample task. The sample was not a light or a shape. It was the schedule that had just ended: DRL or DRO. The birds had to pick the comparison key that matched the schedule they had just worked under. The twist: the chance of seeing DRL next was changed across conditions.
What they found
When DRL became more likely, pigeons biased their picks toward the DRL side. The schedule itself acted like a color or a shape, guiding the next choice. The data showed a clean probability-based bias, even though the task stayed the same.
How this fits with other research
Lattal (1975) ran the same DRL/DRO matching task first. That paper proved the idea worked; this 1979 paper asks what happens when you mess with the odds. Reberg et al. (1979) also worked with pigeons and bias, but used signal-detection instead of matching-to-sample. Both labs saw the same pattern: more reinforcement for one choice pulls responses that way, even when the real signal does not change. Burgio et al. (1986) looks like a clash at first—they say delays hurt discriminability. No conflict: they tested memory gaps, while Crowley (1979) tested probability gaps. Different variables, different stories.
Why it matters
Your client may pick the red card more often not because red is easier to see, but because red has won more often lately. Before you add extra prompts or new teaching steps, check the payoff odds across choices. Balance the reinforcement rates and watch the bias fade.
Want CEUs on This Topic?
The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.
Join Free →Count the last 20 reinforcers for each choice in your matching task; rebalance if one side tops 70%.
02At a glance
03Original abstract
Three pigeons were trained on a matching procedure involving a sample component and a choice component. Responding in the sample component, according to either a differential-reinforcement-of-low-rate schedule on some trials or a differential-reinforcement-of-other-behavior schedule on other trials, produced access to the choice component in which each of two keys was illuminated with a unique color. The correct choice response was defined by the contingency that was met to produce the choice. The food hopper operated for 1.5 seconds following an appropriate sample response and for 3 seconds following a correct choice response. A signal-detection analysis showed that variations in the probability of presentation of the different contingencies systematically affected response bias but not sensitivity to the contingencies as stimuli. Substitution of a blackout for food at the end of the sample component did not differentially affect performance, but elimination of the delay between sample and choice components generally increased the sensitivity measure. The findings suggest a role for reinforcement contingency discrimination in schedule-controlled responding.
Journal of the experimental analysis of behavior, 1979 · doi:10.1901/jeab.1979.31-15