Choice between reliable and unreliable outcomes: mixed percentage-reinforcement in concurrent chains.
Learners pick the path that gives faster, surer reinforcement, so keep delays short and payoff odds high in your programs.
01Research in Context
What this study did
Green et al. (1987) let pigeons pick between two keys in a two-part chain.
One key always paid off. The other key paid off only half the time.
The team then changed how long each part of the chain lasted to see if delay altered the birds’ preference.
What they found
The birds strongly favored the sure-thing key.
When the wait times in the chain grew, the preference for the reliable key grew with them.
The results fit an extended delay-reduction model: shorter wait to sure food equals stronger choice.
How this fits with other research
Mazur et al. (1992) ran a near-copy of the task but mixed up the number of links instead of the payoff odds. They still saw a pull toward the risky side, yet adding a green light cue wiped the pull out. Together the studies show both probability and signals steer choice.
Martin et al. (1997) kept the 50 % vs 100 % comparison and simply inserted a five-second gap before the food signal. That small gap cut the birds’ risky preference in half, proving the signal itself acts as a conditioned reinforcer.
Anonymous (1995) also used the 50 % vs 100 % setup and found the risky key won only when the delays were short. Once delays lengthened or a gap was added, preference flipped to the sure key. The pattern lines up with Green et al. (1987) and underlines the same rule: delay controls the strength of preference.
Why it matters
For BCBAs the lesson is plain: clients will gravitate toward the option that cuts wait time and delivers sure reinforcement. If you want a student to pick the new task, keep the delay to reinforcement short and the payoff odds high. Check your schedules—adding extra steps or uncertain rewards can quietly weaken the power of your program.
Want CEUs on This Topic?
The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.
Join Free →Audit your token system: replace any 50 % payoff slots with 100 % payoff and watch engagement rise.
02At a glance
03Original abstract
Pigeons' choices between alternatives that provided different percentages of reinforcement in mixed schedules were studied using the concurrent-chains procedure. In Experiment 1, the alternatives were terminal-link schedules that were equal in delay and magnitude of reinforcement, but that provided different percentages of reinforcement, with one schedule providing, reinforcement twice as reliably as the other. All pigeons preferred the more reliable schedule, and their level of preference was not systematically affected by variation in the absolute percentage values, or in the magnitude of reinforcement. In Experiment 2, preference for a schedule providing 100% reinforcement over one providing 33% reinforcement increased systematically with increases in the duration of the terminal links. In contrast, preference decreased systematically with increases in the duration of the initial links. Experiment 3 examined choice with equal percentages of reinforcement but unequal delays to reinforcement. Preference for the shorter delay to reinforcement was not systematically affected by variation in the absolute percentage of reinforcement. The overall pattern of results supported predictions based on an extension of the delay-reduction hypothesis to choice procedures involving mixed schedules of percentage reinforcement.
Journal of the experimental analysis of behavior, 1987 · doi:10.1901/jeab.1987.47-57