ABA Fundamentals

Conditioned reinforcement value and choice.

Preston et al. (1991) · Journal of the experimental analysis of behavior 1991
★ The Verdict

A stimulus becomes a stronger conditioned reinforcer when it sharply reduces the time to the next backup reinforcer.

✓ Read this if BCBAs who use tokens, points, or praise to bridge delays to primary reinforcement.
✗ Skip if Practitioners who deliver only immediate, primary reinforcers with no bridging stimuli.

01Research in Context

01

What this study did

The team used pigeons in a two-key choice box. One key led to food after a short delay. The other key led to food after a longer delay.

A light or sound marked each path. The birds could peck either key at any time. The researchers watched which key the birds chose most.

02

What they found

The birds switched their pick as the overall rate of food changed. When food came faster on one side, the signal for that side became stronger conditioned reinforcement.

The results matched the delay-reduction rule: a signal is valuable when it cuts wait time for food.

03

How this fits with other research

Emmelkamp et al. (1986) ran a near-copy test and saw the same shift, giving extra proof for the rule.

Eisenmajer et al. (1998) added unsignaled 3-s delays and saw choice drop. That seems to clash, but the delay was hidden, so the signal lost its power—no contradiction.

Vos et al. (2013) showed that even short, signaled delays hurt fixed-ratio response rate. They moved the idea from choice tasks to everyday work schedules.

04

Why it matters

The rule is simple: the worth of a praise, token, or light depends on how much it shortens the wait for backup reinforcers. In your session, keep praise immediate or make sure it clearly predicts fast reinforcement. If reinforcement must be delayed, insert a brief signal and keep the overall rate of reinforcement high. This keeps conditioned reinforcers strong and learner engagement steady.

Free CEUs

Want CEUs on This Topic?

The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.

Join Free →
→ Action — try this Monday

Track how many seconds each praise statement saves the learner from waiting for the real item; keep that saved time as short as possible.

02At a glance

Intervention
not applicable
Design
other
Finding
positive

03Original abstract

The delay-reduction hypothesis of conditioned reinforcement states that the reinforcing value of a food-associated stimulus is determined by the delay to primary reinforcement signaled by the onset of the stimulus relative to the average delay to primary reinforcement in the conditioning situation. In contrast, most contemporary models of conditioned reinforcement strength posit that the reinforcing strength of a stimulus is some simple function only of the delay to primary reinforcement in the presence of stimulus. The delay-reduction hypothesis diverges from other conditioned reinforcement models in that it predicts that a fixed-duration food-paired stimulus will have different reinforcing values depending on the frequency of its presentation. In Experiment 1, pigeons' key pecks were reinforced according to concurrent-chains schedules with variable-interval 10-second and variable-interval 20-second terminal-link schedules. The initial-link schedule preceding the shorter terminal link was always variable-interval 60 seconds, and the initial-link schedule requirement preceding the longer terminal link was varied between 1 second and 60 seconds across conditions. In Experiment 2, the initial-link schedule preceding the longer of two terminal links was varied for each of three groups of pigeons. The terminal links of the concurrent chains for the three groups were variable-interval 10 seconds and 20 seconds, variable-interval 10 seconds and 30 seconds, and variable-interval 30 seconds and 50 seconds. In both experiments, preference for the shorter terminal link was either a bitonic function or an inverse function of the initial-link schedule preceding the longer terminal-link schedule. Consistent with the predictions of the delay-reduction hypothesis, the relative values of the terminal-link stimuli changed as a function of the overall frequency of primary reinforcement. Vaughan's (1985) melioration model, which was shown to be formally similar to Squires and Fantino's (1971) delay-reduction model, can be modified so as to predict these results without changing its underlying assumptions.

Journal of the experimental analysis of behavior, 1991 · doi:10.1901/jeab.1991.55-155