ABA Fundamentals

Probability of response and probability of reinforcement in a response-defined analogue of an interval schedule.

Millenson (1966) · Journal of the experimental analysis of behavior 1966
★ The Verdict

VI schedule quirks come from the learner’s response chain habit crossed with the arranged payoff odds.

✓ Read this if BCBAs who write or adjust reinforcement schedules in skill-acquisition programs.
✗ Skip if Clinicians focused only on token boards or fixed-ratio fluency drills.

01Research in Context

01

What this study did

Nigro (1966) built a tiny computer model of a pigeon. The model pecked in chains. Each chain length had its own chance of food.

By changing those chances, the team made the model act like real birds on a variable-interval (VI) schedule. They wanted to see if VI patterns pop out from two simple things: the bird’s built-in chain bias and the set food odds.

02

What they found

The model did copy real VI performance. Long chains plus low food odds made steady, slow responding. Short chains plus higher odds made faster bursts.

The study says VI patterns are not magic. They are the product of the animal’s own response style mixed with the experimenter’s reinforcement probabilities.

03

How this fits with other research

Iwata et al. (1990) later showed the same math breaks if extra food waits outside the session. Their open-economy birds ignored the model’s neat rate curve, proving the interaction only holds in a closed economy.

Lecavalier et al. (2006) went further. They gave rats inverted-U feedback functions and found the animals climbed to the peak. This optimization view keeps the probability idea but adds a hill-climbing rule, so it partly supersedes the 1966 account.

Adkins et al. (1997) added history power. Birds trained on DRH first kept their high rate even when switched to VI. The 1966 model missed that past probabilities can override current ones.

04

Why it matters

You now have a quick lens for puzzling response patterns. Ask two questions: What chain length is this learner prone to emit? And what are the true odds of payoff right now? If the rate looks weird, check economy, history, and feedback shape before you tweak the schedule. Use closed economies during teaching, probe for DRH/DRL history in assessments, and remember that optimization beats simple probability when reinforcers are scarce.

Free CEUs

Want CEUs on This Topic?

The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.

Join Free →
→ Action — try this Monday

Count the client’s average response run length, then match your VI timer so the payoff odds fit that chain length.

02At a glance

Intervention
not applicable
Design
single case other
Population
not specified
Finding
positive

03Original abstract

Variable interval (VI) responding was hypothesized to be a function of differential reinforcement susceptibilities of various unspecified behavior chains that mediate interresponse times (IRTs). To test this hypothesis, probabilities of reinforcement were regulated for the lengths of chains of key pecking responses of pigeons, analogous to the way that VI regulates probabilities of reinforcement for IRTs. This procedure generated a number of VI-like effects, supporting the notion that VI behavior can be construed as a special case of an interaction between the organism's function relating reinforcement susceptibilities to chain length and the experimenter's function relating probabilities of reinforcement to chain length.

Journal of the experimental analysis of behavior, 1966 · doi:10.1901/jeab.1966.9-87