Acquisition of a spatially defined operant with delayed reinforcement.
Delayed reinforcement alone can create new lever pressing in rats; an immediate bridging stimulus speeds the process but is not required.
01Research in Context
What this study did
Researchers put rats in a box with one lever. Each lever press earned food, but the food arrived 30 seconds later. A tone could sound right after the press. The team asked: Will the rats learn the lever press at all with this long wait?
They also tested if the immediate tone helps the rats connect press to payoff. Some rats got the tone every press. Others got only the delayed food. The scientists counted how fast each group learned to press.
What they found
Every rat learned to press, even without the tone. The 30-second delay did not block new learning. It just took more sessions than typical immediate-reinforcement studies.
The group with the immediate tone picked up the skill faster. The tone acted like a tiny bridge across the delay. Still, the rats without any tone reached the same final performance.
How this fits with other research
Davis et al. (1994) repeated the idea with Siamese fighting fish. The fish swam through a ring for delayed visual reinforcement. Both studies show the same core point: new responses can lock in even when payoff is late.
LeSage et al. (1996) added d-amphetamine to the same rat setup. The drug did not stop learning under 8-s delays unless the dose was so high it disrupted all movement. Together the papers stretch the finding across species and drug states.
Byrne et al. (2019) moved past acquisition. They showed that once the response exists, its duration also comes under delayed-reinforcement control. The 1993 study opened the door; later work walked through it.
Why it matters
If you run early intervention, remember that a reinforcer can still work even when it cannot follow the target within seconds. A brief praise or token delivered 20–30 s later can shape a brand-new skill, especially if you give an immediate bridge like a click or “nice.” The rat data say the bridge helps, but the absence of it does not doom the program. Keep teaching; the learner can still connect the dots.
Want CEUs on This Topic?
The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.
Join Free →Try adding a brief click or praise right after the target response while keeping the actual treat delivery at 20–30 s; measure if acquisition speeds up versus baseline.
02At a glance
03Original abstract
Two experiments investigated the role of an immediate, response-produced auditory stimulus during acquisition, via delayed reinforcement, of a response selected to control for possible unprogrammed, operandum-related sources of response feedback. Experimentally naive rats were exposed to a delayed-food reinforcement condition, specifically a tandem fixed-ratio 1 differential-reinforcement-of-other-behavior 30-s schedule. The response was defined as breaking a photocell beam located near the ceiling at the rear of the operant conditioning chamber. In Experiment 1, rates of photobeam breaking by each rat increased from near zero, regardless of the presence or absence of a tone that immediately followed the response initiating the delay interval. Though not essential, the tone facilitated response acquisition and resulted in more efficient response patterns at stability. Experiment 2 demonstrated that photobeam-breaking response rates under the delayed reinforcement contingency exceeded those in a preceding baseline condition in which no food was delivered. In addition, upon introduction of the delayed reinforcement procedure, correspondence between response patterns and the requirements of the reinforcement schedule increased over baseline levels in the absence of a food contingency. Together with a previous report of Lattal and Gleeson (1990), the present results suggest that response acquisition with delayed reinforcement is a robust phenomenon that may not depend on a mechanically defined response or an immediate external stimulus change to mediate the temporal gap between response and reinforcer.
Journal of the experimental analysis of behavior, 1993 · doi:10.1901/jeab.1993.59-373