Token Economy: A Practitioner's Guide to Token Reinforcement, Exchange Schedules, and Schedule Thinning
A token economy is a generalized conditioned reinforcement system in which a learner earns neutral, portable items — points, chips, stickers, app icons, level-system marks — contingent on target behavior, then exchanges accumulated tokens for backup reinforcers from a pre-defined menu degli Espinosa et al. (2024). It is one of the oldest and most heavily evidenced reinforcement arrangements in applied behavior analysis, descending in a direct line from Ayllon and Azrin's 1968 hospital ward work and now spanning classrooms, outpatient clinics, residential schools, day programs, prisons, zoo employee training, and home-based parent programs Hackenberg (2018) Pritchard et al. (2018) Vergason & Gravina (2020) Pascale et al. (2025) degli Espinosa et al. (2020). The practical job for a BCBA, RBT, or classroom team is not "deploy tokens" — it is to specify a token-production schedule, an exchange schedule, a backup-reinforcer menu, and a thinning plan tight enough that a second clinician could replicate it without you in the room, then verify experimentally that the tokens you've chosen actually function as reinforcers for that specific learner.
01What the Research Says
What a token economy actually is in 2026 practice
The modern operating definition comes out of the experimental analysis of behavior: tokens are generalized conditioned reinforcers — discriminative stimuli for a delayed exchange — that derive value from a defined contingency between earning, accumulating, and trading them for backup reinforcers Hackenberg (2018). Hackenberg's translational review reframes any token system as a multi-schedule arrangement with three programmable layers: token-production schedule (responses per token), exchange-production schedule (responses or tokens to access cash-in), and exchange ratio (tokens per backup reinforcer) Hackenberg (2018). The dominant failure mode in field implementations is treating the token economy as a monolithic procedure rather than a stack of programmable schedules Hackenberg (2018). The evidence-based recommendations paper translates that frame into two practitioner roadmaps: Part 1 for building from scratch, Part 2 for diagnosing and repairing a system that has drifted degli Espinosa et al. (2024).
The conditioning step is non-optional
Tokens are not inherently reinforcing. They acquire reinforcer value only by reliable pairing with established backup reinforcers, and any system that skips this step is using unconditioned neutral stimuli to shape behavior degli Espinosa et al. (2024). Bonfonte, Bourret, and Lloveras' progressive-ratio comparison is unusually direct: after establishing novel tokens through a tandem-control assessment, response output for the tokens themselves was reliably weaker than for high-preference edible backup reinforcers, indicating even successfully conditioned tokens can lag their best backup items Bonfonte et al. (2020). Fiske and colleagues' multiple-schedule reinforcer assessment found tokens equivalent to primary reinforcers for half of participants and meaningfully lower for the other half Fiske et al. (2020). Empirical reinforcer assessment of the tokens themselves should sit upstream of clinical deployment. When tokens come back weak, the repair levers are denser token-production schedules, more pairing trials, or temporary fallback to direct primary reinforcement Bonfonte et al. (2020) Fiske et al. (2020).
Token selection is itself a procedural variable
The physical token must be deliverable in roughly a second, durable enough to survive the session, unmistakable from non-token stimuli, and difficult to access non-contingently. Beahm and colleagues' multiple-probe study replaced physical chips with an iPad-based ClassDojo token board for adults with developmental disabilities in a day-habilitation program; engagement in daily-living and vocational tasks rose immediately and was maintained after token thinning, and the app removed the materials-management problem that often kills physical systems Beahm et al. (2023). Radogna and colleagues' Italian vocational study used portable physical tokens — exchanged at 5:1 for choice backup items — embedded inside BST for workplace social skills, with tokens portable enough to be delivered the moment a correct, unprompted response occurred at the worksite (Radogna et al., 2024). Token form follows token deployment: the highest-value selection criterion is "can this be delivered at a one-second latency in the actual setting where the behavior happens?"
Token-production and exchange schedules are the two main control surfaces
Every applied token system embeds two ratio schedules in series — token-production (response to token) and exchange-production (token to backup reinforcer). Argueta, Leon, and Brewer compared FR and VR exchange-production schedules (FR 2/5/10 vs VR 2/5/10) for a child with autism, holding token-production at FR 1; response output was statistically equivalent across schedule types, but a concurrent-chains preference probe showed the participant exclusively chose VR Argueta et al. (2019). Falligant, Pence, and Bedell ran the analogous comparison at the exchange-distribution level: accumulated (save-up) versus distributed (immediate) exchanges. Preference is conditional — children preferred accumulated when tokens were earned for easy tasks or on dense schedules, and shifted toward distributed when tasks were hard or token-production was lean Falligant et al. (2020). The clinical implication: response output is largely insensitive to within-band schedule choice, while learner preference is highly sensitive — treat schedule design as a preference variable and probe with concurrent-chains arrangements rather than guess Argueta et al. (2019) Falligant et al. (2020).
Flexible earning requirements solve the post-reinforcement pause problem
Fixed earning requirements — "you need exactly five tokens before you trade" — produce a scallop: behavior accelerates near the terminal token and drops after exchange. Cihon and colleagues' non-concurrent multiple-baseline across three boys with autism evaluated a flexible token system in which the number of tokens needed before exchange varied session-by-session based on prior responding Cihon et al. (2019). The flexible requirement maintained high levels of academic and social-initiation responding without the post-reinforcement pause typical of fixed schedules Cihon et al. (2019). The procedural rule: set each session's requirement at the previous session's mean (or a stretch above it), document on the data sheet, and adjust upward only when responding is stable Cihon et al. (2019).
Tokens behave as economic commodities, not just reinforcers
A growing strand of the corpus treats tokens as commodities subject to demand laws. Andrade and Hackenberg's pigeon study manipulated the response cost ("price") of one token type while leaving an alternative token available; subjects produced fewer of the high-priced token (own-price elasticity) and more of the alternative token (cross-price elasticity), demonstrating that generalized tokens function as substitutable commodities in a closed economy Andrade & Hackenberg (2017). Wan, Tan, and Hackenberg extended this to demand for token-holding itself: across six within-subject reversals, accumulation was highest when both token-production ratio (labor cost) and exchange-production ratio (transaction cost) were low, and accumulation fell as either rose Wan et al. (2026). Levins and Gilroy's conceptual extension argues that the Operant Demand Framework — tracking consumption (responses completed) against price (tokens required) — gives clinicians a quantitative readiness signal for thinning: thin only after consumption stays stable as price rises in small increments Levins & Gilroy (2025). None of these papers is yet a randomized field test, but together they convert "thinning" from an art into a measurable decision rule Hackenberg (2018).
Maintenance is the failure point for most token economies
Regnier and colleagues' systematic review of 50+ token-economy studies (1965–2020) — covering children with ADHD, adults with developmental disabilities, psychiatric inpatients, and classroom students — identified four empirically tested maintenance strategies: schedule thinning, stimulus fading, self-monitoring/self-administration, and combination approaches. Combinations consistently outperformed single techniques Regnier et al. (2022). Pairing token thinning with self-monitoring checklists and explicit fading toward natural reinforcers (teacher praise, completed-task access) produced the best durability Regnier et al. (2022). The recurring failure mode: most primary studies measured neither social validity nor maintenance beyond six months, and several reported visible drops in target behavior when tokens were withdrawn abruptly rather than thinned Regnier et al. (2022).
Response cost is the inverse contingency, and learners do not uniformly prefer reinforcement
Response cost — token removal contingent on undesired behavior — is the procedural mirror image of reinforcement-based token earning. Jowett Hirst, Dozier, and Payne ran an alternating-treatments comparison of reinforcement-only and reinforcement-plus-response-cost token economies across children in group and individual instructional formats Jowett Hirst et al. (2016). Both procedures were similarly effective at increasing on-task behavior for most participants, and participant preference between the two was idiosyncratic rather than systematic Jowett Hirst et al. (2016). The implication for practice is that response cost is not a default escalation move but a configurable variant a clinician can offer when the learner shows preference for it; equal effectiveness with reinforcement-only systems means the choice between them turns on acceptability, ease of staff implementation, and the broader behavior plan rather than on outcome power Jowett Hirst et al. (2016).
Group and visitor-delivered token systems
Token economies are not restricted to dyadic instructional contexts. Pritchard, Penney, and Mace's correlational analysis of the ACHIEVE! program — a school-wide points-and-level token economy at a residential school for students with intellectual and developmental disabilities — found that higher daily point totals were inversely correlated with daily frequency of severe problem behavior (aggression, harmful sexual behavior, property damage) Pritchard et al. (2018). The program awarded points for prosocial skills without punishment, used a transparent level system tied to meaningful privileges, and daily point totals served as a real-time proxy for risk of severe-behavior episodes Pritchard et al. (2018). Vergason and Gravina's ABAB study at a zoo took the idea further: visitors and confederates delivered tokens contingent on a "10-5" greeting protocol (eye contact at 10 ft, verbal greeting at 5 ft), producing 35–45% increases in greeting behavior over baseline and demonstrating that non-trained patrons can function as reliable token dispensers when criteria are distance-anchored and the prize menu is small but desirable Vergason & Gravina (2020).
Establishing the right backup-reinforcer menu
Backup-reinforcer selection is the highest-leverage decision in any token system. The recommendations paper requires that backup options (1) be drawn from a current preference assessment, (2) include enough variety to avoid within-session satiation, and (3) be reliably available the moment a learner reaches the terminal token count degli Espinosa et al. (2024). Bonfonte and colleagues' progressive-ratio data argue for explicit testing: only high-preference edibles or activities sustained breakpoints comparable to primary reinforcers Bonfonte et al. (2020). Letting the learner pick from the menu at exchange time is itself a reinforcer and is part of why VR and accumulated-exchange schedules tend to feel preferable under the right conditions Argueta et al. (2019) Falligant et al. (2020).
Delay tolerance is a learner variable, and you can measure it
Kim, Fienup, Reed, and Jahromi validated a 2-minute delay-discounting task that predicted second-grade students' actual saving and earning patterns inside a class-wide token economy across two single-subject experiments Kim et al. (2024). Students with steeper discounting curves spent tokens immediately and accumulated less; students with shallower curves saved across longer intervals — and the screening task runs in roughly the time it takes to do morning attendance Kim et al. (2024). Identify high-discounting students at the start of the year and program tighter exchange windows, savings-bank routines, or distributed (not accumulated) exchange schedules for them, while the rest of the class operates on a standard accumulated schedule Kim et al. (2024) Falligant et al. (2020).
Token economies for health behaviors and OBM
Two corpus papers extend token-economy mechanics outside academic and reduction targets. Patel, Normand, and Kohn ran a token system contingent on moderate-to-vigorous physical activity (MVPA) in preschoolers, comparing baseline, contingent-token, and non-contingent-token conditions; only the contingent-token phase reliably raised MVPA and lengthened active bouts in three of four children — non-contingent token presence failed to produce comparable effects Patel et al. (2019). Nastasi, Sheppard, and Raiff used Fitbit-recorded daily step counts in a residential group home; an 8-week changing-criterion design produced large step-count increases for three of four adults with developmental disabilities Nastasi et al. (2020). Both papers confirm that token contingency requires actual contingency: tokens delivered non-contingently — even with identical materials, exchange opportunities, and backup menu — do not produce the same effects Patel et al. (2019).
Staff training is cheaper than the field assumes
The other failure mode is implementation drift — staff who deliver tokens late, miss the contingency, or skip exchanges. Gutierrez and colleagues' non-concurrent multiple-baseline across three graduate-level ABA trainees showed that a brief 30-minute manualized instruction (a 10-step task analysis covering materials setup, token delivery, paired praise, contingent token removal, snack access at exchange, and reset) brought novice staff to 98–100% procedural accuracy on a DTI-embedded token economy Gutierrez et al. (2020). A written manual used for 30 minutes can replace lengthy in-vivo modeling, making high-fidelity rollout across new RBTs and classroom teachers a tractable supervision problem Gutierrez et al. (2020).
Cross-setting evidence: corrections, COVID-era home programs, vocational settings
The corpus documents token economies operating well outside the clinic-and-classroom default. Pascale and colleagues' review references prior meta-analytic evidence that 28 of 29 reviewed correctional behavioral studies employed token economies, and notes that prisoners reduced rule-breaking specifically to gain access to the token economy Pascale et al. (2025). degli Espinosa and colleagues' COVID-era case series with eight Italian families converted school-based token boards into 24-hour home-wide token economies, using contingent solo-play access as a backup reinforcer to give parents predictable work-and-respite windows degli Espinosa et al. (2020). Radogna and colleagues' Italian vocational study used a 5:1 exchange ratio with choice backup items inside a BST package to teach job-related social skills to adults with neurodevelopmental disorders during real workplace tasks (Radogna et al., 2024). The shared procedural pattern: small ratios, immediate token delivery, choice at exchange, and pairing with routines that already structure the setting degli Espinosa et al. (2020).
02Evidence Tier Breakdown
A practitioner-facing page should be honest about where the evidence comes from Regnier et al. (2022). The token-economy literature lives mostly at the single-subject experimental design (SCED) and narrative/translational-review layers, with one quasi-experimental school program, one high-quality systematic review of maintenance strategies, a small set of theoretical and behavioral-economics papers, and a basic-laboratory branch in pigeons that grounds the procedural mechanics in operant-economic theory Hackenberg (2018) Regnier et al. (2022) Andrade & Hackenberg (2017) Wan et al. (2026).
Systematic and narrative reviews. Regnier and colleagues' PRISMA-guided systematic review of 50+ token-economy maintenance studies (1965–2020) is the densest review in the corpus and anchors the maintenance-strategy claims on this page Regnier et al. (2022). degli Espinosa and Hackenberg's evidence-based recommendations paper is a narrative review that operationalizes basic and applied token-reinforcement findings into two practitioner roadmaps (build new system / repair existing system) degli Espinosa et al. (2024). Hackenberg's translational review re-analyzes laboratory and field studies through a behavioral-economics lens and grounds the multi-schedule frame used throughout this page Hackenberg (2018). These three sit at the top of the practitioner-evidence pyramid for token economies, but only the Regnier review uses formal systematic-review methodology Regnier et al. (2022).
Quasi-experimental and group studies. Pritchard, Penney, and Mace's analysis of the ACHIEVE! program is the closest the corpus comes to a group-level demonstration: a correlational year-long examination of daily point totals against severe problem behavior across a residential IDD school, showing inverse association with no random assignment Pritchard et al. (2018). The design supports use of a school-wide token economy as a population-level intervention but does not establish causation Pritchard et al. (2018).
Single-subject experimental designs. Most applied token-economy evidence is SCED degli Espinosa et al. (2024). Cihon and colleagues (n=3) demonstrate flexible-earning-requirement effectiveness Cihon et al. (2019). Argueta, Leon, and Brewer (n=1) compare FR vs VR exchange-production schedules Argueta et al. (2019). Falligant, Pence, and Bedell (n=3) map preference between accumulated and distributed exchanges across task difficulty Falligant et al. (2020). Bonfonte and colleagues (n=2) and Fiske and colleagues (n=4) compare token vs primary reinforcer efficacy via progressive-ratio and multiple-schedule reinforcer assessments Bonfonte et al. (2020) Fiske et al. (2020). Jowett Hirst and colleagues compare reinforcement vs response cost across group and individual instruction Jowett Hirst et al. (2016). Patel and colleagues (n=4) and Nastasi and colleagues (n=4) demonstrate token efficacy for physical activity in preschoolers and adults with developmental disabilities Patel et al. (2019) Nastasi et al. (2020). Beahm and colleagues (n=3) demonstrate app-based delivery for adults Beahm et al. (2023). Vergason and Gravina (n=4) demonstrate visitor-delivered tokens in OBM Vergason & Gravina (2020). Gutierrez and colleagues (n=3) demonstrate manualized staff training Gutierrez et al. (2020). Kim and colleagues (n=16) tie delay-discounting to token-saving Kim et al. (2024). Radogna and colleagues (n=4) and degli Espinosa and colleagues (n=8 families) extend SCED to vocational and home contexts (Radogna et al., 2024) degli Espinosa et al. (2020).
Behavioral-economic and basic-laboratory studies. Andrade and Hackenberg's pigeon study (n=4) demonstrates own-price and cross-price elasticity in a generalized token economy Andrade & Hackenberg (2017). Wan, Tan, and Hackenberg (n=6) extend the demand analysis to token accumulation and exchange-production cost Wan et al. (2026). These are non-human and laboratory-bounded but provide the experimental-economics scaffolding the applied translational papers depend on Hackenberg (2018).
Theoretical and conceptual. Levins and Gilroy extend the Operant Demand Framework into clinical thinning decisions without empirical validation; the framework is promising but unverified in classroom or clinic samples Levins & Gilroy (2025). The recommendations paper has substantial conceptual content alongside its applied scaffolding degli Espinosa et al. (2024).
Cross-setting / supporting evidence. Pascale and colleagues' Italian-prison paper cites Gendreau et al.'s correctional meta-analysis as supporting evidence rather than running new token-economy data; treat this as practice-pattern evidence for secure settings rather than primary outcome data Pascale et al. (2025).
Bottom line. The convergent evidence is strong for the operational claims made on this page: tokens function as generalized conditioned reinforcers when properly conditioned; token-production and exchange-schedule design drive output and preference; flexible earning requirements outperform fixed ones; response cost is roughly equivalent to reinforcement-only systems with idiosyncratic preference; staff can be trained to high fidelity in 30 minutes; and combined maintenance strategies outperform abrupt withdrawal Hackenberg (2018) Cihon et al. (2019) Jowett Hirst et al. (2016) Gutierrez et al. (2020) Regnier et al. (2022). Evidence is weaker for any claim that one specific exchange ratio, token form, or thinning rate is universally superior — those are case-level decisions the corpus treats as preference-tested Falligant et al. (2020) Fiske et al. (2020).
03Procedure: Building a Token Economy from Scratch
A six-step build sequence that the recommendations paper organizes and the SCED literature validates piece-by-piece degli Espinosa et al. (2024).
1. Pick target behaviors and data system. Tokens require an unambiguous response definition and a way to count in real time at the same latency as token delivery Hackenberg (2018). Examples in the corpus: 5-second momentary time sampling + accelerometers (Patel), Fitbit step counts (Nastasi), app-tracked engagement intervals (Beahm), staff-recorded daily prosocial skill points (ACHIEVE!) Patel et al. (2019) Nastasi et al. (2020) Beahm et al. (2023) Pritchard et al. (2018).
2. Run a current preference assessment for backup reinforcers. Refresh within 30 days; build a 5–8 item menu with at least one high-preference edible, one preferred activity, and one tangible degli Espinosa et al. (2024). Bonfonte and colleagues' progressive-ratio data are the strongest argument that backup-reinforcer quality directly determines token efficacy at higher response costs Bonfonte et al. (2020). Keep the menu visible at exchange so the learner picks Falligant et al. (2020).
3. Condition the token by pairing. Pair token delivery with immediate (within-second) backup access on a 1:1 schedule across multiple sessions until the learner treats the token as a discriminative stimulus for the backup degli Espinosa et al. (2024). Run a tandem-control or multiple-schedule reinforcer assessment to verify token function before step 4 Bonfonte et al. (2020) Fiske et al. (2020).
4. Specify the three schedules in writing. Token-production, exchange-production, and exchange ratio Hackenberg (2018). Start dense — FR 1 token-production for the first 1–2 sessions, small terminal count (3–5) — and write the actual numbers on the data sheet degli Espinosa et al. (2024). Probe accumulated vs distributed and FR vs VR exchange-production with concurrent-chains arrangements rather than guessing Falligant et al. (2020) Argueta et al. (2019).
5. Train staff with a written 10-step task analysis. Gutierrez and colleagues' task analysis covers materials setup, token delivery (with paired praise), error correction, token storage, exchange access, backup delivery, reset, and data recording Gutierrez et al. (2020). A 30-minute read produced 98–100% accuracy in graduate trainees; a competency check before client contact is the operational guardrail Gutierrez et al. (2020).
6. Schedule a thinning plan from day one. The highest-yield maintenance pattern is combined strategies — schedule thinning + self-monitoring + fade to natural reinforcers; abrupt withdrawal predicts decay Regnier et al. (2022). Operant Demand readiness rule: thin only after consumption stays stable across small price increases, paired with upstream increase in natural reinforcement Levins & Gilroy (2025).
04Core Parameters: What You Are Actually Programming
A token economy is a stack of programmable parameters. The corpus is consistent that practitioners should specify each in writing rather than treat them as defaults Hackenberg (2018) degli Espinosa et al. (2024).
- Token-production schedule (continuous vs ratio). Continuous (FR 1) is appropriate during conditioning and early acquisition. Ratio (FR 2+, VR 2+) is appropriate during maintenance and for older or stronger responders. The Wan et al. accumulation data show that low token-production cost increases token-holding, which is sometimes desirable (savings, self-control practice) and sometimes not (immediate-exchange contexts) Wan et al. (2026).
- Exchange-production schedule (FR vs VR; small vs large). Argueta and colleagues showed equivalent response output across FR and VR within matched ratios, but a clear preference for VR — meaning, all else equal, default to VR exchange when the learner can choose Argueta et al. (2019). Wan et al. add that low exchange-production cost increases accumulation, so a transaction-heavy exchange ramp will push learners toward immediate spending Wan et al. (2026).
- Exchange ratio (tokens per backup item). Small ratios (3–5 tokens per exchange) for naive learners and during conditioning; the recommendations paper warns specifically against thinning the exchange ratio too quickly, which is one of the field's signature drift patterns degli Espinosa et al. (2024). The Italian vocational program ran a 5:1 ratio with choice backup items and small batches that ensured quick pay-offs while limiting satiation (Radogna et al., 2024).
- Immediate vs delayed exchange. Falligant and colleagues' choice data show that under easy tasks or dense token-production schedules, learners prefer accumulated (delayed) exchanges; under hard tasks or lean token-production schedules, they prefer distributed (immediate) exchanges Falligant et al. (2020). Kim and colleagues' delay-discounting screener flags students whose delay tolerance is too low for accumulated systems and who need tighter exchange windows or savings-bank routines Kim et al. (2024).
- Fixed vs flexible earning requirement. Cihon and colleagues' flexible system — terminal token count adjusted session-by-session based on prior performance — eliminated the post-reinforcement pause typical of fixed-requirement designs and is one of the highest-yield procedural variants in the recent literature Cihon et al. (2019).
- Individual vs group contingency. Individual contingencies (each learner's tokens depend only on their own responding) are the default for skill acquisition. Group contingencies (independent — everyone earns on their own; dependent — one learner's behavior earns for the group; interdependent — the group earns together if a criterion is hit) work for classroom-level prosocial behavior and are the structure of school-wide systems like ACHIEVE! and games like Good Behavior Game Pritchard et al. (2018). The Vergason zoo OBM study is effectively an interdependent group contingency in which patrons' tokens reward the staff team as a class rather than individually Vergason & Gravina (2020).
- Reinforcement only vs reinforcement plus response cost. Jowett Hirst and colleagues showed roughly equivalent effectiveness, with idiosyncratic preference Jowett Hirst et al. (2016). Pritchard and colleagues argue, on different evidence, that a reinforcement-only school-wide system (ACHIEVE!) coincides with lower severe-behavior rates than punishment-based programs and is preferable on social-validity grounds Pritchard et al. (2018). The clinical move is to default to reinforcement-only, add response cost only when the learner prefers it or when a specific contingency requires removable consequences, and write the response-cost rule explicitly so staff cannot improvise Jowett Hirst et al. (2016).
05Establishing Operations: Making Tokens Actually Function as Reinforcers
A token system is only as strong as the establishing operations (EOs) operating on its backup reinforcers degli Espinosa et al. (2024). Two EO levers are under direct clinician control:
Pre-session deprivation. If the menu's high-preference backup items are freely available outside the session, the EO is abolished and the tokens lose value. The recommendations paper is explicit that backup items used in token systems should be reserved for the token contingency; non-contingent free access is the most common reason a previously functional token economy stops producing responding degli Espinosa et al. (2024). Patel and colleagues' MVPA study makes the same point empirically: non-contingent token presentation failed to produce the same MVPA increases as contingent token delivery, even with identical materials and exchange opportunities Patel et al. (2019).
Menu rotation and satiation control. Backup-reinforcer satiation within session is a leading cause of mid-session response decay. The Italian vocational study ran a 5:1 ratio with choice at exchange and small batches partly to limit within-session satiation; the recommendations paper argues for explicit menu rotation across sessions and for keeping at least 5–8 items live at any time (Radogna et al., 2024) degli Espinosa et al. (2024). Bonfonte and colleagues' breakpoint data are effectively a satiation-and-value test: the items with the highest progressive-ratio breakpoints are the ones most resistant to within-session satiation effects Bonfonte et al. (2020).
The procedural takeaway: a daily preference probe (15–60 seconds with 3–4 candidate items) at the start of the session is one of the highest-yield procedural additions a token economy can carry, and it costs almost nothing in staff time degli Espinosa et al. (2024).
06Group Token Systems: Good Behavior Game, Mystery Motivator, Lottery, Level Systems
Group contingencies extend token logic to classroom and residential populations. The corpus does not include modern Good Behavior Game or mystery-motivator outcome studies directly, but the operational logic — group earns or loses tokens contingent on a public criterion, exchange at intervals, criterion can be public or hidden — is supported by the same exchange-schedule and contingency-quality variables that govern individual systems Hackenberg (2018). The closest direct evidence is Pritchard and colleagues' ACHIEVE! — a school-wide points-and-level system that awarded points for prosocial skills without punishment, with daily totals inversely correlated with severe-behavior frequency Pritchard et al. (2018). Vergason and Gravina's zoo study is a workable OBM analog: distance-anchored target behavior, visitor-delivered tokens, and a small prize menu produced reliable greeting increases without supervisor presence Vergason & Gravina (2020). Both map onto the group-contingency template for Good Behavior Game and lottery systems — public simple criteria, tokens delivered the moment criterion is met, small menus with quick payoffs — with the same guardrails: probe preference, watch for satiation, write the rule explicitly degli Espinosa et al. (2024).
07Response Cost as the Inverse Contingency
Response cost — token removal contingent on undesired behavior — is the procedural mirror of token earning Jowett Hirst et al. (2016). Jowett Hirst, Dozier, and Payne ran the only direct head-to-head comparison in the recent corpus: alternating-treatments across group and individual instructional formats with reinforcement-only and reinforcement-plus-response-cost token economies, with on-task behavior as the dependent variable and child preference probed independently Jowett Hirst et al. (2016). Both procedures were similarly effective at increasing on-task behavior for most children, and participant preference was idiosyncratic — some children chose reinforcement-only, others chose reinforcement-plus-response-cost, with no systematic pattern Jowett Hirst et al. (2016). The clinical implications are concrete: response cost is not an automatic escalation move, equal effectiveness with reinforcement-only means the choice turns on acceptability and staff capacity, and learner preference between the two formats should be probed (briefly, with the actual tokens) before committing the BIP to one or the other Jowett Hirst et al. (2016). When response cost is selected, the recommendations paper's guardrails apply: pre-specify exactly which behaviors trigger token removal, cap removals so the learner cannot reach negative balance, pair each removal with a brief, neutral, scripted statement, and document the response-cost rule so staff cannot improvise degli Espinosa et al. (2024). The non-punishment ACHIEVE! data are a counterweight to defaulting to response cost on social-validity grounds: a school-wide reinforcement-only system showed inverse correlation with severe behavior across a year of operation Pritchard et al. (2018).
08Across Settings
Classrooms (K-12)
Public-school classrooms are where most token economies operate. The corpus directly supports class-wide token economies with second-graders (Kim et al.) and the embedded use of brief delay-discounting screeners to predict which students will accumulate vs spend tokens Kim et al. (2024). Jowett Hirst and colleagues' group-and-individual instructional comparison maps onto typical classroom delivery and shows reinforcement-only and reinforcement-plus-response-cost tokens are roughly equivalent for on-task behavior Jowett Hirst et al. (2016). The flexible-earning-requirement procedure, while validated with three boys with autism in an outpatient program, transfers cleanly to classroom dyads or small groups: set today's terminal token count from yesterday's data, raise it when responding stabilizes Cihon et al. (2019). The classroom-level practical pattern that emerges from the corpus is small terminal token counts (3–5), a backup-reinforcer menu with at least one high-preference item, daily preference probes at session start, accumulated exchange for easy tasks and distributed exchange for harder ones, and a written thinning plan from day one tied to natural reinforcers (teacher praise, completed-task access) degli Espinosa et al. (2024) Falligant et al. (2020) Regnier et al. (2022).
Outpatient and university clinics
Outpatient clinics are where the SCED-grade token-economy mechanics live. Cihon's flexible-earning study, Argueta's exchange-schedule preference work, Falligant's accumulated-vs-distributed comparison, Bonfonte's progressive-ratio efficacy testing, and Fiske's multiple-schedule reinforcer assessment all run inside outpatient or university clinic settings with children with autism receiving DTT Cihon et al. (2019) Argueta et al. (2019) Falligant et al. (2020) Bonfonte et al. (2020) Fiske et al. (2020). The procedural pattern is similar to classroom but with tighter session-level data: pre-session preference probe, conditioning assessment of the tokens themselves, small exchange ratios (3–5 tokens), and concurrent-chains preference probes when staff are unsure which schedule the learner will tolerate Fiske et al. (2020) Falligant et al. (2020).
Residential schools and adult disability services
Residential settings concentrate severe topographies, dispersed staff, and longer time horizons. Pritchard's ACHIEVE! is a residential-school points-and-level system; Beahm and Nastasi run in residential or day-program settings for adults with developmental disabilities; the recommendations paper is built specifically to handle drift in long-running systems Pritchard et al. (2018) Beahm et al. (2023) Nastasi et al. (2020) degli Espinosa et al. (2024). Plan for: tighter-than-monthly menu refresh cycles to limit satiation across long days; the 30-minute manualized training fix for staff drift; app-based delivery to eliminate physical-token-management problems across shifts; and combined maintenance strategies (schedule thinning + self-monitoring + fade to natural reinforcers) degli Espinosa et al. (2024) Gutierrez et al. (2020) Beahm et al. (2023) Regnier et al. (2022).
Hospitals, psychiatric inpatient, and corrections
The Regnier review documents psychiatric inpatient and adult clinical token economies as one of four populations covered across 50+ studies Regnier et al. (2022). In secure settings, Pascale and colleagues note that 28 of 29 correctional behavioral studies (Gendreau et al., 2014) employed token economies, and prisoners reduced rule-breaking specifically to gain access to the token economy Pascale et al. (2025). The procedural shift in higher-risk settings is toward reinforcement-only systems on social-validity grounds and toward explicitly written response-cost rules when token removal is included, so staff under pressure cannot improvise Pritchard et al. (2018) degli Espinosa et al. (2024).
OBM, staff performance, and home parent training
Vergason and Gravina's zoo study demonstrates token-economy mechanics for adult staff: visitor and confederate-delivered tokens, distance-anchored 10-5 greeting target, small prize menu, 35–45% increases over baseline without supervisor presence Vergason & Gravina (2020). Radogna and colleagues' 5:1 vocational system embedded in BST extends the same logic to job-related social skills (Radogna et al., 2024). degli Espinosa and colleagues' COVID-era case series converted school-based token boards into 24-hour home-wide token economies, using contingent solo-play access as a backup reinforcer to give parents predictable work-and-respite windows degli Espinosa et al. (2020). The procedural rule for home programs: extend an existing school token board rather than build a new system, broaden the menu to household tasks, and tie exchange to predictable parental needs degli Espinosa et al. (2020). Beahm and colleagues' app-based ClassDojo system transfers cleanly because the materials problem is solved by the phone Beahm et al. (2023).
09Common Pitfalls
- Skipping the conditioning step. Tokens introduced as if pre-conditioned will frequently fail to function as reinforcers; the recommendations paper is explicit that pairing tokens with established backup reinforcers must precede any contingent use degli Espinosa et al. (2024). Bonfonte and colleagues' progressive-ratio data show that even properly conditioned tokens can lag the value of high-preference primary reinforcers, which means the conditioning step is necessary but not sufficient — empirical reinforcer assessment of the tokens should follow conditioning Bonfonte et al. (2020).
- Backup-reinforcer drift. Free non-contingent access to the menu items abolishes the EO and silently kills the token contingency; the most common reason a previously functional token system stops working is that staff or family members started giving the backup items away degli Espinosa et al. (2024). Patel and colleagues' contrast between contingent and non-contingent token delivery is the experimental version of the same point Patel et al. (2019).
- Exchange-ratio drift. Thinning the exchange ratio too quickly — going from 3 tokens per item to 10 in a week — is one of the recommendations paper's named drift patterns and a common reason maintenance fails degli Espinosa et al. (2024). The Operant Demand Framework rule is to thin only after consumption stays stable across small price increments Levins & Gilroy (2025).
- Fixed terminal token counts producing post-reinforcement pauses. Cihon and colleagues' flexible-earning-requirement study explicitly addresses this drift; the lever is varying the terminal count session-by-session based on prior responding rather than fixing it Cihon et al. (2019).
- Treating accumulated vs distributed exchange as a fixed program parameter. Falligant and colleagues' data show preference is conditional on task difficulty and token-production density; locking into one schedule at the start of treatment misses the lever Falligant et al. (2020).
- Using tokens that cannot be delivered fast enough. If physical tokens require fishing in a pocket or finding a sticker book, latency exceeds 1–2 seconds and the contingency erodes; app-based delivery (Beahm) and pre-positioned chip jars (Radogna) are the procedural fixes Beahm et al. (2023) (Radogna et al., 2024).
- Skipping a daily preference probe. Backup-reinforcer satiation drives mid-session response decay; a 15–60-second probe at session start catches it before it costs an hour of teaching time degli Espinosa et al. (2024) Bonfonte et al. (2020).
- Assuming staff are implementing as written. Implementation drift is the second most common token-economy failure mode after backup-reinforcer drift; a 30-minute manualized check using Gutierrez and colleagues' 10-step task analysis catches most of it cheaply Gutierrez et al. (2020).
- Withdrawing tokens abruptly at "mastery" rather than thinning. The Regnier review documents this as a common cause of behavior decay; combined maintenance strategies (schedule thinning + self-monitoring + fade to natural reinforcers) outperform single techniques Regnier et al. (2022).
- Confusing response cost with extinction or punishment. Response cost is contingent token removal — a procedurally specific consequence — not a generic "consequence." Jowett Hirst and colleagues' equal-effectiveness finding does not authorize improvised token removal; the rule must be written and the response cost capped to prevent negative balances Jowett Hirst et al. (2016) degli Espinosa et al. (2024).
- Programming a single token form across radically different settings. Beahm's app-based system, Radogna's portable physical chips, and the ACHIEVE! school-wide point system are not interchangeable; token form follows token deployment, and asking a paraprofessional to manage chips at a noisy worksite is a setup for delivery latency errors Beahm et al. (2023) (Radogna et al., 2024) Pritchard et al. (2018).
10Decision Logic: Token Economy vs Alternatives
When a referral arrives with "needs reinforcement system," a token economy is one of several options. The corpus supports the following decision rules:
- Use simple praise + immediate primary reinforcement during acquisition when the learner cannot yet tolerate any delay. Bonfonte and Fiske data show primary reinforcers outperform tokens for some learners, and tokens introduce a delay acquisition-stage responding may not survive Bonfonte et al. (2020) Fiske et al. (2020). Convert to a token economy once responding is stable on direct primary reinforcement.
- Use a token economy when (a) behavior is at maintenance/fluency, (b) the learner tolerates a 1–10 second delay between response and backup access, and (c) you need to bridge that delay across multiple responses degli Espinosa et al. (2024) Hackenberg (2018).
- Choose tokens when you need portability across settings or staff. Beahm's app-based system, Radogna's worksite chips, and degli Espinosa's home-wide system show token economies travel better than direct primary reinforcement Beahm et al. (2023) (Radogna et al., 2024) degli Espinosa et al. (2020).
- Choose tokens over Premack when reinforcing many small responses at low latency rather than a single response with delayed activity access; pure Premack arrangements cannot deliver at one-second latency.
- Choose tokens over DRA + immediate reinforcement when the alternative behavior occurs at low rate and the contingency must support several instances before a backup reinforcer is appropriate. Tokens accumulate; immediate primary reinforcement does not.
- Default to reinforcement-only; add response cost only when the learner prefers it. Jowett Hirst's equal-effectiveness finding plus the ACHIEVE! data argue against defaulting to response cost Jowett Hirst et al. (2016) Pritchard et al. (2018).
- For class-wide systems, screen delay-discounting at the start of the year. Kim's 2-minute task identifies high-discounting students who need tighter exchange windows Kim et al. (2024).
- Match exchange schedule to task difficulty. Accumulated exchanges with easy tasks; distributed exchanges with hard tasks Falligant et al. (2020).
- Thin using the consumption-vs-price readiness rule. Levins and Gilroy: thin only after consumption stays stable as price rises in small increments, paired with upstream increase in natural reinforcement Levins & Gilroy (2025).
- When a token economy stops working, run the Part 2 audit before redesigning. Most failures are backup-reinforcer drift, ratio drift, or implementation drift — not conceptual mismatch degli Espinosa et al. (2024).
11Practitioner Takeaways
- Always condition tokens before using them contingently. Pair tokens with established backup reinforcers on a 1:1 schedule across multiple sessions before tying them to target responses; verify reinforcer function with a tandem-control or multiple-schedule assessment degli Espinosa et al. (2024) Bonfonte et al. (2020) Fiske et al. (2020).
- Pick token form for delivery latency, not aesthetics. App-based delivery beats physical chips when the setting is mobile, noisy, or staffed by paraprofessionals; physical chips beat apps when the learner needs a visible accumulating display Beahm et al. (2023) (Radogna et al., 2024).
- Run a current preference assessment for the backup-reinforcer menu. Refresh within 30 days, include 5–8 items spanning edible/activity/tangible categories, and let the learner choose at exchange degli Espinosa et al. (2024) Falligant et al. (2020).
- Specify all three schedules in writing. Token-production schedule, exchange-production schedule, and exchange ratio go on the data sheet so the next staff member can replicate without guessing Hackenberg (2018) degli Espinosa et al. (2024).
- Use small terminal token counts for naive learners. Start at 3–5 tokens per exchange and thin only after consumption is stable across small price increases degli Espinosa et al. (2024) Levins & Gilroy (2025).
- Probe accumulated vs distributed exchange against task difficulty. Easy tasks or dense schedules favor accumulated; hard tasks or lean schedules favor distributed; concurrent-chains probes settle this in minutes Falligant et al. (2020).
- Default VR over FR for exchange-production when the learner can choose. Output is equivalent; preference is reliable for VR Argueta et al. (2019).
- Use a flexible terminal-token-count rule when post-reinforcement pause is a problem. Set today's count from yesterday's data, raise it when responding stabilizes Cihon et al. (2019).
- Train staff with the 30-minute manualized 10-step task analysis. Read the task analysis, run a competency check, and verify procedural integrity before client contact; this is the cheapest known fix for implementation drift Gutierrez et al. (2020).
- Plan thinning from day one with combined maintenance strategies. Schedule thinning + self-monitoring + fade to natural reinforcers is the systematic-review-supported maintenance pattern; abrupt withdrawal predicts behavior decay Regnier et al. (2022).
- For class-wide systems, screen delay-discounting in 2 minutes. Identify high-discounting students at the start of the year and program tighter exchange windows or savings-bank routines for them Kim et al. (2024).
- Default to reinforcement-only; add response cost only when the learner prefers it or the contingency requires removable consequences. Cap token removal so the learner cannot reach negative balance and write the response-cost rule explicitly Jowett Hirst et al. (2016) degli Espinosa et al. (2024).
12Frequently Asked Questions
Do tokens always work as reinforcers once they're paired with backup items?
No. Bonfonte and colleagues' progressive-ratio comparison showed that even after standard tandem-control conditioning, high-preference edible backup reinforcers sustained higher response rates than the tokens themselves for both participants, indicating tokens can lag the value of their best backup items Bonfonte et al. (2020). Fiske and colleagues' multiple-schedule reinforcer assessment found tokens equivalent to primary reinforcers for half their participants and weaker for the other half Fiske et al. (2020). The implication is that conditioning is necessary but not sufficient — run an empirical reinforcer assessment of the tokens themselves before relying on them clinically degli Espinosa et al. (2024).
What's the difference between token-production schedule, exchange-production schedule, and exchange ratio?
Three different layers of the same system Hackenberg (2018). Token-production schedule = the work required per token earned (FR 1, FR 3, VR 2, etc.). Exchange-production schedule = the work or time required between earning the terminal token and getting access to the backup item (FR 5 of additional responses, fixed-time 10-minute wait, end-of-session). Exchange ratio = the number of tokens needed per backup item (3:1, 5:1, 10:1) Hackenberg (2018). All three are programmable, all three drive output and preference, and the recommendations paper is unambiguous that they should be specified in writing degli Espinosa et al. (2024).
Should I use accumulated (save-up) or distributed (immediate) exchanges?
It depends on task difficulty and token-production schedule density. Falligant and colleagues' concurrent-choice data showed three children strongly preferred accumulated exchanges when tokens were earned for easy tasks or delivered on dense schedules, and shifted to preferring distributed exchanges when tasks were hard or token-production schedules were lean Falligant et al. (2020). The procedural rule is to probe preference with a brief concurrent-chains arrangement and to be prepared to switch as task difficulty changes within a treatment program Falligant et al. (2020).
Is response cost as effective as reinforcement-only token economies?
For most learners, yes — and learner preference is idiosyncratic. Jowett Hirst and colleagues' alternating-treatments comparison found reinforcement-only and reinforcement-plus-response-cost token systems similarly effective at increasing on-task behavior across group and individual instructional formats, with no systematic preference between the two Jowett Hirst et al. (2016). The recommendation is to default to reinforcement-only on social-validity grounds, probe learner preference when response cost is a candidate, and write the response-cost rule explicitly with caps to prevent negative balance Jowett Hirst et al. (2016) degli Espinosa et al. (2024) Pritchard et al. (2018).
How do I know when to thin the token schedule?
Use the Operant Demand Framework readiness rule: thin only after consumption stays stable across small price increases Levins & Gilroy (2025). Operationally, raise the token-production ratio (more responses per token) or the exchange ratio (more tokens per backup item) by one increment, hold it for several sessions, and check whether responding stays at criterion; if it does, thin again, and pair each thinning step with an increase in upstream natural reinforcement (teacher praise, completed-task access) Regnier et al. (2022). The systematic-review evidence is unambiguous that combined maintenance strategies — schedule thinning plus self-monitoring plus fade to natural reinforcers — outperform any single technique Regnier et al. (2022).
Can RBTs run a token economy with high fidelity?
Yes, with a 30-minute written task analysis and a competency check. Gutierrez and colleagues' non-concurrent multiple-baseline showed three graduate-level ABA trainees reached 98–100% accuracy on a 10-step DTI-embedded token-economy task analysis after a brief manualized instruction, and the gain was maintained across replications Gutierrez et al. (2020). The 10 steps cover materials setup, token delivery (paired with praise), error correction, token storage, exchange access, backup-reinforcer delivery, reset, and data recording — the same task analysis can serve as the staff competency check before client contact Gutierrez et al. (2020).
Will a token economy work for adults with developmental disabilities, not just children?
The corpus is direct: yes Beahm et al. (2023) Nastasi et al. (2020). Beahm and colleagues' app-based ClassDojo system increased engagement in daily-living and vocational tasks for three adults aged 21–38 in a day-habilitation program, with gains maintained after thinning Beahm et al. (2023). Nastasi, Sheppard, and Raiff's Fitbit-based contingency-management arrangement produced large daily-step-count increases for three of four adults with developmental disabilities in a residential group home Nastasi et al. (2020). Radogna and colleagues' Italian vocational study extends the same logic to job-related social skills with adults with neurodevelopmental disorders during real workplace tasks (Radogna et al., 2024).
What goes in the BIP that documents the token economy?
At minimum: target-behavior definition; backup-reinforcer menu with preference-assessment date; token-production schedule, exchange-production schedule, and exchange ratio in writing; token form and delivery method; pairing/conditioning protocol; data system; staff task-analysis competency-check date; response-cost rule (if any) with caps on removal; and the thinning plan tied to consumption-vs-price readiness criteria and combined maintenance strategies degli Espinosa et al. (2024) Gutierrez et al. (2020) Levins & Gilroy (2025) Regnier et al. (2022) Jowett Hirst et al. (2016).
13References
Primary research synthesized in this guide. DOIs link to the original source.
- Andrade, L. F., & Hackenberg, T. D. (2017). Substitution effects in a generalized token economy with pigeons. Journal of the Experimental Analysis of Behavior, 107(1), 65–86. https://doi.org/10.1002/jeab.231
- Argueta, T., Leon, Y., & Brewer, A. (2019). Exchange schedules in token economies: A preliminary investigation of second-order schedule effects. Behavioral Interventions, 34(4), 488–501. https://doi.org/10.1002/bin.1661
- Beahm, L. A., Ingvarsson, E. T., Funk, N., Haskins, L., & Frazier, J. (2023). Using an app-based token economy to increase engagement in daily living and vocational tasks with adults with developmental disabilities. Behavior Analysis in Practice, 16(3), 825–838. https://doi.org/10.1007/s40617-023-00774-4
- Bonfonte, S. A., Bourret, J. C., & Lloveras, L. A. (2020). Comparing the reinforcing efficacy of tokens and primary reinforcers. Journal of Applied Behavior Analysis, 53(3), 1462–1474. https://doi.org/10.1002/jaba.675
- Cihon, J. H., Ferguson, J. L., Milne, C. M., Leaf, J. B., McEachin, J., & Leaf, R. (2019). A preliminary evaluation of a token system with a flexible earning requirement. Behavior Analysis in Practice, 12(2), 365–369. https://doi.org/10.1007/s40617-018-00316-3
- degli Espinosa, F., & Hackenberg, T. D. (2024). Token economies: Evidence-based recommendations for practitioners. Behavioral Interventions, 39(2), e2051. https://doi.org/10.1002/bin.2051
- degli Espinosa, F., Metko, A., Raimondi, M., Impenna, M., & Scognamiglio, E. (2020). A model of support for families of children with autism living in the COVID-19 lockdown: Lessons from Italy. Behavior Analysis in Practice, 13(3), 550–558. https://doi.org/10.1007/s40617-020-00438-7
- Falligant, J. M., Pence, S. T., & Bedell, S. B. (2020). Preferences for token exchange-production schedules: Effects of task difficulty and token-production schedules. Behavioral Interventions, 35(4), 549–561. https://doi.org/10.1002/bin.1706
- Fiske, K. E., Isenhower, R. W., Bamond, M. J., & Lauderdale-Littin, S. (2020). An analysis of the value of token reinforcement using a multiple-schedule assessment. Journal of Applied Behavior Analysis, 53(2), 895–907. https://doi.org/10.1002/jaba.613
- Gutierrez, J., Reeve, S. A., Vladescu, J. C., DeBar, R. M., & Giannakakos, A. R. (2020). Evaluation of manualized instruction to train staff to implement a token economy. Behavior Analysis in Practice, 13(2), 422–434. https://doi.org/10.1007/s40617-019-00386-x
- Hackenberg, T. D. (2018). Token reinforcement: Translational research and application. Journal of Applied Behavior Analysis, 51(2), 393–435. https://doi.org/10.1002/jaba.439
- Jowett Hirst, E. S., Dozier, C. L., & Payne, S. W. (2016). Efficacy of and preference for reinforcement and response cost in token economies. Journal of Applied Behavior Analysis, 49(2), 329–345. https://doi.org/10.1002/jaba.294
- Kim, J. Y., Fienup, D. M., Reed, D. D., & Jahromi, L. B. (2024). A rapid assessment of sensitivity to reward delays and classwide token economy savings for school-aged children. Journal of Behavioral Education, 33(4), 730–751. https://doi.org/10.1007/s10864-022-09503-3
- Levins, P., & Gilroy, S. P. (2025). Extending token economy systems with the operant demand framework. Journal of Behavioral Education. https://doi.org/10.1007/s10864-024-09556-6
- Nastasi, J. A., Sheppard, R. D., & Raiff, B. R. (2020). Token-economy-based contingency management increases daily steps in adults with developmental disabilities. Behavioral Interventions, 35(4), 730–742. https://doi.org/10.1002/bin.1711
- Pascale, V., Pritchard, J. K., Iovino, F. P. C., Bassani, S., & Wine, B. (2025). The effects of a DRO and self-monitoring program on prisoners in an Italian prison. Behavior Analysis in Practice. https://doi.org/10.1007/s40617-025-01045-0
- Patel, R. R., Normand, M. P., & Kohn, C. S. (2019). Incentivizing physical activity using token reinforcement with preschool children. Journal of Applied Behavior Analysis, 52(4), 1029–1041. https://doi.org/10.1002/jaba.536
- Pritchard, D., Penney, H., & Mace, F. C. (2018). The ACHIEVE! program: A point and level system for reducing severe problem behavior. Behavioral Interventions, 33(1), 41–55. https://doi.org/10.1002/bin.1506
- Radogna, C., D'Angelo, G., & Lerman, D. C. (2024). Assessing and teaching job-related social skills to adults with neurodevelopmental disorders in Italy. Behavior Analysis in Practice, 17, 393–410. https://doi.org/10.1007/s40617-023-00873-2
- Regnier, S. D., Traxler, H. K., Devoto, A., & DeFulio, A. (2022). A systematic review of treatment maintenance strategies in token economies: Implications for contingency management. Perspectives on Behavior Science, 45(4), 819–861. https://doi.org/10.1007/s40614-022-00358-7
- Vergason, C. M., & Gravina, N. E. (2020). Using a guest- and confederate-delivered token economy to increase employee–guest interactions at a zoo. Journal of Applied Behavior Analysis, 53(3), 1789–1796. https://doi.org/10.1002/jaba.599
- Wan, H., Tan, L., & Hackenberg, T. D. (2026). Behavioral economic analysis of pigeons' token accumulation and reinforcer demand in a laboratory-based token economy. Journal of the Experimental Analysis of Behavior. https://doi.org/10.1002/jeab.70095