Interobserver agreement: A preliminary investigation into how much is enough?
Extra IOA sessions don’t boost agreement—behavior rate does, so train well and move on.
01Research in Context
What this study did
Hausman et al. (2022) re-checked old functional-analysis tapes. They clipped the tapes at different places. Then they asked: does IOA change if we score 5, 10, or 20 sessions?
The team used trained observers and clean clinic rooms. They wanted to see if a magic session number gives better agreement.
What they found
IOA stayed almost the same no matter how many sessions they kept. The only thing that moved the score was how often the behavior happened.
In short, extra sessions did not clean up the numbers. Response rate, not session count, drove the differences.
How this fits with other research
Rolider et al. (2012) already warned that exact-agreement IOA jumps around when behavior is fast. Hausman’s data now show the jump comes from rate, not from grabbing more sessions.
Cox et al. (2025) just gave us new math—precision, recall, F1—to replace old percent agreement. Pair that with Hausman’s finding and you can run fewer sessions and still report sharper metrics.
Essig et al. (2023) counted papers and saw half skip fidelity IOA. Hausman answers the quiet worry behind that gap: you don’t need a pile of extra IOA sessions to feel safe.
Why it matters
Stop collecting “just-in-case” IOA sessions. Train your observer well, control the room, and score until the rate looks stable. You save time and still meet the journal standard.
Want CEUs on This Topic?
The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.
Join Free →Pick two IOA sessions per condition instead of five—spend the saved time on observer re-training.
02At a glance
03Original abstract
Interobserver agreement (IOA) is important for research and practice, and supports the consistency of behavioral data (Kahng et al., 2011). Although general parameters for how much IOA is needed have been suggested (Bailey & Burch, 2018), it is unknown if the total number of sessions with IOA might impact the IOA coefficient. In this study, IOA was reanalyzed using functional analysis data at various cutoffs. Obtained IOA from these analyses was then compared to the original IOA. Overall, results suggested that, at least when using highly trained observers in a structured clinical setting, there were no significant differences in IOA across cutoffs. However, IOA was sensitive to overall rate of responding in the functional analysis. These data are encouraging, particularly for practitioners, because they provide preliminary support that the amount of sessions with IOA may not be as important as the consistency of the data.
Journal of Applied Behavior Analysis, 2022 · doi:10.1002/jaba.811