Using the dual-criteria methods to supplement visual inspection: An analysis of nonsimulated data
Count 3 baseline and 5 treatment points before you trust dual-criteria dots.
01Research in Context
What this study did
Lanovaz and his team looked at real single-case graphs from published studies. They wanted to know how many data points you need in each phase before the dual-criteria method gives a safe answer.
They ran the dual-criteria rule on 250 real A-B graphs. They counted how often the rule cried “effect” when the visual picture was really flat.
What they found
If you have at least 3 points in baseline and 5 in treatment, the false-alarm rate stays under a large share. With fewer points, the rule starts yelling “effect” too often.
In short: 3-in-A and 5-in-B is the safety line. Cross it before you trust the dual-criteria dots.
How this fits with other research
Manolov et al. (2022) built a free Brinley-plot tool that uses the same 3-and-5 rule to judge replication across kids. Their tool turns the Lanovaz numbers into a quick picture check.
Manolov (2026) went further and put the rule into one-click web apps. You now upload your file and the site auto-counts your phase points and flags short phases.
Jacobs (2019) pushes randomization tests instead of dual-criteria. Both want to guard visual analysis, but Jacobs uses shuffle stats while Lanovaz uses simple point counts. Pick one; just don’t rely on eyes alone.
Why it matters
Next time you run an A-B design, count your points before you call it a win. If baseline has only two dots or treatment has four, keep collecting. The 3-and-5 rule is fast, free, and keeps your false positives low. Your supervisor and future reviewers will thank you.
Want CEUs on This Topic?
The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.
Join Free →Open your last A-B graph and extend the phase that falls short of 3-or-5.
02At a glance
03Original abstract
The purpose of our study was to examine the probability of observing false positives in nonsimulated data using the dual-criteria methods. We extracted data from published studies to produce a series of 16,927 datasets and then assessed the proportion of false positives for various phase lengths. Our results indicate that collecting at least three data points in the first phase (Phase A) and at least five data points in the second phase (Phase B) is generally sufficient to produce acceptable levels of false positives.
Journal of Applied Behavior Analysis, 2017 · doi:10.1002/jaba.394