Quantification of behavioral data with effect sizes and statistical significance tests
Add Tau z and RD effect-size calculations to your next SCED manuscript—visual inspection alone isn’t reliable enough.
01Research in Context
What this study did
Costello et al. (2022) compared how well different ways of reading single-case graphs agree with each other.
They looked at visual inspection, two effect-size numbers (RD and g), and two statistical tests (Tau z and PWD).
The team ran these metrics on sets of graphs that already existed and checked how often they told the same story.
What they found
The numbers matched each other: Tau z, PWD, RD, and g all pointed the same way.
Visual inspection alone was shaky; different viewers did not always agree.
Adding the small math steps made the decision clearer and more repeatable.
How this fits with other research
Carlin et al. (2022) ran a simulation the same year and reached the same rule: use Tau to spot an effect, then RD or g to size it.
The two papers together form a direct replication, one with real data and one with fake data, giving the same advice.
Dowdy et al. (2021) review shows the field is already pooling these exact metrics in meta-analyses, so the practice is ready for journals.
Falligant et al. (2020) tested a different aid (fail-safe k) and also warned: numbers help, but check them before trusting your eyes.
Why it matters
You no longer need to choose between eyes and numbers. Run Tau z for a quick yes/no, add RD to show how big, and write both in your paper. Reviewers expect it, and your graphs will convince faster.
Get CEUs on This Topic — Free
The ABA Clubhouse has 60+ on-demand CEUs including ethics, supervision, and clinical topics like this one. Plus a new live CEU every Wednesday.
Open your last graph, plug the data into a free Tau z calculator, and paste the number into the caption.
02At a glance
03Original abstract
This article describes the use of statistical significance tests and distance-based effect sizes with behavioral data from single case experimental designs (SCEDs). Such data often are interpreted only with visual analysis. However, a growing movement in the field is to quantify results to improve decision-making and communication across studies and sciences. The goal of the present study was to assess the agreement between visual analysis and various statistical tests. We recruited visual analysts to judge 160 pairwise data sets from published articles and compared these analyses to significance tests and effect sizes. One-tailed significance testing of Tau z and the percentage of pairwise differences in the predicted direction (PWD) generally agreed with each other, and complemented the effect sizes of Ratio of Distances (RD) and g. Visual analysis was somewhat unreliable and should be combined with statistical complements to maximize decision accuracy.
Journal of Applied Behavior Analysis, 2022 · doi:10.1002/jaba.938