Assessment & Research

Quantification of behavioral data with effect sizes and statistical significance tests

Costello et al. (2022) · Journal of Applied Behavior Analysis

★ The Verdict

Add Tau z and RD effect-size calculations to your next SCED manuscript—visual inspection alone isn’t reliable enough.

✓ Read this if BCBAs who write or review single-case studies for journals, theses, or grant reports.

✗ Skip if Practitioners who only read graphs for daily clinical choices and never publish.

01Research in Context

What this study did

Costello et al. (2022) compared how well different ways of reading single-case graphs agree with each other.

They looked at visual inspection, two effect-size numbers (RD and g), and two statistical tests (Tau z and PWD).

The team ran these metrics on sets of graphs that already existed and checked how often they told the same story.

What they found

The numbers matched each other: Tau z, PWD, RD, and g all pointed the same way.

Visual inspection alone was shaky; different viewers did not always agree.

Adding the small math steps made the decision clearer and more repeatable.

How this fits with other research

Carlin et al. (2022) ran a simulation the same year and reached the same rule: use Tau to spot an effect, then RD or g to size it.

The two papers together form a direct replication, one with real data and one with fake data, giving the same advice.

Dowdy et al. (2021) review shows the field is already pooling these exact metrics in meta-analyses, so the practice is ready for journals.

Falligant et al. (2020) tested a different aid (fail-safe k) and also warned: numbers help, but check them before trusting your eyes.

Why it matters

You no longer need to choose between eyes and numbers. Run Tau z for a quick yes/no, add RD to show how big, and write both in your paper. Reviewers expect it, and your graphs will convince faster.

FREE CEUs

Get CEUs on This Topic — Free

The ABA Clubhouse has 60+ on-demand CEUs including ethics, supervision, and clinical topics like this one. Plus a new live CEU every Wednesday.

✓ 60+ on-demand CEUs (ethics, supervision, general)

✓ New live CEU every Wednesday

✓ Community of 500+ BCBAs

✓ 100% free to join

Join The ABA Clubhouse — Free →

→ Action — try this Monday

Open your last graph, plug the data into a free Tau z calculator, and paste the number into the caption.

02At a glance

Intervention

not applicable

Design

methodology paper

Finding

not reported

03Original abstract

This article describes the use of statistical significance tests and distance-based effect sizes with behavioral data from single case experimental designs (SCEDs). Such data often are interpreted only with visual analysis. However, a growing movement in the field is to quantify results to improve decision-making and communication across studies and sciences. The goal of the present study was to assess the agreement between visual analysis and various statistical tests. We recruited visual analysts to judge 160 pairwise data sets from published articles and compared these analyses to significance tests and effect sizes. One-tailed significance testing of Tau z and the percentage of pairwise differences in the predicted direction (PWD) generally agreed with each other, and complemented the effect sizes of Ratio of Distances (RD) and g. Visual analysis was somewhat unreliable and should be combined with statistical complements to maximize decision accuracy.

Journal of Applied Behavior Analysis, 2022 · doi:10.1002/jaba.938