Evaluating Tau-U With Oral Reading Fluency Data and the Impact of Measurement Error.
Tau-U effect sizes from reading-fluency data can be inflated by ceiling effects and measurement error—check your data before reporting.
01Research in Context
What this study did
Pitchford et al. (2019) looked at Tau-U, a popular effect-size number for single-case reading data. They asked: do ceiling effects and small measurement errors make the number look bigger than it is?
They ran math checks on oral-reading fluency scores. The paper is a methods study, so no kids were treated; the team just tested the tool itself.
What they found
Tau-U can swell when kids hit the top score on CBM-R passages. Tiny scoring mistakes also nudge the final number up or down.
The authors say: still use Tau-U, but eyeball your data first. If many points hug the ceiling or bounce from error, report the finding with caution.
How this fits with other research
Tarlow (2017) already offered a fix called Baseline Corrected Tau. That metric keeps scores inside clear bounds and irons out baseline drift. Pitchford et al. (2019) now give real reading-fluency proof that such a fix is needed, so the later paper supersedes the older one.
Stephens et al. (2018) likewise warn that small scoring rule changes (like when to stop digit span) can shift validity. Both papers tell you to check the fine print before trusting the number.
Zoia et al. (2019) show cultural context can tilt motor-test norms. Together these studies form a theme: no score is sacred—always validate the tool in your setting.
Why it matters
If you run single-case reading sessions, run a quick ceiling check before you brag about a big Tau-U. Plot the data, look for flat tops, and mention any measurement noise in your report. Better yet, consider switching to Baseline Corrected Tau for a cleaner picture of your intervention’s impact.
Get CEUs on This Topic — Free
The ABA Clubhouse has 60+ on-demand CEUs including ethics, supervision, and clinical topics like this one. Plus a new live CEU every Wednesday.
Graph your latest CBM-R data, flag any scores at or near the ceiling, and recompute Tau-U without those points to see if the effect shrinks.
02At a glance
03Original abstract
Recently, researchers have argued that using quantitative effect sizes in single-case design (SCD) research may facilitate the identification evidence-based practices. Indices to quantify nonoverlap are among the most common methods for quantifying treatment effects in SCD research. Tau-U represents a family of effect size indices that were developed to address criticisms of previously developed measures of nonoverlap. However, more research is necessary to determine the extent to which Tau-U successfully addresses proposed limitations of other nonoverlap methods. This study evaluated Tau-U effect sizes, derived from multiple-baseline designs, where researchers used curriculum-based measures of reading (CBM-R) to measure reading fluency. Specifically, we evaluated the distribution of the summary Tau-U statistic when applied to a large set of CBM-R data and assessed how the variability inherent in CBM-R data may influence the obtained Tau-U values. Findings suggest that the summary Tau-U statistic may be susceptible to ceiling effects. Moreover, the results provide initial evidence that error inherent in CBM-R scores may have a small but meaningful influence on the obtained effect sizes. Implications and recommendations for research and practice are discussed.
Behavior modification, 2019 · doi:10.1177/0145445518760174