Assessment & Research

Temporal dynamics of early child-clinician prosodic synchrony predict one year autism intervention outcomes using AI driven affective computing.

Bertamini et al. (2025) · Scientific Reports 2025
★ The Verdict

Early voice harmony between preschooler and clinician, caught by AI, predicts later developmental gains.

✓ Read this if BCBAs running naturalistic early-intervention sessions with preschoolers with autism.
✗ Skip if Clinicians who only use table-top DTT or work with non-speaking older youth.

01Research in Context

01

What this study did

The team recorded the first 10 minutes of naturalistic therapy sessions with preschoolers who have autism.

An AI tool measured how the child’s and clinician’s voices rose, fell, and paused together.

They asked: does this early “prosodic synchrony” predict developmental gains one year later?

02

What they found

Kids whose voices synced better early on showed stronger language and social growth a year later.

More voice variety, smoother turn-taking, and shared emotional tone all pointed to better outcomes.

03

How this fits with other research

Lin et al. (2026) saw low affective synchrony in babies who were later diagnosed. Bertamini moves the same idea into therapy: low preschool synchrony now flags risk of poor progress.

Au-Yeung et al. (2015) linked parent-child skin-synchrony to emotional attunement. The new study swaps parents for clinicians and skin for voice, showing the pattern still holds.

Schertz et al. (2016) meta-analysis found clinician-plus-parent programs give the biggest language gains. Bertamini adds a timing clue: the very first minutes of clinician talk set the pace.

Trembath et al. (2019) tracked vocal ups and downs across months. Here, one early snapshot of voice harmony forecasts the whole year.

04

Why it matters

You can now “listen ahead” without extra tests. If the child and clinician sound like they are singing the same tune, keep doing what you are doing. If not, pause and add more pauses, imitation, or shared affect right away. The AI gives you an early red flag before months of slow progress show up in your data.

Free CEUs

Want CEUs on This Topic?

The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.

Join Free →
→ Action — try this Monday

Record the first 5 minutes of session audio, count smooth turn-taking and shared pitch rises; if low, add more vocal mirroring and shared affect before moving to demands.

02At a glance

Intervention
not applicable
Design
pre post no control
Sample size
25
Population
autism spectrum disorder
Finding
positive

03Original abstract

The patient-therapist interpersonal dynamics is a cornerstone of psychotherapy, yet how it shapes clinical outcomes remains underexplored and difficult to quantify. This is also true in autism, where interpersonal interplay is recognized as an active element of intervention. Moreover, behavioral research is time-consuming and labor-intensive, limiting its translational applications. We studied 25 autistic preschoolers (17 therapists) across two naturalistic 60-minute sessions of developmental intervention at baseline and after three months (50 videos total). Clinical outcomes were assessed at baseline and one year into intervention. We developed a fully automated pipeline combining deep learning and affective computing to: (i) segment full-session audio recordings, (ii) model child-clinician acoustic synchrony using nonlinear metrics grounded in complex systems theory, and (iii) predict long-term response from early synchrony patterns. Changes in early synchrony dynamics predicted clinical response. Better outcomes were associated with synchrony patterns reflecting increased variability, predictability, and self-organization alongside prosodic features linked to emotional engagement. Our scalable, non-invasive system enables large-scale, objective measurement of therapy dynamics. In autism, our findings emphasize the importance of early interpersonal synchrony and emotional engagement as active drivers of developmental change. Our approach captures the full dynamics of entire therapy sessions, providing a richer, ecologically valid view of interpersonal synchrony. The online version contains supplementary material available at 10.1038/s41598-025-17057-3.

Scientific Reports, 2025 · doi:10.1038/s41598-025-17057-3