Tutorial: Lessons Learned for Behavior Analysts from Data Scientists
Clean and lock your session data, then team up with a data scientist to let machine-learning models reveal hidden behavioral patterns.
01Research in Context
What this study did
Neely et al. (2024) wrote a how-to guide for behavior analysts who want to use data-science tools.
The authors explain how to clean, protect, and format large sets of session data.
They show where to find a data-science partner and which steps turn raw numbers into machine-learning models.
What they found
The paper does not test a treatment. Instead, it lists lessons the authors learned while working with big data sets.
Key advice: tidy data first, lock up private information, and let the algorithm look for hidden patterns.
How this fits with other research
Slanzi et al. (2024) also published a 2024 tutorial, but on the Countee app for single-case data entry. Both papers push digital tools, yet Neely aims at huge agency-wide files while Slanzi targets one-client sheets.
Moeyaert et al. (2020) teach multilevel meta-analysis of single-case designs. Neely extends that idea by showing how machine-learning models can spot trends across thousands of cases without hand-coding each graph.
Burney et al. (2023) argue for adding qualitative interviews to behavior analysis. That sounds opposite to Neely’s number-heavy path, but the two calls fit together: numbers tell you what happens, interviews tell you why clients care.
Why it matters
If you store every trial in one messy spreadsheet, you already have big data. This paper gives you a checklist to make that spreadsheet model-ready. Start small: pick one month of session data, remove extra spaces, add a client-ID column, and save a locked copy. Then ask a local data-science student to run a simple clustering script. You may discover response patterns you never saw by eye.
Want CEUs on This Topic?
The ABA Clubhouse has 60+ free CEUs — live every Wednesday. Ethics, supervision & clinical topics.
Join Free →Export last month’s session data, delete blank rows, add one consistent date column, and save the file in a password-protected folder labeled ‘ML-ready.’
02At a glance
03Original abstract
Big data is a computing term used to refer to large and complex data sets, typically consisting of terabytes or more of diverse data that is produced rapidly. The analysis of such complex data sets requires advanced analysis techniques with the capacity to identify patterns and abstract meanings from the vast data. The field of data science combines computer science with mathematics/statistics and leverages artificial intelligence, in particular machine learning, to analyze big data. This field holds great promise for behavior analysis, where both clinical and research studies produce large volumes of diverse data at a rapid pace (i.e., big data). This article presents basic lessons for the behavior analytic researchers and clinicians regarding integration of data science into the field of behavior analysis. We provide guidance on how to collect, protect, and process the data, while highlighting the importance of collaborating with data scientists to select a proper machine learning model that aligns with the project goals and develop models with input from human experts. We hope this serves as a guide to support the behavior analysts interested in the field of data science to advance their practice or research, and helps them avoid some common pitfalls.
Perspectives on Behavior Science, 2024 · doi:10.1007/s40614-023-00376-z