By Matt Harrington, BCBA · Behaviorist Book Club · Research-backed answers for behavior analysts
A BCBA performance scorecard should cover the major responsibility domains of the role at your specific agency: clinical assessment (quality and completeness of FBAs, functional communication training assessments, skill assessments), treatment planning (behavior intervention plan quality, goal alignment with assessment findings, treatment integrity plan documentation), supervision (frequency and quality of technician feedback, supervisee development plan implementation, observation-to-feedback latency), family collaboration (parent training delivery, family communication quality and timeliness, treatment acceptability assessment), documentation (accuracy, timeliness, regulatory compliance), and professional conduct (meeting attendance, ethics compliance, consultation-seeking behavior). Each domain should have specific behavioral indicators, performance criteria, and an assessment frequency. Scorecards that only measure documentation completion miss the clinical dimensions that most directly affect client outcomes.
Appropriate feedback reception includes: acknowledging receipt of the feedback without immediately arguing against it, asking clarifying questions to understand the specific behavior and its impact rather than seeking loopholes, expressing specific commitment to a behavior change rather than general agreement, and following through on that commitment in subsequent observed performance. Teaching this behavior requires modeling the target response, providing practice opportunities with role-play scenarios featuring both easy and emotionally activating feedback content, delivering feedback on the feedback reception behavior itself (behavioral feedback about feedback behavior is recursive but necessary), and reinforcing improvement in feedback reception specifically and consistently. Extinction of defensive responses — not withdrawing the feedback when defensiveness occurs — is the essential extinction component.
Upward and peer feedback requires the same behavioral specificity as downward feedback, with additional attention to the framing that makes the feedback receivable across a power differential. Center the feedback on client welfare or shared professional standards rather than personal criticism: 'I observed something in [situation] that I think has implications for [client name]'s treatment and I want to share it' is different from 'you did this wrong.' Use collaborative framing that positions you and the colleague as both invested in quality rather than adversarial: 'I may be missing context here — can you help me understand...' If the concern is about Ethics Code compliance, the process becomes more formal regardless of organizational hierarchy.
Assessment frequency should be tied to the consequences of performance variation and the pace of development for the role. For new staff in their first 90 days, monthly scorecard completion provides frequent enough data to identify and address gaps while development is most plastic. For established staff, quarterly is typically sufficient to track trends and identify emerging concerns. Self-assessment should be completed by the staff member before or simultaneously with supervisor assessment, and discrepancies between self-ratings and supervisor ratings are among the most valuable supervisory data the process generates. Large consistent discrepancies in either direction — the supervisor rating substantially higher or lower than the self-assessment — are important conversation starters about self-awareness and perceptual accuracy.
Defensive responses to feedback typically serve one of several functions: escape from the aversive private experience of perceived criticism or failure (maintained by negative reinforcement when the supervisor backs down), attention or social validation from peers who witness the exchange, or the automatic function of reducing conditioned physiological arousal associated with evaluation. Understanding the function matters because the intervention differs. Escape-maintained defensiveness requires extinction (the supervisor does not back down) plus functional communication training (what to say instead). Attention-maintained defensiveness requires differential reinforcement of alternative behavior plus reduced social attention for defensive responses. Automatic-function defensiveness may benefit from physiological regulation strategies alongside behavioral training.
Inter-rater reliability is the technical answer, and it requires investment: supervisors should regularly calibration-code the same staff performance against the scorecard and compare their ratings, discuss discrepancies, and refine operational definitions until ratings are consistent. Calibration sessions are not bureaucratic overhead — they are quality assurance for the assessment system. Beyond technical consistency, organizational culture shapes how scorecards are used: when leadership models scorecard use as a development tool rather than a discipline mechanism, supervisors across the agency are more likely to implement it that way. When scorecards are pulled out only for performance improvement plans, the tool becomes associated with punishment rather than development.
Self-monitoring shifts the locus of performance regulation from external to internal stimulus control, which is both more efficient and more professionally sustainable. A BCBA who accurately self-monitors against explicit performance criteria can identify and correct performance gaps in real time, without waiting for a supervisor's observation cycle. This is not the same as avoiding accountability — it is the development of the professional self-regulation that allows competent independent practice. Scorecards support this by providing the operational definitions against which self-monitoring can occur: without specific criteria, self-monitoring produces impressions rather than behavioral data. The combination of specific criteria, regular self-assessment, and supervisor calibration produces the most robust performance feedback system.
Research across organizational contexts consistently shows that feedback quality is among the strongest predictors of employee engagement and retention, outperforming compensation in many studies when a threshold of adequate pay is exceeded. Staff who understand what is expected, who receive timely and specific feedback about their performance, and who can see a clear connection between their behavior and their professional advancement are more likely to stay than staff who feel they are performing in the dark. In ABA specifically, the motivating operations around staff retention are particularly high: turnover disrupts client relationships, degrades treatment integrity during transition periods, and forces agencies into repeated onboarding cycles that consume the resources that could otherwise support clinical quality.
Scorecards for RBTs should focus on the behavioral dimensions most directly relevant to their clinical role: treatment integrity across the procedures they are assigned to implement (measured via systematic observation), data recording accuracy (verified through inter-rater reliability checks against supervisor or peer recordings), session management quality (learner engagement, pacing, transition management), family interaction conduct, and professional reliability metrics (punctuality, communication responsiveness, documentation timeliness). Criteria should be achievable at the expected competency level for the technician's tenure, with differentiated expectations for new, developing, and established staff. Sharing the scorecard at hire and revisiting it in each supervision meeting as a shared reference document — rather than producing it only during formal reviews — embeds it as a normal part of the supervisory conversation.
Feedback describes specific observed behavior and its relationship to a defined standard: 'In this session, you delivered the reinforcer on 7 of 12 correct responses — the standard is 100% of correct responses, and I want to help you get there.' Evaluation renders a judgment about the person: 'You are not consistent enough in your reinforcement delivery.' The distinction matters for feedback reception because evaluation activates threat appraisal — the recipient must defend their self-concept, not just correct a behavior. Feedback invites a behavioral response: what to do differently. Training supervisors to deliver feedback rather than evaluation, and training supervisees to recognize and request specific behavioral feedback rather than character assessments, reduces the defensive responding that makes feedback conversations adversarial rather than developmental.
The ABA Clubhouse has 60+ on-demand CEUs including ethics, supervision, and clinical topics like this one. Plus a new live CEU every Wednesday.
Ready to go deeper? This course covers this topic with structured learning objectives and CEU credit.
Achieving Greatness: Breakfast of Champions — Paula Antonelli · 1 BACB Supervision CEUs · $19.99
Take This Course →1 BACB Supervision CEUs · $19.99 · BehaviorLive
Research-backed educational guide with practice recommendations
Side-by-side comparison with clinical decision framework
All behavior-analytic intervention is individualized. The information on this page is for educational purposes and does not constitute clinical advice. Treatment decisions should be informed by the best available published research, individualized assessment, and obtained with the informed consent of the client or their legal guardian. Behavior analysts are responsible for practicing within the boundaries of their competence and adhering to the BACB Ethics Code for Behavior Analysts.