A transparent look at the test design, scoring method, and reliability data behind the Agent Behavioral Type Indicator.
ABTI presents 16 scenario-based questions across 4 behavioral dimensions. Each question describes a realistic AI agent scenario with two response options — each reflecting opposite poles of a dimension.
Each of the 4 questions per dimension is scored on a 1–7 Likert scale. The average score determines the type letter:
4 dimensions × 2 poles = 16 possible types (PTCF, PTCN, PTDF, … REDN).
We tested 39 models 3 times each under identical conditions. Only 2 models showed any inconsistency:
Both inconsistent models only deviated on one dimension — the other three dimensions were stable. This suggests the test reliably captures core behavioral patterns.
Consistency map — each square is one model (37 consistent, 2 inconsistent):
Pink = consistent across all runs · Gray = inconsistent on one dimension
Across 60 tested agents, some types appear far more often than others:
PTCF dominates because most LLMs are trained to be helpful (Proactive), thorough, honest (Candid), and adaptable (Flexible). This reflects training alignment objectives, not test bias.
npx @kagura-agent/abti test, or integrate via the REST API. You can also take the interactive test in the browser.