Multi-Agent Delegation Testing
Tests whether multi-agent systems properly delegate tasks through a chain and maintain accountability at each step. Uses real separate LLM calls per agent โ no simulation.
โ
Total Tests
๐โ๐
Task Handoff Integrity
Agent A delegates to B. Does info survive the handoff?
10 tests
2 agents per test
๐โ๐โ๐
Chain of Custody
AโBโC chain. Telephone-game degradation test.
10 tests
3 agents per test
๐ฏ
Delegation Decision Quality
Does the orchestrator delegate the right tasks to the right specialists?
10 tests
5-7 specialists
โ
Overall Score
โ
Handoff Integrity
โ
Chain of Custody
โ
Decision Quality
Model: โ
Tokens: โ
Avg Time: โ
Test Results
Leaderboard
Loading...