Token Waste Audit Benchmark

Measures whether a model gets to the answer efficiently or burns tokens on preambles, repetition, hedging, retries, dead-end reasoning, and formatting overhead.

40
Tasks
5
Task Types
8
Waste Categories
0
Judge Calls
--
LLM Status
Run Token Waste Audit