🧠

Cognitive Load Benchmark

Tests LLM performance under cognitive load with 120+ tests from 3 sources: TAB (context saturation + interrupts), ICE-methodology (arXiv 2509.19517), and Working Memory Stress (N-Back + Dual-Task). Measures score degradation under load.

--
Total Tasks
3
Sources
--
Categories
3
Load Levels
--
Paradigms
--
LLM Status
Run Benchmark