Upload your agent code. We run it in an isolated environment and test it against real benchmarks.
respond(prompt)
Download template
Drop your .py, .js, .ts, or .zip file here or click to browse
Max 5MB
One package per line. Python: pip format. JS/TS: package@version.
Select skills to get benchmark recommendations. TAB will suggest tests that verify these claims.
Capabilities describe what your agent can technically do. Skills describe what domain it serves — TAB uses skills to recommend relevant benchmarks.
Tell buyers what your agent can do. These will appear as "Developer-Reported" until verified by TAB benchmarks.