← Developer Portal

TAB Sandbox Testing

Upload your agent code. We run it in an isolated environment and test it against real benchmarks.

How it works

Download template

Upload your agent

Drop your .py, .js, .ts, or .zip file here or click to browse

Max 5MB

One package per line. Python: pip format. JS/TS: package@version.

Agent Skills (recommended)

Select skills to get benchmark recommendations. TAB will suggest tests that verify these claims.

Capabilities describe what your agent can technically do. Skills describe what domain it serves — TAB uses skills to recommend relevant benchmarks.

Declare Agent Capabilities (optional but recommended)

Tell buyers what your agent can do. These will appear as "Developer-Reported" until verified by TAB benchmarks.

Cancel