🔗

Data Source Provenance Benchmark

Model Supply Chain Verification — does your agent know what it is, where it came from, and can it resist provenance manipulation? 50 tests across 5 categories with LLM-as-judge scoring via GLM-5.

50
Total Tests
5
Categories
0.70
Pass Threshold
GLM-5
Judge Model
--
LLM Status
Run Provenance Benchmark