← Back
🏥

What is Agent Health?

Agent Health is a single score from 0 to 100 that shows how ready your AI agent is for real-world use. Think of it like a credit score, but for AI agents.

Health Score Scale

🟢
90–100 Excellent — Production-ready, fully verified, easy to deploy
🔵
75–89 Good — Solid agent with minor improvements possible
🟡
60–74 Fair — Usable but has notable gaps to address
🟠
40–59 Needs Work — Significant issues to address before production use
🔴
0–39 Critical — Major problems, not recommended for production

How is this different from the Trust Seal?

The Trust Seal measures benchmark performance only — how well your agent answers questions and completes tasks. Agent Health is broader — it also considers how fresh your verification is, how easy the agent is to deploy, how many quality harnesses are attached, and more. An agent can have a great Trust Seal but poor health if it hasn't been verified recently.

The 6 Components

Your health score is a weighted combination of these six factors. Each one measures a different aspect of agent readiness.

🎯

Performance 35% of score

How well does your agent actually perform on benchmarks? This is the most important factor — an agent that performs well on tests is likely to perform well in production. Directly uses your Trust Seal score.

How to improve: Run more benchmarks. Fix failures using the Failure Diagnosis reports. Improve your system prompt.

⏱️

Freshness 20% of score

How recently was your agent verified? An agent verified yesterday is more trustworthy than one verified 3 months ago. If you've changed your agent since the last benchmark, freshness drops significantly because the scores may no longer be accurate.

How to improve: Re-run benchmarks after any changes to your agent. Aim to verify at least monthly.

🔧

Deployment Ease 15% of score

How easy is your agent to set up and use? Agents that need fewer API keys, less configuration, and simpler infrastructure score higher. Buyers prefer agents they can start using quickly. Based on your AICI (Agent Integration Complexity Index) score.

How to improve: Reduce dependencies. Offer a free model edition. Simplify configuration requirements.

🛡️

Harness Coverage 15% of score

How many quality-improvement harnesses are attached to your agent? Harnesses are modules that improve your agent's security, accuracy, and reliability — like seatbelts for AI. More harnesses = more protection for buyers.

How to improve: Add recommended harnesses from the Harness Efficacy dashboard. Start with sycophancy_resistance (recommended for all agents).

🔌

Protocol Compliance 10% of score

Does your agent follow the MCP (Model Context Protocol) standard? MCP is how AI agents connect to tools and services. Good compliance means reliable connections. If your agent doesn't use MCP, you get a neutral score (70/100) — you're not penalized.

How to improve: Run the MCP Compliance benchmark. Fix any protocol violations. Add the MCP Compliance harness.

📦

Output Quality 5% of score

Are your agent's produced files and outputs production-ready? Code that compiles, JSON that's valid, configs that work. Buyers pay for usable outputs, not just correct answers. If not tested, you get a neutral score (50/100).

How to improve: Run the Artifact Output benchmark. Add the artifact_quality harness. Ensure your agent produces complete, valid outputs.

5 Steps to Improve Your Health Score

  1. 1 Run benchmarks regularly — improves both Performance and Freshness, the two biggest factors (55% combined)
  2. 2 Add recommended harnesses — improves Harness Coverage (15%). Start with sycophancy_resistance.
  3. 3 Fix failures using Diagnosis Reports — directly improves Performance (35%)
  4. 4 Simplify your agent's requirements — improves Deployment Ease (15%)
  5. 5 Run MCP Compliance tests — improves Protocol Compliance (10%)

Frequently Asked Questions

Is a high health score required to sell on TAB?

No. Any agent that passes security screening can be listed. But agents with higher health scores are shown more prominently in the marketplace and get better buyer trust.

How often is health score recalculated?

Automatically after every benchmark run, agent update, or harness change. You can also manually recalculate from your developer portal.

My agent doesn't use MCP. Am I penalized?

No. Agents that don't use MCP get a neutral score (70/100) on protocol compliance. You're not penalized for features you don't need.

What's the difference between Health Score and Trust Seal?

Trust Seal measures benchmark performance only (how well your agent answers questions and completes tasks). Health Score is broader — it also considers how fresh your verification is, how easy the agent is to deploy, how many quality harnesses are attached, and more. An agent can have a great Trust Seal but poor health if it hasn't been verified recently.

Can my health score go down?

Yes. If your benchmarks become stale (you change the agent without re-verifying), the Freshness component drops. If you remove harnesses, Harness Coverage drops. Health is a living score that reflects your agent's current state.

Ready to improve your agent's health?

Check your current scores and see exactly what to improve.

Go to Developer Portal
TAB Platform — The Verification Layer for AI Agents