Simple, honest pricing.

Start free. Scale when you need to. No usage surprises.

Free
$0
forever
For individuals and side projects exploring agent evals for the first time.
  • 50 evals / month
  • Up to 10 scenarios per file
  • Pass/fail scoring
  • CLI runner
  • Community support
Get started free
Enterprise
Custom
 
For teams with high eval volume, compliance requirements, or custom integrations.
  • Unlimited evals
  • SSO / SAML
  • Private cloud deployment option
  • Custom assertion plugins
  • SLA & dedicated support
  • Onboarding and eval design consulting
Contact us

Common questions

What counts as an eval?

One eval = one scenario run against your agent. If you have 20 scenarios and run your eval suite once, that's 20 evals.

What frameworks does Agent Jig support?

Any framework — LangChain, LlamaIndex, CrewAI, AutoGen, or fully custom agents in Python or Node. If your agent has an HTTP endpoint or can be called as a subprocess, it works.

Do you store my agent's outputs?

On the Pro plan, eval outputs are retained for 90 days to power regression diffs and score history. You can delete your data at any time. Enterprise customers can choose private cloud deployment.

Can I run evals locally without sending data to Agent Jig?

Yes. The CLI runner can operate fully local — evals run on your machine, scores stay on your machine. The cloud dashboard is optional.

What's the trial period for Pro?

14 days free, no credit card required. If you haven't set up CI integration by day 14, we'll extend it — we want you to see the value before you pay.

Start with the free plan.

50 evals/month, no credit card. Upgrade when you're shipping to production.

Get started free