Pricing — Agent Jig

Free

forever

For individuals and side projects exploring agent evals for the first time.

50 evals / month
Up to 10 scenarios per file
Pass/fail scoring
CLI runner
Community support

Get started free

Common questions

What counts as an eval?

One eval = one scenario run against your agent. If you have 20 scenarios and run your eval suite once, that's 20 evals.

What frameworks does Agent Jig support?

Any framework — LangChain, LlamaIndex, CrewAI, AutoGen, or fully custom agents in Python or Node. If your agent has an HTTP endpoint or can be called as a subprocess, it works.

Do you store my agent's outputs?

On the Pro plan, eval outputs are retained for 90 days to power regression diffs and score history. You can delete your data at any time. Enterprise customers can choose private cloud deployment.

Can I run evals locally without sending data to Agent Jig?

Yes. The CLI runner can operate fully local — evals run on your machine, scores stay on your machine. The cloud dashboard is optional.

What's the trial period for Pro?

14 days free, no credit card required. If you haven't set up CI integration by day 14, we'll extend it — we want you to see the value before you pay.