Zero-dependency eval harness for LLM and agent regression testing. Scores outputs with exact, contains, regex, JSON, citation, and token-F1 checks. Compares two runs to flag regressions.
[email protected] is safe to use (health: 56/100)
Get this data programmatically — free, no authentication.
curl https://depscope.dev/api/check/pypi/ai-eval-forgeLast updated · 2026-04-24T07:16:54.072489Z