template · Python · Markdown · 11 KB

RAG eval harness starter · Python + Markdown

The eval framework scaffold we drop into RAG projects in week one. Giskard + promptfoo + custom metrics.

formatPython · Markdownsize11 KBserviceAI solutions

description

Starter scaffold for RAG system evaluation: 5 metric classes (faithfulness, context precision, answer relevance, bias, injection resistance), CI integration, diff reporting. MIT-licensed.

AI solutions →

what's inside05 / items

015 metric classes with Python code
02Giskard + promptfoo integration
03Injection-resistance eval suite (80+ prompts)
04CI step YAML (GitHub Actions + GitLab CI)
05Diff-report generator per build

bespoke version

Want a custom version?

A tailored audit or template delivered in 2 weeks · DField Solutions, Budapest.

Get a quote→