evaluating-with-promptfoo

star 1

LLM evaluation and red-teaming toolkit using promptfoo. Covers promptfooconfig.yaml configuration, 40+ assertion types (deterministic, model-graded, RAG), provider setup (OpenAI, Anthropic, Google, Ollama, HTTP, custom JS/Python), red teaming (134+ plugins, jailbreak strategies, compliance frameworks), CLI commands, caching, and CI/CD integration. Use when writing promptfooconfig.yaml, designing LLM test suites, running adversarial red team evaluations, or integrating LLM quality gates in CI/CD. Detects: promptfooconfig.yaml or promptfoo in package.json. For general LLMOps operations, use designing-genai-patterns. For general test methodology (TDD/AAA), use testing-code.

sumik5 By sumik5 schedule Updated 6/7/2026

Skill instructions (SKILL.md) could not be loaded from local cache or raw GitHub repository.

Install via CLI
npx skills add https://github.com/sumik5/sumik-claude-plugin --skill evaluating-with-promptfoo
Repository Details
star Stars 1
call_split Forks 0
navigation Branch main
article Path SKILL.md
More from Creator