evaluating-with-promptfoo

star 1

LLM evaluation and red-teaming toolkit using promptfoo. Covers promptfooconfig.yaml configuration, 40+ assertion types (deterministic, model-graded, RAG), provider setup (OpenAI, Anthropic, Google, Ollama, HTTP, custom JS/Python), red teaming (134+ plugins, jailbreak strategies, compliance frameworks), CLI commands, caching, and CI/CD integration. Use when writing promptfooconfig.yaml, designing LLM test suites, running adversarial red team evaluations, or integrating LLM quality gates in CI/CD. Detects: promptfooconfig.yaml or promptfoo in package.json. For general LLMOps operations, use designing-genai-patterns. For general test methodology (TDD/AAA), use testing-code.

By sumik5 schedule Updated 6/7/2026

play_arrow Run Skill in Manus View GitHub

Skill instructions (SKILL.md) could not be loaded from local cache or raw GitHub repository.

Install via CLI

npx skills add https://github.com/sumik5/sumik-claude-plugin --skill evaluating-with-promptfoo

Repository Details

star Stars 1

call_split Forks 0

navigation Branch main

article Path SKILL.md