llm-redteaming

star 2

Use when testing LLM applications or agentic systems for prompt injection, jailbreaks, tool abuse, data leakage, and eval regressions. Uses garak/promptfoo and the configured MCP servers; writes all artifacts under /pentest/.

mayflower

By mayflower schedule Updated 2/7/2026

play_arrow Run Skill in Manus View GitHub

name: llm-redteaming description: Use when testing LLM applications or agentic systems for prompt injection, jailbreaks, tool abuse, data leakage, and eval regressions. Uses garak/promptfoo and the configured MCP servers; writes all artifacts under /pentest/. license: MIT metadata: owner: mayflowergmbh/kali-ai-redteam

LLM Red Teaming Runbook

Inputs To Collect

Target base URL / API endpoint(s)
Auth method (key, cookie, OAuth), rate limits, and scope constraints
Model/provider (if known) and tool/plugin surface (if any)

Baseline

Run /recon <target-url> and save output under /pentest/recon/.
Record request/response schema (including streaming) and any moderation behavior.

Automated Coverage

garak for probe-driven scanning; save to /pentest/scans/garak-*.
promptfoo for eval suites/redteam runs; save to /pentest/scans/promptfoo-*.

Manual Exploitation

Use /probe for quick injections and /extract for system prompt extraction attempts.
Use Playwright MCP for headless flows (login, multi-step prompt injection, indirect injection via UI content).
Persist key artifacts in /pentest/prompts/ and /pentest/exploits/ (full payload + full response).

Reporting

Map findings to OWASP LLM Top 10 categories.
Include exact repro steps and expected/actual behavior.

Install via CLI

npx skills add https://github.com/mayflower/kali-ai-redteam --skill llm-redteaming

Repository Details

star Stars 2

call_split Forks 0

navigation Branch main

article Path SKILL.md

More from Creator

mayflower

mayflower Explore all skills →