Explore AI Agent Skills & Claude Prompts
Discover open-source agent skills for Claude Code, Codex, ChatGPT, and any tool that uses SKILL.md.
Enter through keywords, occupations, creators, and GitHub sources to see what kinds of skills are emerging across domains.
Use the same catalog through the API
Connect 381,784 public skills to your own search, analytics, or agent workflow with the REST API.
Querying local SQLite index...
truenas-ops
by drewid74Use this when: set up an SMB or NFS share, my ZFS pool shows errors, automate dataset snapshots, replicate data to another NAS, fix dataset permissions for Docker containers, my share is not accessible, migrate TrueNAS CORE to SCALE, tune storage for media or databases, add a cloud backup destination, my disk is failing, expand or replace a drive in the pool, set up offsite replication, TrueNAS, ZFS, ix-applications, SCALE 24.10, docker on TrueNAS, pool health check, dataset record size, NAS API, container bind mount permissions
arbitrage-audit
by drewid74Use this when: audit my business model for AI risk, where is my value at risk from AI, which parts of my business does AI threaten, arbitrage gap analysis, how durable is my competitive advantage, AI compression of my market, is my business model AI-proof, what gaps am I depending on, competitive moat analysis, where AI will eat my margin, assess my business against AI disruption, what happens to my business when AI improves, am I on the right side of AI disruption, strategic AI risk assessment
r8-analyzer
by drewid74Analyzes Android build files and R8 keep rules to identify redundancies, broad package-wide rules, and rules that subsume library consumer keep rules. Use when developers want to optimize their app's size, remove redundant or overly broad keep rules, or troubleshoot Proguard configurations.
llm-inference-stack
by drewid74Use this when: run a model locally, which model fits my GPU, my inference is too slow, serve LLM in production, how to quantize a model, too many concurrent users, self-host an AI, set up Ollama, vLLM vs Ollama, route between multiple models, VRAM out of memory, model won't load, tokens per second too low, pick a quantization format, LiteLLM gateway setup, what model fits 8GB VRAM, OpenAI-compatible local endpoint
karpathy-metric-pre
by drewid74Use this when: red-team my optimization metric, find ways to game my metric, metric pre-mortem, adversarial metric evaluation, gaming vectors for my KPI, what could an agent exploit in my metric, metric failure modes, proxy divergence risk, eval contamination, silent degradation from optimization, metric gaming pre-mortem, is my metric robust enough for auto-improvement, build secondary metrics, evaluation diversity plan, metric countermeasures, what does overfitting look like in my system, holdout scenario design, disappearance test, metric gaming attack surface, is my metric gameable, optimization agent failure modes, what could go wrong with my eval, unsupervised optimization risk, metric red team
karpathy-trace-infrastructure
by drewid74Use this when: audit my agent observability, do I have enough logging for auto-improvement, trace infrastructure readiness, can a meta-agent read my agent's reasoning, agent trace audit, do I have full reasoning traces, tool call logging, decision point visibility, structured trace format, session reproducibility for agents, agent harness version control, baseline snapshots for optimization, failure classification in traces, cost and latency tracking per agent step, sandboxed execution environment, agent evaluation harness, LangSmith setup, Braintrust setup, Arize setup, am I logging enough for a meta-agent, what traces does auto-improvement need, agent observability gaps, trace infrastructure gaps, auto-improvement infrastructure readiness, meta-agent trace requirements, agent replay from logs, is my logging sufficient for optimization loop
karpathy-triplet-diag
by drewid74Use this when: am I ready for auto-improvement, is my system ready for the Karpathy Loop, can I run autoresearch on this, evaluate auto-improvement readiness, define the editable surface, define the optimization metric, define the experiment time budget, write a program.md spec, build a program.md, optimization target spec, agent loop readiness, can an agent optimize this, is my pipeline ready for autoresearch, surface metric budget triplet, blocker report for auto-improvement, what's blocking my autoresearch loop, scope an optimization target, can I hand this to an agent loop, is this system loop-ready
homelab-commander
by drewid74Universal homelab infrastructure specialist. Docker self-hosting, TrueNAS ZFS operations, Proxmox VM/LXC management, K3s Kubernetes with FluxCD, 3-2-1 backup strategy, blue-green deployments, internal DNS (Pi-hole/Technitium), VLAN design, Home Assistant + MQTT, and cross-platform migration planning.
proxmox-k3s-infra
by drewid74Use this when: set up a homelab Kubernetes cluster, create a VM from a template, my K3s node won't join, set up GPU passthrough, automate VM provisioning with cloud-init, my IOMMU groups are wrong, back up VMs automatically, install K3s on a new node, deploy persistent workloads in Kubernetes, manage LXC containers, rebuild infrastructure from code, set up GitOps for my cluster, Proxmox, K3s, FluxCD, ArgoCD, Helm, Longhorn, PBS, Terraform proxmox, VLAN design, Ceph vs NFS, MetalLB, NVIDIA passthrough, ACS override
career-gap-map
by drewid74Use this when: audit my career for AI risk, is my job at risk from AI, which of my skills are AI-proof, am I on the right side of AI disruption, career strategy for AI era, future-proof my career, what skills should I develop, is my role being automated, AI impact on my job, career gap analysis, where should I focus my skills, what do I do with AI time savings, am I using AI correctly in my career, career migration plan, am I riding a closing gap, how do I stay relevant, reskill for AI, career readiness assessment
orchestration-scaffold
by drewid74Use this when: scaffold a local AI orchestration stack, set up LangGraph supervisor, connect Ollama to OpenCode, configure MCP servers for agent tools, wire OpenCode as the developer interface for a multi-agent system, set up A2A protocol between LangGraph and OpenCode, configure VS Code for agent development, orchestration stack on TrueNAS / Docker Compose homelab, choose between AG2 and CrewAI sub-frameworks, set up agent harness with PreToolUse hooks, define SLIs for an AI orchestration stack, design generator-evaluator pattern for agent output, local inference routing with Qwen3 or Mistral Small.
docker-selfhost
by drewid74Use this when: set up a self-hosted service, my container keeps restarting, add HTTPS to a container, connect containers to each other, set up a reverse proxy, my volumes are not persisting, migrate a stack to a new server, expose a service to the internet, back up container data, deploy on TrueNAS, run multiple services on one host, containers can't reach each other, Docker, Traefik, Cloudflare Tunnel
Browse Agent Skills by Occupation
23 major groups · 867 SOC occupations
Browse by Category
Explore agent skills organized by their primary use case
Explore the agent skills ecosystem by occupation and creator
SkillMD is not just a keyword search box. It is an open map that organizes public skills by occupation, creator, and repository, helping you see which workflows, judgment criteria, and domain habits people are writing for AI agents.
Then follow creators and GitHub repositories back to the source: compare the skills a team maintains, whether the repo is active, and how the README frames the work before you open, install, or reuse anything.
Use it three ways: learn an unfamiliar field by occupation, study how creators organize skills, then use source context to decide what is worth opening or reusing.
01 Map a field
Browse 23 occupation groups and 867 SOC roles to learn what skills exist in adjacent domains and how they break down real work.
02 Follow creators
Use creator and repository pages to inspect maintained skill collections, recent updates, and source context before trusting a result.
03 Search with sources
Search 1.7M+ collected skills, then use occupation tags, creators, and GitHub source context to decide what is worth opening.
Start with the occupation map, then follow creators and repositories back to real code. SkillMD helps explain why a skill is worth opening, not only what it is named.
Standardizing Agent Capabilities with SKILL.md and Model Context Protocol (MCP)
In the rapidly evolving landscape of artificial intelligence, LLM agents (Large Language Model agents) have transitioned from simple text predictors to autonomous problem solvers. To orchestrate complex, multi-step agentic workflows, developers require a standardized format to specify agent capabilities, prompt instructions, system rules, and database bindings. This is where SKILL.md and the Model Context Protocol (MCP) have emerged as standard developer paradigms. SkillMD serves as the central directory for indexing, exploring, and sharing these critical agent configurations.
Our open-source registry currently tracks over 1.7 million collected SKILL.md configurations and system prompts. By compiling agent configurations from active developers on GitHub, we bridge the gap between prompt engineering research and production execution. Whether you are building agents with Anthropic's Claude Code, OpenAI's GPT-4, Google's Gemini, or local models using Ollama and LlamaIndex, standardized skill definitions ensure your agents behave predictably across different runtime environments.
What is the Model Context Protocol (MCP)?
The Model Context Protocol (MCP) is an open-source standard designed to connect LLMs to data sources, developer tools, and external environments. MCP establishes a bidirectional communication channel between client applications (like Cursor, Claude Desktop, or custom agent systems) and servers hosting data or capabilities. Standardizing instructions via SKILL.md enables LLMs to query databases, read local files, execute terminal commands, and integrate third-party APIs. SkillMD allows you to find ready-to-run MCP servers and prompt instructions for various occupations and technical tasks.
The Structure of a Professional SKILL.md File
A valid SKILL.md configuration is designed to be easily read by humans and parsed by LLMs. It contains precise system instructions, trigger conditions, required parameters, and execution examples. Below is the typical architectural blueprint of a professional agent skill:
- Metadata & Core Scope: Declares the name of the skill, author details, target models, and a description of the capability.
- Triggers & Intent Detection: Details semantic triggers that help the agent decide when to invoke this skill.
- System Prompts: Explicit system-level instructions that direct the agent's behavior, personality, safety guardrails, and formatting preferences.
- Capabilities & Tools: Lists the files, databases, or APIs the agent must access to complete the tasks.
- Few-Shot Examples: Demonstrates real inputs and outputs, helping the model generalize behavior through in-context learning.
Optimizing Agent Workflows for Modern LLMs
Writing effective agent skills requires deep knowledge of prompt engineering. With the release of advanced reasoning models like Claude 3.5 Sonnet, ChatGPT o1, and DeepSeek-V3, prompt templates must focus on structured thinking. Developers are encouraged to use XML tags (e.g., <thought>, <context>, and <rules>) to isolate execution boundaries. Standardized prompts prevent agents from suffering from context drift, ensuring that long-running tasks remain aligned with the initial system parameters.
Exploring by SOC Occupations and Creator Profiles
What makes SkillMD unique is its taxonomy. Instead of simple text search, we parse and organize files according to the Standard Occupational Classification (SOC) system. This means you can discover skills written for Computer and Mathematical roles, Business and Financial operations, Legal, Design, and and Educational Instruction fields. By tracking creator profiles, developers can study how different teams organize their custom instructions, compare version updates, and fork public configs for specialized enterprise use cases.
SkillMD operates as a high-performance index running on a fast Go backend and a highly responsive Astro SSR frontend. All search queries execute in milliseconds, featuring smart debouncing to prevent multiple API requests while keeping user data secure. Join our community of developers to standardize your AI agent instructions and optimize your LLM prompting workflows today.
Frequently Asked Questions
A practical guide to agent skills: what they are, how to inspect them, and how SkillMD helps you explore the ecosystem.