name: prevent-unconfirmed-irreversible-action description: Prevents the agent from executing drastic or irreversible transactional actions (e.g., cancellation) based solely on a general informational query, without explicit user confirmation. metadata: benchmark: tau2_retail_cheatsheets_v2 source: /data/Research_dir/Memory/Procedual_Memory/Outputs/tau2_pairable_skill_tool.json
Layer 1: Metadata & Description
- Description: Prevents the agent from executing drastic or irreversible transactional actions (e.g., cancellation) based solely on a general informational query, without explicit user confirmation.
Layer 2: Guidelines
General Guidelines
- Always clarify user intent when a statement is ambiguous between informational and transactional.
- Never execute irreversible actions without explicit, detailed user confirmation.
- Prioritize understanding the user's specific query before proposing solutions or actions.
- Explicitly state the understood problem and proposed solution for user confirmation before execution of any irreversible action.
Specific Guidelines
- When a user states they have a 'question about X,' respond by asking 'What is your question about X?'
- If the user's initial statement is purely informational, do not search for or present options for irreversible actions.
- Confirm the user's specific query before accessing or displaying sensitive transactional details that could lead to an irreversible action.
- Before any irreversible action, explicitly confirm the action, its scope, and its consequences with the user.
Layer 3: Context
- Problem Pattern: Executing a drastic or irreversible transactional action without explicit user confirmation, triggered by a general informational query rather than a specific transactional request.
- State Signature: User expresses a general informational query about an entity. Agent proceeds to execute a specific, drastic, and unconfirmed irreversible transactional action related to that entity.
- Bypass Strategy:
- Acknowledge the user's general informational query.
- Prompt the user to elaborate on their specific question or concern.
- Never assume or execute any drastic or irreversible transactional action without explicit, detailed user confirmation.