recall

star 45

Search Origin's local memory by query. Targeted lookup, not orientation. Invoked as `/recall <query>`. Use when the user asks "do you remember", "what do you know about", "look up".

7xuanlu By 7xuanlu schedule Updated 5/24/2026

name: recall description: > Search Wenlan's local memory by query. Targeted lookup, not orientation. Invoked as /recall <query>. Use when the user asks "do you remember", "what do you know about", "look up". argument-hint: "" allowed-tools: ["mcp__plugin_wenlan_wenlan__recall"]

/recall

Search Wenlan's memory by natural-language query. Returns matching memories ranked by hybrid vector + FTS search, then re-ordered by the agent if it helps.

Argument parsing

The /recall skill accepts one optional inline token of the form space:<name> anywhere in the argument string. Extract it before treating the rest as the query:

raw_args="<the full argument string passed to /recall>"
space_arg="$(printf '%s\n' "$raw_args" | grep -oE 'space:[A-Za-z0-9_-]+' | head -1 | cut -d: -f2)"
query="$(printf '%s\n' "$raw_args" | sed -E 's/[[:space:]]*space:[A-Za-z0-9_-]+[[:space:]]*/ /g' | sed -E 's/^[[:space:]]+|[[:space:]]+$//g')"

If space_arg is non-empty, pass it to the resolver as --arg "$space_arg".

Resolve the active space

Call the bundled resolver:

resolved="$("$CLAUDE_PLUGIN_ROOT/bin/resolve-space.sh" --cwd "$PWD" \
    ${space_arg:+--arg "$space_arg"} 2>/dev/null)"
space="$(printf '%s\n' "$resolved" | cut -f1)"
source_layer="$(printf '%s\n' "$resolved" | cut -f2)"

Pass space="$space" to the recall MCP tool. Print one line before the call:

Resolved space: <space> (from <source-layer>)

Two phases

When a local model or API key is configured, the daemon can rerank and expand server-side. In local memory mode it cannot. The skill always does agent-side expansion and rerank itself — cheap, makes results good in both modes.

Phase 1 — expand the query (agent-side)

Before calling recall, rewrite the user's query into a more search-friendly form:

  • Replace pronouns with the referent ("it" → the actual thing).
  • Expand abbreviations the embedder is unlikely to know.
  • Add the obvious synonym when the original term is too narrow (e.g. "auth" → "auth OR authentication").

Don't over-expand. If the query is already specific, leave it alone. One recall call per /recall invocation — duplicate calls double embedding load and the merge step is rarely worth it. The daemon's own search_memory_expanded exists for the multi-query case; if it matters, use that endpoint instead of issuing parallel calls here.

Phase 2 — call the MCP tool

recall(query="<expanded query>", space=<resolved>, memory_type=<inferred>)

Inferences (do not ask the user):

  • space: current working directory (e.g. ~/Repos/wenlan/..."wenlan"), the topic being discussed, or whatever space was mentioned in recent turns. Always pass when scope is known; if uncertain, run list_spaces later (post-PR-C) or omit.
  • memory_type: only when the query itself names a type ("decision on X", "lesson about Y", "preference for Z"). Otherwise omit and let hybrid search rank.
  • limit: default 10. Use 3-5 for quick lookups, 10-20 for exploration.

Phase 3 — rerank (agent-side)

The daemon returns hits ranked by hybrid search. That ranking is good but not perfect — it doesn't know the user's exact intent.

Re-read the returned memories against the original query. Promote the ones that directly answer the question; demote ones that just share keywords.

Show the user the top 3-5 reranked hits. Surface the rest only if asked.

Phase 4 — render revision context (per result)

Each memory may carry revision fields: version, pending_revision, merged_from, last_delta_summary. Most memories are fresh (v1, none set) — render nothing extra for those. Only add a tag line when something meaningful is present.

Condition: emit the tag line when any of these holds:

  • version > 1
  • merged_from is non-empty
  • pending_revision == true

Format — one compact line above the memory body:

<id>  v<N> (merged <K> memories)         ← merged_from has K entries
<id>  v<N>, pending revision against <id> ← pending_revision true
<id>  v<N> — <last_delta_summary>         ← version > 1, delta populated
<id>  v<N>                                ← version > 1, no delta

Rules:

  • Merged takes precedence over pending_revision in the label.
  • Omit — <delta> when last_delta_summary is empty or null.
  • Skip the tag line entirely when version == 1 (or null) and no other flag is set. Preserves current output for fresh memories.

When to use

  • "What did I say about X?"
  • "Do you remember the decision on Y?"
  • Need a specific fact before continuing.

When NOT to use

  • Broad session orientation → use /brief instead.
  • Storing a new memory → use /capture.

Hint: write specific queries

"Alice database preference" finds more than "database stuff". The semantic matcher rewards specificity. If too many results return, add filters rather than making the query longer.

Install via CLI
npx skills add https://github.com/7xuanlu/origin --skill recall
Repository Details
star Stars 45
call_split Forks 3
navigation Branch main
article Path SKILL.md
More from Creator