spring-ai-rag-media-pgvector

star 208

Build RAG pipelines for media-asset knowledge bases using Spring AI and PostgreSQL pgvector. Use when Codex needs to design database schema, ingestion/chunking/embedding workflow, and retrieval logic that prioritizes internal media knowledge before falling back to general model knowledge.

microwind

By microwind schedule Updated 3/2/2026

play_arrow Run Skill in Manus View GitHub

name: spring-ai-rag-media-pgvector description: Build RAG pipelines for media-asset knowledge bases using Spring AI and PostgreSQL pgvector. Use when Codex needs to design database schema, ingestion/chunking/embedding workflow, and retrieval logic that prioritizes internal media knowledge before falling back to general model knowledge.

Spring AI RAG Media PgVector

Workflow

Define knowledge scope:

Enumerate media asset types (movie metadata, synopsis, production notes, tags, reviews, scripts).
Define source-of-truth systems and ingestion frequency.

Provision database and extension:

Apply references/pgvector-schema.sql via Flyway.
Confirm vector(1536) dimension matches chosen embedding model.

Build ingestion pipeline:

Normalize source records into JSONL chunks with stable IDs.
Validate chunk payload using scripts/validate_chunks.py.
Persist raw document metadata and vectorized chunks.

Embed and index:

Use Spring AI embedding model to generate vectors.
Upsert chunk vectors in batches.
Rebuild IVFFlat index only after bulk ingestion.

Implement retrieval priority:

Execute vector similarity search against internal chunks first.
If top score is below threshold or result count is low, fallback to model-only answer path.
Keep both retrieved evidence and fallback reason in response metadata.

Enforce grounded generation:

Use retrieval context and source citations in prompt.
If no reliable internal evidence is found, explicitly state uncertainty.

Guardrails

Keep chunk size stable (recommended 300-800 Chinese chars; overlap 50-120).
Version embedding model IDs; avoid mixing vectors from different dimensions in one table.
Use metadata filters (asset_type, year, language) before distance ranking when possible.
Log retrieval scores for online quality monitoring.

Resources

references/pgvector-schema.sql: DB and index template for media RAG.
references/rag-priority-strategy.md: retrieval-first orchestration strategy.
references/application-yml-template.md: Spring AI and datasource baseline config.
scripts/validate_chunks.py: chunk payload validator before embedding ingestion.

Install via CLI

npx skills add https://github.com/microwind/design-patterns --skill spring-ai-rag-media-pgvector

Repository Details

star Stars 208

call_split Forks 44

navigation Branch main

article Path SKILL.md

More from Creator

microwind

microwind Explore all skills →