docling

star 2

Expert guidance on document processing with Docling and audio transcription with Whisper.

FullFran By FullFran schedule Updated 2/2/2026

name: docling description: Expert guidance on document processing with Docling and audio transcription with Whisper.

Docling & Ingestion Expert Skill

This skill provides patterns for processing diverse document formats and audio files.

๐Ÿ“ Document Processing

  • Multi-format support: Use docling for converting PDF, DOCX, PPTX, XLSX, HTML, and Markdown to a clean Markdown representation.
  • HybridChunker: Always use Docling's HybridChunker to preserve document structure (headings, lists, tables).
  • Token Awareness: Configure chunker with max_tokens=512 to align with embedding models (e.g., text-embedding-3-small).

๐Ÿ”Š Audio Transcription

  • Whisper ASR: Use docling's integration with Whisper Turbo for transcribing audio files (.mp3, .wav, .m4a, .flac).
  • Path Handling: Pass pathlib.Path objects to DocumentConverter for audio files.

๐Ÿ—๏ธ Pipeline Architecture

  • Ingestion Pipeline: Centralize logic in src/ingestion/ingest.py.
  • Metadata Extraction: Extract YAML frontmatter and standard document properties.
  • Batching: Use batch embedding generation and insert_many for chunks to optimize performance.
Install via CLI
npx skills add https://github.com/FullFran/Hybrid-RAG-example --skill docling
Repository Details
star Stars 2
call_split Forks 1
navigation Branch main
article Path SKILL.md
More from Creator