c-voice

star 149

Convert speech to text using `sag` (ElevenLabs STT) and synthesize speech using `say` (macOS built-in TTS). Enables voice input transcription and audio output.

daxaur By daxaur schedule Updated 2/28/2026

name: c-voice description: Convert speech to text using sag (ElevenLabs STT) and synthesize speech using say (macOS built-in TTS). Enables voice input transcription and audio output. tags: [voice, speech, tts, stt, elevenlabs, say, sag, audio]

What This Skill Does

Enables Claude to transcribe spoken audio to text via sag (powered by ElevenLabs) and to speak text aloud using the macOS say command.

Available CLI Tools

sag — Speech-to-Text (ElevenLabs)

# Transcribe an audio file
sag transcribe --file recording.mp3

# Record from microphone and transcribe
sag record --output transcript.txt

# Transcribe with a specific language hint
sag transcribe --file audio.wav --language en

# Output transcript to stdout
sag transcribe --file audio.m4a --stdout

say — Text-to-Speech (macOS built-in)

# Speak text aloud
say "Hello, how can I help you today?"

# Use a specific voice
say -v Samantha "Your report is ready."

# Save spoken audio to a file
say -o output.aiff "Text to synthesize"

# List available voices
say -v ?

# Control speaking rate (words per minute)
say -r 180 "Speaking at a custom rate"

Usage Guidelines

  • Use sag to process audio files the user provides or to capture mic input
  • Use say to read back responses, summaries, or alerts aloud
  • Prefer say -v Samantha or -v Alex for natural-sounding output on macOS

Notes

  • sag requires a valid ElevenLabs API key configured in environment
  • say is built into macOS — no installation needed
  • Supported audio input formats for sag: MP3, WAV, M4A, FLAC
Install via CLI
npx skills add https://github.com/daxaur/openpaw --skill c-voice
Repository Details
star Stars 149
call_split Forks 6
navigation Branch main
article Path SKILL.md
More from Creator