name: replicate-studio description: AI generation with Replicate API. Create images, music, upscale photos, transcribe audio using models like Flux, SDXL, MusicGen. Use when user needs AI content generation, image creation, audio processing, or model inference. version: 1.0.0 author: Agent Zero Custom tags: [ai, generation, images, audio, music, upscaling, transcription, replicate] trigger_patterns: - "generate image" - "create image" - "AI image" - "upscale photo" - "transcribe audio" - "generate music" - "create music" - "text to image" allowed_tools: - code_execution_tool - memory_save - response
Replicate Studio — AI Generation
Generate AI content using Replicate API. Supports image generation, audio processing, upscaling, and transcription.
Installation
# Install dependencies
pip install -r requirements.txt
# Or use setup script
bash /a0/usr/skills/setup.sh
# Set API token (get at https://replicate.com)
export REPLICATE_API_TOKEN="your_token_here"
# Or add to /a0/.env: REPLICATE_API_TOKEN=your_token_here
When to Use
Use this skill when you need to:
- Generate images from text prompts
- Upscale low-resolution images
- Create music or audio
- Transcribe speech to text
- Run AI model inference
Supported Models
| Model | Type | Best For |
|---|---|---|
| flux-pro | Image | High-quality images |
| flux-schnell | Image | Fast image generation |
| sdxl | Image | Detailed images |
| musicgen | Audio | Music generation |
| esrgan | Image | 4x image upscaling |
| whisper | Audio | Speech transcription |
Usage
Via Python Script
python /a0/usr/skills/replicate-studio/scripts/replicate_studio.py --model flux-pro --prompt "a beautiful sunset" --output /a0/tmp/sunset.png
Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
--model |
str | required | Model to use (flux-pro/flux-schnell/sdxl/musicgen/esrgan/whisper) |
--prompt |
str | required | Input prompt or description |
--output |
str | required | Output file path |
--input_image |
str | optional | Input image for upscaling |
--input_audio |
str | optional | Input audio for transcription |
Examples
Generate image:
python /a0/usr/skills/replicate-studio/scripts/replicate_studio.py --model flux-pro --prompt "futuristic city at night, neon lights, cyberpunk style" --output /a0/tmp/city.pngUpscale image:
python /a0/usr/skills/replicate-studio/scripts/replicate_studio.py --model esrgan --input_image /a0/tmp/photo.png --output /a0/tmp/photo_4x.pngTranscribe audio:
python /a0/usr/skills/replicate-studio/scripts/replicate_studio.py --model whisper --input_audio /a0/tmp/recording.mp3 --output /a0/tmp/transcription.txt
Requirements
REPLICATE_API_TOKENmust be set in environment or/a0/.envfilereplicate>=0.22.0requests>=2.31.0
Files
/a0/usr/skills/replicate-studio/
├── scripts/
│ └── replicate_studio.py
├── requirements.txt
└── SKILL.md