moltbot

sag

65,001
8,013
# Install this skill:
npx skills add moltbot/moltbot --skill "sag"

Install specific skill from multi-skill repository

# Description

ElevenLabs text-to-speech with mac-style say UX.

# SKILL.md


name: sag
description: ElevenLabs text-to-speech with mac-style say UX.
homepage: https://sag.sh
metadata: {"moltbot":{"emoji":"🗣️","requires":{"bins":["sag"],"env":["ELEVENLABS_API_KEY"]},"primaryEnv":"ELEVENLABS_API_KEY","install":[{"id":"brew","kind":"brew","formula":"steipete/tap/sag","bins":["sag"],"label":"Install sag (brew)"}]}}


sag

Use sag for ElevenLabs TTS with local playback.

API key (required)
- ELEVENLABS_API_KEY (preferred)
- SAG_API_KEY also supported by the CLI

Quick start
- sag "Hello there"
- sag speak -v "Roger" "Hello"
- sag voices
- sag prompting (model-specific tips)

Model notes
- Default: eleven_v3 (expressive)
- Stable: eleven_multilingual_v2
- Fast: eleven_flash_v2_5

Pronunciation + delivery rules
- First fix: respell (e.g. "key-note"), add hyphens, adjust casing.
- Numbers/units/URLs: --normalize auto (or off if it harms names).
- Language bias: --lang en|de|fr|... to guide normalization.
- v3: SSML <break> not supported; use [pause], [short pause], [long pause].
- v2/v2.5: SSML <break time="1.5s" /> supported; <phoneme> not exposed in sag.

v3 audio tags (put at the entrance of a line)
- [whispers], [shouts], [sings]
- [laughs], [starts laughing], [sighs], [exhales]
- [sarcastic], [curious], [excited], [crying], [mischievously]
- Example: sag "[whispers] keep this quiet. [short pause] ok?"

Voice defaults
- ELEVENLABS_VOICE_ID or SAG_VOICE_ID

Confirm voice + speaker before long output.

Chat voice responses

When Peter asks for a "voice" reply (e.g., "crazy scientist voice", "explain in voice"), generate audio and send it:

# Generate audio file
sag -v Clawd -o /tmp/voice-reply.mp3 "Your message here"

# Then include in reply:
# MEDIA:/tmp/voice-reply.mp3

Voice character tips:
- Crazy scientist: Use [excited] tags, dramatic pauses [short pause], vary intensity
- Calm: Use [whispers] or slower pacing
- Dramatic: Use [sings] or [shouts] sparingly

Default voice for Clawd: lj2rcrvANS3gaWWnczSX (or just -v Clawd)

# Supported AI Coding Agents

This skill is compatible with the SKILL.md standard and works with all major AI coding agents:

Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.