transcribe

by @badlogic in Web & API

240

# Install this skill:

npx skills add badlogic/pi-skills --skill "transcribe"

Install specific skill from multi-skill repository

# Description

Speech-to-text transcription using Groq Whisper API. Supports m4a, mp3, wav, ogg, flac, webm.

# SKILL.md

name: transcribe
description: Speech-to-text transcription using Groq Whisper API. Supports m4a, mp3, wav, ogg, flac, webm.

Transcribe

Speech-to-text using Groq Whisper API.

Setup

The script needs GROQ_API_KEY environment variable. Check if already set:

echo $GROQ_API_KEY

If not set, guide the user through setup:
1. Ask if they have a Groq API key
2. If not, have them sign up at https://console.groq.com/ and create an API key
3. Have them add to their shell profile (~/.zshrc or ~/.bashrc):
bash export GROQ_API_KEY="<their-api-key>"
4. Then run source ~/.zshrc (or restart terminal)

Usage

{baseDir}/transcribe.sh <audio-file>

Supported Formats

m4a, mp3, wav, ogg, flac, webm
Max file size: 25MB

Output

Returns plain text transcription with punctuation and proper capitalization to stdout.

# Supported AI Coding Agents

This skill is compatible with the SKILL.md standard and works with all major AI coding agents:

⚡ Amp 🚀 Antigravity 🤖 Claude Code 🦀 Clawdbot 📝 Codex ▶️ Cursor 🤖 Droid 💎 Gemini CLI 🐙 GitHub Copilot 🪿 Goose 📊 Kilo Code 🔧 Kiro CLI 💻 OpenCode 🦘 Roo Code 🌲 Trae 🏄 Windsurf

Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.