ElevenLabs Speech-to-Text transcription workflows with Scribe v1 supporting 99 languages, speaker diarization, and Vercel AI SDK integration. Use when implementing audio transcription, building STT features, integrating speech-to-text, setting up Vercel AI SDK with ElevenLabs, or when user mentions transcription, STT, Scribe v1, audio-to-text, speaker diarization, or multi-language transcription.
View on GitHubFebruary 1, 2026
Select agents to install to:
npx add-skill https://github.com/vanman2024/ai-dev-marketplace/blob/main/plugins/elevenlabs/skills/stt-integration/SKILL.md -a claude-code --skill stt-integrationInstallation paths:
.claude/skills/stt-integration/# stt-integration This skill provides comprehensive guidance for implementing ElevenLabs Speech-to-Text (STT) capabilities using the Scribe v1 model, which supports 99 languages with state-of-the-art accuracy, speaker diarization for up to 32 speakers, and seamless Vercel AI SDK integration. ## Core Capabilities ### Scribe v1 Model Features - **Multi-language support**: 99 languages with varying accuracy levels - **Speaker diarization**: Up to 32 speakers with identification - **Word-level timestamps**: Precise synchronization for video/audio alignment - **Audio event detection**: Identifies sounds like laughter and applause - **High accuracy**: Optimized for accuracy over real-time processing ### Supported Formats - **Audio**: AAC, AIFF, OGG, MP3, Opus, WAV, WebM, FLAC, M4A - **Video**: MP4, AVI, Matroska, QuickTime, WMV, FLV, WebM, MPEG, 3GPP - **Limits**: Max 3 GB file size, 10 hours duration ## Skill Structure ### Scripts (scripts/) 1. **transcribe-audio.sh** - Direct API transcription with curl 2. **setup-vercel-ai.sh** - Install and configure @ai-sdk/elevenlabs 3. **test-stt.sh** - Test STT with sample audio files 4. **validate-audio.sh** - Validate audio file format and size 5. **batch-transcribe.sh** - Process multiple audio files ### Templates (templates/) 1. **stt-config.json.template** - STT configuration template 2. **vercel-ai-transcribe.ts.template** - Vercel AI SDK TypeScript template 3. **vercel-ai-transcribe.py.template** - Vercel AI SDK Python template 4. **api-transcribe.ts.template** - Direct API TypeScript template 5. **api-transcribe.py.template** - Direct API Python template 6. **diarization-config.json.template** - Speaker diarization configuration ### Examples (examples/) 1. **basic-stt/** - Basic STT with direct API 2. **vercel-ai-stt/** - Vercel AI SDK integration 3. **diarization/** - Speaker diarization examples 4. **multi-language/** - Multi-language transcription 5. **webhook-integration/** - Async transcription with webhooks