Production deployment patterns for ElevenLabs API including rate limiting, error handling, monitoring, and testing. Use when deploying to production, implementing rate limiting, setting up monitoring, handling errors, testing concurrency, or when user mentions production deployment, rate limits, error handling, monitoring, ElevenLabs production.
View on GitHubFebruary 1, 2026
Select agents to install to:
npx add-skill https://github.com/vanman2024/ai-dev-marketplace/blob/main/plugins/elevenlabs/skills/production-deployment/SKILL.md -a claude-code --skill production-deploymentInstallation paths:
.claude/skills/production-deployment/# Production Deployment Complete production deployment guide for ElevenLabs API integration including rate limiting patterns, comprehensive error handling strategies, monitoring setup, and testing frameworks. ## Overview This skill provides battle-tested patterns for deploying ElevenLabs API integration to production environments with: - **Rate Limiting**: Concurrency-aware rate limiting respecting plan limits - **Error Handling**: Comprehensive error recovery and retry strategies - **Monitoring**: Real-time metrics, logging, and alerting - **Testing**: Load testing, concurrency validation, and production readiness checks ## Quick Start ### 1. Setup Monitoring Infrastructure ```bash bash scripts/setup-monitoring.sh --project-name "my-elevenlabs-app" \ --log-level "info" \ --metrics-port 9090 ``` This script: - Configures Winston logging with rotation - Sets up Prometheus metrics endpoints - Creates health check endpoints - Initializes error tracking ### 2. Deploy Production Configuration ```bash bash scripts/deploy-production.sh --environment "production" \ --api-key "$ELEVENLABS_API_KEY" \ --concurrency-limit 10 \ --region "us-east-1" ``` This script: - Validates environment variables - Applies rate limiting configuration - Configures error handling middleware - Sets up monitoring integrations - Performs smoke tests ### 3. Test Rate Limiting ```bash bash scripts/test-rate-limiting.sh --concurrency 20 \ --duration 60 \ --plan-tier "pro" ``` This script: - Simulates concurrent requests - Validates queue behavior - Measures latency under load - Generates performance report ## ElevenLabs Concurrency Limits ### Limits by Plan Tier | Plan | Multilingual v2 | Turbo/Flash | STT | Music | |------|-----------------|-------------|-----|-------| | Free | 2 | 4 | 8 | N/A | | Starter | 3 | 6 | 12 | 2 | | Creator | 5 | 10 | 20 | 2 | | Pro | 10 | 20 | 40 | 2 | | Scale | 15 | 30 | 60 | 3 | | Business | 15 | 30 | 60 | 3 | | Enterprise | Elevated | Elev