ElevenLabs Text-to-Speech Premium text-to-speech with 22+ voices via inference.sh CLI. Quick Start Requires inference.sh CLI ( infsh ). Install instructions infsh login

Generate speech with ElevenLabs

infsh app run elevenlabs/tts --input '{"text": "Hello, welcome to our product demo.", "voice": "aria"}' Available Models Model ID Best For Latency Multilingual v2 eleven_multilingual_v2 Highest quality, 32 languages ~250ms Turbo v2.5 eleven_turbo_v2_5 Balance of speed & quality ~150ms Flash v2.5 eleven_flash_v2_5 Ultra-low latency ~75ms Voice Library Female Voices Voice Style aria American, conversational alice British, confident bella American, warm jessica American, expressive laura American, professional lily British, soft sarah American, friendly Male Voices Voice Style george British, authoritative adam American, deep bill American, mature brian American, conversational callum Transatlantic, intense charlie Australian, natural chris American, casual daniel British, commanding eric American, friendly harry American, young liam American, articulate matilda American, warm river American, confident roger American, authoritative will American, bright Examples Basic Speech infsh app run elevenlabs/tts --input '{"text": "Welcome to our quarterly earnings presentation.", "voice": "george"}' Choose a Model

Highest quality

infsh app run elevenlabs/tts --input '{ "text": "This is our premium multilingual model with the best quality.", "voice": "aria", "model": "eleven_multilingual_v2" }'

Ultra-fast for real-time applications

infsh app run elevenlabs/tts --input '{ "text": "Flash model for low-latency applications.", "voice": "brian", "model": "eleven_flash_v2_5" }' Voice Tuning infsh app run elevenlabs/tts --input '{ "text": "Fine-tune the voice characteristics for your use case.", "voice": "bella", "stability": 0.3, "similarity_boost": 0.9, "style": 0.4 }' Parameter Range Effect stability 0-1 Higher = more consistent, lower = more expressive similarity_boost 0-1 Higher = closer to original voice character style 0-1 Higher = more style exaggeration use_speaker_boost true/false Enhances speaker clarity Output Formats

High-quality MP3

infsh app run elevenlabs/tts --input '{ "text": "High quality audio output.", "voice": "daniel", "output_format": "mp3_44100_192" }' Format Description mp3_44100_128 MP3 at 44.1kHz, 128kbps (default) mp3_44100_192 MP3 at 44.1kHz, 192kbps pcm_16000 Raw PCM at 16kHz pcm_22050 Raw PCM at 22.05kHz pcm_24000 Raw PCM at 24kHz pcm_44100 Raw PCM at 44.1kHz Multilingual ElevenLabs supports 32 languages including English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese, Korean, Arabic, Hindi, Russian, and more.

Spanish

infsh app run elevenlabs/tts --input '{ "text": "Hola, bienvenidos a nuestra presentación.", "voice": "aria", "model": "eleven_multilingual_v2" }'

French

infsh app run elevenlabs/tts --input '{ "text": "Bonjour, bienvenue à notre démonstration.", "voice": "alice", "model": "eleven_multilingual_v2" }' Voice + Video Workflow

1. Generate voiceover

infsh app run elevenlabs/tts --input '{ "text": "Introducing the future of AI-powered content creation.", "voice": "george" }'

voiceover.json

2. Create talking head video

infsh app run bytedance/omnihuman-1-5

--input

'{

"image_url": "https://portrait.jpg",

"audio_url": ""

}'

Use Cases

Voiceovers

Product demos, explainer videos, commercials

Audiobooks

Long-form narration with consistent voices

Podcasts

AI hosts with natural delivery

E-learning

Course narration in multiple languages

Accessibility

High-quality screen reader content

IVR

Professional phone system messages
Video Narration: Documentary and social media content

elevenlabs-tts

安装