AI Voice Generation Generate natural AI voices via inference.sh CLI. Quick Start Requires inference.sh CLI ( infsh ). Get installation instructions: npx skills add inference-sh/skills@agent-tools infsh login
Generate speech
infsh app run infsh/kokoro-tts --input '{ "prompt": "Hello! This is an AI-generated voice that sounds natural and engaging.", "voice": "af_sarah" }' Available Models Model App ID Best For Kokoro TTS infsh/kokoro-tts Natural, multiple voices DIA infsh/dia-tts Conversational, expressive Chatterbox infsh/chatterbox Casual, entertainment Higgs infsh/higgs-tts Professional narration VibeVoice infsh/vibevoice Emotional range Kokoro Voice Library American English Voice ID Gender Style af_sarah Female Warm, friendly af_nicole Female Professional af_sky Female Youthful am_michael Male Authoritative am_adam Male Conversational am_echo Male Clear, neutral British English Voice ID Gender Style bf_emma Female Refined bf_isabella Female Warm bm_george Male Classic bm_lewis Male Modern Voice Generation Examples Professional Narration infsh app run infsh/kokoro-tts --input '{ "prompt": "Welcome to our quarterly earnings call. Today we will discuss the financial performance and strategic initiatives for the past quarter.", "voice": "am_michael", "speed": 1.0 }' Conversational Style infsh app run infsh/dia-tts --input '{ "text": "Hey, so I was thinking about that project we discussed. What if we tried a different approach?", "voice": "conversational" }' Audiobook Narration infsh app run infsh/kokoro-tts --input '{ "prompt": "Chapter One. The morning mist hung low over the valley as Sarah made her way down the winding path. She had been walking for hours.", "voice": "bf_emma", "speed": 0.9 }' Video Voiceover infsh app run infsh/kokoro-tts --input '{ "prompt": "Introducing the next generation of productivity. Work smarter, not harder.", "voice": "af_nicole", "speed": 1.1 }' Podcast Host infsh app run infsh/kokoro-tts --input '{ "prompt": "Welcome back to Tech Talk! Im your host, and today we are diving deep into the world of artificial intelligence.", "voice": "am_adam" }' Multi-Voice Conversation
Generate dialogue between two speakers
Speaker 1
infsh app run infsh/kokoro-tts --input '{ "prompt": "Have you seen the latest AI developments? Its incredible how fast things are moving.", "voice": "am_michael" }'
speaker1.json
Speaker 2
infsh app run infsh/kokoro-tts --input '{ "prompt": "I know, right? Just last week I tried that new image generator and was blown away.", "voice": "af_sarah" }'
speaker2.json
Merge conversation
infsh app run infsh/media-merger
--input
'{
"audio_files": ["
Process long text in chunks
TEXT
"Your very long text here..."
Split and generate
Chunk 1
infsh app run infsh/kokoro-tts
--input
'{
"prompt": "
chunk1.json
Chunk 2
infsh app run infsh/kokoro-tts
--input
'{
"prompt": "
chunk2.json
Merge chunks
infsh app run infsh/media-merger
--input
'{
"audio_files": ["
1. Generate voiceover
infsh app run infsh/kokoro-tts --input '{ "prompt": "This stunning footage shows the beauty of nature in its purest form.", "voice": "am_michael" }'
voiceover.json
2. Merge with video
infsh app run infsh/media-merger
--input
'{
"video_url": "https://your-video.mp4",
"audio_url": "
1. Generate speech
infsh app run infsh/kokoro-tts --input '{ "prompt": "Hi, Im excited to share some updates with you today.", "voice": "af_sarah" }'
speech.json
2. Animate with avatar
infsh app run bytedance/omnihuman-1-5
--input
'{
"image_url": "https://portrait.jpg",
"audio_url": "
Slow narration
infsh app run infsh/kokoro-tts --input '{ "prompt": "Take a deep breath. Let yourself relax.", "voice": "bf_emma", "speed": 0.8 }' Punctuation for Pacing Use punctuation to control speech rhythm: Punctuation Effect Period . Full pause Comma , Brief pause ... Extended pause ! Emphasis ? Question intonation - Quick break infsh app run infsh/kokoro-tts --input '{ "prompt": "Wait... Did you hear that? Something is coming. Something big!", "voice": "am_adam" }' Best Practices Match voice to content - Professional voice for business, casual for social Use punctuation - Control pacing with periods and commas Keep sentences short - Easier to generate and sounds more natural Test different voices - Same text sounds different across voices Adjust speed - Slightly slower often sounds more natural Break long content - Process in chunks for consistency Use Cases Voiceovers - Video narration, commercials Audiobooks - Full book narration Podcasts - AI hosts and guests E-learning - Course narration Accessibility - Screen reader content IVR - Phone system messages Content localization - Translate and voice