Nemotron Speech Skills Note: "Nemotron Speech" is the public-facing name for what NVIDIA documents today as Riva / Riva NIM . All commands, container images, gRPC APIs, Python imports, and documentation URLs still use "Riva" — the rename is brand-only. Do not rename commands, images, or doc URLs. Agent: When walking the user through a multi-step workflow, announce each step before presenting it: Step N/M — Step Title (e.g., " Step 1/4 — Deploy the Container "). Purpose Single entry point for all NVIDIA Nemotron Speech (Riva) NIM workflows: ASR (speech-to-text), TTS (text-to-speech), and NMT (translation). Covers cloud-hosted inference via build.nvidia.com, self-hosted Docker deployment, client-protocol choice for ASR (gRPC, HTTP, WebSocket), custom NeMo model deployment via riva-build , ASR pipeline tuning (VAD, diarization, language models), and the prerequisite Docker / NGC / driver setup. When to Use This Skill Use this skill for any Nemotron Speech / Riva NIM task — deployment, testing, custom model build, system requirements check, or model selection across ASR / TTS / NMT modalities. Workflow Identify the user's task type, then load the corresponding reference file from references/ . The reference files contain the detailed per-workflow content; this SKILL.md is a routing surface. Load only the reference relevant to the task at hand. Prerequisites Show more Installs 534 Repository nvidia/skills GitHub Stars 1.3K First Seen Jun 1, 2026 Security Audits Gen Agent Trust Hub Pass Socket Pass Snyk Warn

nemotron-speech

安装