varg sdk - declarative ai video orchestration

jsx-based ai video generation. describe scenes declaratively, render videos automatically.

what is this?

varg sdk is a declarative video orchestration framework. instead of manually calling apis, stitching clips, and managing async workflows, you describe what you want in jsx and the runtime handles:

parallel generation of images/videos/audio automatic caching (re-renders reuse cached assets) ffmpeg composition under the hood provider abstraction (fal, elevenlabs, replicate)

think of it like react for video - you declare the structure, the engine figures out the execution.

terminology term meaning Render root container - sets dimensions (1080x1920 for tiktok, 1920x1080 for youtube) Clip timeline segment with duration, contains visual/audio layers Image static image - generated from prompt or loaded from file Video video clip - text-to-video OR image-to-video animation Music background audio - generated from prompt or loaded from file Speech text-to-speech with voice selection Title/Subtitle text overlays with positioning Captions auto-generated captions from Speech element Grid layout helper for multi-image/video grids core concepts {/ tiktok dimensions /}

<Title position="bottom">Welcome</Title>

no imports needed - the render runtime auto-imports all components (Render, Clip, Video, Image, Music, Speech, Title, Grid, etc.) and providers (fal, elevenlabs). just write jsx and export default.

Video component

Video handles both text-to-video and image-to-video:

// text-to-video - generate from scratch

media-generation

安装