Generate video from audio
Authentication
Request
Audio file to be used as the soundtrack for the video. See Input Formats for supported formats and size limits.
Input image to be used as the first frame of the video. Required if prompt is not provided. See Input Formats for supported formats and size limits.
Text description of how the video should be generated. Required if image_uri is not provided. Can be empty string when image_uri is provided. If image_uri is provided, this describes how the image should be animated. If no image_uri is provided, this describes the video content.
Optional guidance scale (also known as CFG) for video generation. Higher values make the output more closely follow the prompt but may reduce quality. Defaults to 5 for text-to-video, or 9 when providing an image.
Model to use for video generation. Currently only ltx-2-pro is supported.