Pricing

API pricing for video generation

Video generation is billed per second of output video. Higher resolution and premium models have proportionally higher costs.

Pricing update: Starting April 1, 2026, pricing for LTX 2.3 Text-to-Video and Image-to-Video endpoints will be updated. See the updated rates in the tables below.

Text-to-Video

Generate videos from text prompts. Pricing is based on the duration of the generated video in seconds.

EndpointModelResolutionCost per second
v1/text-to-videoltx-2-fast1920x1080$0.04
2560x1440$0.08
3840x2160$0.16
ltx-2-pro1920x1080$0.06
2560x1440$0.12
3840x2160$0.24
ltx-2-3-fast1920x1080 / 1080x1920$0.04 → $0.06
2560x1440 / 1440x2560$0.08 → $0.12
3840x2160 / 2160x3840$0.16 → $0.24
ltx-2-3-pro1920x1080 / 1080x1920$0.06 → $0.08
2560x1440 / 1440x2560$0.12 → $0.16
3840x2160 / 2160x3840$0.24 → $0.32

Image-to-Video

Generate videos from an input image. Pricing is based on the duration of the generated video in seconds.

EndpointModelResolutionCost per second
v1/image-to-videoltx-2-fast1920x1080$0.04
2560x1440$0.08
3840x2160$0.16
ltx-2-pro1920x1080$0.06
2560x1440$0.12
3840x2160$0.24
ltx-2-3-fast1920x1080 / 1080x1920$0.04 → $0.06
2560x1440 / 1440x2560$0.08 → $0.12
3840x2160 / 2160x3840$0.16 → $0.24
ltx-2-3-pro1920x1080 / 1080x1920$0.06 → $0.08
2560x1440 / 1440x2560$0.12 → $0.16
3840x2160 / 2160x3840$0.24 → $0.32

Audio-to-Video

Generate videos synchronized to audio input. Pricing is based on the duration of the input audio in seconds.

EndpointModelResolutionCost per second
v1/audio-to-videoltx-2-pro1920x1080$0.10
ltx-2-3-pro1920x1080$0.10

Retake (Video Editing)

Edit and regenerate portions of an existing video. Pricing is based on the duration of the input video.

EndpointModelResolutionCost per second
v1/retakeltx-2-pro1920x1080$0.10
ltx-2-3-pro1920x1080$0.10

Extend (Video Extension)

Extend a video by generating additional frames at the beginning or end. Pricing is based on the duration of the extended portion plus context frames from the input video, capped at a total of 505 billed frames. The resulting billed seconds depend on the input video’s frame rate (~21 seconds at 24fps).

EndpointModelResolutionCost per second
v1/extendltx-2-pro1920x1080$0.10
ltx-2-3-pro1920x1080$0.10