For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Example: “Gentle rustling of leaves, distant bird calls, soft wind”
Longer, descriptive prompts consistently produce better motion, coherence, and audio alignment.
See our Prompting guide for additional tips on writing good prompts.
5. Prompt Enhancement
Locate the Prompt Enhancer node:
Enabled by default: Automatically expands and improves your prompt
Bypass option: Disable when you need exact control over phrasing
Best for: Shorter prompts that need more detail
The enhancer adds:
Visual details and scene elements
Motion descriptions and dynamics
Audio cues and atmospheric details
6. Configure Sampling
Find the KSampler or LTXVSampler node:
Steps
Distilled model: 4-8 steps (optimized for speed)
Full model: 20-50 steps (higher quality)
Start with lower values and increase if quality is insufficient
CFG (Classifier-Free Guidance)
Range: 2.0-5.0
Lower values (2.0-3.0): More creative, less prompt adherence
Higher values (4.0-5.0): Stronger prompt adherence, less variation
Recommended: 3.0-3.5 for balanced results
Sampler Type
euler: Fast, good for testing
dpmpp_2m: Higher quality, slightly slower
Experiment to find your preference
Seed
Fixed seed: Reproducible results for iteration
Random seed: Explore variations
7. Two-Stage Generation
The workflow uses a multi-scale approach:
Base Generation
Generates at your specified resolution
Fast iteration for testing prompts and parameters
Upscale Pass
Increases resolution and refines details
Uses the LTXVUpscale node
Scale factor: Typically 2x
Frame rate: Must match base generation
This approach provides:
Faster experimentation at lower resolution
High-quality final output without generating everything at max resolution from the start
8. Decoding
The workflow uses LTXVDecoder nodes:
Audio Decoder
Processes audio latents separately
Outputs synchronized audio stream
Supports dialogue, music, and ambient sound
Video Decoder
Uses tiled decoding to minimize VRAM usage
Processes video latents in manageable chunks
Maintains quality while reducing memory requirements
Note: Audio and video are generated separately, then merged during decoding for synchronized output.
9. Save Output
Configure the SaveVideo node:
Format: MP4 (default), MOV, or WebM
Codec: H.264 (compatibility) or H.265 (smaller files)
Audio: Automatically embedded from audio decoder
Filename: Use descriptive names for organization
Advanced Techniques
Full Model Variant
The full model workflow provides higher quality at the cost of longer generation time.
Key Differences
Uses the full LTX-2 checkpoint and specialized VAE
Stage 1: 15-20 steps (up to 40 for experimentation)
Uses LTXV Scheduler instead of manual sigmas
Applies distilled LoRA in Stage 2 (recommended strength: 0.6)
Using LoRAs
Add LoRALoader nodes to customize:
Style LoRAs: Apply artistic styles or visual aesthetics
Motion LoRAs: Enhance specific types of movement
Character LoRAs: Maintain consistent character appearance