LTX-2 generates high-fidelity 4K 50FPS video with matched audio in a single pass.
Native resolution, high fidelity, and open source.
π Experience the future of generative media

LTX-2 is a revolutionary single-model solution that generates realistic video and audio simultaneously, eliminating the need for post-processing.
Generates video and audio (ambient, dialogue, music) in a single pass for perfect synchronization.
Supports 3840x2160 resolution at up to 50 FPS for professional-grade output.
Optimized to run on consumer GPUs (RTX series, 12GB+ VRAM) with NVFP8 quantization.
Supports text-to-video, image-to-video, and video-to-video with precise control.
Experience the advantages of a unified audio-visual generation model.
Advanced capabilities for next-generation content creation.
Generate stunning high-resolution videos with smooth motion.
Single-model architecture ensures audio perfectly matches visuals.
Runs locally on NVIDIA RTX GPUs with 12GB+ VRAM.
Fast, Pro, Ultra, and Distilled modes for different needs.
Camera control, keyframes, and reference images supported.
Hugging Face weights, GitHub code, and community extensions.
Everything you need to know about LTX-2.
Download the model and start generating synchronized audio-visual content today.