πŸŽ‰ LTX-2 Model Released

Synchronized Audio-Video GenerationLTX-2

LTX-2 generates high-fidelity 4K 50FPS video with matched audio in a single pass.
Native resolution, high fidelity, and open source.

🎁 Experience the future of generative media

LTX-2 Generation Example

The First Native Audio-Video AI

LTX-2 is a revolutionary single-model solution that generates realistic video and audio simultaneously, eliminating the need for post-processing.

Synchronized Generation

Generates video and audio (ambient, dialogue, music) in a single pass for perfect synchronization.

Native 4K Resolution

Supports 3840x2160 resolution at up to 50 FPS for professional-grade output.

High Efficiency

Optimized to run on consumer GPUs (RTX series, 12GB+ VRAM) with NVFP8 quantization.

Multimodal Input

Supports text-to-video, image-to-video, and video-to-video with precise control.

Why Choose LTX-2

Experience the advantages of a unified audio-visual generation model.

Unlike other models that require separate audio generation, LTX-2 creates sound that matches the visual action instantly, like rain sounds for a rainy scene.

Key Features

Advanced capabilities for next-generation content creation.

Native 4K 50FPS

Generate stunning high-resolution videos with smooth motion.

Audio-Video Sync

Single-model architecture ensures audio perfectly matches visuals.

Consumer Hardware

Runs locally on NVIDIA RTX GPUs with 12GB+ VRAM.

Multiple Modes

Fast, Pro, Ultra, and Distilled modes for different needs.

Precise Control

Camera control, keyframes, and reference images supported.

Open Ecosystem

Hugging Face weights, GitHub code, and community extensions.

Frequently Asked Questions

Everything you need to know about LTX-2.







Start Creating with LTX-2

Download the model and start generating synchronized audio-visual content today.

LTX-2 β€” Synchronous Audio-Video Generation