LTX-2 LTX-2: The First Open-Source Efficient Joint Audio-Visual Foundation Model LTX-2 is the first open-source model that generates synchronized audio and video together using a joint diffusion process, enabling realistic speech, sound effects, and motion alignment in a single system.