MUMBAI, India, June 24 -- Intellectual Property India has published a patent application (202617064411 A) filed by Google LLC on May 21, 2026, for Scalable Latent Video Diffusion Via Transformers.
Inventors include Gupta, Agrim; Yu, Lijun; Sohn, Kihyuk; Essa, Irfan Aziz; Jiang, Lu; and Lezama Torres De La Llosa, Jos.
The application for the patent was published on June 12, 2026, under issue no. 24/2026.
Abstract: Methods, systems, and apparatus for scalable latent video diffusion via transformers. In one aspect a method includes generating, using an encoder and from respective videos and collections of images, latent tensors, each latent tensor representing a respective one of the videos or a respective one collection of images, wherein each latent tensor is included in a same latent space; and training a transformer backbone using the latent tensors, the transformer backbone comprising: one or more first self-attention layers that each perform spatial self-attention operations modeling spatial relations in each latent tensor; and one or more second self-attention layers that each perform spatiotemporal self-attention operations modeling spatiotemporal dynamics in latent tensors generated from videos.
Disclaimer: Curated by HT Syndication.