Google Lumiere: Everything about multimodal AI model for videos creation

Lumiere video generation AI model lets users apply text-based image editing methods for consistent video editing, said Google

Google Lumiere, Google's AI video Generator
Google Lumiere
Harsh Shivam New Delhi
2 min read Last Updated : Jan 29 2024 | 11:28 AM IST

Google has unveiled a new multimodal AI model ‘Lumiere’ for video generation. Google said, “Lumiere is a text-to-video diffusion model designed for synthesising videos that portray realistic, diverse and coherent motion.” The company touted that the model facilitates content creation tasks and video editing applications such as image-to-video, video in painting, and stylized video generation.

According to Google, Lumiere model uses a Space-Time u-Net (STUNet) architecture to generate videos. Using this architectural design, the model processes all frames in a video at once instead of generating keyframes and then filling the missing frames using temporal super-resolution (TSR) models, which is a common approach for existing video generators.

Google said Lumiere generates the entire temporal duration of the video at once by deploying both spatial and temporal down- and up-sampling. It essentially means the model first generates a full frame rate video in low resolution and later upscales the generated video using a spatial super-resolution (SSR) model to produce the final result. In the research paper previewing Lumiere, Google said that the sample videos generated by the AI model are 80 frames long at 16 frames-per-second, essentially 5 seconds long. The initially generated video is at 128x128 resolution, which is then upscaled to 1024x1024 using SSR.

According to Google, Lumiere video generation model also lets users apply text-based image editing methods for consistent video editing. For example, its Cinemagraphs feature lets users animate a specific region within the image to generate a video. For stylized video generation, Lumiere can generate videos in the target style using a single reference image provided by the user.

*Subscribe to Business Standard digital and get complimentary access to The New York Times

Smart Quarterly

₹900

3 Months

₹300/Month

SAVE 25%

Smart Essential

₹2,700

1 Year

₹225/Month

SAVE 46%
*Complimentary New York Times access for the 2nd year will be given after 12 months

Super Saver

₹3,900

2 Years

₹162/Month

Subscribe

Renews automatically, cancel anytime

Here’s what’s included in our digital subscription plans

Exclusive premium stories online

  • Over 30 premium stories daily, handpicked by our editors

Complimentary Access to The New York Times

  • News, Games, Cooking, Audio, Wirecutter & The Athletic

Business Standard Epaper

  • Digital replica of our daily newspaper — with options to read, save, and share

Curated Newsletters

  • Insights on markets, finance, politics, tech, and more delivered to your inbox

Market Analysis & Investment Insights

  • In-depth market analysis & insights with access to The Smart Investor

Archives

  • Repository of articles and publications dating back to 1997

Ad-free Reading

  • Uninterrupted reading experience with no advertisements

Seamless Access Across All Devices

  • Access Business Standard across devices — mobile, tablet, or PC, via web or app

More From This Section

Topics :GoogleGoogle's AIartifical intelligenceAI systemsTechnology

First Published: Jan 29 2024 | 11:28 AM IST

Next Story