Apple releases AI tool for image editing through text input: Details here

The MGIE model consists of MLLM that provides descriptive instructions to the diffusion model for achieving desired editing results.

Apple iPhone
Image: Apple
Harsh Shivam New Delhi
2 min read Last Updated : Feb 08 2024 | 11:50 AM IST

Researchers at Apple have published a new paper detailing their MLLM-Guided Image Editing (MGIE) AI Model, which can edit an image using text prompts. Apple worked alongside University of California, Santa Barbara researchers to come up with a new model that is capable of handling a wide range of editing scenarios, from simple colour adjustments to more complex object manipulations. 

The MGIE model consists of a Multimodal Large Language Model that expands users request and provides "concise expressive instructions" that the diffusion model can use to edit the input image. According to the research paper, this way of editing allows the MGIE model to address "ambiguous human commands to achieve reasonable editing".

For example, a picture of a pizza with the input "make it more healthy" is understood by the MLLM, which interprets the ambiguous term "healthy" and connects it with "Vegetable toppings on a pizza". The diffusion model then edits the image according to the instructions provided by the MLLM. 

READ: Adopt AI, don't sit on sidelines: Microsoft's Satya Nadella to CEOs

According to the research, existing models such as LLM-Guided Image Editing (LGIE) lack the visual perception of MGIE. The Large Language Model (LLM) is confined to a single modality, while the MLLM, with access to the input image and cross-modal understanding, derives more descriptive instructions. For example, if the user wants the image to be brighter, the MLLM within the MGIE model will let the diffusion model know which regions should be brightened.

MGIE is available as an open-source project on GitHub and can be downloaded with code, data and pre-trained models. According to VentureBeat, the image editing model is also available through a web demo hosted on Hugging Face spaces. However, Apple has not yet confirmed how it plans to utilise this model beyond research projects. 

Earlier this month, During Apple's quarterly earnings call, CEO Tim Cook confirmed that the company is working on AI features for its devices that will be announced later this year. Apple is expected to incorporate gen-AI features into its virtual assistant Siri and Messages app for features like text summarisation, suggestions and more. Similarly, other services across Apple's platform, such as Apple Music, Pages and Keynotes, will likely get the AI treatment, too. 

*Subscribe to Business Standard digital and get complimentary access to The New York Times

Smart Quarterly

₹900

3 Months

₹300/Month

SAVE 25%

Smart Essential

₹2,700

1 Year

₹225/Month

SAVE 46%
*Complimentary New York Times access for the 2nd year will be given after 12 months

Super Saver

₹3,900

2 Years

₹162/Month

Subscribe

Renews automatically, cancel anytime

Here’s what’s included in our digital subscription plans

Exclusive premium stories online

  • Over 30 premium stories daily, handpicked by our editors

Complimentary Access to The New York Times

  • News, Games, Cooking, Audio, Wirecutter & The Athletic

Business Standard Epaper

  • Digital replica of our daily newspaper — with options to read, save, and share

Curated Newsletters

  • Insights on markets, finance, politics, tech, and more delivered to your inbox

Market Analysis & Investment Insights

  • In-depth market analysis & insights with access to The Smart Investor

Archives

  • Repository of articles and publications dating back to 1997

Ad-free Reading

  • Uninterrupted reading experience with no advertisements

Seamless Access Across All Devices

  • Access Business Standard across devices — mobile, tablet, or PC, via web or app

More From This Section

Topics :Apple artifical intelligenceApple iOS

First Published: Feb 08 2024 | 11:50 AM IST

Next Story