This Microsoft bot can sketch an image from caption-like descriptions

The core of this bot is a technology known as a 'Generative Adversarial Network' or GAN

Image
IANS San Francisco
Last Updated : Jan 19 2018 | 12:54 PM IST

Microsoft is developing a bot that can draw what you want it to by leveraging Artificial Intelligence (AI) technology -- programmed to pay close attention to individual words when generating images from caption-like text descriptions.

The technology, which the researchers simply call the drawing bot, can generate images of everything from ordinary pastoral scenes -- such as grazing livestock -- to the absurd and a floating double-decker bus.

Each image contains details that are absent from the text descriptions, indicating that this AI contains an artificial imagination.

"If you go to Bing and you search for a bird, you get a bird picture. But here, the pictures are created by the computer, pixel by pixel, from scratch. These birds may not exist in the real world -- they are just an aspect of our computer's imagination of birds," Xiaodong He from Microsoft's research lab in a blog post late on Thursday.

According to results on an industry standard test, reported in a research paper posted on arXiv.org, the bot produced a nearly three-fold boost in image quality compared to the previous state-of-the-art technique for text-to-image generation.

The core of this bot is a technology known as a "Generative Adversarial Network" or GAN.

The network consists of two Machine Learning models -- one that generates images from text descriptions and another, known as a discriminator, that uses text descriptions to judge the authenticity of generated images.

The researchers said that text-to-image generation technology could find practical applications acting as a sort of sketch assistant to painters and interior designers or as a tool for voice-activated photo refinement.

For now, the technology is imperfect.

"For AI and humans to live in the same world, they have to have a way to interact with each other. The language and vision are the two most important modalities for humans and machines to interact with each other," The blog post explained.

 

*Subscribe to Business Standard digital and get complimentary access to The New York Times

Smart Quarterly

₹900

3 Months

₹300/Month

SAVE 25%

Smart Essential

₹2,700

1 Year

₹225/Month

SAVE 46%
*Complimentary New York Times access for the 2nd year will be given after 12 months

Super Saver

₹3,900

2 Years

₹162/Month

Subscribe

Renews automatically, cancel anytime

Here’s what’s included in our digital subscription plans

Exclusive premium stories online

  • Over 30 premium stories daily, handpicked by our editors

Complimentary Access to The New York Times

  • News, Games, Cooking, Audio, Wirecutter & The Athletic

Business Standard Epaper

  • Digital replica of our daily newspaper — with options to read, save, and share

Curated Newsletters

  • Insights on markets, finance, politics, tech, and more delivered to your inbox

Market Analysis & Investment Insights

  • In-depth market analysis & insights with access to The Smart Investor

Archives

  • Repository of articles and publications dating back to 1997

Ad-free Reading

  • Uninterrupted reading experience with no advertisements

Seamless Access Across All Devices

  • Access Business Standard across devices — mobile, tablet, or PC, via web or app

More From This Section

First Published: Jan 19 2018 | 12:54 PM IST

Next Story