Microsoft develops AI enabled 'bot artist'

Image
Press Trust of India Washington
Last Updated : Jan 19 2018 | 1:35 PM IST
Microsoft researchers are developing an artificial intelligence (AI) enabled 'drawing bot' that can create images from text descriptions of an object.
The technology can generate images of everything from ordinary pastoral scenes, such as grazing livestock, to the absurd, such as a floating double-decker bus, Microsoft said in a blog post.
Each image contains details that are absent from the text descriptions, indicating that this artificial intelligence contains an artificial imagination, it said.
The technology under development in Microsoft's research labs is programmed to pay close attention to individual words when generating images from caption-like text descriptions, the company said.
This deliberate focus produces a nearly three-fold boost in image quality compared to the previous state-of-the-art technique for text-to-image generation, according to results on an industry standard test reported in a research paper posted on arXiv.org.
"If you go to Bing and you search for a bird, you get a bird picture. But here, the pictures are created by the computer, pixel by pixel, from scratch," said Xiaodong He, a principal researcher at Microsoft's research lab in Washington.
He and colleagues started with technology that automatically writes photo captions - the CaptionBot - and then moved to the one that answers questions humans ask about images, such as the location or attributes of objects, which can be especially helpful for blind people.
"Now we want to use the text to generate the image," said Qiuyuan Huang, a postdoctoral researcher in He's group.
Text-to-image generation technology could find practical applications acting as a sort of sketch assistant to painters and interior designers, or as a tool for voice-activated photo refinement, the researchers said.
At the core of Microsoft's drawing bot is a technology known as a Generative Adversarial Network, or GAN.
The network consists of two machine learning models, one that generates images from text descriptions and another, known as a discriminator, that uses text descriptions to judge the authenticity of generated images.

Disclaimer: No Business Standard Journalist was involved in creation of this content

*Subscribe to Business Standard digital and get complimentary access to The New York Times

Smart Quarterly

₹900

3 Months

₹300/Month

SAVE 25%

Smart Essential

₹2,700

1 Year

₹225/Month

SAVE 46%
*Complimentary New York Times access for the 2nd year will be given after 12 months

Super Saver

₹3,900

2 Years

₹162/Month

Subscribe

Renews automatically, cancel anytime

Here’s what’s included in our digital subscription plans

Exclusive premium stories online

  • Over 30 premium stories daily, handpicked by our editors

Complimentary Access to The New York Times

  • News, Games, Cooking, Audio, Wirecutter & The Athletic

Business Standard Epaper

  • Digital replica of our daily newspaper — with options to read, save, and share

Curated Newsletters

  • Insights on markets, finance, politics, tech, and more delivered to your inbox

Market Analysis & Investment Insights

  • In-depth market analysis & insights with access to The Smart Investor

Archives

  • Repository of articles and publications dating back to 1997

Ad-free Reading

  • Uninterrupted reading experience with no advertisements

Seamless Access Across All Devices

  • Access Business Standard across devices — mobile, tablet, or PC, via web or app

More From This Section

First Published: Jan 19 2018 | 1:35 PM IST

Next Story