Researchers at the US Army Research Laboratory and the University of Texas at Austin considered a specific case where a human provides real-time feedback in the form of critique.
First introduced by researchers as Training an Agent Manually via Evaluative Reinforcement (TAMER), the team developed a new algorithm called Deep TAMER.
It is an extension of TAMER that uses deep learning - a class of machine learning algorithms that are loosely inspired by the brain to provide a robot the ability to learn how to perform tasks by viewing video streams in a short amount of time with a human trainer.
Many current techniques in artificial intelligence require robots to interact with their environment for extended periods of time to learn how to optimally perform a task.
During this process, the agent might perform actions that may not only be wrong, like a robot running into a wall for example, but catastrophic like a robot running off the side of a cliff.
As a first step, the researchers demonstrated Deep TAMER's success by using it with 15 minutes of human-provided feedback to train an agent to perform better than humans on the Atari game of bowling - a task that has proven difficult for even state-of-the-art methods in artificial intelligence.
Deep-TAMER-trained agents exhibited superhuman performance, besting both their amateur trainers and, on average, an expert human Atari player.
Within the next one to two years, researchers are interested in exploring the applicability of their newest technique in a wider variety of environments: for example, video games other than Atari Bowling and additional simulation environments to better represent the types of agents and environments found when fielding robots in the real world.
"While both humans and autonomous agents can be trained in advance, the team will inevitably be asked to perform tasks, for example, search and rescue or surveillance, in new environments they have not seen before," he said.
"In these situations, humans are remarkably good at generalising their training, but current artificially- intelligent agents are not," he added.
Disclaimer: No Business Standard Journalist was involved in creation of this content
You’ve reached your limit of {{free_limit}} free articles this month.
Subscribe now for unlimited access.
Already subscribed? Log in
Subscribe to read the full story →
Smart Quarterly
₹900
3 Months
₹300/Month
Smart Essential
₹2,700
1 Year
₹225/Month
Super Saver
₹3,900
2 Years
₹162/Month
Renews automatically, cancel anytime
Here’s what’s included in our digital subscription plans
Exclusive premium stories online
Over 30 premium stories daily, handpicked by our editors


Complimentary Access to The New York Times
News, Games, Cooking, Audio, Wirecutter & The Athletic
Business Standard Epaper
Digital replica of our daily newspaper — with options to read, save, and share


Curated Newsletters
Insights on markets, finance, politics, tech, and more delivered to your inbox
Market Analysis & Investment Insights
In-depth market analysis & insights with access to The Smart Investor


Archives
Repository of articles and publications dating back to 1997
Ad-free Reading
Uninterrupted reading experience with no advertisements


Seamless Access Across All Devices
Access Business Standard across devices — mobile, tablet, or PC, via web or app
