Google teaches robots to learn from each other using a hive mind

5 0

By Matthew Griffin Robo Revolution 8th October 2016

WHY THIS MATTERS IN BRIEF

Robots learning from other robots will ultimately mean robots will learn new skills faster, accelerating the robot revolution.

The robots of the world are uniting – and that’s either a great thing or a terrifying thing depending on your view.

Google has a plan to speed up robotic learning, and it involves getting robots to share their experiences – via the cloud – and collectively improve their capabilities – via AI and deep learning. And while the below video might just look like a number of robots learning how to open a door think bigger picture.

Think about thousands, tens of thousands or eventually millions of robots – from drones and autonomous vehicles to manufacturing robots to advanced humanoid robots like ATLAS – learning directly from each others experiences via a Hive Mind and now think about the potential that that has – or that that would have for you, or us as a species. That’s game changing – however you want to look at it.

Powerful inhalable Covid-19 vaccine passes preliminary tests

Sergey Levine from the Google Brain team, along with collaborators from Alphabet subsidiaries DeepMind and GoogleX, published a blog post on Monday describing an approach for “general-purpose skill learning across multiple robots.”

Teaching robots how to do even the most basic tasks in real world settings such as homes and offices has vexed roboticists for decades. To tackle this challenge, the Google researchers decided to combine two recent technology advances. The first is cloud robotics, a concept that envisions robots sharing data and skills with each other through an online repository. The other is machine learning, and in particular, the application of deep neural networks to let robots learn for themselves.

In a series of experiments carried out by the researchers, individual robotic arms attempted to perform a given task repeatedly. Not surprisingly, each robot was able to improve its own skills over time, learning to adapt to slight variations in the environment and its own motions. But the Google team didn’t stop there. They got the robots to pool their experiences to “build a common model of the skill” that, as the researches explain, was better and faster than what they could have achieved on their own.

“The skills learned by the robots are still relatively simple – pushing objects and opening doors – but by learning such skills more quickly and efficiently through collective learning, robots might in the future acquire richer behavioural repertoires that could eventually make it possible for them to assist us in our daily lives.”

Overview of the training

Earlier this year, Levine and colleagues from X showed how deep neural nets can help robots teach themselves a grasping task. In that study, a group of robot arms went through some 800,000 grasp attempts, and though they failed a lot in the beginning, their success rate improved significantly as their neural net continuously retrained itself.

In their latest experiments, the Google researchers tested three different scenarios. The first involved robots learning motor skills directly from trial and error practice. Each robot started with a copy of a neural net as it attempted to open a door over and over. At regular intervals, the robots sent data about their performances to a central server, which used the data to build a new neural network that better captured how action and success were related. The server then sent the updated neural net back to the robots.

“Given that this updated network is a bit better at estimating the true value of actions in the world, the robots will produce better behavior,” the researchers wrote.

“This cycle can then be repeated to continue improving on the task.”

In the second scenario, the researchers wanted robots to learn how to interact with objects not only through trial and error but also by creating internal models of the objects, the environment, and their behaviors. Just as with the door opening task, each robot started with its own copy of a neural network as it “played” with a variety of household objects.

The worlds first virtual CEO takes the reigns at a Chinese gaming company

The robots then shared their experiences with each other and together built what the researchers describe as a “single predictive model” that gives them an implicit understanding of the physics involved in interacting with the objects. You could probably build the same predictive model by using a single robot, but sharing the combined experiences of many robots gets you there much faster.

Finally, the third scenario involved robots learning skills with help from humans. The idea is that people have a lot of intuition about their interactions with objects and the world, and that by assisting robots with manipulation skills we could transfer some of this intuition to robots to let them learn those skills faster. In the experiment, a researcher helped a group of robots open different doors while a single neural network on a central server encoded their experiences. Next the robots performed a series of trial and error repetitions that were gradually more difficult, helping to improve the network.

“The combination of human guidance with trial and error learning allowed the robots to collectively learn the skill of door opening in just a couple of hours,” the researchers wrote.

“Since the robots were trained on doors that look different from each other, the final policy succeeds on a door with a handle that none of the robots had seen before.”

The Google team explained that the skills their robots have learned are still quite limited. But they hope that, as robots and algorithms improve and become more widely available, the notion of pooling their experiences will prove critical in teaching robots how to do useful tasks.

3D Bio-Printing robot prints human tissue within the body to treat internal injuries

“In all three of the experiments described above, the ability to communicate and exchange their experiences allows the robots to learn more quickly and effectively. This becomes particularly important when we combine robotic learning with deep learning, as is the case in all of the experiments discussed above. We’ve seen before that deep learning works best when provided with ample training data. For example, the popular ImageNet benchmark uses over 1.5 million labeled examples. While such a quantity of data is not impossible for a single robot to gather over a few years, it is much more efficient to gather the same volume of experience from multiple robots over the course of a few weeks. Besides faster learning times, this approach might benefit from the greater diversity of experience – a real world deployment might involve multiple robots in different places and different settings, sharing heterogeneous, varied experiences to build a single highly generalizable representation,” they said.

As robots begin to master the art of learning it’s inevitable that one day they’ll be able to acquire new skills instantly at at much, much faster rates than humans have ever been able to to.

Matthew Griffin / About Author

Matthew Griffin, multi-award winning Futurist and named Futurist of the Year 2024, has been described as a "Walking encyclopaedia of the future" by NASA and a futurist polymath. One of the world's most renowned futurists and strategic foresight experts Matthew is the 15 times author of the blockbuster "Codex of the Future" series, and is the Founder and Futurist in Chief of the 311 Institute, a global Futures and Deep Futures advisory firm working across the next 50 years, XPotential University, the world's first free futures and foresight university, and the World Futures Forum which works with the United Nations to solve the worlds greatest challenges. Matthew is an in demand international keynote, acclaimed university lecturer and mentor, and host of the hit Fanatical Futurist podcast.

A rare talent in his past Matthew helped build and run several multi-billion dollar business units for Atos, Dell-EMC, and IBM, and his ability to identify, track, and explain the impacts of hundreds of emerging technologies and trends on global business, culture, and society has earned him a powerful reputation and a roster of clients that include royal households, world leaders, G7, G20, and G77+ governments, and many of the world's most respected brands including ABB, Accenture, Adidas, AON, ARM, BCG, Centrica, Citi Group, Coca Cola, Dentons, Deloitte, Disney, Dow, EY, KPMG, Lego, Legal & General, LinkedIn, Microsoft, PepsiCo, Qualcomm, RWE, Samsung, T-Mobile, UBS, VISA, and many others. He was also the only futurist invited to talk at the UN COP28 held in Dubai alongside world leaders.

Regularly featured in the global media including the AP, BBC, Bloomberg, CNBC, Discovery, Forbes, Khaleej Times, Telegraph, TIME, ViacomCBS, WIRED, and the WSJ, Matthews mission is to help organisations create a fair and sustainable future whose benefits are shared by everyone irrespective of their ability, background, or circumstances.

Comments (5)

Google teaches robots to learn from each other - Socializing AI

16th October 2016 at 6:34 pm

[…] Source: Global Futurist […]

1.2 million jobs to vanish as Foxconn unveils plans for fully autonomous factories

7th January 2017 at 9:00 am

[…] re-programming robots? Well, those problems and costs are going away as robots become more adept at teaching other robots new tasks, using the equivalent of a “hive” […]

Robert

19th July 2017 at 6:45 pm

Robots increase the rate of entropy.
We should tax emissions to protect our environment – https://www.emissionstax.org/pricing-pollution-jobs-robots/

Matthew Griffin

19th July 2017 at 9:08 pm

Hi Robert thanks for the link – it’s an interesting perspective and one that’s not voiced very often, thanks again

Toyota invents new Large Behaviour Model AI to accelerate robot learning – By Futurist and Virtual Keynote Speaker Matthew Griffin

14th October 2023 at 7:11 pm

[…] – upgrading entire fleets of robots with new skills as they go using something akin to a robot hive mind like we’ve seen […]

Google teaches robots to learn from each other using a hive mind

ORGANISING AN EVENT OR WORKSHOP?

STAY CONNECTED

FREE BOOKS AND STUFF

MY PLEDGE TO THE PLANET

NET ZERO .

ZERO HARM .

ZERO IMPACT .

ZERO WASTE .

EXPLORE MORE!

You have Successfully Subscribed!

Pin It on Pinterest

Google teaches robots to learn from each other using a hive mind

WHY THIS MATTERS IN BRIEF

Robots learning from other robots will ultimately mean robots will learn new skills faster, accelerating the robot revolution.

Related Posts

Comments (5)

Leave a comment Cancel reply

ORGANISING AN EVENT OR WORKSHOP?

STAY CONNECTED

FREE BOOKS AND STUFF

MY PLEDGE TO THE PLANET

NET ZERO .

ZERO HARM .

ZERO IMPACT .

ZERO WASTE .

EXPLORE MORE!

You have Successfully Subscribed!

Pin It on Pinterest