DeepMind's self-learning AI took four hours to learn and master chess

7 0

By Matthew Griffin Intelligence and the Senses 19th December 2017

WHY THIS MATTERS IN BRIEF

Generalised AI’s that can learn and master complex subjects within hours will have wide ranging benefits for industry and society alike.

As most people will happily tell you Chess isn’t an easy game, at least by human standards, but for an Artificial Intelligence (AI) powered by what many are starting to call an “alien like” intelligence as it turns out the game that has kept some of the best human minds busy for millennia is little more than a trivial distraction that can be learned and mastered in just a few hours.

Japanese researchers unveil their stunning "full body DeepFakes"

In a paper published last week a team of researchers at Google’s DeepMind lab detailed how their latest creation, called AlphaZero, which is a tweaked more generic version of AlphaGo Zero, a self-learning AI that “creates its own knowledge with no human input,” and the descendent of the great Alpha Go that routinely annihilated the world’s best Go players before being annihilated itself by AlphaGo Zero 100 games to nil, took just four hours to learn the rules of chess before obliterating the open source world champion chess program Stockfish with “superhuman” like performance.

Put this another way AlphaZero ingested and absorbed all the world’s chess knowledge in less time than it takes to drive from London to Manchester, and after being programmed with nothing more than the rules of chess it took Stockfish to town, playing 100 games, winning 28 and drawing in all the rest, with Stockfish recording no wins and AlphaZero no losses.

“We now know who our new overlord is,” said chess researcher David Kramaley, the CEO of chess science website Chessable, “it will no doubt revolutionise the game, but think about how this could be applied outside chess. This algorithm could run cities, continents, universes.”

Deepfakes bring a dead dictator back to life just before this major election

DeepMind has been developing and refining its AI’s for years, teaching them to do everything from dream, fight one another and imagine, to giving them human like memory and the ability to self-learn, and super human skills that include building new AI’s, lip reading, play games and translating hundreds of languages on the fly, and it’s clear that it’s only just starting to get warmed up.

Unlike AlphaGo Zero and AlphaZero’s predecessors, who all learned to play their games by watching and analysing the moves made by human players, ironically a tactic that was intended to help the fledgling AI’s master strategy but that now is increasingly looking like it was actually more of a hindrance than a help, the two newest members of the family are increasingly showing us just how devastatingly effective their approach to self-learning actually is.

“It’s like an alien civilisation inventing its own mathematics,” says computer scientist Nick Hynes from MIT, “what we’re seeing here is a model free from human bias and presuppositions. It can learn whatever it determines is optimal, which may indeed be more nuanced that our own conceptions of the same.”

Apples CEO Tim Cook speaks out the future of AI

In their latest paper the researchers outline how the latest AlphaZero AI takes the self-learning technique, called Reinforcement Learning, and manages to apply it much more generally than you’d expect in order to give it the ability to solve a broader range of problems. And that broader focus means AlphaZero doesn’t just play chess. It also plays Shogi, a form of Japanese Chess, and Go too, and unsurprisingly, it only took two and eight hours respectively to master those games as well.

“I always wondered how it would be if a superior species landed on Earth and showed us how they played chess,” said chess grandmaster Peter Heine Nielsen, “now I know.”

Check mate, mate.

Matthew Griffin / About Author

Matthew Griffin, described as “The Adviser behind the Advisers” and a “Young Kurzweil,” is the founder and CEO of the World Futures Forum and the 311 Institute, a global Futures and Deep Futures consultancy working across the next 50 years, and is an award winning futurist, and author of “Codex of the Future” series.

Regularly featured in the global media, including AP, BBC, Bloomberg, CNBC, Discovery, RT, Viacom, and WIRED, Matthew’s ability to identify, track, and explain the impacts of hundreds of revolutionary emerging technologies on global culture, industry and society, is unparalleled. Recognised for the past six years as one of the world’s foremost futurists, innovation and strategy experts Matthew is an international speaker who helps governments, investors, multi-nationals and regulators around the world envision, build and lead an inclusive, sustainable future.

A rare talent Matthew’s recent work includes mentoring Lunar XPrize teams, re-envisioning global education and training with the G20, and helping the world’s largest organisations envision and ideate the future of their products and services, industries, and countries.

Matthew's clients include three Prime Ministers and several governments, including the G7, Accenture, Aon, Bain & Co, BCG, Credit Suisse, Dell EMC, Dentons, Deloitte, E&Y, GEMS, Huawei, JPMorgan Chase, KPMG, Lego, McKinsey, PWC, Qualcomm, SAP, Samsung, Sopra Steria, T-Mobile, and many more.

Comments (7)

Prof. Nick Colosimo CEng FIET FIKE

21st December 2017 at 5:18 pm

It’s a very good achievement albeit very narrow and there is scope to improve the representation of the opponent in the work.

Steve Olson

21st December 2017 at 5:21 pm

Can it beat the IBM chess computer?

Justin Roberts

21st December 2017 at 5:21 pm

How about ridding the earth of that pesky species, homo sapiens? We would be the major competitor after all.

Bhaskar Jyoti Nath

21st December 2017 at 5:23 pm

Hey Mathew san … can you share some link or provide any way so that I get a chance to play Chess against DeepMind? (In case it is available).

DARPA propose creating an AI that can monitor the whole world for threats – Music, Radio & TV

23rd March 2019 at 9:35 pm

[…] How? Well, they have a general idea but they’re looking for expertise. The problem, they note, is that schemas currently have to be laboriously defined and checked by humans. At that point you might as well inspect the information yourself. So the KAIROS program aims to have the AI teach itself, much ion the same way that elsewhere Google DeepMind’s Alpha Zero AI platform is now self-learning and “producing its own knowledge.” […]

DARPA propose creating an AI that can monitor the whole world for threats – International News

23rd March 2019 at 11:04 pm

Autonomous AI could create an autonomous cyber warfare System of Systems – Matthew Griffin | Keynote Speaker & Master Futurist

3rd March 2024 at 8:18 am

[…] TNO to the increasing number and intensity of cyber attacks has been dubbed ‘Athena.’ It is a self-learning and self-repairing system that can perform a wide range of security tasks […]

DeepMind’s self-learning AI took four hours to learn and master chess

WHY THIS MATTERS IN BRIEF

Generalised AI’s that can learn and master complex subjects within hours will have wide ranging benefits for industry and society alike.

Comments (7)

Leave a comment Cancel reply

ORGANISING AN EVENT OR WORKSHOP?

STAY CONNECTED

FREE BOOKS AND STUFF

MY PLEDGE TO THE PLANET

NET ZERO .

ZERO HARM .

ZERO IMPACT .

ZERO WASTE .

EXPLORE MORE!

You have Successfully Subscribed!

Pin It on Pinterest

DeepMind’s self-learning AI took four hours to learn and master chess

WHY THIS MATTERS IN BRIEF

Generalised AI’s that can learn and master complex subjects within hours will have wide ranging benefits for industry and society alike.

Related Posts

Comments (7)

Leave a comment Cancel reply

ORGANISING AN EVENT OR WORKSHOP?

STAY CONNECTED

FREE BOOKS AND STUFF

MY PLEDGE TO THE PLANET

NET ZERO .

ZERO HARM .

ZERO IMPACT .

ZERO WASTE .

EXPLORE MORE!

You have Successfully Subscribed!

Pin It on Pinterest