DeepMind's self-learning AI took four hours to learn and master chess

By Matthew Griffin Intelligence and the Senses 19th December 2017

WHY THIS MATTERS IN BRIEF

Generalised AI’s that can learn and master complex subjects within hours will have wide ranging benefits for industry and society alike.

As most people will happily tell you Chess isn’t an easy game, at least by human standards, but for an Artificial Intelligence (AI) powered by what many are starting to call an “alien like” intelligence as it turns out the game that has kept some of the best human minds busy for millennia is little more than a trivial distraction that can be learned and mastered in just a few hours.

Andrew Ng launches LandingLens to democratise AI development for everyone

In a paper published last week a team of researchers at Google’s DeepMind lab detailed how their latest creation, called AlphaZero, which is a tweaked more generic version of AlphaGo Zero, a self-learning AI that “creates its own knowledge with no human input,” and the descendent of the great Alpha Go that routinely annihilated the world’s best Go players before being annihilated itself by AlphaGo Zero 100 games to nil, took just four hours to learn the rules of chess before obliterating the open source world champion chess program Stockfish with “superhuman” like performance.

Put this another way AlphaZero ingested and absorbed all the world’s chess knowledge in less time than it takes to drive from London to Manchester, and after being programmed with nothing more than the rules of chess it took Stockfish to town, playing 100 games, winning 28 and drawing in all the rest, with Stockfish recording no wins and AlphaZero no losses.

“We now know who our new overlord is,” said chess researcher David Kramaley, the CEO of chess science website Chessable, “it will no doubt revolutionise the game, but think about how this could be applied outside chess. This algorithm could run cities, continents, universes.”

MIT study suggests people outsourcing their work to AI are getting stupider

DeepMind has been developing and refining its AI’s for years, teaching them to do everything from dream, fight one another and imagine, to giving them human like memory and the ability to self-learn, and super human skills that include building new AI’s, lip reading, play games and translating hundreds of languages on the fly, and it’s clear that it’s only just starting to get warmed up.

Unlike AlphaGo Zero and AlphaZero’s predecessors, who all learned to play their games by watching and analysing the moves made by human players, ironically a tactic that was intended to help the fledgling AI’s master strategy but that now is increasingly looking like it was actually more of a hindrance than a help, the two newest members of the family are increasingly showing us just how devastatingly effective their approach to self-learning actually is.

“It’s like an alien civilisation inventing its own mathematics,” says computer scientist Nick Hynes from MIT, “what we’re seeing here is a model free from human bias and presuppositions. It can learn whatever it determines is optimal, which may indeed be more nuanced that our own conceptions of the same.”

US Government to start giving IOT devices a cyber security rating from 2023

In their latest paper the researchers outline how the latest AlphaZero AI takes the self-learning technique, called Reinforcement Learning, and manages to apply it much more generally than you’d expect in order to give it the ability to solve a broader range of problems. And that broader focus means AlphaZero doesn’t just play chess. It also plays Shogi, a form of Japanese Chess, and Go too, and unsurprisingly, it only took two and eight hours respectively to master those games as well.

“I always wondered how it would be if a superior species landed on Earth and showed us how they played chess,” said chess grandmaster Peter Heine Nielsen, “now I know.”

Check mate, mate.

Matthew Griffin / About Author

Matthew Griffin is a multi-award winning Futurist and expert in Disruption and Innovation, Geopolitics, Leadership, and Technology, who NASA have described as a "walking encyclopaedia of the future" and a "futurist Polymath." 15-time best selling author of the "Codex of the Future" series, Matthew is the Founder and Futurist in Chief of the 311 Institute, a global Futures and Deep Futures advisory firm working with royal households, world leaders, G7, G20, and G77 governments, NGOs, and multi-national mid and mega cap firms to help them explore, shape, and lead the next 50 years of business and society.

An award-winning YouTube creator with over a million followers, with an unrivalled global reach and impact, Matthew is a highly sought-after international keynote speaker, lecturer, and mentor who collaborates with global leaders through the United Nations Alliance of Civilizations (UNAOC) and United Nations General Assembly (UNGA) to shape pivotal initiatives such as the UN’s AI for Humanity program, the United Nations Conference of the Parties (UN COP), and the World Economic Forum in Davos.

As the former Global Head of Cloud, National Security, and Enterprise Sales for companies including Atos, Dell-EMC, and IBM, Matthew has a proven track record of building multi-billion dollar business units and turning failing divisions into market leaders. His ability to identify, analyse, and communicate the implications of hundreds of emerging technologies and trends is unparalleled, and his insights are trusted by many of the world’s most respected organisations, including ABB, Accenture, Adidas, AON, ARM, BCG, Centrica, Citi, Coca-Cola, Dentons, Deloitte, Dow Jones, EY, Google, KPMG, Lego, Legal & General, LinkedIn, Microsoft, PepsiCo, Qualcomm, RWE, Samsung, Siemens AG and Siemens Energy, T-Mobile, UBS, VISA, Walmart, Workday, Worldpay and many others.

Regularly featured in the global media including the AP, BBC, Bloomberg, CNBC, Discovery, Forbes, Khaleej Times, Telegraph, TIME, ViacomCBS, WIRED, and the WSJ, Matthews mission is to help organisations create a fair and sustainable future whose benefits are shared by everyone irrespective of their ability, background, or circumstances.

Comments (5)

Prof. Nick Colosimo CEng FIET FIKE

21st December 2017 at 5:18 pm

It’s a very good achievement albeit very narrow and there is scope to improve the representation of the opponent in the work.

Steve Olson

21st December 2017 at 5:21 pm

Can it beat the IBM chess computer?

Justin Roberts

21st December 2017 at 5:21 pm

How about ridding the earth of that pesky species, homo sapiens? We would be the major competitor after all.

Bhaskar Jyoti Nath

21st December 2017 at 5:23 pm

Hey Mathew san … can you share some link or provide any way so that I get a chance to play Chess against DeepMind? (In case it is available).

A decade after AlphaGo AI trained Go players are all thinking alike – Matthew Griffin | Keynote Speaker & Master Futurist

11th March 2026 at 5:08 pm

[…] and refined by playing millions of games against itself. In 2017, its successor, AlphaGo Zero, picked up Go from scratch. Without studying any human games, it learned by playing against itself, with moves based only on […]

DeepMind’s self-learning AI took four hours to learn and master chess

WHY THIS MATTERS IN BRIEF

Generalised AI’s that can learn and master complex subjects within hours will have wide ranging benefits for industry and society alike.

Comments (5)

Leave a comment Cancel reply

ORGANISING AN EVENT OR WORKSHOP ?

CONNECT

FREE BOOKS

GET IN TOUCH

Pin It on Pinterest

DeepMind’s self-learning AI took four hours to learn and master chess

WHY THIS MATTERS IN BRIEF

Generalised AI’s that can learn and master complex subjects within hours will have wide ranging benefits for industry and society alike.

Related Posts

Comments (5)

Leave a comment Cancel reply

ORGANISING AN EVENT OR WORKSHOP ?

CONNECT

FREE BOOKS

GET IN TOUCH

Pin It on Pinterest