MIT and DeepMind's new AI can't be tricked with weird lighting

By Matthew Griffin Intelligence and the Senses 26th March 2018

WHY THIS MATTERS IN BRIEF

As we produce more systems that rely on machine vision to do their jobs or keep people safe, systems that can’t see the world accurately pose both a danger and a risk.

Machine Vision has come a long way since Imagenet, a large repository of labelled images that researchers use to train their newest Artificial Intelligence (AI) agents with, was released, but still to this day images with bad, tricky or just plain weird lighting can still confuse even the best AI’s algorithms and get them to misreport whatever it is they’re looking at. And there are a multitude of examples where an AI has been tricked or confused, such as an AI that was being used to run an autonomous train prototype that mistook a shadow for a rock and came to a dead stop on the track, and even Nvidia’s DAVE 2.0 self-driving car software that under certain lighting conditions would send a simulated car off a cliff, both of which exemplify the issue that machine vision enthusiasts everywhere still face.

Invidia's Canvas puts a professional AI artist in the palm of your hand

Over the past couple of years in order to try to overcome the issue researchers have either tried to create special hand crafted rules about how light interacts with objects or used data sets that cover as many lighting situations as possible, but there is a nearly limitless combination of items and light in the real world and that handicaps both approaches.

Now though a paper by researchers from MIT and DeepMind has detailed a new AI process that can identify images in different lighting without having to hand craft new rules or train on a huge data set. The process, called a Rendered Intrinsics Network, or RIN for short, automatically separates an image into reflectance, shape, and lighting layers. It then recombines the layers into a reconstruction of the original image.

To train their RIN the researchers started off by creating a data set consisting of five shapes including cones, cubes, cylinders, spheres and torus’s, and rendered each one with ten different orientations and over five hundred different colours.

JLR turns UK motorways into a test track for autonomous cars

As a proof of concept they then showed how breaking down an image into the three layers could help a computer identify what an item in an image is, or at least figure out what the real shape of objects in said image could be. For example, the model also learned to spot and categorise much more complicated objects, such as the classic image test models Stanford bunny, Utah teapot, and Blender’s Suzanne, after being trained on the basic sample shapes, without ever seeing specifically labelled examples.

Beyond offering a new way to overcome the problem of infinite lighting situations for an image RIN is also an example of learning with unlabelled data. Most AI still needs labelled data to learn, and preparing it takes hours of repetitive human labour so finding a way to learn from unlabelled data itself is yet another AI frontier that needs overcome, so the teams, especially DeepMind, who recently created one of the world’s first self-learning AI’s called Alpha Zero, have made progress on both fronts.

Matthew Griffin / About Author

Matthew Griffin is a multi-award winning Futurist and expert in Disruption and Innovation, Geopolitics, Leadership, and Technology, who NASA have described as a "walking encyclopaedia of the future" and a "futurist Polymath." 15-time best selling author of the "Codex of the Future" series, Matthew is the Founder and Futurist in Chief of the 311 Institute, a global Futures and Deep Futures advisory firm working with royal households, world leaders, G7, G20, and G77 governments, NGOs, and multi-national mid and mega cap firms to help them explore, shape, and lead the next 50 years of business and society.

An award-winning YouTube creator with over a million followers, with an unrivalled global reach and impact, Matthew is a highly sought-after international keynote speaker, lecturer, and mentor who collaborates with global leaders through the United Nations Alliance of Civilizations (UNAOC) and United Nations General Assembly (UNGA) to shape pivotal initiatives such as the UN’s AI for Humanity program, the United Nations Conference of the Parties (UN COP), and the World Economic Forum in Davos.

As the former Global Head of Cloud, National Security, and Enterprise Sales for companies including Atos, Dell-EMC, and IBM, Matthew has a proven track record of building multi-billion dollar business units and turning failing divisions into market leaders. His ability to identify, analyse, and communicate the implications of hundreds of emerging technologies and trends is unparalleled, and his insights are trusted by many of the world’s most respected organisations, including ABB, Accenture, Adidas, AON, ARM, BCG, Centrica, Citi, Coca-Cola, Dentons, Deloitte, Dow Jones, EY, Google, KPMG, Lego, Legal & General, LinkedIn, Microsoft, PepsiCo, Qualcomm, RWE, Samsung, Siemens AG and Siemens Energy, T-Mobile, UBS, VISA, Walmart, Workday, Worldpay and many others.

Regularly featured in the global media including the AP, BBC, Bloomberg, CNBC, Discovery, Forbes, Khaleej Times, Telegraph, TIME, ViacomCBS, WIRED, and the WSJ, Matthews mission is to help organisations create a fair and sustainable future whose benefits are shared by everyone irrespective of their ability, background, or circumstances.