Neural networks can now transfer the style of one photo to another

0 0

By Matthew Griffin Intelligence and the Senses 30th March 2017

WHY THIS MATTERS IN BRIEF

As we see continue to see the rise of new types of synthetic content and tools style transfer could one day help us create new games and videos faster.

Interested in the Exponential Future? Connect, download a free E-Book, watch a keynote, or browse my blog.

As we continue to see the rise of creative machines that can automatically generate new synthetic content it’s likely that you’ve heard of an Artificial Intelligence (AI) derived technique known as “style transfer” – and if you haven’t heard of it then you’ve certainly seen it, whether you know it or not.

The US airforce will rule the air with swarms of drones

The process uses AI deep learning to apply the look and feel of one image to another, and appears in apps like Prisma and Facebook. These style transfers, however, are stylistic, and not photorealistic, and they look good because they look like they’ve been painted. Now a group of researchers from Cornell University and Adobe have augmented style transfer so that it can transfer the look of one photo onto another while still looking like a photo rather than just a painting, and the results are impressive.

The researchers’ work is outlined in a paper called “Deep Photo Style Transfer.” Essentially, they’ve taken the methods of the original style transfer, and added another layer of neural networks to the process — a layer that makes sure that the details of the original image are preserved.

“People are very forgiving when they see [style transfer images] in these painterly styles,” said Cornell professor Kavita Bala, a co-author of the study. “But with real photos there’s a stronger expectation of what we want it to look like, and that’s why it becomes an interesting challenge.”

Baidu's AI can clone anyone's voice in under a minute

The added neural network layer pays close attention to what Bala calls “local affine patches.” There’s no quick way to accurately translate this phrase, but it basically means the various edges within the image, whether that’s the border between a tree and a lake, or a building and the sky. While style transfer tends to play fast and loose with these edges, shifting them back and forth as it pleases, photo style transfer preserves them.

There are limits to the technique, of course. The algorithms seem to work best with structures like buildings, and the flaws are more obvious with faces. Furthermore, you can’t use massively different photos for transferring style, otherwise the neural networks have a tougher time analysing elements to copy from picture to picture.

“If you have a picture of a lake and you have a scene where you’re taking the style from, ideally it would also have a water body in it of some sort,” says Bala. “There’s no defined limit, but this is a good open research question. We put the code out because we want people to play with it and try it out.” (The code is available here on GitHub.)

This new prototype Covid-19 test requires just a smartphone and your voice

The question is, how long will it be until we start seeing these sorts of photo style transfers being made accessible to the public, or before we see them being used in game and synthetic content development? My guess would be a couple of years. After all, the original style transfer went from a first research paper to Facebook’s app, reaching hundreds of millions of users, in less than two years’ time. And with Adobe’s involvement in this paper, there’s obviously an expectation that it’s at least a little bit interested in some sort of commercialisation.

For now, though, the researchers are already thinking about what areas photorealistic style transfer could be applied to next.

“The question of how far you can push it is important,” says Bala. “Video is a logical thing for it to go to, and that, I expect, will happen sooner rather than later, and of course game development could also be a fantastic opportunity where it would make a real difference to how fast new developers can create games.”

Matthew Griffin / About Author

Matthew Griffin, multi-award winning Futurist and named Futurist of the Year 2024, has been described as a "Walking encyclopaedia of the future" by NASA and a futurist polymath. One of the world's most renowned futurists and strategic foresight experts Matthew is the 15 times author of the blockbuster "Codex of the Future" series, and is the Founder and Futurist in Chief of the 311 Institute, a global Futures and Deep Futures advisory firm working across the next 50 years, XPotential University, the world's first free futures and foresight university, and the World Futures Forum which works with the United Nations to solve the worlds greatest challenges. Matthew is an in demand international keynote, acclaimed university lecturer and mentor, and host of the hit Fanatical Futurist podcast.

A rare talent in his past Matthew helped build and run several multi-billion dollar business units for Atos, Dell-EMC, and IBM, and his ability to identify, track, and explain the impacts of hundreds of emerging technologies and trends on global business, culture, and society has earned him a powerful reputation and a roster of clients that include royal households, world leaders, G7, G20, and G77+ governments, and many of the world's most respected brands including ABB, Accenture, Adidas, AON, ARM, BCG, Centrica, Citi Group, Coca Cola, Dentons, Deloitte, Disney, Dow, EY, KPMG, Lego, Legal & General, LinkedIn, Microsoft, PepsiCo, Qualcomm, RWE, Samsung, T-Mobile, UBS, VISA, and many others. He was also the only futurist invited to talk at the UN COP28 held in Dubai alongside world leaders.

Regularly featured in the global media including the AP, BBC, Bloomberg, CNBC, Discovery, Forbes, Khaleej Times, Telegraph, TIME, ViacomCBS, WIRED, and the WSJ, Matthews mission is to help organisations create a fair and sustainable future whose benefits are shared by everyone irrespective of their ability, background, or circumstances.