Edtech company Udacity uses deepfake tech to create educational videos automatically

5 0

By Matthew Griffin Intelligence and the Senses 2nd August 2019

WHY THIS MATTERS IN BRIEF

Most online educational courses are text and graphics based, but now Udacity is using deepfake tech to automatically generate educational videos from the content.

Interested in the future and want to experience even more?! eXplore More.

Producing content for Massive Open Online Course (MOOC) platforms like Coursera and EdX might be academically rewarding, and potentially lucrative, but it’s also hugely time consuming – particularly where videos are involved, so Udacity, in an ode to Soul Machines who recently created “Will” the world’s first avatar teacher who’s already taught over 250,000 children about energy, have been looking into ways to get Artificial Intelligence (AI) to produce the videos automatically for them – something that would be a game changer in the academic world.

A new form of "Master algorithm" could pave the way for super intelligent machines

After all, professional level lecture clips require not only a veritable studio’s worth of equipment, but significant resources to transfer, edit, and upload footage of each lesson, so that’s why research scientists at Udacity, an online learning platform with over 100,000 courses, are investigating a new machine learning framework that automatically generates lecture videos from audio narration alone. And, for now at least the tech they’re developing isn’t a million miles away from other so called synthetic content AI generators, like the ones I’ve discussed many times before that are being used to create DeepFakes and next generation Text to Video content, among many other things.

An example of the tech

They claim in a preprint paper (“LumièreNet: Lecture Video Synthesis from Audio“) on Arxiv.org that their AI system, called LumièreNet, “can synthesise footage of any length by directly mapping between audio and corresponding visuals.”

ChatGPT claims the crown as the fastest growing app in history

“In current video production an AI that semi, or fully), automates lecture video production at scale would be highly valuable to enable agile video content development (rather than reshooting each new video),” wrote the paper’s co-authors. “To [this] end, we propose a new method to synthesise lecture videos from any length of audio narration: … A simple, modular, and fully neural network-based [AI] which produces an instructor’s full pose lecture video given the audio narration input, which has not been addressed before from deep learning perspective, as far as we know.”

The researchers’ model has a pose estimation component that’s not too dissimilar from Nvidia’s latest GauGAN AI or the so called full body DeepFake tech that recently came out of Japan, that synthesises body figure images from video frames extracted from a training data set, chiefly by detecting and localizing major body points to create detailed surface-based human body representations.

Locus's new robots learn to navigate warehouses by themselves

Meanwhile a second module in the model, a bidirectional recurrent long-short term memory (BLSTM) network that processes data in order so that each output reflects the inputs and outputs that precede it, takes as input audio features and attempts to suss out the relationship between them and visual elements.

To test LumièreNet, the researchers filmed an instructor’s lecture video for around eight hours at Udacity’s in-house studio. This yielded roughly four hours of video and two narrations for training and validation. The researchers report that the trained AI system produces “convincing” clips with smooth body gestures and realistic hair, but note that its creations, two of which are here and here, likely won’t fool most observers because the pose estimator can’t capture fine details like eye motion, lips, hair, and clothing, synthesized lecturers rarely blink and they tend to move their mouths unnaturally. Worse, their eyes sometimes look in different directions and their hands always appear oddly blurry.

A powerful battery breakthrough from MIT could usher in electric planes

The team posits that the addition of “face keypoints” (i.e., fine details) might lead to better synthesis, and they note that — fortunately — their system’s modular design allows each component to be trained and improved independently.

“[M]any future directions are feasible to explore,” wrote the researchers. “Even though our approach is developed with primary intents to support agile video content development, which is crucial in current online MOOC courses, we acknowledge there could be potential misuse of the technologies … We hope that our results will catalyse new developments of deep learning technologies for commercial video content production.”

Matthew Griffin / About Author

Matthew Griffin, multi-award winning Futurist and named Futurist of the Year 2024, has been described as a "Walking encyclopaedia of the future" by NASA and a futurist polymath. One of the world's most renowned futurists and strategic foresight experts Matthew is the 15 times author of the blockbuster "Codex of the Future" series, and is the Founder and Futurist in Chief of the 311 Institute, a global Futures and Deep Futures advisory firm working across the next 50 years, XPotential University, the world's first free futures and foresight university, and the World Futures Forum which works with the United Nations to solve the worlds greatest challenges. Matthew is an in demand international keynote, acclaimed university lecturer and mentor, and host of the hit Fanatical Futurist podcast.

A rare talent in his past Matthew helped build and run several multi-billion dollar business units for Atos, Dell-EMC, and IBM, and his ability to identify, track, and explain the impacts of hundreds of emerging technologies and trends on global business, culture, and society has earned him a powerful reputation and a roster of clients that include royal households, world leaders, G7, G20, and G77+ governments, and many of the world's most respected brands including ABB, Accenture, Adidas, AON, ARM, BCG, Centrica, Citi Group, Coca Cola, Dentons, Deloitte, Disney, Dow, EY, KPMG, Lego, Legal & General, LinkedIn, Microsoft, PepsiCo, Qualcomm, RWE, Samsung, T-Mobile, UBS, VISA, and many others. He was also the only futurist invited to talk at the UN COP28 held in Dubai alongside world leaders.

Regularly featured in the global media including the AP, BBC, Bloomberg, CNBC, Discovery, Forbes, Khaleej Times, Telegraph, TIME, ViacomCBS, WIRED, and the WSJ, Matthews mission is to help organisations create a fair and sustainable future whose benefits are shared by everyone irrespective of their ability, background, or circumstances.

Comments (5)

DeepFakes: The Good and The Bad. - AIDETIC BLOG

30th June 2021 at 11:39 am

[…] Udacity, an online learning platform with over 100,000 courses, used Deepfakes to create video lectures automatically for its MOOC content. […]

Current state of deepfake media – The Deepfake Dive

18th April 2022 at 3:35 am

[…] All made possible by large data samples and algorithm. Deepfake is used by movie studios, education, corporate and gallery sectors. Deepfake can have weaponised by bad actors to cause reputation […]

What Are The Positive Applications Of Deepfakes?

9th June 2022 at 9:27 am

[…] the text slides from the course or listen to audio lectures. However, now, Udacity is investigating a new machine learning framework that automatically generates lecture videos from text-based content or audio narration. This makes […]

Deepfake : c'est quoi ?

1st February 2023 at 8:13 am

[…] terme d’informations, voire d’éducation, certaines pistes peuvent être également […]

#deepfakesforgood – hannah’s w0rld

25th April 2023 at 6:39 pm

[…] Another positive use for deepfakes is in the field of education. With the ability to create realistic simulations of historical events or scientific phenomena, deepfakes could be used to provide students with a more immersive learning experiences. For example, a deepfake could be used to create a virtual tour of a historical site or to simulate a scientific experiment that would be too dangerous or expensive to do in real life. Deepfakes can also be used to create educational videos to go along with any educational text. The EdTech company Udacity pioneered this back in 2019. […]

Edtech company Udacity uses deepfake tech to create educational videos automatically

WHY THIS MATTERS IN BRIEF

Most online educational courses are text and graphics based, but now Udacity is using deepfake tech to automatically generate educational videos from the content.

Comments (5)

Leave a comment Cancel reply

ORGANISING AN EVENT OR WORKSHOP?

STAY CONNECTED

FREE BOOKS AND STUFF

MY PLEDGE TO THE PLANET

NET ZERO .

ZERO HARM .

ZERO IMPACT .

ZERO WASTE .

EXPLORE MORE!

You have Successfully Subscribed!

Pin It on Pinterest

Edtech company Udacity uses deepfake tech to create educational videos automatically

WHY THIS MATTERS IN BRIEF

Most online educational courses are text and graphics based, but now Udacity is using deepfake tech to automatically generate educational videos from the content.

Related Posts

Comments (5)

Leave a comment Cancel reply

ORGANISING AN EVENT OR WORKSHOP?

STAY CONNECTED

FREE BOOKS AND STUFF

MY PLEDGE TO THE PLANET

NET ZERO .

ZERO HARM .

ZERO IMPACT .

ZERO WASTE .

EXPLORE MORE!

You have Successfully Subscribed!

Pin It on Pinterest