This new AI uses both sight and sound to estimate depression

0 0

By Matthew Griffin Intelligence and the Senses 22nd August 2019

WHY THIS MATTERS IN BRIEF

As mental health issues become more pronounced and more prominent in society researchers are trying to find new ways to identify people who suffer from it.

Interested in the future and want to experience even more?! Watch a keynote, grab a free book, read thousands of articles, and connect!

Detecting emotional arousal from the sound of someone’s voice is one thing — startups like Beyond Verbal, Affectiva, and MIT spinout Cogito are leveraging natural language processing to accomplish just that. But as robots and bots trained in psychology, such as Woebot who’s now helped millions of people, start appearing on the scene to help patients in new ways, there’s an argument to be made that speech alone isn’t enough to diagnose someone with depression – let alone judge its severity.

Enter new research from scientists at the Indian Institute of Technology Patna and the University of Caen Normandy, which examines how non-verbal signs and visuals can drastically improve estimations of depression level.

Google DeepMind's new business unit to assess AI's impact on society

“The steadily increasing global burden of depression and mental illness acts as an impetus for the development of more advanced, personalized and automatic technologies that aid in its detection,” the paper’s authors wrote. “Depression detection is a challenging problem as many of its symptoms are covert.”

The researchers encoded seven modalities — things like downward angling of the head, eye gaze, the duration and intensity of smiles, and self-touches, along with text and verbal cues — which they fed to a machine learning model that fused them together into vectors, or mathematical representations. These fused vectors were then passed onto a second system that predicted the severity of depression based on the Personal Health Questionnaire Depression Scale (PHQ-8), a diagnostic test often employed in large clinical psychology studies.

Europe's new Artificial Intelligence Act can demand AI models are retrained and deleted

To train the various systems, the researchers tapped AIC-WOZ, a depression data set that’s part of a larger corpus — the Distress Analysis Interview Corpus — containing annotated audio snippets, video recordings, and questionnaire responses of 189 clinical interviews supporting the diagnosis of psychological conditions like anxiety, depression, and post-traumatic stress disorder. Each sample contained an enormous amount of data, including a raw audio file, a file containing the coordinates of 68 facial “landmarks” of the interviewee, complete with time stamps, confidence scores, and detection success flags, two files containing head pose and eye gaze features of the participant, a transcript file of the interview, and more.

After several pre-processing steps and model training, the team compared the results of the AI systems using three metrics – Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), and Explained Variance Score (EVS). They report that the fusion of the three modalities — acoustic, text, and visual — helped in giving the “most accurate” estimation of depression level, outperforming the previous state of the art systems by 7.17% on RMSE and 8.08% on MAE.

The "world's most dangerous AI" is now helping automate coding

In the future, they plan to study recent multitask learning architectures and “dig deeper” into novel representations of text data, and if their work bears fruit it’d be a promising development for the more than 300 million people now living with depression — a number that’s sadly on the rise.

Source: arVix

Matthew Griffin / About Author

Matthew Griffin, multi-award winning Futurist and named Futurist of the Year 2024, has been described as a "Walking encyclopaedia of the future" by NASA and a futurist polymath. One of the world's most renowned futurists and strategic foresight experts Matthew is the 15 times author of the blockbuster "Codex of the Future" series, and is the Founder and Futurist in Chief of the 311 Institute, a global Futures and Deep Futures advisory firm working across the next 50 years, XPotential University, the world's first free futures and foresight university, and the World Futures Forum which works with the United Nations to solve the worlds greatest challenges. Matthew is an in demand international keynote, acclaimed university lecturer and mentor, and host of the hit Fanatical Futurist podcast.

A rare talent in his past Matthew helped build and run several multi-billion dollar business units for Atos, Dell-EMC, and IBM, and his ability to identify, track, and explain the impacts of hundreds of emerging technologies and trends on global business, culture, and society has earned him a powerful reputation and a roster of clients that include royal households, world leaders, G7, G20, and G77+ governments, and many of the world's most respected brands including ABB, Accenture, Adidas, AON, ARM, BCG, Centrica, Citi Group, Coca Cola, Dentons, Deloitte, Disney, Dow, EY, KPMG, Lego, Legal & General, LinkedIn, Microsoft, PepsiCo, Qualcomm, RWE, Samsung, T-Mobile, UBS, VISA, and many others. He was also the only futurist invited to talk at the UN COP28 held in Dubai alongside world leaders.

Regularly featured in the global media including the AP, BBC, Bloomberg, CNBC, Discovery, Forbes, Khaleej Times, Telegraph, TIME, ViacomCBS, WIRED, and the WSJ, Matthews mission is to help organisations create a fair and sustainable future whose benefits are shared by everyone irrespective of their ability, background, or circumstances.