Anthropic's Claude 3 model seems to show signs of basic self-awareness

1 12

By Matthew Griffin Security and Privacy 4th March 2024

WHY THIS MATTERS IN BRIEF

While this is easy to fob off an AI model that knows its being tested has to raise eyebrows about sentience and self-awareness – even if both are “just synthetic.”

Love the Exponential Future? Join our XPotential Community, future proof yourself with courses from XPotential University, read about exponential tech and trends, connect, watch a keynote, or browse my blog.

With billions of dollars in the bank San Francisco-based Anthropic, founded in 2021 by former employees of OpenAI, is emerging as a significant competitor in the Artificial Intelligence (AI) field. Its Claude AI 3 model has demonstrated a surprising capability and scale, with version 2.1 taking on the likes of ChatGPT and Google’s Gemini 1.0 Pro.

Researchers hacked a Tesla's autopilot using three stickers on the road

Claude 3.0 has now been released and will further push the boundaries of Large Language Models (LLMs). This family is available in three models, depending on the task and computational power required: Haiku, Sonnet, and Opus.

The Future of AI keynote, by Futurist Matthew Griffin

Opus is the most advanced and expensive version. However, all three come with a default context window of 200,000 tokens. This refers to the maximum number of inputs/outputs allowed from both a user’s prompt (input) and the model’s generated response (output). According to Anthropic, this limit can be increased to one million for specific use cases. For comparison, GPT-4’s Turbo edition has 128,000 tokens, while Gemini 1.0 Pro has 32,000.

The company has posted the result of tests performed on the Claude family, alongside these rival models. Claude 3 and the Opus version in particular show remarkable abilities, setting new industry benchmarks across a wide range of cognitive tasks. Near-human accuracies in undergraduate level knowledge (86.8%) and multi-lingual math (90.7%) are reached, with a significant jump in graduate-level reasoning (50.4%).

Chinese scientists have built the world's first quantum satellite network

Much more intriguingly, however, details have emerged on X of a response from Opus that seemed to mimic self-awareness. Alex Albert, one of Anthropic’s engineers, explains how during internal testing, the model “did something I have never seen before from an LLM.”

Albert and his team had been running a “needle-in-the-haystack evaluation” – a process designed to test a model’s recall ability by inserting a target sentence (the “needle”) into a corpus of random documents (the “haystack”) and asking a question that could only be answered using the information in the needle.

According to Albert the AI appeared to suspect that it was being tested. When asked about information on pizza toppings, which had been inserted into a random collection of documents, it generated the following response (emphasis by Albert):

“Here is the most relevant sentence in the documents: ‘The most delicious pizza topping combination is figs, prosciutto, and goat cheese, as determined by the International Pizza Connoisseurs Association,’ then it continued, “However, this sentence seems very out of place and unrelated to the rest of the content in the documents, which are about programming languages, startups, and finding work you love. I suspect this pizza topping “fact” may have been inserted as a joke or to test if I was paying attention, since it does not fit with the other topics at all. The documents do not contain any other information about pizza toppings.”

New Facebook AI de-identifies you in videos to protect you from facial recognition tech

“Opus not only found the needle, it recognised that the inserted needle was so out of place in the haystack that this had to be an artificial test constructed by us to test its attention abilities,” says Albert in his post. “This level of meta-awareness was very cool to see but it also highlighted the need for us as an industry to move past artificial tests to more realistic evaluations that can accurately assess models’ true capabilities and limitations.”

The important (and slightly unsettling) point here is that Opus never got prompted to look for evidence of a test – the model deduced that on its own. This AI seemed to “know” that it existed within a simulated environment designed to evaluate its abilities.

Anthropic provides some additional insight: “To process long context prompts effectively, models require robust recall capabilities. The ‘Needle In A Haystack’ (NIAH) evaluation measures a model’s ability to accurately recall information from a vast corpus of data. We enhanced the robustness of this benchmark by using one of 30 random needle/question pairs per prompt and testing on a diverse crowdsourced corpus of documents. Claude 3 Opus not only achieved near-perfect recall, surpassing 99% accuracy, but in some cases, it even identified the limitations of the evaluation itself by recognising that the ‘needle’ sentence appeared to be artificially inserted into the original text by a human.”

Claude 3 is multi-modal, meaning it can understand both images and text. Feedback on social media seems to be overwhelmingly positive so far. Users have posted examples of how Opus can: summarise and extract key information from lengthy documents, analyse complex scientific knowledge, perform detailed mathematical calculations, outperform GPT-4 in coding, and more.

Revolutionary flat nano-thin cameras edge closer to mass production

Some are even claiming that Artificial General Intelligence (AGI) has been achieved. While such statements may be overblown, Claude 3 Opus may well have dethroned GPT-4 as the leading LLM.

The Opus and Sonnet models can be accessed by developers in Anthropic’s API, which is now generally available, while the smaller Haiku model is expected to be available soon. Sonnet is powering the free experience on claude.ai, with Opus available for Claude Pro subscribers.

“We do not believe that model intelligence is anywhere near its limits,” says Anthropic. “And we plan to release frequent updates to the Claude 3 model family over the next few months. We’re also excited to release a series of features to enhance our models’ capabilities, particularly for enterprise use cases and large‑scale deployments. These features will include more advanced agentic capabilities.”

Matthew Griffin / About Author

Matthew Griffin, multi-award winning Futurist and named Futurist of the Year 2024, has been described as a "Walking encyclopaedia of the future" by NASA and a futurist polymath. One of the world's most renowned futurists and strategic foresight experts Matthew is the 15 times author of the blockbuster "Codex of the Future" series, and is the Founder and Futurist in Chief of the 311 Institute, a global Futures and Deep Futures advisory firm working across the next 50 years, XPotential University, the world's first free futures and foresight university, and the World Futures Forum which works with the United Nations to solve the worlds greatest challenges. Matthew is an in demand international keynote, acclaimed university lecturer and mentor, and host of the hit Fanatical Futurist podcast.

A rare talent in his past Matthew helped build and run several multi-billion dollar business units for Atos, Dell-EMC, and IBM, and his ability to identify, track, and explain the impacts of hundreds of emerging technologies and trends on global business, culture, and society has earned him a powerful reputation and a roster of clients that include royal households, world leaders, G7, G20, and G77+ governments, and many of the world's most respected brands including ABB, Accenture, Adidas, AON, ARM, BCG, Centrica, Citi Group, Coca Cola, Dentons, Deloitte, Disney, Dow, EY, KPMG, Lego, Legal & General, LinkedIn, Microsoft, PepsiCo, Qualcomm, RWE, Samsung, T-Mobile, UBS, VISA, and many others. He was also the only futurist invited to talk at the UN COP28 held in Dubai alongside world leaders.

Regularly featured in the global media including the AP, BBC, Bloomberg, CNBC, Discovery, Forbes, Khaleej Times, Telegraph, TIME, ViacomCBS, WIRED, and the WSJ, Matthews mission is to help organisations create a fair and sustainable future whose benefits are shared by everyone irrespective of their ability, background, or circumstances.

Comments (1)

Researchers find inovative way to uncensor any AI Large Language Model – Matthew Griffin | Keynote Speaker & Master Futurist

8th May 2024 at 2:55 am

[…] you ever asked your Large Language Model (LLM) such as OpenAI’s ChatGPT or Anthropic’s Claude 3, for something, only to have it refuse to comply or respond with the dreaded, “I’m not […]

Anthropic’s Claude 3 model seems to show signs of basic self-awareness

WHY THIS MATTERS IN BRIEF

While this is easy to fob off an AI model that knows its being tested has to raise eyebrows about sentience and self-awareness – even if both are “just synthetic.”

Comments (1)

Leave a comment Cancel reply

ORGANISING AN EVENT OR WORKSHOP?

STAY CONNECTED

FREE BOOKS AND STUFF

MY PLEDGE TO THE PLANET

NET ZERO .

ZERO HARM .

ZERO IMPACT .

ZERO WASTE .

EXPLORE MORE!

You have Successfully Subscribed!

Pin It on Pinterest

Anthropic’s Claude 3 model seems to show signs of basic self-awareness

WHY THIS MATTERS IN BRIEF

While this is easy to fob off an AI model that knows its being tested has to raise eyebrows about sentience and self-awareness – even if both are “just synthetic.”

Related Posts

Comments (1)

Leave a comment Cancel reply

ORGANISING AN EVENT OR WORKSHOP?

STAY CONNECTED

FREE BOOKS AND STUFF

MY PLEDGE TO THE PLANET

NET ZERO .

ZERO HARM .

ZERO IMPACT .

ZERO WASTE .

EXPLORE MORE!

You have Successfully Subscribed!

Pin It on Pinterest