Scientists trained an AI model using an IBM Quantum Computer and it rocked

By Matthew Griffin Computing 23rd April 2026

WHY THIS MATTERS IN BRIEF

Quantum hardware could shrink the runaway cost of ever-larger AI models instead of forcing more brute-force scaling.

Matthew Griffin is the World’s #1 Futurist Keynote Speaker and Global Advisor for the G7 and Fortune 500, specialising in exponential disruption across 100 countries. Book a Keynote or Advisory Session — Join 1M+ followers on YouTube and explore his 15-book Codex of the Future series.

Researchers have developed a method to reduce uncertainty in Artificial Intelligence (AI) systems by tapping into the power of quantum computers. They say their work represents the first demonstration of “quantum enhancement” in a production-scale, pretrained Large Language Model (LLM). One of the key metrics used to measure the quality and capabilities of AI systems such as Anthropic‘s Claude, OpenAI‘s ChatGPT and similar services is a unit known as “perplexity” – often expressed as PPL. This measures a system’s general ability to properly predict the next word in a sentence or sequence of words.

Ultra energy efficient molecular computer breaks records and blurs boundaries

A system with a low PPL is considered better at predicting the next word, while one with a high PPL is to produce erratic outputs. There are multiple methods to reduce PPL in large AI models, including fine-tuning, training on larger datasets, and adding parameters.

GPT-5.5, for example, is to have somewhere between 2 trillion and 5 trillion parameters. In all standard LLMs, each parameter takes up space in the system’s memory, meaning that as these models become larger and more capable, they require increasingly larger infrastructure like the massive $500 Billion US Stargate Project.

But scientists at Multiverse Computing have found an alternative to scaling up the infrastructure around AI. In a new study uploaded May 7 to the arXiv preprint database, they proposed that a relatively small boost in the number of parameters in an AI model can lead to a significant reduction in perplexity when running them using quantum circuit blocks – the fundamental units of quantum computations.

The world's first Pneumatic computer works using pressure not electricity

“The results reported here constitute, to our knowledge, the first demonstration of end-to-end quantum enhancement of a production-scale, widely-deployed LLM on real superconducting quantum hardware for autoregressive language generation,” the scientists wrote in the study.

“Their significance lies not in the magnitude of the perplexity improvements – which will grow with hardware fidelity and qubit count – but in the fact that they exist at all.”

In the study, the researchers created and executed quantum circuit blocks called Cayley-parameterised unitary adapters (CUAs). Cayley parameters are a set of mathematical matrices that can be “trained” by weighting them towards specific matrix components. They’re inserted into a specific layer of an LLM for training on a classical computer.

The Future of Cyber Security and Quantum Technology Panel by Futurist Keynote Speaker Matthew Griffin

Book a Keynote with Matthew

The LLM’s original parameters are frozen during this process so that they remain unchanged. The new hybrid system containing both the trained Cayley parameters and the original model parameters is then executed on the 156-qubit IBM Quantum System Two superconducting Quantum Processing Unit (QPU).

QuirkLogic's new E-Ink whiteboard is a giant kindle for your office

The resulting Quantum-Classical hybrid model lowered the perplexity of Llama 3.1 8B – an 8 billion-parameter model created by Meta – by 1.4% while adding only 6,000 parameters (a 0.000075% increase).

, a senior research scientist at Multiverse Computing and first author of the study, described the new technique as a proof of concept for further development. Speaking with reporters he explained that quantum computers can provide some advantages over a strictly classical paradigm – but they come with a trade-off.

“The first thing you do is encode [the parameters] in the quantum computer. Once you have encoded the state, you are ready to apply the Cayley unitary adapter, which we train classically and then implement in quantum hardware,” he said.

He explained that these adapters are small, which is important because the bigger the circuit, the more “noise” there is. Noise generated during quantum computations – which can come from interactions with nearby qubits, disturbances from the Earth’s magnetic field, radiation from Wi-Fi or phones, and even cosmic rays – may cause errors and render outputs and measurements meaningless.

A fully autonomous robot dentist just completed its first human procedure

As in much of quantum computing research, quantum error correction is one of the main areas of interest. In this study, mitigating errors caused by noise was the primary obstacle Aizpurua and the Multiverse Computing team were attempting to overcome.

The scientists loaded the classically trained Cayley unitary adapters into the quantum system before end-to-end inference – the phase of AI use where the model executes a response – occurred. Then, the hybrid outputs could be measured against the normal non-quantum-enhanced results.

The researchers discovered that the hybrid model could answer several questions correctly that the base Llama model could not.

In one astronomy question, the original model incorrectly selected an answer indicating that only Saturn has Jovian planet rings. However, the CUA-enhanced model correctly identified all Jovian planets as ringed.

New hot qubits let quantum computers run near room temperature

In another example, the original model incorrectly answered a biology question on the population-genetic consequences of gene flow, selecting “Hardy–Weinberg disruption” while the CUA-enhanced model correctly identified increased genetic homogeneity.

“So here we can see an example in which an AI model doesn’t answer correctly, and then you add something [quantum enhancement] and suddenly it answers correctly,” Aizpurua said.

This result, coupled with the measured 1.4% reduction in perplexity, demonstrates a clear path forward for developing quantum hybrid AI systems, Aizpurua said. He added that this research could help researchers overcome current development bottlenecks where systems are constrained by developers’ ability to scale classical computing infrastructure.

Future research would involve developing methods by which the entire quantum circuit, not just the Cayley unitary adapters, is directly encoded, Aizpurua said. This would ostensibly result in an LLM capable of achieving lower perplexity and higher accuracy, using fewer parameters than any purely classical method.

This new Chinese photonic computer chip out performs Nvidia GPUs by 100x

Ultimately, he said, the goal of the research is to produce higher-quality AI systems capable of reaching Quantum Superiority, a term that describes a quantum-based computer system capable of performing feats unachievable by any classical computer.

Does this mean quantum computers can now make AI models smarter?
Not yet — the improvement was tiny, but it is the first proof that real quantum hardware can lower a deployed model’s error rate at all, and the researchers expect the gains to grow as qubit counts and fidelity improve.

Matthew Griffin / About Author

Matthew Griffin is a multi-award winning Futurist and expert in Disruption and Innovation, Geopolitics, Leadership, and Technology, who NASA have described as a "walking encyclopaedia of the future" and a "futurist Polymath." 15-time best selling author of the "Codex of the Future" series, Matthew is the Founder and Futurist in Chief of the 311 Institute, a global Futures and Deep Futures advisory firm working with royal households, world leaders, G7, G20, and G77 governments, NGOs, and multi-national mid and mega cap firms to help them explore, shape, and lead the next 50 years of business and society.

An award-winning YouTube creator with over a million followers, with an unrivalled global reach and impact, Matthew is a highly sought-after international keynote speaker, lecturer, and mentor who collaborates with global leaders through the United Nations Alliance of Civilizations (UNAOC) and United Nations General Assembly (UNGA) to shape pivotal initiatives such as the UN’s AI for Humanity program, the United Nations Conference of the Parties (UN COP), and the World Economic Forum in Davos.

As the former Global Head of Cloud, National Security, and Enterprise Sales for companies including Atos, Dell-EMC, and IBM, Matthew has a proven track record of building multi-billion dollar business units and turning failing divisions into market leaders. His ability to identify, analyse, and communicate the implications of hundreds of emerging technologies and trends is unparalleled, and his insights are trusted by many of the world’s most respected organisations, including ABB, Accenture, Adidas, AON, ARM, BCG, Centrica, Citi, Coca-Cola, Dentons, Deloitte, Dow Jones, EY, Google, KPMG, Lego, Legal & General, LinkedIn, Microsoft, PepsiCo, Qualcomm, RWE, Samsung, Siemens AG and Siemens Energy, T-Mobile, UBS, VISA, Walmart, Workday, Worldpay and many others.

Regularly featured in the global media including the AP, BBC, Bloomberg, CNBC, Discovery, Forbes, Khaleej Times, Telegraph, TIME, ViacomCBS, WIRED, and the WSJ, Matthews mission is to help organisations create a fair and sustainable future whose benefits are shared by everyone irrespective of their ability, background, or circumstances.