Skip to main content Scroll Top

Scientists trained an AI model using an IBM Quantum Computer and it rocked

WHY THIS MATTERS IN BRIEF

Quantum hardware could shrink the runaway cost of ever-larger AI models instead of forcing more brute-force scaling.

 

Matthew Griffin is the World’s #1 Futurist Keynote Speaker and Global Advisor for the G7 and Fortune 500, specialising in exponential disruption across 100 countries. Book a Keynote or Advisory SessionJoin 1M+ followers on YouTube and explore his 15-book Codex of the Future series.

 


 

Researchers have developed a method to reduce uncertainty in Artificial Intelligence (AI) systems by tapping into the power of quantum computers. They say their work represents the first demonstration of "quantum enhancement" in a production-scale, pretrained Large Language Model (LLM). One of the key metrics used to measure the quality and capabilities of AI systems such as Anthropic's Claude, OpenAI's ChatGPT and similar services is a unit known as "perplexity" – often expressed as PPL. This measures a system's general ability to properly predict the next word in a sentence or sequence of words.

 

RELATED
World first as scientists manage to make carbon fiber using plants, not oil

 

A system with a low PPL is considered better at predicting the next word, while one with a high PPL is to produce erratic outputs. There are multiple methods to reduce PPL in large AI models, including fine-tuning, training on larger datasets, and adding parameters.

GPT-5.5, for example, is to have somewhere between 2 trillion and 5 trillion parameters. In all standard LLMs, each parameter takes up space in the system’s memory, meaning that as these models become larger and more capable, they require increasingly larger infrastructure like the massive $500 Billion US Stargate Project.

But scientists at Multiverse Computing have found an alternative to scaling up the infrastructure around AI. In a new study uploaded May 7 to the arXiv preprint database, they proposed that a relatively small boost in the number of parameters in an AI model can lead to a significant reduction in perplexity when running them using quantum circuit blocks – the fundamental units of quantum computations.

 

RELATED
Researchers built a breakthrough quantum transistor using just a single atom

 

"The results reported here constitute, to our knowledge, the first demonstration of end-to-end quantum enhancement of a production-scale, widely-deployed LLM on real superconducting quantum hardware for autoregressive language generation," the scientists wrote in the study.

"Their significance lies not in the magnitude of the perplexity improvements – which will grow with hardware fidelity and qubit count – but in the fact that they exist at all."

In the study, the researchers created and executed quantum circuit blocks called Cayley-parameterised unitary adapters (CUAs). Cayley parameters are a set of mathematical matrices that can be "trained" by weighting them towards specific matrix components. They’re inserted into a specific layer of an LLM for training on a classical computer.

 

The Future of Cyber Security and Quantum Technology | Sapphire, UK | Matt Griffin | Panel Session, by Futurist Keynote Speaker Matthew Griffin

Book a Keynote with Matthew

The LLM's original parameters are frozen during this process so that they remain unchanged. The new hybrid system containing both the trained Cayley parameters and the original model parameters is then executed on the 156-qubit IBM Quantum System Two superconducting Quantum Processing Unit (QPU).

 

RELATED
ORNL ushers in the exascale computing era with the world's first exaflop supercomputer

 

The resulting Quantum-Classical hybrid model lowered the perplexity of Llama 3.1 8B – an 8 billion-parameter model created by Meta – by 1.4% while adding only 6,000 parameters (a 0.000075% increase).

, a senior research scientist at Multiverse Computing and first author of the study, described the new technique as a proof of concept for further development. Speaking with reporters he explained that quantum computers can provide some advantages over a strictly classical paradigm – but they come with a trade-off.

"The first thing you do is encode [the parameters] in the quantum computer. Once you have encoded the state, you are ready to apply the Cayley unitary adapter, which we train classically and then implement in quantum hardware," he said.

He explained that these adapters are small, which is important because the bigger the circuit, the more "noise" there is. Noise generated during quantum computations – which can come from interactions with nearby qubits, disturbances from the Earth’s magnetic field, radiation from Wi-Fi or phones, and even cosmic rays – may cause errors and render outputs and measurements meaningless.

 

RELATED
Re-igniting Moore's Law, MIT's radical Carbon Nanotube computer leaps from lab to factory

 

As in much of quantum computing research, quantum error correction is one of the main areas of interest. In this study, mitigating errors caused by noise was the primary obstacle Aizpurua and the Multiverse Computing team were attempting to overcome.

The scientists loaded the classically trained Cayley unitary adapters into the quantum system before end-to-end inference – the phase of AI use where the model executes a response – occurred. Then, the hybrid outputs could be measured against the normal non-quantum-enhanced results.

The researchers discovered that the hybrid model could answer several questions correctly that the base Llama model could not.

In one astronomy question, the original model incorrectly selected an answer indicating that only Saturn has Jovian planet rings. However, the CUA-enhanced model correctly identified all Jovian planets as ringed.

 

RELATED
UC Berkeley's atom thin display paves the way for truly invisible displays

 

In another example, the original model incorrectly answered a biology question on the population-genetic consequences of gene flow, selecting “Hardy–Weinberg disruption” while the CUA-enhanced model correctly identified increased genetic homogeneity.

"So here we can see an example in which an AI model doesn't answer correctly, and then you add something [quantum enhancement] and suddenly it answers correctly," Aizpurua said.

This result, coupled with the measured 1.4% reduction in perplexity, demonstrates a clear path forward for developing quantum hybrid AI systems, Aizpurua said. He added that this research could help researchers overcome current development bottlenecks where systems are constrained by developers' ability to scale classical computing infrastructure.

Future research would involve developing methods by which the entire quantum circuit, not just the Cayley unitary adapters, is directly encoded, Aizpurua said. This would ostensibly result in an LLM capable of achieving lower perplexity and higher accuracy, using fewer parameters than any purely classical method.

 

RELATED
This new water based computer chip brings Terahertz computing in reach

 

Ultimately, he said, the goal of the research is to produce higher-quality AI systems capable of reaching Quantum Superiority, a term that describes a quantum-based computer system capable of performing feats unachievable by any classical computer.

 


 

Does this mean quantum computers can now make AI models smarter?
Not yet — the improvement was tiny, but it is the first proof that real quantum hardware can lower a deployed model’s error rate at all, and the researchers expect the gains to grow as qubit counts and fidelity improve.

Related Posts

Leave a comment

Pin It on Pinterest

Share This