Together, the teams solved a 50-year-old grand challenge in biology, and because proteins are at the heart of most medications, they may also have seeded a whole new era of drug development. For the first time, we have unprecedented insight into the protein engines of our cells, many of which had remained impervious to traditional lab techniques.
Yet one glaring detail was missing. Proteins don’t operate alone. They often associate into complexes – small groups that interact to carry out critical tasks in our cells and bodies, and understanding how these work is crucial to unlocking even more mysteries and helping create even more powerful new healthcare treatments. So this month the UW team upped their game.
Tapping into both AlphaFold and RoseTTAFold, they tweaked the programs to predict which proteins are likely to tag-team and sketched up the resulting complexes into a 3D models.
Using AI, the team predicted hundreds of complexes many of which are entirely new that regulate DNA repair, govern the cell’s digestive system, and perform other critical biological functions. These under-the-hood insights could impact the next generation of DNA editors and spur new treatments for neurodegenerative disorders or anti-aging therapies.
“It’s a really cool result,” said Dr. Michael Snyder at Stanford University, who was not involved in the study, to Science.
Like a compass, the results can guide experimental scientists as they test the predictions and search for new insights into how our cells grow, age, die, malfunction, and reproduce. Several predictions further highlighted how our cells absorb external molecules—a powerful piece of information that could help us coerce normally reluctant cells to gulp up medications.
“It…gives you a lot of potential new drug targets,” said study author Dr. Qian Cong at the University of Texas Southwestern Medical Center.
Our bodies are governed by proteins, each of which intricately folds into 3D shapes. Like unique Lego bricks, these shapes allow the proteins to combine into larger structures, which in turn conduct the biological processes that propel life.
Too abstract? An example: when cells live out their usual lifespan, they go through a process called apoptosis – in Greek, the falling of the leaves – in which the cell gently falls apart without disturbing its neighbours by leaking toxic chemicals. The entire process is a cascade of protein-protein interactions. One protein grabs onto another protein to activate it. The now-activated protein is subsequently released to stir up the next protein in the chain, and so on, eventually causing the aging or diseased cell to sacrifice itself.
Another example: in neurons during learning, synapses (the hubs that connect brain cells) call upon a myriad of proteins that form a complex together. This complex, in turn, spurs the neuron’s DNA to make proteins that etch the new memory into the brain.
“Everything in biology works in complexes. So, knowing who works with who is critical,” said Snyder.
For decades, scientists have relied on painfully slow processes to figure out those interactions. One approach is computational: map out a protein’s structure down to the atomic level and predict “hot spots” that might interact with another protein. Another is experimental: using both biological lab prowess and physics ingenuity, scientists can isolate protein complexes from cells – like sugar precipitating from lemonade when there’s too much of it – and use specialized equipment to analyze the proteins. It’s tiresome, expensive, and often plagued with errors.
Deep learning is now shining light on the whole enterprise.
The main idea is deceptively simple. Proteins are made of twisting strands and sheets of a single line of amino acids, like beads strung onto a tangled but semi-predictable mess of yarn. Deep learning can parse how the yarn folds into 3D shapes based on the structure of the amino acid “beads” alone.
Last year, DeepMind and a team from UW led by Dr. David Baker both took a crack at the problem. Without knowing anything else about a protein, the teams’ algorithms, AlphaFold2 and RoseTTaFold, were able to churn out thousands of protein structures. Though both were impressive, compared to AlphaFold2, Baker’s AI wasn’t as accurate for single-protein predictions. But where RoseTTAFold shone was in predicting proteins with multiple sub-units—in essence, a single protein made up of a handful of structures, each physically grabbing onto the next. It’s a perfect jumping-off point for diving into protein handshakes.
At the time, the AI only worked on proteins in simple creatures, like bacteria. In the new study, Baker’s team focused on a more complicated organism – the common yeast, which has a cellular structure similar to that of humans. The choice of focusing on yeast proteins was deliberate: as a lab favorite, its genome is relatively small, and there’s a “gold standard” set of protein interactions to test out the updated algorithm.
Compared to bacteria, which the older AI had tackled, yeast had a far more complicated system for translating its DNA into proteins. Each step added noise. To get around the hiccup, the team used an evolutionary approach. If a protein-protein interaction is important for biology, they reasoned, then the “hands” – the protein interface – where they grab onto each other should change together as species evolve to maintain the interaction.
They compared the amino acid sequences – 20 “letters” total, compared to DNA’s 4 – of over 6,000 yeast proteins to nearly 6,500 proteins in other similar species. Like cracking a cipher, this allowed the team to home in on the amino acids that change in lockstep. They then traced the “letters” to their protein owners and hypothesized that these owners likely formed a complex.
Using both AlphaFold and RoseTTAFold, the team next solved the 3D structure of these protein candidates. Surprisingly, each algorithm on its own struggled in performance and power consumption. But by tag-teaming, with RoseTTAFold screening protein pairs first, followed by AlphaFold, they achieved “excellent performance,” the team said, with a precision of 95 percent for the gold standard set.
They next expanded their test to over eight million co-evolved yeast protein pairs. Together, the new algorithm found over 1,500 pairs likely to interact, and drew up 3D models for about 800 that hadn’t previously been characterized—that is, about half.
The success rate is a triumph for biology. Digging deeper, the team found that most of the newly predicted complexes and interactions “play roles in almost all key processes” and “provide broad insights into biological function.”
Among the AI-predicted complexes are those that control DNA repair after damage, a process dubbed homologous recombination. Recombination is the cellular machinery that CRISPR and its variants tap into. Understanding the protein members and complexes involved could potentially lead to new avenues for gene editing.
Other complexes are involved in the cell’s recycling mechanism, which often goes awry in diseases involving neurodegeneration. Over time, toxic proteins build up and overwhelm vulnerable neurons, causing them to malfunction. Other complexes include those needed for cells to swallow nutrients and medication, those that unwind chromosomes—which house DNA—during reproduction, and those that translate RNA into proteins.
Like any simulation, the results are only hypotheses for now. But they offer unprecedented clues, at a large scale, into potentially new complexes and functions that escaped previous study. These predictions are a great example of the promise of 3D structures, said Dr. John Jumper, one of the lead developers for AlphaFold. Just last month, his team at DeepMind posted a pre-print on AlphaFold-Multimer, an algorithmic variant that predicts protein complexes at about 67 percent accuracy in nearly 4,500 test cases.
The study is just the start. “As with any new method, it is important when interpreting the results to keep in mind the limitations of the approach,” the team warned. For example, the AI doesn’t work as well for protein complexes that only transiently interact or those that have extremely complicated structures. The results have so far only been tested in yeast protein complexes, and may miss those restricted to another species. The AI also isn’t very confident in its predictions—tests show confidence levels of about 70 percent for each complex.
Still, that’s the thrill. Thanks to deep learning, we’re cracking the protein complexes underpinning biology at a massive scale. “It’s a really exciting time,” said Baker.
Matthew Griffin, described as “The Adviser behind the Advisers” and a “Young Kurzweil,” is the founder and CEO of the World Futures Forum and the 311 Institute, a global Futures and Deep Futures consultancy working between the dates of 2020 to 2070, and is an award winning futurist, and author of “Codex of the Future” series.
Regularly featured in the global media, including AP, BBC, Bloomberg, CNBC, Discovery, RT, Viacom, and WIRED, Matthew’s ability to identify, track, and explain the impacts of hundreds of revolutionary emerging technologies on global culture, industry and society, is unparalleled. Recognised for the past six years as one of the world’s foremost futurists, innovation and strategy experts Matthew is an international speaker who helps governments, investors, multi-nationals and regulators around the world envision, build and lead an inclusive, sustainable future.
A rare talent Matthew’s recent work includes mentoring Lunar XPrize teams, re-envisioning global education and training with the G20, and helping the world’s largest organisations envision and ideate the future of their products and services, industries, and countries.
Matthew's clients include three Prime Ministers and several governments, including the G7, Accenture, Aon, Bain & Co, BCG, Credit Suisse, Dell EMC, Dentons, Deloitte, E&Y, GEMS, Huawei, JPMorgan Chase, KPMG, Lego, McKinsey, PWC, Qualcomm, SAP, Samsung, Sopra Steria, T-Mobile, and many more.
FANATICALFUTURIST PODCAST! Hear about ALL the latest futures news and breakthroughs!SUBSCRIBE
1000's of articles about the exponential future, 1000's of pages of insights, 1000's of videos, and 100's of exponential technologies: Get The Email from 311, your no-nonsense briefing on all the biggest stories in exponential technology and science.