New artificial intelligence (AI) tools have emerged that can help scientists discover previously unknown proteins and design entirely new ones. When used properly, this AI might enable more efficient vaccine development, speed up cancer drug research, and even help discover entirely new substances.
In 2020, DeepMind, an AI research institute, a subsidiary of Alphabet, launched AlphaFold, an AI tool that uses deep learning to solve one of the ‘difficulties’ in the biological world. Announcementthat surprised the world. A biological challenge that Alphafold solved was to accurately predict the shape of a protein. Proteins are fundamental to life, and understanding their shape is essential to utilize proteins. Earlier this summer, DeepMind announced that Alphafold might predict the shape of any protein known to the scientific community.
A research team at the University of Washington Two papers published inhereWow here), a new tool, ‘ProteinMPNN’, is expected to perfectly complement AlphaFold’s protein prediction technology.
Two papers published today show the latest examples of how deep learning is revolutionizing protein design by providing scientists with new research tools. Previously, researchers made proteins by slightly modifying proteins that exist in nature. However, protein MPNN will expand the domain of proteins from scratch while allowing researchers to design proteins from scratch.
David Baker, one of the scientists who participated in this paper and director of the Institute for Protein Design at the University of Washington, said, “In nature, everything related to life, from getting energy from sunlight to making molecules, is fundamentally solved. “Everything in biology happens with proteins,” he said.
“Proteins have evolved together over the course of evolution to solve problems faced by organisms as they evolved,” he said. However, we are currently facing a new problem, such as COVID-19. “If we can design proteins that are good at solving new problems just like they are good at solving old problems while evolving together in the course of life, that would be a really powerful weapon.”
Proteins are made up of hundreds to thousands of amino acids, which are linked together in long chains and fold into three-dimensional shapes. AlphaFold provides insight into how proteins will behave while helping researchers predict protein folding structures.
Protein MPNN will help solve the opposite problem. Protein MPNN can help researchers find amino acid sequences that fold into their shape, given that they already know the exact structure of the protein. The system uses a neural network that has been trained from a very large number of examples of amino acid sequences that fold into three-dimensional structures.
But researchers also have to solve another problem. To design a protein that has real-world applications, such as a new enzyme that digests plastics, researchers must first figure out which protein shape has that function.
To do this, researchers from Baker’s lab use two machine learning techniques the team calls ‘constrained hallucination’ and ‘in painting’. These two methods were last published in Articlehas been described in detail in
‘Limited hallucinations’ allows users to select a sequence with a specific function through a random search among all possible protein sequences. This ‘hallucination’ makes all possible protein structure searches possible, powered by machine learning’s ability to process vast datasets. There are a total of 20 types of amino acids, which can be combined into a huge number of different sequences.
“In nature, we only find a small fraction of possible protein sequences,” Baker said. So if you limit your search to sequences that exist in nature, you get nothing.”
‘Inpainting’ works like the autocomplete function of a word processor for protein structures and sequences. This method allows researchers to create entirely new proteins that have never been seen before in nature, such as gigantic ring-shaped structures.
Baker’s team is testing whether such ring-shaped structures can be used as components for tiny machines that operate on the nanoscale. In the future, these nanomachines might be used to unblock arteries, for example.
Using machine learning to design proteins in this way is ‘great work’, said Lynne Regan, professor of biochemistry and bioengineering at the University of Edinburgh.
Machine learning will make the entire process of designing proteins much faster and easier, and will allow researchers to create entirely new proteins and structures on a much larger scale. The software is over 200 times faster than the best tools ever and requires minimal user input, potentially lowering the barrier to entry for protein design.
“This software and other recent advances are changing the field of biomolecular structure prediction and design,” said Jeffrey Gray, professor of chemistry and biomolecular engineering at Johns Hopkins University.
“The implications of these changes for understanding biology, health and disease and designing new molecules to reduce human suffering are impressive,” Gray explains.
By combining deep learning tools developed in his lab with tools developed in Baker’s lab, Gray said he will better understand the immune system and immune-related diseases and use AI to design treatments.
“AlphaFold solves the problem of predicting protein structure and is a transformative role AI and machine learning can play in biology,” said Pushmeet Kohli, team leader for DeepMind’s AI for Science team. “Providing a new era in biology, MPNN, which designs proteins for specific tasks, is another proof of this paradigm shift.”
Current open source software repository GitHubThe protein MPNN, which is available free of charge from “Of course, the important thing is, ‘what are you going to design?'” Baker said. (By Melissa Heikkilä)