De novo protein design by inversion of the AlphaFold structure prediction network

CA Goverde, B Wolf, H Khakzad, S Rosset… - Protein …, 2023 - Wiley Online Library
CA Goverde, B Wolf, H Khakzad, S Rosset, BE Correia
Protein Science, 2023Wiley Online Library
De novo protein design enhances our understanding of the principles that govern protein
folding and interactions, and has the potential to revolutionize biotechnology through the
engineering of novel protein functionalities. Despite recent progress in computational design
strategies, de novo design of protein structures remains challenging, given the vast size of
the sequence‐structure space. AlphaFold2 (AF2), a state‐of‐the‐art neural network
architecture, achieved remarkable accuracy in predicting protein structures from amino acid …
Abstract
De novo protein design enhances our understanding of the principles that govern protein folding and interactions, and has the potential to revolutionize biotechnology through the engineering of novel protein functionalities. Despite recent progress in computational design strategies, de novo design of protein structures remains challenging, given the vast size of the sequence‐structure space. AlphaFold2 (AF2), a state‐of‐the‐art neural network architecture, achieved remarkable accuracy in predicting protein structures from amino acid sequences. This raises the question whether AF2 has learned the principles of protein folding sufficiently for de novo design. Here, we sought to answer this question by inverting the AF2 network, using the prediction weight set and a loss function to bias the generated sequences to adopt a target fold. Initial design trials resulted in de novo designs with an overrepresentation of hydrophobic residues on the protein surface compared to their natural protein family, requiring additional surface optimization. In silico validation of the designs showed protein structures with the correct fold, a hydrophilic surface and a densely packed hydrophobic core. In vitro validation showed that 7 out of 39 designs were folded and stable in solution with high melting temperatures. In summary, our design workflow solely based on AF2 does not seem to fully capture basic principles of de novo protein design, as observed in the protein surface's hydrophobic vs. hydrophilic patterning. However, with minimal post‐design intervention, these pipelines generated viable sequences as assessed experimental characterization. Thus, such pipelines show the potential to contribute to solving outstanding challenges in de novo protein design.
Wiley Online Library