Towards functional de novo designed proteins

https://doi.org/10.1016/j.cbpa.2019.06.011Get rights and content

Our ability to design completely de novo proteins is improving rapidly. This is true of all three main approaches to de novo protein design, which we define as: minimal, rational and computational design. Together, these have delivered a variety of protein scaffolds characterised to high resolution. This is truly impressive and a major advance from where the field was a decade or so ago. That all said, significant challenges in the field remain. Chief amongst these is the need to deliver functional de novo proteins. Such designs might include selective and/or tight binding of specified small molecules, or the catalysis of entirely new chemical transformations. We argue that, whilst progress is being made, solving such problems will require more than simply adding functional side chains to extant de novo structures. New approaches will be needed to target and build structure, stability and function simultaneously. Moreover, if we are to match the exquisite control and subtlety of natural proteins, design methods will have to incorporate multi-state modelling and dynamics. This will require more than black-box methodology, specifically increased understanding of protein conformational changes and dynamics will be needed.

Introduction

De novo protein design is said to have come of age [1]. From the early de novo proteins confirmed by high-resolution structures [2, 3, 4], the field has advanced rapidly with new scaffolds covering all-α [5,6,7], all-β [8], and mixed-α/β and α + β structural space [9,10,11]. In addition, side-chain constellations can be controlled exquisitely to introduce networks of hydrogen bonds throughout target structures [12], which, in turn, can improve the design and characterisation of de novo membrane proteins [13].

However, the ability to design functional de novo proteins from scratch, or to embellish existing de novo scaffolds with new functions, is still in its infancy. Herein, we use terms like ‘functional protein design’ for any stably folded de novo protein frameworks that incorporate interactions with small or large molecules, catalytic activity and so on. With notable exceptions—for example, reports of a functional ion transporter [14], a de novo designed catalytic triad [15••], and a highly efficient de novo enzyme [16••]— general design principles for functional protein design are sparse. Indeed, it may be that overoptimised de novo proteins, which are often hyperthermally stable, may not make good platforms for functional design, as it is known that dynamics play essential roles in ligand binding and catalysis [17, 18, 19].

Herein, we focus on truly de novo proteins rather than those achieved through protein engineering or redesign—that is, where functions are improved in or introduced to natural proteins. Of course, the latter have led to novel enzymes and ligand-binding proteins [20, 21, 22]. Whilst impressive, protein engineering relies on the inherent stability of natural scaffolds and their tolerance to mutation, and often uses the randomness of directed evolution to access the targeted function [23]. By contrast, de novo protein design removes the dependence on naturally evolved scaffolds, and has the potential for a deeper understanding of the contribution that every side chain makes towards the structure, stability and function of de novo proteins. Of course, this is an extremely challenging approach and its goals are ambitious.

Notable advances have also been made in introducing metal-binding and protein–protein interactions into de novo proteins (see recent reviews Refs. [24, 25, 26]). However, these pose different challenges to those laid out herein, and are only mentioned in passing in this review.

Section snippets

From minimal, through rational, to computational design

There is no single approach to protein design. However, the field can be split broadly into three different approaches (Figure 1). In minimal design binary patterns of polar (p) and hydrophobic (h) residues are used to define a target structure [27,28]. α-Helices lend themselves to this as they can be directed to fold and assemble with sequence patterns of the type hpphppp. As a result, the vast majority of work in this area has targeted four-helix bundles. Rational design goes a step further

Minimal design of functional four-helix bundles

DeGrado, Hecht and Dutton have pioneered the concepts of minimal and rational de novo protein design (reviewed extensively in Refs. [27,33,34]). In short, these combine chemical intuition about protein structure and basic sequence-to-structure relationships to deliver straightforward designed protein scaffolds (Figure 1). Key targets in these endeavours have been four-helix bundles, which involve the coalescence of amphipathic helices encoded by self-associating peptides or within single

Rational parametric design of functional assemblies

Rules-based or rational protein design and computational design do not have to be mutually exclusive (Figure 1). By incorporating design rules into computational design algorithms, the number of models that need to be built and scored can be reduced dramatically. Parametric design lends itself to this. Here, target protein folds are described mathematically with a minimal number of parameters. Not surprisingly given their simplicity and potential regularity, de novo four-helix bundles have been

Fragment-based computational design beyond protein engineering

As protein structures increase in complexity, more-sophisticated approaches are needed to access more-elaborate architectures. By harnessing the power of computers, thousands of designs can be generated and analysed in silico at scales beyond minimal and rational design. The most widespread approach is fragment-based design, which has three aspects: libraries of fragments or motifs are taken from structural databases, algorithms are developed to combine these to assemble target structures, and

Designing in function from the beginning

Most of the studies described above focus on the design of the de novo scaffold before functionalisation or improving an already established functional de novo protein. Though there are clear examples to the contrary [15••,31,39], extant de novo designed scaffolds may not have suitable cavities or sites for every targeted function, or the post-incorporation of the necessary functional residues may prove problematic. Indeed, for natural proteins it is well documented that small changes, even

Challenges ahead

The robust and routine design of functional de novo proteins remains an unsolved problem. For instance, to our knowledge, there are no examples of tight binding of small, polar molecules by de novo proteins. The change in approach in the last few years to incorporate the functional aspect of the design at an early stage shows clear potential, which we envisage will be become more evident as design algorithms improve. However, as stated above, accessing functional de novo proteins that work on a

Conflict of interest statement

Nothing declared.

References and recommended reading

Papers of particular interest, published within the period of review, have been highlighted as:

  • • of special interest

  • •• of outstanding interest

Acknowledgements

WMD, GGR and DNW were supported by a European Research Council Advanced Grant (340764). DNW is supported by the EPSRC and BBSRC through the BrisSynBio Synthetic Biology Research Centre (BB/L01386X1). DNW holds a Royal Society Wolfson Research Merit Award (WM140008). We thank Ross Anderson for providing a computational model of the C45 protein.

References (106)

  • C.W. Wood et al.

    ISAMBARD: an open-source computational environment for biomolecular analysis, modelling and design

    Bioinformatics

    (2017)
  • S. Studer et al.

    Evolution of a highly active and enantiospecific metalloenzyme from short peptides

    Science

    (2018)
  • Z. Li et al.

    Energy functions in de novo protein design: current challenges and future prospects

    Annu Rev Biophys

    (2013)
  • A. Leaver-Fay et al.

    Rosetta3: an object-oriented software suite for the simulation and design of macromolecules

    Methods Enzymol

    (2011)
  • M.J. Bick et al.

    Computational design of environmental sensors for the potent opioid fentanyl

    eLife

    (2017)
  • J. Banda-Vázquez et al.

    Redesign of LAOBP to bind novel L-amino acid ligands

    Protein Sci

    (2018)
  • X. Garrabou et al.

    Fast Knoevenagel condensations catalyzed by an artificial Schiff-base-forming enzyme

    J Am Chem Soc

    (2016)
  • S. Eisenbeis et al.

    Potential of fragment recombination for rational design of proteins

    J Am Chem Soc

    (2012)
  • C.J. Reedy et al.

    Heme protein assemblies

    Chem Rev

    (2004)
  • P.-S. Huang et al.

    The coming of age of de novo protein design

    Nature

    (2016)
  • C.P. Hill et al.

    Crystal-structure of α1 – implications for protein design

    Science

    (1990)
  • S. Nautiyal et al.

    Crystal structure of a designed, thermostable, heterotrimeric coiled coil

    Protein Sci

    (1999)
  • B. Kuhlman et al.

    Design of a novel globular protein fold with atomic-level accuracy

    Science

    (2003)
  • A.R. Thomson et al.

    Computational design of water-soluble α-helical barrels

    Science

    (2014)
  • G.G. Rhys et al.

    Maintaining and breaking symmetry in homomeric coiled-coil assemblies

    Nat Commun

    (2018)
  • S.-Q. Zhang et al.

    Designed peptides that assemble into cross-α amyloid-like structures

    Nat Chem Biol

    (2018)
  • E. Marcos et al.

    De novo design of a non-local β-sheet protein with high stability and accuracy

    Nat Struct Mol Biol

    (2018)
  • H. Liang et al.

    De novo design of a βαβ motif

    Angew Chem Int Ed

    (2009)
  • P.S. Huang et al.

    De novo design of a four-fold symmetric TIM-barrel protein with atomic-level accuracy

    Nat Chem Biol

    (2016)
  • E. Marcos et al.

    Prinicples for designing proteins with curved β sheets

    Science

    (2017)
  • S.E. Boyken et al.

    De novo design of protein homo-oligomers with modular hydrogen-bond network–mediated specificity

    Science

    (2016)
  • P. Lu et al.

    Accurate computational design of multipass transmembrane proteins

    Science

    (2018)
  • N.H. Joh et al.

    De novo design of a transmembrane Zn2+-transporting four-helix bundle

    Science

    (2014)
  • A.J. Burton et al.

    Installing hydrolytic activity into a completely de novo protein framework

    Nat Chem

    (2016)
  • D.W. Watkins et al.

    Construction and in vivo assembly of a catalytically proficient and hyperthermostable de novo enzyme

    Nat Commun

    (2017)
  • A. Stank et al.

    Protein binding pocket dynamics

    Acc Chem Res

    (2016)
  • L.E. Kay

    New views of functionally dynamic proteins by solution NMR spectroscopy

    J Mol Biol

    (2016)
  • A.G. Palmer

    Enzyme dynamics from NMR spectroscopy

    Acc Chem Res

    (2015)
  • L. Jiang et al.

    De novo computational design of retro-aldol enzymes

    Science

    (2008)
  • J.B. Siegel et al.

    Computational design of an enzyme catalyst for a stereoselective bimolecular diels-alder reaction

    Science

    (2010)
  • L. Giger et al.

    Evolution of a designed retro-aldolase leads to complete active site remodeling

    Nat Chem Biol

    (2013)
  • A.F.A. Peacock

    Incorporating metals into de novo proteins

    Curr Opin Chem Biol

    (2013)
  • A.G. Tebo et al.

    Artificial metalloenzymes derived from three-helix bundles

    Curr Opin Chem Biol

    (2015)
  • G. Schreiber et al.

    Computational design of protein–protein interactions

    Curr Opin Struct Biol

    (2013)
  • W.F. DeGrado et al.

    De novo design and structural characterization of proteins and metalloproteins

    Annu Rev Biochem

    (1999)
  • F. Thomas et al.

    De novo-designed α-helical barrels as receptors for small molecules

    ACS Synth Biol

    (2018)
  • J. Dou et al.

    De novo design of a fluorescence-activating beta-barrel

    Nature

    (2018)
  • B.M. Discher et al.

    Hydrophilic to amphiphilic design in redox protein maquettes

    Curr Opin Chem Biol

    (2003)
  • K.J. Grayson et al.

    The ascent of man(made oxidoreductases)

    Curr Opin Struct Biol

    (2018)
  • G. Kodali et al.

    Design and engineering of water-soluble light-harvesting protein maquettes

    Chem Sci

    (2017)
  • Cited by (48)

    • Insulin receptor-inspired soluble insulin binder

      2023, European Journal of Cell Biology
    • Exploiting enzyme evolution for computational protein design

      2022, Trends in Biochemical Sciences
      Citation Excerpt :

      This makes de novo protein design once again an exciting area, particularly as the field has matured significantly [65–67], with significant contributions also likely to be made from recent developments in protein structure prediction (such as Alpha Fold [68] and recent developments in protein structure prediction using a three-track neural network [69]). Various approaches have been used to try to design new protein scaffolds [65–67], one of which is modular design to assemble new protein folds [70]. This approach is attractive because it effectively allows new proteins to be assembled as if they are made of molecular LEGO® building blocks.

    • Algorithms for protein design

      2022, Advances in Protein Chemistry and Structural Biology
    • A Brief History of De Novo Protein Design: Minimal, Rational, and Computational: De novo protein design

      2021, Journal of Molecular Biology
      Citation Excerpt :

      Usually, and in the broadest terms, this is done by choosing a target protein shape or assembly and then finding a sequence, or sequences most likely to fold to and stabilise that target. For the next step of introducing function, there are two schools of thought in de novo design18: some advocate generating stably folded scaffolds first and then embellishing these with functions; while others argue that the processes of structural and functional design should be more integrated–in short, the de novo proteins should be built around the functional regions. Second, and before getting too embroiled in the details of protein design, we need some boundaries, definitions and illustrations.

    • Stability of Ligand-induced Protein Conformation Influences Affinity in Maltose-binding Protein

      2021, Journal of Molecular Biology
      Citation Excerpt :

      Thereby, we could expand the methodologies to evolve enzymes towards biotechnological relevant functions. Moreover, our approach could be of value in understanding and guiding the binding behavior of de novo designed proteins.51 Especially, now the existence of multiple conformations are also taken into account during the design process.52

    View all citing articles on Scopus
    View full text