Journal of Molecular Biology
Volume 432, Issue 4, 14 February 2020, Pages 1183-1198
Journal home page for Journal of Molecular Biology

Protein Interaction Energy Landscapes are Shaped by Functional and also Non-functional Partners

https://doi.org/10.1016/j.jmb.2019.12.047Get rights and content

Highlights

  • Functional and nonfunctional protein interactions undergo severe competition.

  • We study how this competition shapes the protein surfaces’ interaction propensity.

  • We present 2D energy maps reflecting the interaction propensity of protein surfaces.

  • The interaction propensity of the whole protein surface is conserved during evolution.

  • Strikingly this feature holds for functional but also nonfunctional protein pairs.

Abstract

In the crowded cell, a strong selective pressure operates on the proteome to limit the competition between functional and non-functional protein-protein interactions. We developed an original theoretical framework in order to interrogate how this competition constrains the behavior of proteins with respect to their partners or random encounters. Our theoretical framework relies on a two-dimensional (2D) representation of interaction energy landscapes, with 2D energy maps, which reflect in a synthetic way the spatial distribution of the interaction propensity of a protein surface for another protein. We realized the interaction propensity mapping of proteins' surfaces in interaction with functional and arbitrary partners and asked whether the distribution of their interaction propensity is conserved during evolution. Therefore, we performed several thousands of cross-docking simulations to systematically characterize the energy landscapes of 103 proteins interacting with different sets of homologs, corresponding to their functional partner’s family or arbitrary protein families. Then, we systematically compared the energy maps resulting from the docking of each protein with the different protein families of the dataset. Strikingly, we show that the interaction propensity not only of the binding sites but also of the rest of the surface is conserved for docking partners belonging to the same protein family. Interestingly, this observation holds for docked proteins corresponding to true but also arbitrary partners. Our theoretical framework enables the characterization of the energy behavior of a protein in interaction with hundreds of proteins and opens the way for the characterization of the behavior of proteins in a specific environment.

Introduction

Biomolecular interactions and particularly protein-protein interactions (PPIs) are central for many physiological processes and are of utmost importance for the functioning of the cell. As such, they have brought about a wealth of studies over the last decades [[1], [2], [3], [4], [5]]. The concentration of proteins in a cell has been estimated to be approximately 2–4 million proteins per cubic micron [6]. In such a highly crowded environment, proteins constantly encounter each other, and numerous nonspecific interactions are then likely to occur [[7], [8], [9], [10]]. For example, in the cytosol of S. cerevisiae, a protein can encounter up to 2000 different proteins [11]. In this complex jigsaw puzzle, each protein has evolved to bind the right piece(s) in the right way at the right moment (positive design), and to prevent misassembly and non-functional interactions (negative design) [[12], [13], [14], [15], [16]].

Positive design constrains the physicochemical properties and the evolution of protein-protein interfaces. Indeed, a strong selection pressure operates on binding sites to maintain the functional assembly, including the functional partner and the functional binding mode. For example, homologs sharing at least 30% sequence identity almost invariably interact in the same way [17]. Conversely, the negative design prevents proteins from being trapped in the numerous competing non-functional interactions inherent to the crowded environment of the cell. Many studies have described the relationship between the propensity of proteins for promiscuous interactions and their abundances or surface properties [[18], [19], [20], [21]]. Particularly, it has been shown that misinteraction avoidance shapes the evolution and physicochemical properties of abundant proteins, resulting in a slower evolution and less sticky surfaces than what is observed for less abundant ones [18,[22], [23], [24], [25], [26]]. The whole surface of abundant proteins is thus constrained, preventing them from engaging in deleterious non-specific interactions that could dramatically impact the cell at high concentrations [25]. For example, it has been shown in E. coli that the net charge, as well as the charge distribution on protein surfaces, affect the diffusion coefficients of proteins in the cytoplasm [19,27]; positively charged proteins move up to 100 times more slowly as they get caught in nonspecific interactions with ribosomes, which are negatively charged, and which, thus shape the composition of the cytoplasmic proteome [27].

All these studies show that both positive and negative designs operate on the whole protein surface. Binding sites are constrained to maintain functional assemblies (i.e., functional binding modes with functional partners) while the rest of the surface is constrained to avoid non-functional assemblies. These constraints should, therefore, shape the energy landscapes of functional but also, non-functional interactions to prevent the latter from prevailing over the former. This should have consequences on the evolution of the whole protein surface propensity to interact with its environment since non-interacting surfaces are in constant competition with functional binding sites. We can thus hypothesize that the distribution over the protein surface of its interaction propensity is constrained during evolution in order to ensure that proteins correctly bind functional partners and do not bind to other proteins to form non-functional assemblies.

In this work, we focus on protein surfaces as a “proxy” for regulating functional and non-functional PPIs. We investigate interaction energy landscapes of proteins with native and non-native partners and ask whether the distribution over the protein surface of its interaction propensity is conserved during evolution. With this aim in mind, we performed large-scale docking simulations to characterize interactions involving either native or native-related (i.e., partners of their homologs) partners or arbitrary partners. Docking simulations have the advantage of being able to simulate the interaction of a protein with arbitrary partners, which is very difficult to address with experimental approaches. Docking algorithms are now fast enough for large-scale applications and allow for the characterization of interaction energy landscapes for thousand of protein couples. Typically, a docking simulation takes from a few minutes to a couple of hours on modern processors [[28], [29], [30]], opening the way for extensive cross-docking experiments [[31], [32], [33], [34], [35]]. Here, we performed a cross-docking experiment involving 103 proteins docked with their native or native-related partners and their homologs, as well as arbitrary partners and their homologs. We represented the interaction energy landscapes resulting from each docking calculation with a two dimensional (2D) energy map in order to (i) realize the interaction propensity mapping of a protein surface in interaction with any protein and (ii) easily compare the energy maps resulting from the docking of a same protein with different sets of homologous partners. Thus, we characterize the evolution of the distribution over a protein surface of its interaction propensity for native or arbitrary partners.

Section snippets

Characterization of the interaction propensity for the whole surface of a protein

If positive and negative designs constrain the propensity of the whole surface of proteins to interact with their functional partners or random encounters, this should shape the evolution of interaction energy landscapes of functional protein pairs but also of random encounter pairs. Consequently, we expect the interaction energy landscape involving two proteins (forming functional or arbitrary partners) to be conserved for a pair of proteins that are homologous to them. Testing this hypothesis

Impact of positive and negative design on the interaction propensity of proteins

In this study, we address the impact of positive and negative design on thousands of interaction energy landscapes with a synthetic and efficient representation of the docking energy landscapes: two-dimensional energy maps. These maps reflect the spatial distribution over the whole surface of a protein (namely the receptor) of its propensity to interact with a given partner (namely, the ligand). We show that the distribution on the protein surface of all regions, including cold, intermediate,

Protein dataset

The dataset comprises 103 protein structures divided into 16 families of structural homologs (see S1 Table for a detailed list of each family). Each family is composed of a monomer (i.e., the master of the family) selected from the protein-protein docking benchmark 5.0 [69] in its unbound forms. Each master protein is a native partner of a master protein of another family (i.e., the 3D structure of the corresponding complex has been characterized experimentally). The two corresponding families

Fundings

SSM work is supported by the “Initiative d’Excellence” program from the French State (Grant “DYNAMO”, ANR-11-LABX-0011-01). HS work was supported by a French government fellowship. This work was performed using HPC resources from GENCI-[TGCC/CINES/IDRIS] (Grant 2017 - [DARI A0030707460]).

Acknowledgments

We thank F. Fraternali, R. Guerois, E. Laine, and M. Montes for their constructive comments on the manuscript.

References (78)

  • H.-X. Zhou et al.

    Modeling protein association mechanisms and kinetics

    Curr. Opin. Struct. Biol.

    (2013)
  • P.L. Kastritis et al.

    Proteins feel more than they see: fine-tuning of binding affinity by properties of the non-interacting surface

    J. Mol. Biol.

    (2014)
  • X. Li et al.

    Protein–protein interactions: hot spots and structurally conserved residues often locate in complemented pockets that pre-organized in the unbound states: implications for docking

    J. Mol. Biol.

    (2004)
  • O. Keskin et al.

    Hot regions in protein–protein interactions: the organization and contribution of structurally conserved hot spot residues

    J. Mol. Biol.

    (2005)
  • S. Sacquin-Mora et al.

    Identification of protein interaction partners and protein–protein interaction sites

    J. Mol. Biol.

    (2008)
  • B. Dubreuil et al.

    Protein abundance biases the amino acid composition of disordered regions to minimize non-functional interactions

    J. Mol. Biol.

    (2019)
  • G. Schreiber et al.

    Protein binding specificity versus promiscuity

    Curr. Opin. Struct. Biol.

    (2011)
  • J. Planas-Iglesias et al.

    Understanding protein–protein interactions using local structural features

    J. Mol. Biol.

    (2013)
  • K. Pawłowski et al.

    Surface map comparison: studying function diversity of homologous proteins

    J. Mol. Biol.

    (2001)
  • T. Vreven et al.

    Updates to the integrated protein–protein interaction benchmarks: docking benchmark version 5 and affinity benchmark version 2

    J. Mol. Biol.

    (2015)
  • A.R. Ferreira da Silva

    A Dirichlet process mixture model for brain MRI tissue classification

    Med. Image Anal.

    (2007)
  • J.I. Garzón et al.

    A computational interactome and functional annotation for the human proteome

    ELife

    (2016)
  • J. Janin et al.

    Protein–protein interaction and quaternary structure

    Q. Rev. Biophys.

    (2008)
  • I. Nobeli et al.

    Protein promiscuity and its implications for biotechnology

    Nat. Biotechnol.

    (2009)
  • I.M.A. Nooren et al.

    New embo MEMBER’S review: diversity of protein-protein interactions

    EMBO J.

    (2003)
  • C.V. Robinson et al.

    The molecular sociology of the cell

    Nature

    (2007)
  • R. Milo

    What is the total number of protein molecules per cell volume? A call to rethink some published values: insights & Perspectives

    Bioessays

    (2013)
  • S.R. McGuffee et al.

    Diffusion, crowding & protein stability in a dynamic molecular model of the bacterial cytoplasm

    PLoS Comput. Biol.

    (2010)
  • I. Yu et al.

    Biomolecular interactions modulate macromolecular structure and dynamics in atomistic model of a bacterial cytoplasm

    ELife

    (2016)
  • J.S. Richardson et al.

    Natural -sheet proteins use negative design to avoid edge-to-edge aggregation

    Proc. Natl. Acad. Sci.

    (2002)
  • S. Pechmann et al.

    Physicochemical principles that regulate the competition between functional and dysfunctional association of proteins

    Proc. Natl. Acad. Sci.

    (2009)
  • E.J. Deeds et al.

    Robust protein protein interactions in crowded cellular environments

    Proc. Natl. Acad. Sci.

    (2007)
  • H. Garcia-Seisdedos et al.

    Proteins evolve on the edge of supramolecular self-assembly

    Nature

    (2017)
  • M. Heo et al.

    Topology of protein interaction network shapes protein abundances and strengths of their functional and nonspecific interactions

    Proc. Natl. Acad. Sci.

    (2011)
  • Q. Wang et al.

    Exploring weak, transient protein–protein interactions in crowded in vivo environments by in-cell nuclear magnetic resonance spectroscopy

    Biochemistry

    (2011)
  • N. Zhang et al.

    Quinary interactions weaken the electric field generated by protein side-chain charges in the cell-like environment

    J. Am. Chem. Soc.

    (2017)
  • X. Mu et al.

    Physicochemical code for quinary protein interactions in Escherichia coli

    Proc. Natl. Acad. Sci. U.S.A.

    (2017)
  • C. Pal et al.

    Highly expressed genes in yeast evolve slowly

    Genetics

    (2001)
  • J. Zhang et al.

    Constraints imposed by non-functional protein–protein interactions on gene expression and proteome size

    Mol. Syst. Biol.

    (2008)
  • View full text