Behaviour of intrinsically disordered proteins in protein–protein complexes with an emphasis on fuzziness
- 2k Downloads
Intrinsically disordered proteins (IDPs) do not, by themselves, fold into a compact globular structure. They are extremely dynamic and flexible, and are typically involved in signalling and transduction of information through binding to other macromolecules. The reason for their existence may lie in their malleability, which enables them to bind several different partners with high specificity. In addition, their interactions with other macromolecules can be regulated by a variable amount of chemically diverse post-translational modifications. Four kinetically and energetically different types of complexes between an IDP and another macromolecule are reviewed: (1) simple two-state binding involving a single binding site, (2) avidity, (3) allovalency and (4) fuzzy binding; the last three involving more than one site. Finally, a qualitative definition of fuzzy binding is suggested, examples are provided, and its distinction to allovalency and avidity is highlighted and discussed.
KeywordsIDP Allovalency Fuzzy complex Signalling Avidity Disorder Kinetics
Signalling and regulation are essential to all living cells and are based on intermolecular interactions, most of which are mediated by proteins. A substantial fraction of proteins include large regions of disorder without clearly defined three-dimensional structure. Such intrinsically disordered proteins (IDPs) are not only very abundant—30–40% of all proteins in the human proteome are disordered or contain intrinsically disordered regions (IDRs) [1, 2]—they also constitute significant parts of membrane proteins [3, 4] and occupy pivotal positions in cellular regulation on all levels . Some even display enzymatic activity . Thus, IDPs are critically involved in key cellular processes and important for understanding life. Although IDP research has grown somewhat independent from traditional biology and biochemistry, it is conceptually important to follow the models, views and nomenclatures used generally for proteins, which have been developed over the past 120 years since Fisher proposed the lock-and-key model for ligand binding . Thus, throughout this review the IDP is referred to as the ligand (L). The residues involved in binding are expected to be disordered, but that does not exclude the presence of ordered regions in other parts of the peptide chain. In the present discussion, ordered regions are assumed not to be involved in the interaction. The binding partner that may or may not be an IDP is referred to as the receptor (R), although this macromolecule does not need to be a receptor per se.
By definition, IDPs have high rotational freedom and sample a wide range of conformations [8, 9, 10]. Their hyper-dynamical nature renders them malleable and thereby potentiates their ability to bind multiple structurally diverse receptors, while retaining specificity. This conjecture implies that IDPs are superior to their folded counterparts when it comes to binding many different partners. Interestingly, the thermodynamics of the interaction between an IDP and a folded partner is essentially similar to the situation when two globular proteins interact, only compromised on average by around 2.5 kcal mol−1 due to loss of conformational entropy originating from the structuring of the disordered chain . However, the distribution of states and the dynamics of the complexes vary. Some IDP binding sites become ordered upon binding to their receptor, a phenomenon called folding upon binding . Several crystal and NMR structures of such complexes exist [13, 14] and they highlight details of the interactions [15, 16, 17, 18]. In terms of kinetics, these are typical examples of simple two-state reactions, where the energy landscape of the complex is presented by a very deep well and one single structure can, in essence, represent the complex. At the other extreme, some ligands never ‘rest’ in complex with a receptor and there is no single conformation for the ‘bound-state’. In this case, the IDP ligand retains conformational freedom in the complex. Such interactions have recently been coined fuzzy complexes [19, 20] and a database has been established, collecting examples of the phenomenon . Between these extremes, other binding modes are found. Earlier work has provided kinetic interpretations of those modes and their mechanisms of binding have been referred to as avidity and allovalency [22, 23]. In the following, we will describe the four different mechanisms in more details.
Simple two-state binding
Avidity was originally used to describe the binding between an antibody and an antigen, and is thus not exclusively an IDP phenomenon . Avidity arises when two or more binding sites are present on the ligand, complementing two or more binding sites on the receptor (Fig. 1b). The binding sites on the ligand are connected by a linker and this linker ensures that once one site is bound to the receptor, other site(s) are spatially close to other receptor sites, resulting in cooperative binding, due principally to a lower entropic cost of binding more than one ligand . Avidity requires the receptor and the ligand to have the same number of binding sites, where each site is unique and the sites cannot exchange. Once the ligand has bound one site, the probability of establishing an additional binding contact is much higher than for the first binding event and so forth, introducing cooperativity.
The first binding event is a second order reaction, whereas subsequent binding events are first order (pseudo-intramolecular) events. Thus, the entropic loss in subsequent binding events is lower.
The values of k esc and k on depend on n, whereas k off and k cap do not, since at any given time, only one ligand site can occupy the binding site on the receptor, and entering the P zone from the F zone is a diffusion process that only depends on [L]. The rate constant k esc thus decreases exponentially with n, introducing the cooperativity of the system.
The defining example of allovalency is Sic1, an IDP from yeast, and its receptor Cdc4. The interaction depends on phosphorylation of up to ten serine and threonine residues on Sic1 . Each of these phosphorylated epitopes can target a single binding pocket on Cdc4. The binding is cooperative, as when less than six sites are phosphorylated there is almost no binding. Phosphorylation of the sixth arbitrary group produces strong binding and further phosphorylation increases the affinity in a non-linear way. The fraction of bound Sic1 to Cdc4 is thus described as:
Allovalent binding where ligand epitopes are created by post-translational modifications has the potential to function as a highly cooperative switch. Please note, that the allovalency model has been discussed and expanded beyond the present formulation by Locasale .
The term fuzzy complex was introduced by Tompa and Fuxreiter in 2008  and the concept has been further refined and discussed, both by Fuxreiter and others [31, 32, 33]. The name is inspired by the mathematical term ‘fuzzy logic’ in which the true answer to a question can be no (0) or yes (1) or any value in between. Thus, analogously in a binding reaction, a ligand can be more than just fully bound to the receptor or completely free. As a further extension to this description, fuzzy complexes are ensembles of complexes, which are all needed to be able to fully describe the bound state (Fig. 1d).
In a wider perspective, all complexes at temperatures above 0 K are fuzzy. In one extreme, atomic vibrations cause fuzziness in a solid-state system. The opposite extreme can be illustrated by non-specific interactions between atoms or molecules in the gas state. In that light, treating fuzziness as a distinct biochemical phenomenon linked to IDPs seems artificial. Fuzzy complexes, however, challenge the view that a protein–ligand complex occupies a single structural state, a notion that is fuelled by the overwhelming amount of crystal structures of protein–ligand complexes deposited in the protein data bank. Obviously, X-ray crystallographic data are biased towards non-dynamic molecules. A fuzzy complex is dynamic in the bound state and occupies several conformational states. Consequently, crystallographic methods are not sufficient for realistic visualization. If a crystal could be grown with a fuzzy complex, the electron density at the binding interface would be the average of all the conformations present in the crystal and hardly possible to interpret. Alternatively, a single state is allowed in the crystal lattice, producing a misleading artefact, at best describing one out of many possible states. Having said that, it is important to mention an interesting study employing SAXS, NMR and X-ray crystallography to investigate the binding of an intrinsically disordered region of ribosomal S6 kinase1 (Rsk1) to its inhibitor S100B. The investigators caught different Rsk1 structures in different crystal forms of the complex and were able to describe the fuzzy complex using data obtained by all three techniques . To our knowledge, the first fuzzy complexes discovered were the homodimerization of the intracellular region of the T cell receptor subunit ζ and, subsequently, the heterodimerization of the same receptor region with a folded protein (Nef protein core from simian immunodeficiency virus) . Although dynamic dimers of IDPs exist , the existence of homo-dimers in the former publication has been challenged  and importantly, so has the initial notion that fuzzy complexes can form without any peak perturbations in the NMR spectra . However, the nature of fuzzy complexes and the degree by which we currently understand them, combined with the degree by which their formation is manifested in changes in measureable parameters, challenge the current toolbox of structural biology. The development of new approaches, in which single molecules analyses are one important road ahead, is needed.
Fuxreiter et al. described fuzzy complexes as ‘protein complexes, where conformational heterogeneity of ID regions is retained and is required for function’ . However, any bond between two functional groups will reduce the number of degrees of freedom of the system by thermodynamic definition. Assuming that conformational heterogeneity is proportional to the number of microstates of the system (definitions of conformational heterogeneity can be found here [39, 40, 41]), conformational heterogeneity cannot be completely retained, not even in a fuzzy complex, because each bound state has lower entropy than each unbound state.
Although the introduction of fuzziness and fuzzy complexes as concepts has been tremendously important for driving our understanding of IDPs, a stricter definition of fuzzy complexes is needed. Thus, to further advance the field, a formal definition of the fuzzy phenomenon in terms of molecular dynamics and kinetics is necessary. This definition must explain the affinity/kinetics and fuel the design of experiments that can directly test for the fuzzy phenomenon. Here we describe fuzziness as two or more ligand binding sites on the receptor being able to bind to two or more receptor binding sites on the ligand. In a sense this is a combination of two allovalency phenomena, one experienced by the ligand and one experienced by the receptor (Fig. 1d). We only describe this conceptually, and present no formalistic description, but refer to Vauquelin et al., who have described the simplest system formalistically where n = 2 for each partner of the complex [42, 43].
What makes fuzzy binding special?
A fuzzy complex consists of an intrinsically disordered ligand and a receptor (which may and may not be disordered itself). The complex, once established, does not lead to a single ligand (and in some cases receptor) conformation, rather the ligand samples a large conformational space as functional groups bind and unbind the receptor. A functional group could be PO4 2−, NH3 +, O−, OH, CH3, a ring system etc., i.e. any functional group in a protein. The ligand-receptor sub-sites recombine during binding and individual interactions within the interface are short lived compared to the overall life-time of the complex. Furthermore, these individual interactions may recombine within the life-time of the complex. So one functional group on one of the proteins may be free to bind different functional groups on the other protein.
How is fuzziness different from the other mechanisms presented in this paper? Of the four modes of interaction described here, fuzziness is most easily confused with (or similar to) allovalency. The difference is that allovalency requires several identical binding sites on the ligand and a single ligand-binding site on the receptor . In the case of allovalency, the cooperative dependency on the number of compatible sub-sites is reflected in k esc, because the probability that an unbound sub-site will rebind to the receptor, and thus prevent diffusion beyond the proximal region increases non-linearly with the number of sub-sites. However, whereas k off is independent on the number of ligand sites in the case of allovalency, this will not be the case for fuzzy binding, if one accepts the definition above. In a fuzzy complex, both the ligand and the receptor contain several ‘sub-binding sites’ or compatible functional groups, and several of those groups can make contact simultaneously. This means that the observed k off is dependent on the number of compatible functional groups and their individual k offs.
Although described individually, we anticipate the discovery of hybrid examples, where two or more receptor binding sites can bind several binding sites on the ligand and where both k off and k esc contribute to the cooperative effect. However, to be able to distinguish between the different mechanisms in a testable frame, we need formalistic descriptions. As far as we know, the exact formalistic definition of fuzzy complex formation in terms of how k off depends on the number of groups has not yet been derived.
One might ask how fuzziness differs from other macromolecular complex formation processes. The difference between a fuzzy complex and unspecific contacts between macromolecules is that a fuzzy complex has a biological consequence. The affinity may be ‘high’ or ‘low’ but the important point is that the result of the interaction has biological outcome. A lower limit for apparent affinity is not possible to define and there is no reason to believe that a higher limit exists beyond which ordered structure is required. Some fuzzy complexes have reported K d values in the nM range .
What can fuzziness offer biological systems that other kinds of complexes cannot? Perhaps the most obvious answer would be binding at low entropic cost, since a high degree of conformational heterogeneity is retained in the complex. However, in general, the negative (unfavourable) entropy change upon binding is more or less the same for IDPs as for folded proteins . With the rather few protein complexes that have been classified as fuzzy so far, it remains to be seen if fuzzy complexes differ in this respect.
Since the cooperativity of fuzziness depends on n, one could imagine that in extreme cases this could lead to very strong binding, pushing the affinity into the pM range for large number of n. Since disorder is maintained in the complex, accessibility to modifying enzymes is not compromised. Thus, even though the affinity of the complex in its unmodified form is high, the lifetime of an individual conformational state is low, which allows for regulation on the fly. In this context, we notice that fuzzy complexes offer a scaffold for ideal rheostats , in which applying or removing post-translational modifications at the binding interface, can tune binding affinity.
In line with the notion that classical interactions between ordered macromolecules and interactions with IDPs in fuzzy complexes represent the extremes of a dynamics trajectory, fuzzy complexes may not require a fundamentally different explanation . In the following, we provide two examples, which according to the definition highlighted above can be classified as fuzzy complexes.
The complex between nucleoporin and nuclear transporter receptors
A protein that needs to enter the nucleus can do so by binding to a soluble protein called a nuclear transport receptor (NTR). In the nuclear pore these can bind to IDPs (or disordered regions in globular proteins) called FG-Nups (phenylalanine-glycine-rich nuclear pore proteins). The interaction between NTRs and FG-Nups were examined in vitro by single molecule FRET, NMR and molecular dynamics simulations [45, 46]. These studies showed that the IDP only undergoes subtle structural and dynamical changes in the complex. Each local interaction—the encounter between complementary functional groups—has low (mM) affinity but the apparent K d is around 100 nM and k on is remarkably high, approaching the theoretical diffusion limit (~109 M−1 s−1). This hints that the interaction may not depend on the relative orientation of the molecules. The authors suggest that these unique kinetic characteristics make it possible to ‘grab’ the NTR proteins with high affinity, still unbind them efficiently and send them along to other Nups in the nuclear pore complex, until they have been transported through the pore with their passenger protein [45, 46]. Thus, these characteristics fulfil the expectations of a fuzzy complex.
Clathrin heavy chain binding by AP180
The process of endocytosis involves dynamic interactions among molecules associated with the membrane. Molecular rearrangements result in invagination of the membrane and ultimately in a vesicle budding off. Central to this process is the association of protein AP180 with the N-terminal domain of clathrin heavy chain (TD). AP180 contains 12 degenerate motifs in the C-terminal 58 kD intrinsically disordered region, and each of these ~23 residues regions contain a DLL/DLF binding motif. Each TD domain has three AP180 receptor sites. Aspects of complex formation has been examined using NMR spectroscopy, analytical ultracentrifugation, isothermal titration calorimetry and X-ray crystallography [47, 48, 49]. NMR spectroscopy studies showed that the TD-bound and free state of an AP180 fragment containing two TD binding motifs retained disorder, but the spectra revealed chemical shift changes. The K d values of the individual sites were determined to be around 200 µM. Interestingly, the k off values were around 3000 s−1 and the k on values were 1–2 × 107 M−1 s−1, approaching those determined for the NTR—FG-Nup interactions described in the example above.
The two examples share very high k on and k off values. This may not be seen as a prerequisite for fuzzy complexes. A fuzzy complex can in principle exist in slow motion. In the case of allovalency, however, k on must be higher than k esc. In spite of the resemblance between the two mechanisms, there are no constraints on k on or k off in fuzzy binding, except of course that k on/k off > 1.
Intrinsically disordered proteins form complexes with other proteins, and may do so by different binding mechanisms. Allovalency and avidity have been described formalistically in the literature whereas the phenomenon of fuzzy complexes has not. It has been put forward conceptually to describe a binding phenomenon associated with IDPs. In the present review, we argued that complete conformational heterogeneity cannot be retained in fuzzy complexes and that the cooperative dependence on the number of groups that can participate in binding (n) arises because both k cap and k off depend on the magnitude of n. Thus, an important conclusion is that IDP complexes with only one receptor-binding site are not strictly fuzzy, but must be described according to the formalisms of allovalency. Notably, both allovalency and fuzzy complexes are dependent on the ligand being an IDP, whereas two-state binding and avidity are not. To this end, the rationale for the existence of fuzzy complexes is discussed. Fuzzy complexes can have very low K d values (nM or lower), but are not restricted to this, and the binding affinity has the potential to be rapidly regulated, for example by post-translational modifications, even when bound. This provides a versatility and swiftness in signal changes, and offers the possibility of rheostat regulation, which may not be possible in the interaction between folded proteins. Although we did not derive a formalistic description of fuzzy binding, we strongly encourage its derivation, which will allow for testable experiments to investigate fuzzy complex formation to the full.
The authors wish to acknowledge the Novo Nordisk Foundation (BBK) and the students working with fuzzy complexes in our lab. We are grateful to Mr. Leif Bolding for providing the graphical input to Fig. 1.
A single receptor site binding to a ligand with several identical binding epitopes.
Two or more receptor sites that bind to two or more ligand epitopes in a cooperative manner.
An IDP (or IDR) binding to a receptor, constantly shifting the coupling of functional compatible groups, thereby retaining some of the conformational heterogeneity of the free molecule.
- 17.Brautigam CA, Wynn RM, Chuang JL et al (2011) Structural and thermodynamic basis for weak interactions between dihydrolipoamide dehydrogenase and subunit-binding domain of the branched-chain-ketoacid dehydrogenase complex. J Biol Chem 286:23476–23488. doi: 10.1074/jbc.M110.202960 CrossRefPubMedPubMedCentralGoogle Scholar
- 35.Sigalov AB, Kim WM, Saline M, Stern LJ (2008) The intrinsically disordered cytoplasmic domain of the T cell receptor ζ chain binds to the Nef protein of simian immunodeficiency virus without a disorder-to-order transition †. Biochemistry 47:12942–12944. doi: 10.1021/bi801602p CrossRefPubMedPubMedCentralGoogle Scholar
- 48.Zhuo Y, Cano KE, Wang L et al (2015) Nuclear magnetic resonance structural mapping reveals promiscuous interactions between clathrin-box motif sequences and the N-terminal domain of the clathrin heavy chain. Biochemistry 54:2571–2580. doi: 10.1021/acs.biochem.5b00065 CrossRefPubMedPubMedCentralGoogle Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.