Blueprinting extendable nanomaterials with standardized protein blocks

Huddy, Timothy F.; Hsia, Yang; Kibler, Ryan D.; Xu, Jinwei; Bethel, Neville; Nagarajan, Deepesh; Redler, Rachel; Leung, Philip J. Y.; Weidle, Connor; Courbet, Alexis; Yang, Erin C.; Bera, Asim K.; Coudray, Nicolas; Calise, S. John; Davila-Hernandez, Fatima A.; Han, Hannah L.; Carr, Kenneth D.; Li, Zhe; McHugh, Ryan; Reggiano, Gabriella; Kang, Alex; Sankaran, Banumathi; Dickinson, Miles S.; Coventry, Brian; Brunette, T. J.; Liu, Yulai; Dauparas, Justas; Borst, Andrew J.; Ekiert, Damian; Kollman, Justin M.; Bhabha, Gira; Baker, David

doi:10.1038/s41586-024-07188-4

Blueprinting extendable nanomaterials with standardized protein blocks

Article
Open access
Published: 13 March 2024

Volume 627, pages 898–904, (2024)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue Submit your manuscript

Blueprinting extendable nanomaterials with standardized protein blocks

Download PDF

20k Accesses
76 Altmetric
1 Mention
Explore all metrics

Abstract

A wooden house frame consists of many different lumber pieces, but because of the regularity of these building blocks, the structure can be designed using straightforward geometrical principles. The design of multicomponent protein assemblies, in comparison, has been much more complex, largely owing to the irregular shapes of protein structures¹. Here we describe extendable linear, curved and angled protein building blocks, as well as inter-block interactions, that conform to specified geometric standards; assemblies designed using these blocks inherit their extendability and regular interaction surfaces, enabling them to be expanded or contracted by varying the number of modules, and reinforced with secondary struts. Using X-ray crystallography and electron microscopy, we validate nanomaterial designs ranging from simple polygonal and circular oligomers that can be concentrically nested, up to large polyhedral nanocages and unbounded straight ‘train track’ assemblies with reconfigurable sizes and geometries that can be readily blueprinted. Because of the complexity of protein structures and sequence–structure relationships, it has not previously been possible to build up large protein assemblies by deliberate placement of protein backbones onto a blank three-dimensional canvas; the simplicity and geometric regularity of our design platform now enables construction of protein nanomaterials according to ‘back of an envelope’ architectural blueprints.

Nanoparticle classification, physicochemical properties, characterization, and applications: a comprehensive review for biologists

Article Open access 07 June 2022

Compartmentalization as a ubiquitous feature of life: from origins of life to biomimetics

Article 21 June 2024

Recent advances in chemical protein synthesis: method developments and biological applications

Article 12 March 2024

Main

There has been considerable recent progress in the design of protein nanomaterials including cyclic oligomers^2,3,4, polyhedral nanocages^5,6,7,8, one-dimensional (1D) fibres^9,10, 2D sheets^11,12 and 3D crystals^10,13 by docking together⁸ or fusing^6,14 protein monomers or cyclic oligomers. Although powerful, these methods have two limitations that arise from the irregularity of almost all protein structures. First, because the shapes of the constituent components are generally complex, they cannot be assembled into higher-order structures on the basis of simple geometric principles; instead, large-scale sampling calculations are required to identify shape-complementary interactions for each case, and there is no guarantee that designable interfaces can be found. Second, as for the myriad protein complexes in nature, the size of a designed protein assembly cannot be readily scaled; it is nearly impossible to make a smaller or larger but otherwise nearly identical version of assemblies generated using current design methods. By contrast, designed materials that extend along just one dimension, such as α-helical coiled coils and repeat proteins, can be grown or shrunk by simply varying the length of the chain. There is a rich history of designing coiled coils using simple geometric principles; this extensibility and designability have made them widely used constituents of designed protein materials¹⁵.

We set out to develop a general approach for designing expandable higher-order protein nanomaterials with the simplicity and programmability of coiled-coil engineering. We reasoned that if a modular and regular toolkit of building blocks and interactions could be generated consisting of linear building blocks constructed from (1) repeating sequence elements that extend without twisting as additional sequence repeats are added (Fig. 1b,c(top),d), (2) curved building blocks that trace out arcs of circles of different radii (Fig. 1c(bottom),d) and (3) non-covalent arrangements that hold two building blocks in pre-specified relative orientations (Fig. 1e), then building up new nanostructures could in principle be carried out by inspection in a manner analogous to blueprinting a house frame (Fig. 1a). Furthermore, as with a house frame, the regular structures of the constituent building blocks could enable scaling the dimensions of the final architecture (area or volume) by simply altering the size of the constituent monomers, and structural reinforcement by placement of additional buttressing elements.

**Fig. 1: Overview of THR protein blocks and interaction modules.**

Design of twistless helix repeats

Natural and previously designed proteins exhibit a wide range of helical geometries with local irregularities, kinks and deviations from linearity¹⁶ that make it difficult to achieve the properties illustrated in Fig. 1 that enable simple nanomaterial scaling (beyond the one dimension accessed by varying the number of repeats in a repeat protein or coiled coil). To achieve these properties, we designed a series of new building blocks constructed from ideal α-helices with all helical axes aligned. Restricting helical geometry to ideal straight helices with zero helical twist in principle considerably limits what types of structure could be built, but this is more than compensated by the great simplification of downstream material design, as illustrated below. We construct twistless helix repeat (THR) protein blocks from identical straight α-helices (typically 2–4 helices in each unit); the length of the blocks can be varied simply by varying the number of repeat units. In contrast to existing natural and designed repeat proteins¹⁷, THRs are constructed to enable modular nanomaterial design: linear blocks are perfectly straight, allowing nanomaterials to be extended and contracted with no alteration in the angles between the constituent monomers; curve blocks have smoothly curving trajectories that stay in-plane; and turn as well as interaction modules enable placement of two blocks in precise relative orientations with angles appropriate for regular material design.

We blueprint THRs by explicit placement of these straight helix structural elements using an extension of the principles used in coiled-coil and helical bundle design^16,18. A first helix a₀, part of the zeroth repeat, is placed at the origin and aligned to the z axis. A copy of a₀ called a₁ is then placed at a new location to set the rigid body transformation between the zeroth and first (and all subsequent) repeat units. After this, any other helices (b₀, c₀ …) that will be part of the repeating unit are placed as appropriate between a₀ and a₁ to provide more helices to pack against for stability, and the helices are connected with loops¹⁹; repetition of this basic unit then generates backbones with the desired geometries¹⁷ (Fig. 1b,c). As the helices are perfectly straight and parallel to the z axis, the overall repeat protein trajectory is fully defined by the following transformation parameters from a₀ to a₁: the distance of displacement in the x–y plane from helical axis to helical axis (d), the change in displacement in the z axis direction (Δh) and the change in helix phase (Δθ; Fig. 1b). The remaining degrees of freedom for the positions of helices b₀, c₀ …, which define the internal geometry of the repeat, are extensively sampled, sequences are designed using Rosetta FastDesign or ProteinMPNN^19,20, and designs are selected for experimental characterization on the basis of packing and sequence–structure consistency metrics (Methods). We obtained synthetic genes encoding the selected designs, expressed them in Escherichia coli and purified the proteins using nickel–nitrilotriacetic acid immobilized metal affinity chromatography. Designs that were solubly expressed were analysed by size-exclusion chromatography (SEC) to determine oligomerization state, and in the case of assemblies a subset was analysed by negative-stain electron microscopy (ns-EM). Experimental success rates and structural homogeneity for different classes of designs are summarized in Supplementary Figs. 1 and 2 and Supplementary Discussion.

To generate straight, linear THRs, we set Δθ to zero. As illustrated in Fig. 2a,b, this results in perfectly straight repeat proteins in which each repeat unit is translated but not rotated relative to the previous unit. There are two subclasses: setting Δh = 0 generates repeat proteins with each repeat unit simply displaced in the x–y plane (Fig. 2a), whereas setting Δh to a non-zero value generates repeat proteins that also step along the z axis (Fig. 2b). We tested 33 linear THRs (with Δh = 0) with helices either about 20 or about 40 residues in height; 23 of 33 tested designs were solubly expressed, and 13 of 19 designs analysed by SEC were primarily monomeric as designed (Supplementary Figs. 1a,b and 2). Structural characterization of the linear building blocks by X-ray crystallography individually and/or cryogenic EM (cryo-EM) in the context of assemblies (see below) revealed that both the detailed internal structures and the overall straight linear geometry were successfully achieved. The backbone root mean square deviations (RMSDs) between the design models and crystal structures of three 20-residue helix designs (THR1, THR2 and THR3) and two 40-residue helix designs (THR5 and THR6) were 0.8, 0.8, 0.4, 0.6 and 1.3 Å, respectively, and in all five cases the relative rotation of successive repeats is nearly zero (Fig. 2a and Supplementary Fig. 6a). We found that we could not only control Δθ = 0, but also program values of the inter-repeat distance d: the crystal structure of a design with d set to a compact helix packing value of 8.7 Å had a very close value of 8.6–8.8 Å at its central interior (THR3), in contrast to most others designed at 10.0 Å (Supplementary Fig. 6b). For structural validation of blocks with non-zero Δh, the cryo-EM structure of an assembly constructed from such a block (THR4) exhibited a linear stair-stepping structure nearly identical to the design model, (backbone RMSD of 1.0 Å; Fig. 2b and Supplementary Fig. 1a).

To generate turn blocks, we blueprint an additional helix c₀ lined up with a₀ and a₁ that can be assigned any specified phase difference, which can be utilized in fusion operations to produce a turn that is equal to θ_c − θ_a (Supplementary Fig. 5d,e). As for all of the THR blocks described here, because of the ideality of the block construction, the same sequence interactions can be used for the intra-block and inter-block interactions; we refer to blocks in which the terminal repeats have identical sequences to the internal repeats as uncapped, and those in which the terminal helices have polar outward-facing residues to prevent self-association (like the linear blocks above) as capped. We experimentally characterized uncapped turn modules that generate rotations of 360/n, in which n is 3, 4, 5 or 6; if the geometry is correct, these should oligomerize to form closed polygons with n subunits. ns-EM 2D class averages of the n = 3 designs clearly show the designed triangular shape with flattened corners (Fig. 2c and Supplementary Fig. 1f), and for n = 4, the designed square shapes (Fig. 2c and Supplementary Fig. 1f) including fine details such as the lower density around the corner helix are observed. For n = 5 and n = 6, success rates were lower, probably because their hinge regions involved less-extensive helix–helix interactions, but we did obtain designs with the expected polygonal structures for both after using reinforced corners on the C₆ design (Supplementary Fig. 1f and Supplementary Discussion). Thus, by controlling the phase rotations between adjacent helices, turns can be encoded while maintaining overall parallel helical architecture. We also made polygonal designs with combinations of linear THRs and new straight helix-heterodimer corner junctions instead of turn modules (Supplementary Discussion and Supplementary Figs. 1g, 9 and 10).

To generate curve THRs, we incorporate a phase change (Δθ) between repeating elements (Fig. 1c) that generates a curved trajectory rather than a linear one. We choose Δθ to be a factor of 360° so that perfectly closed rings can be generated. The size of the closed ring can be controlled by specifying Δθ and the distance d between repeats (Supplementary Fig. 7). To access a broad range of d parameter values, we add additional helices to the repeat unit; for circular rings we used four helices per repeat unit. A full curve THR ring with n repeats can be divided into smaller chains each with m repeats, in which m is a factor of n; n/m uncapped repeats can associate to generate the full ring with cyclic symmetry²¹. To facilitate gene synthesis and protein production, we characterized such split oligomeric versions of the rings rather than synthesizing very long single chains. We designed rings with 12, 18, 20 and 30 repeats ranging from 9 to 22 nm in outside diameter. The 12- and 20-repeat rings were tested as C₄ designs, whereas the 18- and 30-repeat rings were tested as C₆ designs. Designs for all four ring sizes were remarkably uniform with ns-EM micrographs densely covered with circular assemblies with few to no defects or alternative structures present (Supplementary Fig. 7). Two-dimensional class averages showed that designs for all four sizes were close to the intended size (Fig. 2d; 10, 1 and 9 unique designs yielded distinct ring shapes for 18-, 20- and 30-repeat rings (Supplementary Figs. 1e and 2)). The smallest rings with 12 repeats have solvent-exposed helices exterior to the ring placed to facilitate outward-facing fusions without disrupting the core packing of the ring; these are clearly visible in the 2D class averages and 5.2-Å-resolution cryo-EM reconstruction of R12B (Fig. 2d and Supplementary Fig. 1e) that matches the designed patterning of the helices. ns-EM of the 18-, 20- and 30-repeat rings (with outside diameters of 12, 14 and 22 nm respectively) showed that many designs formed remarkably monodisperse populations of ring-like structures closely consistent with the design models (Fig. 2e,f and Supplementary Fig. 1e). ns-EM class averages of these designs had the smooth and round shape of the design models, and were in most but not all cases homogeneous (some designs assembled into closed-ring species that ranged by ±1 chain of the desired number, resulting in some slightly oblong shapes; Supplementary Fig. 1e). These designs highlight the control over ring curvature that can be achieved by specifying building block repeat rotation parameters.

The simplicity of our blocks in principle enables the reinforcing of designed materials using struts rigidly linking distinct structural elements. As a first test of this, we sought to build concentric ring assemblies from pairs of rings that have different sizes but repeat numbers that share large common denominators. For example, 2 repeat units of a 20-repeat ring can be combined with 3 repeat units of a 30-repeat ring as 10 copies generate a complete ring in both cases (Fig. 3a, left). Rings were segmented into matching cyclic symmetries, the rotation and z displacement of one ring relative to the other was sampled, and linear THRs were placed to connect the inner and outer rings. We constructed single-component C₁₀ concentric ring assemblies by connecting a three-repeat-unit curved block and a two-repeat-unit curved block that both generate a 36° (360°/10) rotation with a radially oriented strut. Two-dimensional class averages of ns-EM images of the designed strutted assemblies show both rings clearly present (Fig. 3a, right; some 11-subunit rings were observed in addition to the target 10-subunit structure). We similarly connected three repeat units with a 20° rotation per repeat, and five repeat units with a 12° rotation per repeat, with a radial strut; the resulting composite subunits map out a 60° rotation of inner and outer rings such that six subunits generate a full 360° ring. The resulting two-component C₆ strutted assembly yielded 2D class averages that showed both rings with all chains present, and a 5.1-Å cryo-EM reconstruction was very close to the design model (RMSD 2.7 Å) with very similar outer diameter (19.7 nm versus 20.1 nm; Fig. 3b and Supplementary Fig. 8c). The helix positioning in the inner ring and the strut are also very close to the design model (Supplementary Fig. 8c, insets). Thus, the modularity of the THRs enables designing complex structures by inspection, and enables buttressing to increase structural robustness (Supplementary Discussion and Supplementary Fig. 8).

**Fig. 3: Design of strutted double rings.**

Expandable nanomaterials

The regularity of our blocks in principle enables scaling the size of nanomaterial designs simply by changing the number of repeats in the constituent THRs without altering any of the inter-block interfaces. How the THRs must be aligned to enable expandability differs for each architecture, as described below.

To construct expandable cyclic assemblies, the linear THRs must be placed such that the propagation axis is normal to the cyclic symmetry axis. For cyclic designs with this property (those built from turn modules or heterodimers; see Supplementary Discussion), adding or removing repeats simply changes the length of the oligomer edge without affecting the interface between monomers. We tested this expandability with a C₄ ‘square’ (sC4) oligomer for which we had obtained a cryo-EM reconstruction with 1.6-Å-backbone RMSD (Supplementary Fig. 10). This subunit consists of a central linear THR flanked by straight-helix heterodimers that produce a 90° turn. To expand this structure, we inserted two additional repeat units (six helices) into the linear THR portion of the subunit. Cryo-EM 2D class averages for both the original and expanded square show close agreement to the design models and clear expansion; the helices clearly remain aligned to the z axis as designed (Fig. 5a).

Architectures with polyhedral nanocage symmetry can be similarly expanded provided that the linear THR propagation axis is parallel to the plane formed by the two symmetry axes spanned by the THR (Supplementary Fig. 11a). To generate such architectures, and enable further access to construction in three dimensions, we designed out-of-plane interactions between building blocks. We first focused on designing C₂ symmetric interfaces in which the angles between linear THRs correspond to the angles needed to generate regular polyhedral symmetry (Fig. 4) when combined with planar C₃ or C₄ components, while also satisfying expandability criteria. For an octahedral ‘cube’ (O₄) built from flat objects with C₄ symmetry that lie on the ‘cube faces’, this angle is 90°. For tetrahedra (T₃), octahedra (O₃) and icosahedra (I₃) built from flat C₃-symmetric objects, the out-of-plane handshake angles that are needed to join the flat objects are 70.5°, 109.5° and 138.2°, respectively^22,23. Handshake C₂ homodimers were generated by fixing this out-of-plane angle and keeping the linear THR propagation axes parallel to each other, sampling only the offset spacing between the THRs⁸ (Supplementary Fig. 12).

**Fig. 4: Modular construction of protein nanocages from THRs.**

To generate expandable nanocages, flat cyclic components that form the faces of the cages were linked through noncovalent handshake interactions at the specified angle. For the flat cyclic component, we used a ring design with 12 repeats (R12B; Fig. 2c) constructed from curve units, and split the 12 repeats into either 3 subunits with 4 repeats each (C₃) or 4 subunits with 3 repeats each (C₄; Supplementary Fig. 3d), depending on the desired polyhedral symmetry architecture. We then fused linear THR arms onto each subunit constrained to point outward parallel to a radial vector emanating from the symmetry axis, but offset such that when the C₂ interface is formed, the C₂ axis is along a radial vector (Fig. 4a and Supplementary Fig. 12). Tetrahedral, octahedral ‘cubic’ and icosahedral structures with C₃ rings at respective axes (T₃, O₃ and I₃), and octahedral structures with C₄ rings at the respective axes (O₄) were constructed by incorporation of the appropriate C₂ interface. For example, to make a ‘cubic’ octahedral nanocage, we incorporate into the C₄ ring arm the 90° C₂ handshake module (Fig. 4e) by simple sequence concatenation. Synthetic genes were obtained for 13 nanocage designs; all 13 expressed solubly, 10 had SEC elution profiles that suggested cage formation, 8 yielded particles with the expected size by ns-EM, and 7 gave 2D class averages and symmetric 3D reconstructions that resembled the design models. Successful designs for each architecture are shown in Fig. 4b–e and Supplementary Figs. 1j and 12. Designed geometric features including the spindle-like twofold handshake interface and the flat ‘in-plane’ ring areas with distinct holes are clearly evident. For the T₃ and O₄ cages, the correct species dominated, but in O₃ and I₃ cages there were noticeable populations of species that were either partially formed or broken under ns-EM conditions (Supplementary Fig. 13). A 7.5-Å cryo-EM reconstruction and an experimental model were obtained for the cubic cage built from tetrameric rings on the faces (cage_O4_34) that were very close to the design model, with the straight helices clearly evident and only very slight deviations in the arm alignment (Fig. 4e and Supplementary Fig. 14). A 4.0-Å cryo-EM reconstruction and an experimental model for the tetrahedral cage_T3_101 were similarly very close (Fig. 4b and Supplementary Fig. 41). These results illustrate the robustness of structures that can be assembled from our regularized building blocks using simple ‘snapping together’ of complementary pieces in three dimensions, and show that with additional reinforcing mechanisms such as cooperativity, structural specificity can be achieved without traditional ‘knob-and-hole’ helix–helix interactions²⁴.

We tested the expansion in all three dimensions of the cubic design (Fig. 4e and Supplementary Fig. 14) by increasing the number of repeat units in the linear arm. We generated four different sizes of the cage_O4_34 by increasing the number of THR helices in the arm by +0, +4, +8 or +12 helices (Fig. 5b and Supplementary Fig. 13). For all sizes, ns-EM 2D class averages (Fig. 5b, bottom row) show all three symmetrical views with the designed increases in size but otherwise close preservation of architecture. Three-dimensional ns-EM reconstructions were consistent with corresponding design models, with the overall cube shape and ring circular pore clearly visible in each of the sizes (Fig. 5b, top row). The first three sizes of cage show primarily intact assemblies across the ns-EM grids; for the largest size (+12), some incomplete assembly was also observed (Supplementary Fig. 13). Additional single-component expandable nanocage designs are described in Supplementary Figs. 13, 16 and 17 and the Supplementary Discussion.

**Fig. 5: Extendable THR-based nanomaterials.**

We next designed two-component expandable nanocages by locking the rotation degrees of freedom of a THR-containing building block to maintain the expandability constraint (Methods), and then docking it against a freely sampling partner oligomer to form an O₄₃ architecture (Fig. 5c, Supplementary Discussion and Supplementary Figs. 19–23). Expandability over four different sizes was achieved with cage_O43_129 (+0, +4, +8 and +12 helices). The internal structure of the oligomers is clearly resolved in cryo-EM reconstructions for the first three sizes and in ns-EM reconstruction of the largest size; the distance between the centre of mass of the tetramer component to the centre of mass of the trimer component across the different sizes is 7.9, 9.4, 11.3 and 11.7 nm respectively (Fig. 5c and Supplementary Fig. 21). Views down each of the three symmetry axes (twofold, threefold and fourfold) are clear for each size (except for the threefold view in the largest size) with slight rotational deviations of the fourfold cyclic component compared to the design model, whereas the rotation of the threefold cyclic component holding the THR remains unperturbed as designed (Fig. 5c and Supplementary Fig. 22). A fifth size (16 additional helices) assembled into cage-like structures but the populations were too heterogeneous for detailed characterization (Supplementary Fig. 21).

For unbounded architectures that extend along one or more axes, extensibility requires that the linear THR propagation axes be parallel to the extension axes. We constructed an antiparallel assembly with an overall train track shape from THR modules (Fig. 6a). The ‘rails’ of the track are linear THRs that are uncapped to allow for unbounded linear assembly end-to-end, and C₂ ‘ties’ dock onto branch interfaces on the sides of the rails, organizing them into strutted antiparallel pairs. Adding repeats to the rails increases the spacing between ties (along the helical axis) and adding repeats to the ties increases the separation distance between rails along a different axis (Fig. 6b). We used 12-helix addition to the rail to double the spacing between ties, and 8-helix addition to the tie to roughly double the length of the tie. For the four combinations of component sizes, we obtained ns-EM 2D class averages consistent with the design models (compare Fig. 6b and Fig. 6d). Train track assembly was robust to fusion of mScarlet-i on rails both at termini and in an internal loop (Supplementary Fig. 24b), and sfGFP on the ties^25,26, as monitored by ns-EM, with density observed for the GFP, Supplementary Fig. 24c).

**Fig. 6: Designed train track fibres.**

Discussion

On determining the first low-resolution model of the structure of a globular protein (myoglobin), John Kendrew wrote in 1958 that “Perhaps the most remarkable features of the molecule are its complexity and its lack of symmetry. The arrangement seems to be almost totally lacking in the kind of regularities which one instinctively anticipates”²⁷. More than six decades of structural biology research have shown this to be a generally appropriate description of protein structure¹. Figures 2–5 show that this complexity is not an inherent feature of the polypeptide chain: the simplicity and regularity of our designed materials approaches that of the wooden beams used for constructing a house frame. This enables the resizing of designed materials in two and three dimensions simply by changing the numbers of repeat units in the THR modules with little or no need for detailed design calculations; previously this has been possible only with coiled coils and repeat proteins with open helical symmetries (propagating along a single axis)^9,15,28. The flat surfaces and regular geometry have immediate applications to the design of bio-mineralizing systems: THR monomers presenting carboxylate groups in regular arrays nucleate the mineralization of carbonate into calcite²⁹, and expandable THR systems such as the cubic assemblies in Fig. 3 presenting such arrays could provide a route to hierarchical protein–mineral hybrid materials.

There are exciting paths forwards to further increase the capabilities of our programmable THR platform. First, our current multi-subunit assemblies have high symmetry, and assembly of arbitrary nanostructures would require breaking symmetry—one approach to achieving this would be to build heterodimeric and heterotrimeric interfaces between THRs, which would enable considerable shape diversification and addressability of each protein chain³⁰. This would allow access to a broad range of asymmetric nanostructures, as with DNA nanotechnology bricks, tiles and slats^{31,32,33,34,35}, but with the higher precision and greater functionality of proteins. Second, the materials generated here all form through self-assembly, but as the number of components increases the overall yield of the desired product could decrease. This limitation could potentially be overcome by stepwise solid-phase assembly with crosslinking after addition of each THR component (as in solid-phase peptide or DNA synthesis, but in three dimensions with the location of addition specified by non-covalent interactions between the THRs; the analogue in construction is nailing lumber pieces together after alignment). The combination of symmetry breaking and stepwise assembly would enable the design of a very wide range of protein nanomaterials based on simple geometric sketches that could be readily genetically modified to present a wide variety of functional domains in precisely controllable relative orientations.

Methods

Computational and experimental methods are all provided in the Supplementary Information.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All data and design models are available in the main text or the Supplementary Information. EM maps have been deposited in the Electron Microscopy Data Bank (R12B: EMD-43318; strut_C6_21: EMD-29893; cage_O4_34: EMD-29915; cage_O4_34_+4: EMD-41907; cage_ O43_ 129: EMD-42906; cage_O43_129_+4: EMD-42944; cage_ O43_ 129_+8: EMD-42031; sC4: EMD-29974; cage_T3_101: EMD-41364; cage_O3_10: EMD-40070 (C₁ asymmetric) and EMD-40071 (octahedral symmetric); cage_T3_5: EMD-40075 (C₁ asymmetric), EMD-40074 (tetrahedral symmetric), EMD-40073 (C₁, 1 chain missing) and EMD-40073 (1 trimer missing); cage_T3_5_+2: EMD-40076). Crystallographic datasets and cryo-EM structures with resolved side chains have been deposited in the Protein Data Bank (THR1: 8G9J; THR2: 8G9K; THR5: 8GA7; THR6: 8GA6; sC4: 8GEL; cage_T3_101: 8TL7; cage_O43_129: 8V2D; cage_O43_129_+4: 8V3B).

Code availability

An example RosettaScripts script and input for generating THR building blocks are provided at https://github.com/tfhuddy/2023-manuscript-materials. The example script was confirmed to successfully run with Rosetta version 3.13 as available at https://rosettacommons.org/ (ref. ¹⁹). Documentation for ProteinMPNN sequence design is available at https://github.com/dauparas/ProteinMPNN (ref. ²⁰). Designs were filtered with AlphaFold 2 available at https://github.com/google-deepmind/alphafold (Supplementary Methods).

References

Berman, H. M. et al. The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000).
Article ADS CAS PubMed PubMed Central Google Scholar
Thomson, A. R. et al. Computational design of water-soluble α-helical barrels. Science 346, 485–488 (2014).
Article ADS CAS PubMed Google Scholar
Wicky, B. I. M. et al. Hallucinating symmetric protein assemblies. Science 378, 56–61 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Fallas, J. A. et al. Computational design of self-assembling cyclic protein homo-oligomers. Nat. Chem. 9, 353–360 (2017).
Article CAS PubMed Google Scholar
Ljubetič, A. et al. Design of coiled-coil protein-origami cages that self-assemble in vitro and in vivo. Nat. Biotechnol. 35, 1094–1101 (2017).
Article PubMed Google Scholar
Hsia, Y. et al. Design of multi-scale protein complexes by hierarchical building block fusion. Nat. Commun. 12, 2294 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
King, N. P. et al. Computational design of self-assembling protein nanomaterials with atomic level accuracy. Science 336, 1171–1174 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Sheffler, W. et al. Fast and versatile sequence-independent protein docking for nanomaterials design using RPXDock. PLoS Comput. Biol. 19, e1010680 (2023).
Article CAS PubMed PubMed Central Google Scholar
Bethel, N. P. et al. Precisely patterned nanofibres made from extendable protein multiplexes. Nat. Chem. 15, 1664–1671 (2023).
Brodin, J. D. et al. Metal-directed, chemically tunable assembly of one-, two- and three-dimensional crystalline protein arrays. Nat. Chem. 4, 375–382 (2012).
Article CAS PubMed PubMed Central Google Scholar
Sinclair, J. C., Davies, K. M., Vénien-Bryan, C. & Noble, M. E. M. Generation of protein lattices by fusing proteins with matching rotational symmetry. Nat. Nanotechnol. 6, 558–562 (2011).
Article ADS CAS PubMed Google Scholar
Ben-Sasson, A.J. et al. Design of biologically active binary protein 2D materials. Nature 589, 468–473 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, Z. et al. Accurate computational design of three-dimensional protein crystals. Nat. Mater. 22, 1556–1563 (2023).
Padilla, J. E., Colovos, C. & Yeates, T. O. Nanohedra: using symmetry to design self assembling protein cages, layers, crystals, and filaments. Proc. Natl Acad. Sci. USA 98, 2217–2221 (2001).
Article ADS CAS PubMed PubMed Central Google Scholar
Woolfson, D. N. Understanding a protein fold: the physics, chemistry, and biology of α-helical coiled coils. J. Biol. Chem. 299, 104579 (2023).
Article CAS PubMed PubMed Central Google Scholar
Grigoryan, G. & Degrado, W. F. Probing designability via a generalized model of helical bundle geometry. J. Mol. Biol. 405, 1079–1100 (2011).
Article CAS PubMed Google Scholar
Brunette, T. J. et al. Exploring the repeat protein universe through computational protein design. Nature 528, 580–584 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Huang, P.-S. et al. High thermodynamic stability of parametrically designed helical bundles. Science 346, 481–485 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Alford, R. F. et al. The Rosetta all-atom energy function for macromolecular modeling and design. J. Chem. Theory Comput. 13, 3031–3048 (2017).
Article CAS PubMed PubMed Central Google Scholar
Dauparas, J. et al. Robust deep learning–based protein sequence design using ProteinMPNN. Science 378, 49–56 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Correnti, C. E. et al. Engineering and functionalization of large circular tandem repeat protein nanoparticles. Nat. Struct. Mol. Biol. 27, 342–350 (2020).
Article CAS PubMed PubMed Central Google Scholar
Coxeter, H. S. M. Regular Polytopes (Courier Corp., 1973).
Yeates, T. O. Geometric principles for designing highly symmetric self-assembling protein nanomaterials. Annu. Rev. Biophys. 46, 23–42 (2017).
Article CAS PubMed Google Scholar
Walshaw, J. & Woolfson, D. N. Extended knobs-into-holes packing in classical and complex coiled-coil assemblies. J. Struct. Biol. 144, 349–361 (2003).
Article CAS PubMed Google Scholar
Pédelacq, J.-D., Cabantous, S., Tran, T., Terwilliger, T. C. & Waldo, G. S. Engineering and characterization of a superfolder green fluorescent protein. Nat. Biotechnol. 24, 79–88 (2005).
Article PubMed Google Scholar
Bindels, D. S. et al. mScarlet: a bright monomeric red fluorescent protein for cellular imaging. Nat. Methods 14, 53–56 (2016).
Article PubMed Google Scholar
Kendrew, J. C. et al. A three-dimensional model of the myoglobin molecule obtained by X-ray analysis. Nature 181, 662–666 (1958).
Article ADS CAS PubMed Google Scholar
Pyles, H., Zhang, S., De Yoreo, J. J. & Baker, D. Controlling protein assembly on inorganic crystals through designed protein interfaces. Nature 571, 251–256 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Davila-Hernandez, F. A. et al. Directing polymorph specific calcium carbonate formation with de novo protein templates. Nat. Commun. 14, 8191 (2023).
Article ADS PubMed PubMed Central Google Scholar
Kibler, R. D. et al. Stepwise design of pseudosymmetric protein hetero-oligomers. Preprint at bioRxiv https://doi.org/10.1101/2023.04.07.535760 (2023).
Wintersinger, C. M. et al. Multi-micron crisscross structures grown from DNA-origami slats. Nat. Nanotechnol. 18, 281–289 (2023).
Article ADS CAS PubMed Google Scholar
Bohlin, J., Turberfield, A. J., Louis, A. A. & Šulc, P. Designing the self-assembly of arbitrary shapes using minimal complexity building blocks. ACS Nano 17, 5387–5398 (2023).
Article CAS PubMed Google Scholar
Petersen, P., Tikhomirov, G. & Qian, L. Information-based autonomous reconfiguration in systems of interacting DNA nanostructures. Nat. Commun. 9, 5362 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Sigl, C. et al. Programmable icosahedral shell system for virus trapping. Nat. Mater. 20, 1281–1289 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Wagenbauer, K. F., Sigl, C. & Dietz, H. Gigadalton-scale shape-programmable DNA assemblies. Nature 552, 78–83 (2017).
Article ADS CAS PubMed Google Scholar

Download references

Acknowledgements

We thank F. Busch and V. Wysocki for providing native mass spectrometry experiments that helped debug some of our early designs; J. Decarreau for support in trying out optical microscopy with fibres; J. Quispe, S. Dickinson and V. S. Bhatt for assistance with cryo-EM data collection; L. Milles and B. Wicky for wet lab assistance; D. Hicks, H. Pyles and W. Sheffler for computational assistance; S. Boyken, C. Hague, J. Bai and L. Stewart for perspective and discussion; F. Praetorius for manuscript editing; the Arnold and Mabel Beckman Cryo-EM Center at the University of Washington for electron microscope use; and the Advanced Light Source (ALS) beamlines 8.2.1, 5.0.3 and 8.2.2 at Lawrence Berkeley National Laboratory for X-ray crystallography data collection. The Berkeley Center for Structural Biology is supported in part by the National Institutes of Health (NIH), National Institute of General Medical Sciences and the Howard Hughes Medical Institute. The ALS is supported by the Director, Office of Science, Office of Basic Energy Sciences and US Department of Energy (DOE; DE-AC02-05CH11231). This work was supported by the Institute for Protein Design Breakthrough Fund, for “De novo design of 100 nm scale protein assemblies” (T.F.H., Y.H., R.D.K. and D.B.) and “De novo design of selective pores” (Y.L. and D.B.); The Audacious Project at the Institute for Protein Design (T.F.H., J.X., E.C.Y., A.J.B., H.L.H., Z.L., R.M., A.K. and D.B.); The Open Philanthropy Project Improving Protein Design Fund (Y.H., R.R., P.J.Y.L., A.K.B., D.E., G.B. and D.B.); National Science Foundation award CHE-1629214 (D.N. and D.B.); the Helen Hay Whitney Foundation (S.J.C.); a gift from Microsoft (J.D. and D.B.); The Donald and Jo Anne Petersen Endowment for Accelerating Advancements in Alzheimer’s Disease Research (T.J.B. and D.B.); and the Howard Hughes Medical Institute (N.B., A.C., B.C. and D.B.). Small-angle X-ray scattering data were collected at the ALS SIBYLS beamline on behalf of US DOE-BER, through the Integrated Diffraction Analysis Technologies programme. Some of the cryo-EM data were collected on a Glacios TEM (Thermo Fisher Scientific) from NIH award S10OD023476 (J.M.K.). Parts of the cryo-EM work were supported by an Open Philanthropy subcontract via the University of Washington. Parts of the cryo-EM data processing were supported by the High Performance Computing facility at NYU School of Medicine. Some of the cryo-EM grids were screened at the Cryo-Electron Microscopy Laboratory Core at NYU School of Medicine (RRID: SCR_019202) and we thank the cryo-EM core staff for their assistance. Some of the cryo-EM data acquisition was carried out at the Simons Electron Microscopy Center and National Resource for Automated Molecular Microscopy and the National Center for cryo-EM Access and Technology located at the New York Structural Biology Center, supported by grants from the Simons Foundation (SF349247) and the NIH National Institute of General Medical Sciences (GM103310, U24 GM129539). This research used resources of the National Energy Research Scientific Computing Center, a US Department of Energy Office of Science User Facility located at Lawrence Berkeley National Laboratory, operated under contract number DE-AC02-05CH11231 using the National Energy Research Scientific Computing Center award BER-ERCAP0022018. This work was supported by the grant DE-SC0018940 MOD03 funded by the US Department of Energy, Office of Science.

Author information

These authors contributed equally: Timothy F. Huddy, Yang Hsia, Ryan D. Kibler, Jinwei Xu

Authors and Affiliations

Department of Biochemistry, University of Washington, Seattle, WA, USA
Timothy F. Huddy, Yang Hsia, Ryan D. Kibler, Jinwei Xu, Neville Bethel, Philip J. Y. Leung, Connor Weidle, Alexis Courbet, Erin C. Yang, Asim K. Bera, S. John Calise, Fatima A. Davila-Hernandez, Hannah L. Han, Kenneth D. Carr, Zhe Li, Ryan McHugh, Gabriella Reggiano, Alex Kang, Miles S. Dickinson, Brian Coventry, T. J. Brunette, Yulai Liu, Justas Dauparas, Andrew J. Borst, Justin M. Kollman & David Baker
Institute for Protein Design, University of Washington, Seattle, WA, USA
Timothy F. Huddy, Yang Hsia, Ryan D. Kibler, Jinwei Xu, Neville Bethel, Philip J. Y. Leung, Connor Weidle, Alexis Courbet, Erin C. Yang, Asim K. Bera, Fatima A. Davila-Hernandez, Hannah L. Han, Kenneth D. Carr, Zhe Li, Ryan McHugh, Gabriella Reggiano, Alex Kang, Brian Coventry, T. J. Brunette, Yulai Liu, Justas Dauparas, Andrew J. Borst & David Baker
M.S. Ramaiah University of Applied Sciences, Bengaluru, India
Deepesh Nagarajan
Department of Cell Biology, NYU School of Medicine, New York, NY, USA
Rachel Redler, Nicolas Coudray & Damian Ekiert
Molecular Engineering and Sciences Institute, University of Washington, Seattle, WA, USA
Philip J. Y. Leung
Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
Alexis Courbet & David Baker
Biological Physics, Structure and Design, University of Washington, Seattle, WA, USA
Erin C. Yang
Applied Bioinformatics Laboratories, NYU School of Medicine, New York, NY, USA
Nicolas Coudray, Damian Ekiert & Gira Bhabha
Division of Precision Medicine, Department of Medicine, NYU Grossman School of Medicine, New York, NY, USA
Nicolas Coudray
Molecular Biophysics and Integrated Bioimaging, Berkeley Center for Structural Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Banumathi Sankaran

Authors

Timothy F. Huddy
View author publications
You can also search for this author in PubMed Google Scholar
Yang Hsia
View author publications
You can also search for this author in PubMed Google Scholar
Ryan D. Kibler
View author publications
You can also search for this author in PubMed Google Scholar
Jinwei Xu
View author publications
You can also search for this author in PubMed Google Scholar
Neville Bethel
View author publications
You can also search for this author in PubMed Google Scholar
Deepesh Nagarajan
View author publications
You can also search for this author in PubMed Google Scholar
Rachel Redler
View author publications
You can also search for this author in PubMed Google Scholar
Philip J. Y. Leung
View author publications
You can also search for this author in PubMed Google Scholar
Connor Weidle
View author publications
You can also search for this author in PubMed Google Scholar
Alexis Courbet
View author publications
You can also search for this author in PubMed Google Scholar
Erin C. Yang
View author publications
You can also search for this author in PubMed Google Scholar
Asim K. Bera
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Coudray
View author publications
You can also search for this author in PubMed Google Scholar
S. John Calise
View author publications
You can also search for this author in PubMed Google Scholar
Fatima A. Davila-Hernandez
View author publications
You can also search for this author in PubMed Google Scholar
Hannah L. Han
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth D. Carr
View author publications
You can also search for this author in PubMed Google Scholar
Zhe Li
View author publications
You can also search for this author in PubMed Google Scholar
Ryan McHugh
View author publications
You can also search for this author in PubMed Google Scholar
Gabriella Reggiano
View author publications
You can also search for this author in PubMed Google Scholar
Alex Kang
View author publications
You can also search for this author in PubMed Google Scholar
Banumathi Sankaran
View author publications
You can also search for this author in PubMed Google Scholar
Miles S. Dickinson
View author publications
You can also search for this author in PubMed Google Scholar
Brian Coventry
View author publications
You can also search for this author in PubMed Google Scholar
T. J. Brunette
View author publications
You can also search for this author in PubMed Google Scholar
Yulai Liu
View author publications
You can also search for this author in PubMed Google Scholar
Justas Dauparas
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Borst
View author publications
You can also search for this author in PubMed Google Scholar
Damian Ekiert
View author publications
You can also search for this author in PubMed Google Scholar
Justin M. Kollman
View author publications
You can also search for this author in PubMed Google Scholar
Gira Bhabha
View author publications
You can also search for this author in PubMed Google Scholar
David Baker
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: T.F.H., Y.H., T.J.B., D.B., R.D.K., J.X., D.N., P.J.Y.L. Methodology: T.F.H., Y.H., J.X., R.D.K., P.J.Y.L., E.C.Y., B.C., T.J.B., J.D. Investigation: T.F.H., Y.H., J.X., R.D.K., N.B., D.N., R.R., P.J.Y.L., C.W., S.J.C., A.C., A.J.B., F.A.D.-H., A.K.B., H.L.H., K.D.C., Z.L., R.M., G.R., A.K., B.S., M.S.D., Y.L. Visualization: T.F.H., Y.H., R.D.K., N.C. Funding acquisition: D.B., G.B., D.E., T.F.H., R.D.K., Y.H., A.J.B. Supervision: D.B., G.B., J.M.K., D.E. Writing (original draft): T.F.H., Y.H. Writing (review and editing): T.F.H., D.B., Y.H., R.D.K., E.C.Y., A.K.B., G.B., P.J.Y.L., S.J.C., N.C.

Corresponding author

Correspondence to David Baker.

Ethics declarations

Competing interests

T.F.H., Y.H., R.D.K. and J.X. are inventors on a provisional patent application submitted by the University of Washington for the design and composition of the proteins created in this study.

Peer review

Peer review information

Nature thanks Mark Bathe, Gustav Oberdorfer and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Methods, Discussion, Figs. 1–43 and Tables 1–6.

Reporting Summary

Supplementary Data

Design names prefixed with a > symbol, followed by the expressed sequence for each design. The sequences include purification tags and the start methionine residue.

Supplementary Data

Design models with ideal backbones as well as experimentally determined structures for comparison. For reference to design names, see Supplementary Fig. 1.

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Huddy, T.F., Hsia, Y., Kibler, R.D. et al. Blueprinting extendable nanomaterials with standardized protein blocks. Nature 627, 898–904 (2024). https://doi.org/10.1038/s41586-024-07188-4

Download citation

Received: 06 June 2023
Accepted: 09 February 2024
Published: 13 March 2024
Issue Date: 28 March 2024
DOI: https://doi.org/10.1038/s41586-024-07188-4
Springer Nature Limited

Associated content

Protein materials, by blueprint

Research Highlight Nature Reviews Materials 05 April 2024

Blueprinting extendable nanomaterials with standardized protein blocks

From

Abstract

Similar content being viewed by others

Nanoparticle classification, physicochemical properties, characterization, and applications: a comprehensive review for biologists

Compartmentalization as a ubiquitous feature of life: from origins of life to biomimetics

Recent advances in chemical protein synthesis: method developments and biological applications

Main

Design of twistless helix repeats

Expandable nanomaterials

Discussion

Methods

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplementary Information

Reporting Summary

Supplementary Data

Supplementary Data

Peer Review File

Rights and permissions

About this article

Cite this article

Protein materials, by blueprint

Navigation

Blueprinting extendable nanomaterials with standardized protein blocks

Abstract

Similar content being viewed by others

Main

Design of twistless helix repeats

Expandable nanomaterials

Discussion

Methods

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation