Background

Collectin-K1 (CL-K1) is a multifunctional secreted pattern-recognition lectin with key roles in host defence, tissue homeostasis and embryogenesis [1,2]. It binds to pathogen-associated molecular patterns and activates mannan-binding lectin (MBL)-associated serine proteases (MASPs-1, −2 and −3) to initiate the lectin pathway of complement activation, and stimulate immune and inflammatory processes [3,4]. It also binds to apoptotic cells and may function as a scavenger receptor to facilitate their clearance as has been shown for other collectins. Elevated serum levels are associated with a severe clotting disorder called disseminated intravascular coagulation, characterised by systemic coagulation and microvascular thrombi, that often leads to multiple organ failure [5]. During embryogenesis, CL-K1 is highly expressed in craniofacial cartilage, heart, bronchi, kidney, and vertebral bodies and together with MASP-3 probably serves as a guidance cue for neural crest cell migration by recognising specific endogenous carbohydrate epitopes [6]. Separate autosomal mutations in the genes encoding CL-K1 or MASP-3 lead to a rare autosomal recessive disorder called Carnevale, Mingarelli, Malpuech and Michels (3MC) syndrome, associated with craniofacial dysmorphism, mental retardation and genital, renal and limb abnormalities, highlighting the importance of these complexes during development.

CL-K1 is a member of the collectin family of animal lectins with an N-terminal collagen-like domain linked to C-terminal carbohydrate-recognition domains (CRDs) via a helical neck [7]. The collagenous stems are tethered together at the N-terminus to form characteristic bouquet- or fan-shaped structures. CL-K1 can form homooligomers as well as heterooligomers with the related molecule collectin-L1 (CL-L1) [8]. Clusters of CRDs recognise their carbohydrate targets through multiple weak interactions [9]. Ca2+ plays an essential role in this process by forming a ternary complex with the protein and the glycan ligand and by stabilising the CRDs [10-12]. CL-K1 binds to a variety of intact bacteria, fungi, and viruses including Escherichia coli, Candida albicans and Influenza A virus [3]. It also binds to DNA, explaining at least in part how it targets apoptotic cells [13]. Although selective for mannose and fucose, CL-K1 binds only weakly to monosaccharides (IC50 approximately 20 mM) compared to MBL (K D approximately 1 mM) [14] and other collectins, and little is known about its specificity towards oligosaccharides on self- or non-self structures [3].

Three separate disease-associated mutations have been identified in the coding region of the CL-K1 gene. All lead to changes in the primary structure of the CRD: two result in single amino acid substitutions: Ser169Pro and Gly204Ser and the third leads to the deletion of Ser217. CL-K1 is undetectable in the serum of individuals homozygous for the Gly204Ser mutation, implying that it is either not secreted or is degraded in serum. However, the other disease-associated mutations have not been characterised [6].

Here we show that CL-K1 primarily recognises a subset of high-mannose oligosaccharides containing the disaccharide motif: Man(α1-2)Man found on both self and non-self structures. It binds to both sugar moieties, in contrast to other collectins that only recognise the terminal sugar. We also show that all three mutations associated with 3MC syndrome prevent normal secretion from mammalian cells probably as a result of structural changes caused by the failure to bind Ca2+ during biosynthesis. The protein deficiency would prevent the normal recognition processes of CL-K1/MASP-3 complexes during development, leading to the 3MC phenotype.

Results

Sugar specificity of CL-K1 towards self and non-self ligands

To characterise the sugar specificity of CL-K1, we initially screened a selection of glycoproteins with well-characterised glycans (Figure 1A). Blot analysis revealed binding to immunoglobulin M (IgM), thyroglobulin and yeast invertase and mannan, but not to RNAse B, fetuin or IgG. The most discernible difference in the pattern of recognition was the presence of high-mannose oligosaccharides on the glycoproteins that were recognised, compared to predominantly complex sugars on those that were not. An exception was RNAse B, which possesses a single N-linked glycosylation site, mainly occupied by Man5 (approximately 60%) but with some Man6-Man9 [15]. To explore carbohydrate binding further, the binding kinetics was determined for yeast invertase, mannan and gp120 from HIV that all possess high-mannose oligosaccharides. CL-K1 bound tightly to all three glycoproteins, but not to milk lactoferrin that contains complex sugars (Figure 1B). In each case, the data fitted best to a two-complex model and the kinetic parameters are presented in Table 1. To confirm that carbohydrate binding leads to activation of complement, C3b deposition was measured on mannan in whole human serum depleted of endogenous lectins (Figure 1C). CL-K1 activated complement with only 2.5-fold lower activity than human MBL.

Figure 1
figure 1

Glycoprotein binding and complement activation by CL-K1. A) Immunoblot of CL-K1 binding to immobilised mammalian and yeast glycoproteins. B) Surface plasmon resonance of CL-K1 binding to glycoproteins. CL-K1 was immobilised on the chip surface (6,000 response units) and six two-fold dilutions of each glycoprotein were injected at a starting concentration of 200 nM. C) Complement component C3b was measured on mannan following incubation with human serum, depleted of endogenous lectins, and supplemented with either CL-K1 or human recombinant MBL. C3b and its breakdown products were detected by absorbance using an anti-human C3c antibody with alkaline phosphatase-conjugated goat anti-rabbit secondary antibody and p-nitrophenyl phosphate as the substrate. Protein concentrations were calculated based on the molecular masses of CL-K1 and MBL trimers (78 kDa in each case). CL-K1, collectin-K1; MBL, mannan-binding lectin.

Table 1 Kinetics of binding of CL-K1 to glycoproteins

To examine binding to individual glycans, fluorophore-labelled CL-K1 was used to probe a broad screen of mammalian carbohydrates using the Core H Glycan array technology facility at The Consortium for Functional Glycomics [16]. The array consisted of 377 natural and synthetic mammalian glycans attached to the surface of a glass microscope slide via covalent amide linkages. The relative fluorescence signal provides an accurate indication of the order of affinities [17]. Consistent with the glycoprotein data, the best ligands were all high-mannose oligosaccharides (Figure 2A and Additional file 1), but not all of these were recognised. Mammalian high-mannose structures comprise a Man5 core attached to the protein via two GlcNAc residues, with α1-2 linked sugars at the non-reducing termini (Figure 2C). CL-K1 only recognised those structures containing at least one terminal Man(α1-2)Man epitope. For example, it bound tightly to Man6, Man7, Man8 and Man9 but not to the Man5 core (ligand 310) that comprises only α1-3 and α1-6 linkages. It also bound to smaller glycans containing the essential disaccharide epitope, for example, the trisaccharide, Man(α1-2)Man(α1-3)Man (ligand 189). The GlcNAc residues were neither necessary nor enhanced binding (for example, compare Man9GlcNac2, ligand 192, and Man9, ligand 312). This pattern of recognition is different from other collectins, such as MBL and pulmonary-surfactant protein D that typically recognise the terminal sugar alone (mannose, fucose or N-acetyglucosamine) [18,19]. It is also different from other C-type lectins, such as DC-SIGN or DC-SIGNR that target larger high-mannose oligosaccharides preferentially (Man5 to Man9) [17]. To quantify the differences in affinities, we compared binding to three different epitopes that all form part of a high-mannose oligosaccharide. Inhibition by Man(α1-2)Man (IC50 of 0.02 mM) was 10- and 8-fold greater than for the Man3 and Man5 core structures, and >300-fold greater than for mannose alone (Figure 2C). Thus, CL-K1 primarily recognises the terminal disaccharide of the glycan, rather than the terminal sugar alone, explaining its unusual binding properties.

Figure 2
figure 2

Carbohydrate binding by CL-K1. A) Glycan-array analysis of CL-K1. Key: open circle, mannose; closed circle, galactose, closed square, N-acetylglucosamine; closed diamond, N-acetylneuraminic acid; open square, N-acetylgalactosamine; open diamond, glucose; and triangle, fucose. High-mannose and blood group/Lewis antigen ligands are in green and blue. B) Inhibition of CL-K1 binding to immobilized yeast invertase (10 μg per well) by high-mannose epitopes (Man2: α1-2 mannobiose; Man3: α1-3, α1-6 mannotriose; Man5: α1-3, α1-3, α1-6 Mannopentaose). Values are the mean ± error from duplicate measurements. C) Schematic representation of the mammalian high-mannose glycan (Man9GlcNAc2). Boxes show the two Man3 core structures that together form the Man5 core. CL-K1, collectin-K1.

CL-K1 also bound to subsets of fucosylated glycans, including those containing Lewisa, Lewisy and blood group B antigens as well as to a selection of structures containing blood group H antigens (Figure 2A). Most fucosylated sugars were not recognised, however, suggesting that CL-K1 contacts multiple monosaccharide moieties in each glycan and that the chemical environment of the binding epitope is important. Binding to almost all ligands was Ca2+-dependent (see Additional file 1), compatible with the pivotal role of Ca2+ in carbohydrate recognition by C-type lectins. An exception was 9-O-acetylneuraminic acid, which gave a similar binding signal in the presence and absence of Ca2+, implying that it interacts with a distinct and independent binding site on CL-K1.

Structure of CL-K1 in complex with a disaccharide ligand

In order to determine the precise specificity and mechanism of carbohydrate recognition by CL-K1, a recombinant trimeric fragment comprising the neck and CRDs was produced in E. coli, refolded from inclusion bodies and was crystallized in the presence of Ca2+, and structures were determined both with and without its primary ligand, Man(α1-2)Man (PDB: 4YLI and 4YMD for the CL-K1 trimer and the trimer bound to Man(α1-2)Man; Table 2). The structures confirm that the CL-K1 fragment is a homotrimer with the three CRDs linked via an α-helical coiled coil (Figure 3). The asymmetric unit of each crystal contained two trimers with each CRD bound to three Ca2+. Density for the sugar was present for four of the six polypeptides (Figure 3A and B). In the ligand complex, the reducing mannose of the disaccharide binds to the Ca2+ common to all C-type lectins, with the equatorial 3- and 4-OHs of the sugar forming coordination bonds with the Ca2+ and hydrogen bonds to surrounding Asn, Asp and Glu residues that also serve as Ca2+ ligands. The interaction is unusual, however, because it involves the penultimate sugar (internal in a high-mannose oligosaccharide) rather than the terminal sugar (non-reducing) that is recognised by other collectins (Figure 3C and D) [10,19]. In addition, the terminal mannose forms hydrogen bonds to the carboxylate group of Glu244 and the guanidinium group of Arg200 to form a small hydrogen-bonding network that stabilises the interaction compared to mannose alone, providing an explanation for its binding specificity. Comparison of the bound and unbound structures reveals no major conformational changes upon carbohydrate recognition. The interaction is somewhat reminiscent of glycan binding by DC-SIGN and DC-SIGNR, which also target high-mannose oligosaccharides via internal sugar moieties (Figure 3E). The ligand specificities of CL-K1 and DC-SIGN/DC-SIGNR are significantly different, however, as noted above.

Table 2 Data collection and refinement statistics
Figure 3
figure 3

Structure of a trimeric fragment of CL-K1 bound to a mannose disaccharide. A and B) top and side views. The carbohydrate is in white and Ca2+ in pink. Comparison of carbohydrate binding by C) CL-K1; D) mannan-binding lectin (PDB: 1KWY) and E) DC-SIGN (PDB: 2IT5). Polar interactions are shown by dotted lines. The penultimate sugar moiety (internal in a high-mannose oligosaccharide) is in grey. CL-K1, collectin-K1.

Ca2+ stabilises CL-K1 by organising the binding loops of the CRDs

Ca2+ is integral to the structure and function of CL-K1 (Figure 3). It not only interacts directly with the carbohydrate ligand but also binds to loops at the top of the CRD. We used the intrinsic fluorescence of the trimeric fragment as a probe to measure Ca2+ binding. Each CRD contains two tryptophan residues, which together form part of the hydrophobic core and are sensitive to changes in the conformation of the adjacent Ca2+-binding loops. Addition of Ca2+ caused a large enhancement of the native fluorescence (approximately 50%) combined with a shift in λem, max to a shorter wavelength (from 346 nm to 339 nm), as the tryptophans move to a more hydrophobic environment (Figure 4A). The K D was measured as 0.43 mM by titration with Ca2+, with a Hill coefficient of 2.4 (Figure 4B). Thus, although only one Ca2+ interacts with the ligand directly, binding itself is highly cooperative, with all three Ca2+ being important.

Figure 4
figure 4

Ca2+ stabilises the CRDs of CL-K1. A) Fluorescence spectra of the CL-K1 fragment in different buffers. The λex was 280 nm. B) Ca2+ binding to the CL-K1 fragment measured by fluorescence. Measurements were recorded at an λem of 345 nm. C) Urea denaturation in the presence and absence of Ca2+ (2 mM). D) Proteolysis of CL-K1 in the presence of increasing concentrations of Ca2+ (two fold dilutions starting at 5 mM). Samples were separated by SDS-PAGE and gels were scanned to determine the amount of CL-K1 remaining. The amount of trypsin was 0.5% (w/w). The intensities of bands corresponding to the neck and CRD and the CRD alone were combined. A sample gel is shown in F). E) Proteolysis of CL-K1 in the presence and absence of Ca2+. Two fold serial dilutions of trypsin were used at a starting concentration of 2% (w/w). In all experiments data are mean ± error from two separate experiments. CL-K1, collectin-K1; CRD, carbohydrate-recognition domain.

We also investigated the role of Ca2+ towards the stability of the CRDs of CL-K1 using urea denaturation. Unfolding of the CRD/neck fragment was accompanied by a dramatic quenching of the fluorescence and a shift in λem, max to approximately 354 nm as the tryptophans become exposed to the polar solvent (Figure 4A). Unfolding could be explained by a simple two-state transition, both in the presence and absence of Ca2+. However, the midpoint (denaturation concentration) was 2.4 M and 1.4 M urea for the Ca2+-bound and -free forms (Figure 4C), indicating that Ca2+ stabilises the CRDs.

The three Ca2+ interact with residues primarily from loops towards the top of the CRD (Figure 5A). In the absence of Ca2+, these regions are likely to be more flexible. To investigate this possibility, we compared the susceptibility of the trimeric CL-K1 fragment towards trypsin, which preferentially cleaves disordered regions of proteins. The Ca2+-free form was highly sensitive to trypsin (Figure 4E). By contrast, an approximately 15 kDa fragment was generated in the presence of Ca2+ that was relatively resistant to further proteolysis (Figure 4E and F). This fragment was identified as the CRD by Edman degradation, beginning at residue Glu148. By measuring proteolysis in the presence of increasing concentrations of Ca2+, the apparent K D for calcium was determined as approximately 1 mM, in good agreement with fluorescence data (Figure 4D and F). Thus, Ca2+ stabilises the CRD by tethering the binding loops together. In the absence of Ca2+, these loops become disordered and more exposed.

Figure 5
figure 5

No secretion of variant CL-K1s containing mutations associated with 3MC syndrome. A) Structure of the CRD of CL-K1 showing the position of mutations associated with 3MC syndrome. Secondary structure features are labelled based on the structure of MBL [38]. Polar interactions are indicated by dotted lines. B) Secretion of wild type and variant CL-K1s from mammalian cells. Serum-free medium from Chinese hamster ovary cells (20 μL) transfected with wild-type (WT) or variant cDNAs was separated on a 15% SDS-PAGE gel under non-reducing conditions and blotted onto a nitrocellulose membrane. Secreted CL-K1 was detected using rabbit polyclonal anti-CL-K1 antibodies. Purified wild-type CL-K1 (approximately 20 ng) was used as a positive control. CL-K1, collectin-K1; CRD, carbohydrate-recognition domain; MBL, mannan-bindng lectin; 3MC, Carnevale, Mingarelli, Malpuech and Michels.

Mutations associated with 3MC syndrome prevent secretion of CL-K1

As well as explaining the mode of carbohydrate recognition, the structures of CL-K1 also reveal the location of residues that are altered in the 3MC syndrome (Ser169, Gly204 and Ser217). Gly204 and Ser217 are located at the base of loop L1 (Figure 5A) that forms part of the binding site for two of the three Ca2+. The carbonyl group of Gly204 forms a hydrogen bond with the amide of Ser217, helping to anchor the loop to the side of the domain. Ser169 forms part of one of two helices at the base of the CRD, close to the helical neck of the trimer.

To characterise the disease-associated mutations, corresponding changes were introduced into the cDNA of CL-K1, and proteins were expressed in Chinese hamster ovary cells. Immunoblotting of media from transfected cells revealed that although wild-type CL-K1 was secreted as expected, no protein was detected in the media of cells transfected with any of the mutant cDNAs (Figure 5B), implying that the encoded proteins are not secreted. This conclusion would explain the protein deficiency in individuals with the Gly204Ser mutation and suggests that those individuals with Ser169Pro or ΔSer217 mutations are also likely to be deficient in CL-K1.

Defective Ca2+ binding by CL-K1 containing disease-associated mutations probably leads to elimination during biosynthesis

Surprisingly, when recombinant fragments of CL-K1 (neck and CRDs) containing the mutations (subsequently referred to as variant fragments) were produced in E. coli, they refolded like the wild-type fragment, and good yields of purified protein were isolated (1 to 2 mg/L of culture). This finding was unexpected because we anticipated that the mutations would severely destabilise the CRDs, thus preventing folding. It was therefore of interest to characterise these fragments in more detail to attempt to understand why the mutations are deleterious and prevent secretion in vivo.

Collectins normally fold in the endoplasmic reticulum (ER), in the presence of Ca2+ where resident chaperones monitor the folding status of proteins [20]. Misfolded or partially folded proteins with exposed hydrophobic regions are recognised by these chaperones, and are targeted for elimination via the ER-associated protein degradation pathway [21]. Consequently, any structural defects caused by the mutations are likely to prevent secretion. Circular dichroism indicated that all three variants were folded, with spectra characteristic of the α/β structure of the neck + CRD regions (Figure 6B). This finding was confirmed by fluorescence spectroscopy and urea denaturation experiments (Figure 7A and B; described in more detail below). The first clue that they were defective was from gel filtration analysis. Like many lectins, CL-K1 binds to dextran, so is retarded on gel filtration columns such as Superdex. This binding could be prevented by excess mannose as a competitive inhibitor, or by ethylenediaminetetraacetic acid (EDTA) that chelates the Ca2+ (Figure 6A), whereupon the wild-type fragment eluted at the position of a homotrimer (47 kDa). The variant fragments were also trimers, but did not bind to the dextran matrix even in the presence of Ca2+, implying loss of carbohydrate binding. To confirm this possibility, we measured binding to yeast invertase. Both full-length CL-K1 and the wild-type fragment bound to the invertase, but the three variant fragments did not bind (Figure 6C). Thus, although folded and assembled into trimers, the mutations disrupt the CRDs preventing normal interactions with carbohydrate ligands.

Figure 6
figure 6

Characterisation of CL-K1 fragments containing disease-associated mutations. A) Gel filtration on a Superdex 200 (10/30 column) in 50 mM Tris–HCl at pH 7.4, containing 150 mM NaCl. The elution positions of conalbumin (75 kDa), ovalbumin (44 kDa) and ribonuclease A (13.7 kDa) are indicated. Top, The wild-type CL-K1 fragment interacts with the column matrix in buffer containing Ca2+ (2 mM) but not in the absence of Ca2+ (2 mM EDTA) or in the presence of inhibitory concentrations of mannose (200 mM). Bottom, None of the three CL-K1 variants interacts with the column, even in the presence of Ca2+. B) CD spectra of the wild-type CL-K1 fragment and the disease-associated CL-K1 variants. C) Binding of biotinylated invertase to full length CL-K1 and trimeric fragments. Values are the mean ± error from duplicate measurements. Tighter binding of the full-length protein is due to its increased avidity. CL-K1, collectin-K1; EDTA, ethylenediaminetetraacetic acid.

Figure 7
figure 7

Destabilisation of variant CL-K1s caused by the failure to bind to Ca2+. A) Fluorescence spectra of variant CL-K1 trimers in the presence (solid line, 2 mM Ca2+) and absence of Ca2+ (dotted line, 2 mM EDTA) and in 6 M urea (dashed line). The λem, max of Ca2+ bound and Ca2+ free forms of the wild-type fragment are indicated by vertical dotted lines. B) Unfolding of the variant CL-K1 (lower) fragments in urea. All experiments were performed in the presence of 2 mM Ca2+ and data were recorded at 345 nM. Values are mean ± error from duplicate experiments. The denaturation concentrations of Ca2+-bound and Ca2+-free forms of the wild-type fragment are indicated by vertical dotted lines. C) SDS-PAGE gels (17.5%) showing proteolysis of variant CL-K1 fragments with two-fold serial dilutions of trypsin (highest concentration is 4% w/w). CL-K1, collectin-K1; EDTA, ethylenediaminetetraacetic acid.

The most likely reason for the failure to bind carbohydrate is loss of Ca2+ binding. As described above, failure to bind Ca2+ destabilises the CRDs and exposes flexible loop regions. The presence of such defects during biosynthesis would likely lead to elimination of CL-K1 polypeptides via ER resident chaperones. We therefore tested Ca2+ binding using fluorescence assays described above. Addition of Ca2+ to the variant CL-K1s did not cause the changes in quantum yield or λem, max that were observed upon Ca2+ binding by the wild-type fragment (Figure 7A). Furthermore, urea denaturation curves resembled the Ca2+-free form, even in the presence of Ca2+, with denaturation concentrations of 1.5, 1.3 and 1.4 M for Ser169Pro, Gly204Ser, ΔSer217 proteins respectively (Figure 7B). Likewise, each of the variants was equally sensitive to trypsin in the presence or absence of Ca2+ (Figure 7B). Thus, all three mutations induce a common structural defect that arises due to the failure to bind to Ca2+. In vivo, such defects are likely to be detected by chaperones in the ER during biosynthesis leading to elimination. In this way, the mutations would prevent normal secretion of CL-K1, accounting for the protein deficiency in patients with 3MC syndrome.

Discussion

The data described here explain for the first time how CL-K1 is able to target self- and non-self carbohydrate ligands to fulfil its diverse biological functions. Man(α1-2)Man epitopes are found at the non-reducing termini of many, but not all, mammalian high-mannose sugars. They are also found on fungal, viral and bacterial structures. For example, mannan from cell walls of C. albicans, a known target of CL-K1 [3], is rich in α1-2 linked mannose oligosaccharides [22]. Likewise, high-mannose oligosaccharides on viral haemagglutinin and gp120 enable CL-K1 to bind to Influenza A virus and HIV [3]. High-mannose oligosaccharides themselves represent only a small subset of mammalian glycans. Most glycoproteins and glycolipids display complex oligosaccharides that are based on a Man3GlcNac2 core not targeted by CL-K1, in which the two antennae are decorated with other sugars. The difference in specificity of CL-K1 compared to other collectins is due to the presence of an extended binding site on the CRD (Figure 3C). Whilst unusual for a collectin, additional interactions have been described for other C-type lectins including DC-SIGN, selectins and the macrophage receptor Mincle, each with different ligand preferences [23].

Heterocomplexes of CL-K1 and CL-L1 polypeptides may recognise additional ligands that are not targeted by CL-K1 homooligomers as a result of contacts mediated through the CRD of CL-L1. Like CL-K1, CL-L1 has a preference for ligands possessing mannose and/or fucose epitopes, although little is known about its specificity towards oligosaccharides on self or non-self structures. The presence of CL-K1/CL-L1 complexes as well as CL-K1 homooligomers in serum and in other tissue fluids may serve to increase the range of pathogens targeted by the lectin pathway. It is currently unclear, however, if CL-L1 participates in developmental processes or is involved in the aetiology of 3MC syndrome.

Ca2+ is essential for the structure and function of C-type lectins, because it stabilises the CRDs and helps position the loops in the correct arrangement for ligand binding. Although we were unable to crystallise CL-K1 in the absence of Ca2+, the structure of a Ca2+-free form of MBL reveals that the core of the CRD is largely unchanged but the loops are positioned differently with some regions being undetectable in the electron density, implying considerable flexibility [11]. A similar scenario for the CL-K1 variants would explain why the CD spectra are similar to wild type but the CRDs are much more sensitive to proteolysis.

Gly204 and Ser217 are located at the base of Ca2+-binding loop L1 (Figure 5A), so the disease-associated mutations probably interfere with the packing of this loop against the CRD, thereby preventing Ca2+ binding. The effects of the Ser169Pro mutation within Helix 1 are less clear. Nevertheless, the introduction of a proline residue in place of the serine is likely to destabilise the helix and preclude the normal interactions with the CRD. Notably, Tyr163 at the top of the helix forms part of the hydrophobic core and packs against residues in loop 1, forming a hydrogen bond with Asn206. Disruption of these contacts probably prevents Ca2+ binding, leading to the disease phenotype.

Folding of CL-K1, like other collectins, probably proceeds in a C- to N-terminal in which folding of the neck and CRDs nucleates assembly of the N-terminal collagen-like domains [20]. All three variant fragments folded and trimerised in vitro, so folding is still thermodynamically favourable, despite the mutations. It is thus likely that at least some polypeptides fold in vivo. Nevertheless, the CRDs will be defective with exposed, disordered regions, so variants are likely to be retained and subsequently eliminated by the ER-associated protein degradation pathway [21]. The possibility that the mutations prevent folding in vivo (but not in vitro) cannot be completely ruled out. However, this scenario would seem less likely, in particular because none of the mutations are greatly destabilising per se (Figure 7B). Instead, the decrease in stability in each case can be accounted for by the failure to bind to Ca2+.

The finding that CL-K1 binds to Lewis antigens and certain blood group antigens, including groups B and H, is both interesting and unexpected. Lewis y is expressed mainly during embryogenesis, and in adults expression is restricted to granulocytes and epithelial surfaces [24]. Elevated levels are also found in most epithelial-derived human carcinomas, including breast, ovary, prostate, colon cancers, and high levels are correlated with poor prognosis [24]. Platelets and coagulatory factors such as Von Willebrand Factor are all decorated with blood group antigens, and crosslinking of these components possibly leading to complement activation might account for the link between elevated CL-K1 serum levels and disseminated intravascular coagulation [25]. Notably, activation of the lectin pathway itself promotes activation of the clotting cascade potentially compounding the problem [26]. The larger binding pocket on CL-K1 and the resulting enhanced affinity for its glycan ligands compared to other collectins suggest that it may be possible to design inhibitors that selectively block ligand binding. These would have therapeutic potential for the treatment of disseminated intravascular coagulation and other disorders associated with profound activation of the innate immune and/or coagulation systems, such as in ischaemia-reperfusion injury.

Conclusions

In conclusion, we have shown that CL-K1 binds to high-mannose oligosaccharides and fucosylated sugars including blood group antigens and Lewis antigens. It binds to high-mannose oligosaccharides via an extended binding site that recognises the terminal two mannose residues. In addition, we have shown that three naturally occurring mutations associated with a severe developmental disorder syndrome prevent secretion of CL-K1 from mammalian cells, probably as a result of a failure to bind Ca2+ during biosynthesis. The resulting protein deficiency would prevent normal developmental processes mediated via CL-K1/MASP-3 complexes leading to 3MC syndrome.

Methods

Proteins

Bovine fetuin and RNAse B, human milk lactoferrin, yeast invertase and yeast mannan were from Sigma-Aldrich Company Ltd. (Gillingham, UK). Bovine thyroglobulin was from GE Healthcare (Little Chalfont, UK). Human IgM and IgG were from Athens Research & Technology, Inc. (Athens, Georgia, USA). Recombinant HIV gp120 was produced in HEK293 cells [27]. Recombinant human MBL was produced in CHO cells as described for rat MBL [28].

Production of full-length CL-K1 and trimeric fragments

For full-length protein, the cDNA was cloned into the polylinker region of expression vector pED [29] and protein was produced in Chinese hamster ovary cells and purified by affinity chromatography on mannose-Sepharose columns as described previously for MBL [28,30]. A fragment of the cDNA encoding the neck and CRD (beginning at residue Ser116) was cloned into the polylinker of pET28a. Cells were grown in Power Prime broth (Molecular Dimensions Limited (Newmarket, UK)), induced during mid-log phase with IPTG (1 mM), and harvested after growth at 37°C for an additional 16 hours. Inclusion bodies were isolated and resuspended in 50 mM Tris–HCl, containing 8 M urea and 5 mM dithiotreitol (DTT) and protein was refolded by drop dilution into 50 mM Tris–HCl pH 8.0, containing 9.6 mM NaCl, 0.4 mM KCl, 2 mM MgCl2, 2 mM CaCl2, 0.5 M arginine, 0.05% polyethylene glycol 3,550, 1 mM GSH and 0.1 mM GSSH at a final protein concentration of 0.1 mg/ml. Proteins were purified by ion exchange chromatography on a 10 mL Q-Sepharose column, using a 0.05 to 1 M gradient of NaCl in 20 mM Tris–HCl at pH 8.0, followed by gel filtration on a Superdex 75 16/60 column (GE Healthcare) in 20 mM Tris at pH 7.5 containing 50 mM NaCl and 2 mM CaCl2.

Binding of CL-K1 to glycoproteins

Two-fold serial dilutions of glycoprotein (0.5 μl at a starting concentration of 2 mg/ml) were spotted on to a nitrocellulose membrane and allowed to dry for 30 minutes at room temperature. The membrane was blocked with 5% milk powder in 25 mM Tris–HCl, pH 7.4 containing 150 mM NaCl for two hours, washed with buffer + 0.5% Tween-20 and then incubated with biotinylated CL-K1 (0.06 mg/ml) at room temperature for one hour. After washing, the membrane was incubated with Streptavidin-ALP conjugate (2ug/ml; Life Technologies Ltd (Paisley, UK)) and developed with BCIP-NBT substrate (Sigma).

Glycan-array screen

Human CL-K1 directly labelled with Alexα-488 was diluted to 200 μg/ml in 20 mM Tris–HCl, pH 7.4 containing 150 mM NaCl, 2 mM MgCl2, 12 mM CaCl2, 1% BSA and 0.05% Tween 20. Labelled protein (70 μL) was applied to the printed surface of the microarray chip coated with natural and synthetic glycans via amino linkers. The chip was covered and incubated at room temperature in a humidified chamber for one hour. Binding was detected by measuring the fluorescence associated with each glycan. Data are the average of six replicates.

Surface plasmon resonance

CL-K1 was immobilised on the surface of a GLM sensor chip (Bio-Rad Laboratories Ltd (Hemel Hempstead, UK)) by amine coupling. Binding was measured in 10 mM Tris–HCl, pH 7.4 containing 140 mM NaCl, 2 mM CaCl2 and 0.005% Tween 20 at a flow rate of 25 μl/minute and at 25°C. The protein surface was regenerated in buffer containing 1 M NaCl and 5 mM EDTA.

Complement activation

Complement activation was measured by deposition of complement component C3b on immobilised mannan. Microtiter plates (Nunc) were coated with 2 μg mannan overnight and incubated with 1/100 dilution of whole human serum, previously depleted of endogenous lectins by passage through two mannose-Sepharose columns (2 × 0.5 ml matrix/mL of serum) in 4 mM barbital, 145 mM NaCl, 2 mM CaCl2, 1 mM MgCl2, pH 7.4, supplemented with recombinant human MBL or human CL-K1. After incubation at 37°C for one hour and washes with buffer containing 0.5% Tween-20, C3b was detected using rabbit anti-human C3c antibody (Dako UK Ltd (Ely, UK)) with alkaline phosphatase-conjugated goat anti-rabbit secondary antibody (Sigma) and p-nitrophenyl phosphate as the substrate.

Circular dichroism

All measurements were recorded on a Jasco J-715 spectropolarimeter in 10 mM phosphate buffer, pH 7.4 and at 25°C using 6 μM protein.

Fluorescence spectroscopy

All experiments were carried out using a Fluoromax-4 (Horiba Jobin Yvon, Horiba UK Ltd (Middlesex, UK)) instrument with excitation wavelength at 280 nm, slit widths of 2 nm and emission wavelength range of 300 to 400 nm. The λex was 280 nm. Protein samples (0.5 μM) in 10 mM Tris–HCl, pH 7.4 containing 140 mM NaCl were measured in the presence of 2 mM CaCl2, 1 mM EDTA and 6 M urea. For urea denaturation, proteins (0.5 μM) were equilibrated in urea (0 to 6 M) for 15 minutes at room temperature and measurements were recorded at λem of 345 nm.

Proteolysis

CL-K1 (0.5 mg/ml) was incubated with trypsin (maximum concentration of 4% w/w) in 50 mM Tris–HCl pH 7.5, containing 140 mM NaCl for one hour at 37°C.

Binding assays

MaxiSorb microtiter plates (Nunc) were coated with yeast invertase by overnight incubation in 0.2 M carbonate-bicarbonate buffer at pH 9.6 and at 4°C. Wells were blocked with 5% BSA in 10 mM Tris-Cl, 140 mM NaCl, pH 7.4 and incubated with biotinylated CL-K1 (50 nM) mixed with dilutions of carbohydrate ligand for one hour at room temperature. After washing, wells were incubated with 1:2000 dilution of goat peroxidase conjugated anti-biotin antibody (Sigma) and developed with tetramethylbenzidine (Sigma). Absorbance was measured at 450 nM. CL-K1 binding to invertase was carried out in the same way except that plates were coated with CL-K1 and incubated with dilutions of biotinylated invertase.

Crystallization and structure determination

All crystals were grown using the sitting-drop vapour diffusion method by mixing equal volumes (1.2 + 1.2 μL) of protein and reservoir solution. Protein at approximately 3 mg/ml was mixed with 10% (vol/vol) ethanol in 100 mM HEPES pH 7.0 containing 4 mM CaCl2. Similar conditions were used to crystallize the complex, except that Man(α1-2)Man (1 mM) was included in the mixture. All crystals were transferred to reservoir solution containing 20% (vol/vol) glycerol, before storage in liquid nitrogen and were maintained at 100 K during data collection. Ligand (1 mM) was included in the cryoprotection buffer for the complex. Diffraction data were collected at beamlines I04 and I04-1 at Diamond Light Source and were processed with iMosflm [31]. Phases were determined by molecular replacement with Phaser [32] using the structure of pulmonary-surfactant protein D [33] as a search model (PDB: 1B08). Models were optimized by using cycles of manual refinement with Coot [34] and refinement in Refmac5 [35], part of the CCP4 software suite [36], and in Phenix [37].