Biological context

The Notch signaling pathway is highly conserved in the animal kingdom and is responsible for cell–cell communication. The Notch signal is activated when ligands on a signal-sending cell bind to the Notch receptor on a signal-receiving cell, resulting in regulated proteolysis and release of the intracellular domain of Notch (NICD). This protein fragment translocates to the nucleus and following interaction with the transcription factor CSL and a coactivator Mastermind regulates transcription of a subset of genes. Canonical signaling via ligand-mediated activation of the Notch receptor influences many aspects of cell growth, differentiation and death which are vital for embryonic development and maintenance of the adult organism. Dysregulation of the pathway results in both inherited and acquired diseases including cancer. The outcome of the Notch signal is often context specific (Bray 2006; Guruharsha et al. 2012; Kopan and Ilagan 2009). For example, Notch signaling can be oncogenic, as in hematopoietic malignancies, but can also have tumour suppressor functions, such as in solid tumours (Koch and Radtke 2007). In addition, Notch is most likely a cancer stem cell factor (Pannuti et al. 2010).

The downstream consequences of Notch activation have been studied widely, and high resolution structures of the NRR and NICD region (Fig. 1) have facilitated the understanding of receptor activation and signaling (for review see Chillakuri et al. 2012). However, the molecular mechanisms behind receptor/ligand recognition and regulation by O-glycosylation are not yet understood. The membrane distal region of Notch (Fig. 1) consists of 36 EGF-like domains organized as multiple tandem repeats. A critical role for EGF11 and EGF12 in ligand binding was identified by deletion studies which showed these domains were sufficient to confer ligand binding in cell aggregation assays (Rebay et al. 1991). More recently Drosophila Notch EGF8 has been shown to contain an additional binding site for the ligand Serrate (Yamamoto et al. 2012). The high resolution X-ray structure of EGF11–13 showed rigid interdomain contacts (with twist and tilt angles of 119° ± 5° and 17° ± 4° respectively (Cordle et al. 2008), in good agreement with an earlier NMR structure (Hambleton et al. 2004) and a near linear organisation. Other multiple tandem repeats of Notch EGF domains which bind calcium are likely to have similar extended structures since residues involved in interface stabilisation are highly conserved. Subsequent site-directed mutagenesis has mapped the ligand-binding site to the central β-hairpin of hN-1 EGF12 (Whiteman et al. 2013), which is post-translationally modified by O-glycosylation (Taylor et al. 2014).

Fig. 1
figure 1

Modular organization of the extracellular domain of the human Notch-1 receptor. The negative regulatory region and the transmembrane domain (TM) are indicated. Individual domains belonging to the Notch intracellular domain are not indicated separately. Calcium-binding EGF domains are indicated in red. Schematic pictures of the constructs EGF4–7, EGF4–6, EGF5–7 and EGF4–5 used in the NMR study are shown. Each EGF domain contains six highly-conserved cysteine residues paired in a 1–3, 2–4, 5–6 arrangement to stabilize domain structure. There are six residues between the 6th cysteine of an EGF domain and the 1st cysteine of the following domain in EGF4–7. Since EGF5 and EGF7 have a calcium-binding consensus sequence, the linker length for EGF4/EGF5 and EGF6/EGF7 comprises two residues; the other four residues are constrained by calcium binding. In the case of EGF5/EGF6 the conformation of the six amino acids is unknown and will be the subject of a future structural study

Sequence inspection demonstrates that a number of EGF-like domains of hNotch-1 do not bind Ca2+; this includes EGF6, EGF10 and EGF22. This could result in very different interfaces which would alter the overall architecture of the receptor, such as is seen in other EGF-containing proteins (Brandstetter et al. 1995; Kettle et al. 1999; Lee et al. 2004). This information is crucial to understand the molecular mechanisms underlying Notch activation and regulation. We have therefore begun a study of these three regions by NMR to understand their structure and dynamics in solution. In this report we present 1H, 13C and 15N assignments for the quadruple EGF-like domain construct EGF4–7 expressed in E. coli.

Methods and experiments

Protein expression, purification and refolding

Protein expression and purification was carried out as described (Knott et al. 1996; Whiteman et al. 2014). The protein fragment, consisting of EGF4 to EGF7 of human Notch-1, was expressed in E. coli BL21 cells transformed with a pQE30 (Qiagen) based protein expression construct and a pREP4 plasmid for control of expression via the Lac Repressor. The expression vector contained an N-terminal His6 tag for purification, a factor Xa protease recognition site (IEGR) for later removal of the His6 tag, an SA spacer, followed by residue Q140 of EGF4; the last residue in the construct was E294. For the constructs EGF4–6, EGF5–7 and EGF4–5, the first and last residues were Q140/E256, D178/E294 and Q140/R217, respectively; all constructs have an SA spacer at their N-terminus following factor Xa cleavage of the His6 tag.

15N or 15N/13C-double-labeled protein was produced by growing cells in M9 medium containing 0.1 % (w/v) 15NH4Cl and 0.5 % (w/v) unlabeled glucose, or 0.1 % (w/v) 15NH4Cl and 0.5 % (w/v) 13C-glucose (Cambridge Isotope Laboratories), 100 μg/ml ampicillin and 25 μg/ml kanamycin, as described previously (Muranyi et al. 2004). 100 ml of starter culture, grown in labelled medium at 37 °C for ~18 h, was added to 0.5 l of medium. Cells were grown to OD600 ~0.8 and expression was induced with isopropyl-β-D-thiogalactopyranoside (IPTG) at a final concentration of 100 μM. Cells were incubated at 18 °C for at least 6 h. Cells were spun down at 5,000 rpm for 10 min and lysed with 6 M guanidine-HCl, 50 mM sodium phosphate, 5 mM β-mercaptoethanol for 1 h at room temperature (RT). DNA was sheared by sonication for 1 min at 20 Watt (Jencons Ultrasonic Processor) and the cell lysate was spun down at 40,000 rpm for 45 min (Beckman L7–55). The supernatant was loaded onto a Ni2+ chelating Sepharose column (GE Healthcare) and eluted with buffer containing 50 mM EDTA and 100 mM Tris pH 8.3. The protein was reduced for 1 h at RT by addition of dithiothreitol (DTT) to a final concentration of 0.1 M. The pH was then lowered to pH ~ 2, by addition of concentrated HCl, and the solution was dialysed against 0.1 % (v/v) trifluoroacetic acid (TFA) overnight. The solution was filtered through 0.2 μM filters (Millex-GP) and the soluble fraction purified by reverse-phase HPLC using a Beckman Gold system with reverse phase C8 column. Purified, reduced protein was refolded under the following conditions: 100 mM Tris–HCl pH 8.3, 3 mM l-cysteine, 0.3 mM l-cystine, 0.2 mg/ml protein at 37 °C for ~48 h. The refolding mixture was acidified to pH ~ 2 and dialysed against 0.1 % (v/v) TFA for at least 5 h. Protein was then concentrated by ultrafiltration and purified by HPLC. After lyophilisation, the His6 tag was removed by incubation overnight with one unit of bovine factor Xa (DENZYME) per mg of protein in 50 mM Tris–HCl pH 7.5, 0.1 M NaCl, 1 mM CaCl2 at 37 °C. Protein was further purified by cation-exchange fast liquid protein chromatography and HPLC. The final product was analysed by SDS-PAGE in the presence and absence of 5 % (v/v) 2-mercaptoethanol. Electrospray ionisation mass spectrometry was performed to check the correct mass of the constructs (16603 Dalton for the unlabeled EGF4–7 construct).

NMR spectroscopy

Ca2+ binding in EGF4–7 was followed by 1D NMR for a sample in 100 % D2O, 5 mM Tris, 150 mM NaCl, pH 7.5 and a protein concentration of ~0.5 mM. Protein concentrations for resonance assignment experiments were 1 mM for 15N-single-labelled, and 0.5 mM for 15N/13C-double-labelled material, in 95 % H2O/5 % D2O (v/v) at pH 6.1 and a Ca2+ concentration of 30 mM. All NMR experiments were carried out at 298 K on home-built spectrometers with triple-resonance probes and GE/Omega data acquisition systems operating at 1H-operating frequencies of 500, 600, 750 and 950 MHz. In addition, a Bruker Avance 500 MHz spectrometer with a Cryoplatform, equipped with a TCI CryoProbe, was used.

Assignment of the spectrum of EGF4–7 was carried out using a dissection approach; NMR spectra were collected for the full-length EGF4–7 protein but also for a series of shorter constructs EGF4–6, EGF5–7 and EGF4–5 (Fig. 1). Initially, sequential backbone assignments were obtained with data from 3D 15N-edited TOCSY-HSQC and NOESY-HSQC experiments (datasets collected for EGF4–7, EGF4–5 and EGF4–6). Sequential 1H and 15N assignments were confirmed and extended to 13C with 3D HNCA (for EGF4–7, EGF4–6 and EGF5–7), HNCO (for EGF4–7 and EGF5–7), HN(CA)CO (for EGF5–7), CBCANH (for EGF4–7 and EGF5–7), CBCA(CO)NH (for EGF4–7, EGF4–6 and EGF5–7) and HBHA(CO)NH (for EGF4–6 and EGF5–7) experiments with 15N/13C-double labeled material. Side-chain assignments were obtained using these experiments, supplemented with 3D HCCH-TOCSY (for EGF4–6 and EGF5–7), 3D 13C-edited NOESY-HSQC (for EGF4–6 and EGF5–7) and 2D 1H-1H NOESY (for all constructs). NMRPipe was used for NMR data processing (Delaglio et al. 1995) and CcpNmr Analysis for subsequent analysis (Vranken et al. 2005).

Assignments and data deposition

The correct fold of the EGF4–7 construct was confirmed using Ca2+ titrations monitored by 1D NMR (Fig. 2). The chemical shifts of resonances from the consensus aromatic residue located between the 3rd and 4th cysteines of a calcium-binding EGF domain and from the aromatic ‘packing’ residue located between the 5th and 6th cysteine of the preceding domain are known to be sensitive to calcium binding (Knott et al. 1996). Changes in the aromatic region of the spectrum around 6.7–6.8, 7.1 and 7.3 ppm are observed upon addition of 250–500 μM Ca2+ (Fig. 2); peaks in these regions correspond to Y279, the consensus aromatic in EGF7, and F249, the packing aromatic in EGF6. At higher concentrations of added Ca2+, peak shifts are observed around 7.7–7.8 ppm; peaks in this region correspond to H196, which is located near to the calcium-binding site in EGF5, and H171, which is next to the packing aromatic (F170) in EGF4. These data demonstrate calcium binding in EGF5 and EGF7 and formation of interfaces of these domains with EGF4 and EGF6, respectively.

Fig. 2
figure 2

Ca2+ titration of EGF4–7 in 100 % D2O, 5 mM Tris, 150 mM NaCl, pH 7.5. The concentration of added CaCl2 is indicated on the left side of the 1D spectra. Black arrows highlight the chemical shift changes arising from Y279 and F249, which are indicative of calcium binding in EGF7 and the formation of a packing interface with EGF6. At higher concentrations of added calcium, chemical shift changes in the histidine region of the spectrum arise from H196 and H171 are indicative of calcium binding in EGF5 and the formation of a packing interface with EGF4

The spectrum of EGF4–7 was assigned using a dissection approach with a series of shorter constructs EGF4–6, EGF5–7 and EGF4–5 (Fig. 1). In general, peaks in the HSQC spectrum of EGF4–7 could be assigned to a specific EGF domain by comparison with spectra of the smaller constructs. Some construct-dependent chemical shift differences were observed in the proximity (~2–3 residues) of the N- and C-termini and in calcium-binding sites. For example, differences are observed for V199 and G200, located near to the calcium-binding site in EGF5, in the EGF4–7 and EGF5–7 constructs; in the former these residues are involved in a packing interaction with F170 located in EGF4 while in the later construct this interaction is missing due to the absence of EGF4.

Resonances from residues Q140 to E294 were assigned for EGF4–7; Fig. 3 shows a 1H-15N HSQC spectrum of EGF4–7. All non-proline backbone 1HN and 15N resonances were assigned except N180, E181 and C182 at the beginning of EGF5; these residues also gave broad resonances in the shorter EGF4–5 construct that otherwise showed a very high signal-to-noise ratio. Further assignments were 93.1 % of Hα, 93.5 % of Hβ, 55.9 % of Hγ, 58.4 % of Hδ and 39.0 % of Hε, 91.6 % of 13C’, 95.5 % of 13Cα, 94.9 % of 13Cβ, 50.0 % of 13Cγ, 31.8 % of 13Cδ. Most missing assignments corresponded to residues at the C-terminus of EGF5 and at the N-terminus of EGF6, where residues showed line broadening. The chemical shift assignments for EGF4–7 have been deposited in the BioMagResBank (http://www.bmrb.wisc.edu) under the accession number 25172.

Fig. 3
figure 3

500 MHz 1H-15N HSQC spectrum of the human Notch-1 EGF4–7 construct at pH 6.1, 30 mM Ca2+, 298 K, in 95 % H2O/5 % D2O