Analysis of nucleoside-binding proteins by ligand-specific elution from dye resin: application to Mycobacterium tuberculosis aldehyde dehydrogenases

Kim, Chang-Yub; Webster, Cecelia; Roberts, Justin K. M.; Moon, Jin Ho; Alipio Lyon, Emily Z.; Kim, Heungbok; Yu, Minmin; Hung, Li-Wei; Terwilliger, Thomas C.

doi:10.1007/s10969-009-9073-z

Analysis of nucleoside-binding proteins by ligand-specific elution from dye resin: application to Mycobacterium tuberculosis aldehyde dehydrogenases

Open access
Published: 13 November 2009

Volume 10, pages 291–301, (2009)
Cite this article

Download PDF

You have full access to this open access article

Journal of Structural and Functional Genomics

Analysis of nucleoside-binding proteins by ligand-specific elution from dye resin: application to Mycobacterium tuberculosis aldehyde dehydrogenases

Download PDF

Chang-Yub Kim¹,
Cecelia Webster²,
Justin K. M. Roberts²,
Jin Ho Moon³,
Emily Z. Alipio Lyon¹,
Heungbok Kim¹,
Minmin Yu⁴,
Li-Wei Hung⁵ &
…
Thomas C. Terwilliger¹

2358 Accesses
11 Citations
Explore all metrics

Abstract

We show that Cibacron Blue F3GA dye resin chromatography can be used to identify ligands that specifically interact with proteins from Mycobacterium tuberculosis, and that the identification of these ligands can facilitate structure determination by enhancing the quality of crystals. Four native Mtb proteins of the aldehyde dehydrogenase (ALDH) family were previously shown to be specifically eluted from a Cibacron Blue F3GA dye resin with nucleosides. In this study we characterized the nucleoside-binding specificity of one of these ALDH isozymes (recombinant Mtb Rv0223c) and compared these biochemical results with co-crystallization experiments with different Rv0223c-nucleoside pairings. We found that the strongly interacting ligands (NAD and NADH) aided formation of high-quality crystals, permitting solution of the first Mtb ALDH (Rv0223c) structure. Other nucleoside ligands (AMP, FAD, adenosine, GTP and NADP) exhibited weaker binding to Rv0223c, and produced co-crystals diffracting to lower resolution. Difference electron density maps based on crystals of Rv0223c with various nucleoside ligands show most share the binding site where the natural ligand NAD binds. From the high degree of similarity of sequence and structure compared to human mitochondrial ALDH-2 (BLAST Z-score = 53.5 and RMSD = 1.5 Å), Rv0223c appears to belong to the ALDH-2 class. An altered oligomerization domain in the Rv0223c structure seems to keep this protein as monomer whereas native human ALDH-2 is a multimer.

Ion-Exchange Chromatography: Basic Principles and Application

Functionalizing tandem mass tags for streamlining click-based quantitative chemoproteomics

Article Open access 10 April 2024

Nikolas R. Burton & Keriann M. Backus

Identification of potential JNK3 inhibitors through virtual screening, molecular docking and molecular dynamics simulation as therapeutics for Alzheimer’s disease

Article 04 April 2024

Bharti Devi, Kailash Jangid, … Vinod Kumar

Introduction

Nucleosides and their derivatives in cellular metabolism are well recognized as energy carriers in metabolic transactions, essential chemical links for intracellular signals, and constituents of nucleic acids (DNA and RNA) [1, 2]. The metabolism of nucleosides is vital to a cell’s survival, and about half of all enzymes are nucleoside-dependent, representing one of the largest and most important classes of cellular proteins [3]. Determining structure–function relationships of nucleoside-binding proteins is therefore a significant component of the structural genomics of proteins that are crucial to cell function. Two major challenges in such studies are the significant number of target proteins, and the problem of sorting proteins according to their nucleoside specificity.

Nucleoside ligand-binding to individual proteins typically occurs with high specificity, and is often sensitive to slight changes at the protein interaction site [4–6]. Elucidating specific interactions between ligands and proteins helps to derive functional insights for many proteins [7, 8], complementing bioinformatics approaches, and may provide the sole source of functional information for hypothetical proteins.

The aldehyde dehydrogenases (ALDH) comprise a large family of proteins which metabolize various endogenous and exogenous substrates [9, 10]. The human genome contains 19 putative ALDH genes and three pseudogenes. Many of them are regulated in response to oxidative stress and over expressed in various tumors [11, 12]. ALDH enzymes have multiple catalytic and non-catalytic functions in ester hydrolysis, antioxidant properties, xenobiotic bioactivation and UV light absorption, and also play important roles in embryogenesis, development and neurotransmission [13, 14]. Mutations in these genes cause subsequent inborn errors in aldehyde metabolism, such as Sjögren-Larsson syndrome [15], Type II hyperprolinaemia and gamma-hydroxybutyric aciduria [16], and pyridoxine-dependent seizures [17]. Human ALDH-2 is important as a nitroglycerin reductase [18] and an activator of NADPH oxidases [9], and for the major function of elimination of toxic aldehydes which lead to lipid peroxidation, protein/enzyme dysfunction, structural damage and apoptosis in alcohol related disorders [19], such as alcohol liver disease [20], heart disease [21] and gastrointestinal cancer [22].

The Mycobacterium tuberculosis (Mtb) genome encodes ten putative ALDH proteins potentially associated with seven different ALDH classes (Table 1), suggesting that they have diversity comparable to human ALDHs [13, 14]. The presence of a Rossman consensus sequence [23] in four of ten predicted Mtb ALDHs (Table 1) further suggests diversity at the level of nucleotide binding at the cofactor binding site.

Table 1 Predicted ALDH family proteins in the M. tuberculosis Genome^a

Full size table

In a previous report, we used dye-ligand elution chromatography to screen for nucleoside-binding proteins in Mtb cell extracts and to analyze the specificity of nucleoside-protein interactions [24]. That study identified 26 native Mtb proteins binding to Cibacron Blue resin that were specifically eluted with nucleosides. Four of these 26 proteins were members of the ALDH family, as shown in Table 1. The large number of ALDH proteins in the relatively small Mtb genome, and the many essential functions of ALDH proteins in human cells, suggest potential critical roles of Mtb ALDHs for survival in its human host environment.

In this report, we purify one of these Mtb ALDHs (Rv0223c), characterize its nucleoside specificity, use ligands that interact with Rv0223c to improve its crystallization, present the first structure of an Mtb ALDH, and show its close structural similarity to human mitochondrial ALDH-2.

Materials and methods

Cloning and expression of putative Mtb ALDHs

Four Mtb ALDH genes (Rv0223c, Rv0458, Rv2858c, and Rv3293) were previously identified as interacting with nucleosides using dye-resin chromatography and ligand-specific elution [24]. Each targeted ALDH gene was amplified by PCR from a M. tuberculosis H37Rv COSMID library as the template with Pfu proof-reading DNA polymerase (Stratagene), using the 5′ NdeI primer, 5′-AGATATACATATG + (N-terminal 21 bases of target sequence)-3′, and the 3′ BamHI primer, 5′-AATTCGGATCC + (C-terminal 23 bases of target sequence)-3′. The underlined bases represent the NdeI and BamHI sites, respectively. The PCR amplicon was digested with NdeI and BamHI restriction endonucleases (NEB), and cleaned using Qiaquick PCR spin column (Qiagen). The product was ligated into a modified pET-28 vector containing a C-terminal 6-His tag, in frame with the BamHI restriction site using T4 DNA ligase (New England BioLabs), and transformed into BL21(DE3) (Novagen). The expressed proteins contained the C-terminal tag GSHHHHHH, where GS is encoded by the BamHI restriction site (GGATCC). BL21(DE3) 3 ml cell culture was tested for the expression of heterologous protein by binding on a Cobalt-chelated Talon superflow bead slurry (Clontech) and SDS–PAGE analysis.

Cell culture was performed as described by Studier [25] with some modifications. Transformed cells were inoculated into 3 ml seed culture media (1 mM MgSO₄, 0.5% glucose, 17 amino acids of 100 μg/ml for each Na-Glu, Asp, Lys-HCl, Arg-HCl, His-HCl, Ala, Pro, Gly, Thr, Ser, Gln, Asn, Val, Leu, Ile Phe, Trp, metal mix of 50 μM Fe, 20 μM Ca, 10 μM Mn, 10 μM Zn, 2 μM for each Co, Cu, Ni, Mo, Se and B, 5 mM PO₄, 5 mM Na, 2.5 mM K, 2.5 mM NH₄ and 1.25 mM SO₄), and grown overnight at 37°C. From the seed culture, 500 μl was inoculated into 500 ml auto-induction media, containing 1 mM MgSO₄, metal mix (same as seed culture), 0.5% glycerol, 0.5% glucose, 0.2% α-lactose, NPS (same as seed culture), and 35 μg/ml kanamycin. After cells were grown at 37°C until OD₆₀₀ reaches 0.5, the growth was continued at 20°C for approximately 16 h until the OD₆₀₀ reached approximately 15. The cells were harvested and stored at −80°C.

The cell pellet was lysed by sonication in 10 ml of buffer A (20 mM Tris–HCl, pH 8.0, and 100 mM NaCl) per gram of cells for 10 min in 30 s pulses at 10°C. The cell debris was removed by ultra-centrifugation for 30 min at 38,000 rpm using a Ti 60 rotor (Beckman). The clear supernatant was filtered through a 0.45 μm pore membrane and loaded on a 5 ml Talon superflow affinity column equilibrated with buffer A. After washing with 30 ml buffer A and 20 ml buffer B (20 mM Tris–HCl, pH 8.0, 100 mM NaCl and 20 mM imidazole), the His-tagged Rv0223c (and the other ALDHs) was eluted from the cobalt affinity column using Buffer C (20 mM Tris–HCl, pH 8.0, 100 mM NaCl and 300 mM imidazole). The eluted fraction was dialyzed against Buffer D (20 mM Tris–HCl, pH 8.0, 100 mM NaCl and 10 mM β-mercaptoethanol) and purified by gel filtration on a Superdex-75 column (GE Healthcare Inc.) using Buffer D for equilibration and elution. The peak fractions (monitored at OD₂₈₀) were analyzed by SDS–PAGE and the pooled protein fractions were concentrated using a Centricon Plus-20 (Millipore) up to 35 mg/ml, which was measured by Bradford assay with IgG (Bio-Rad) as a standard. The purity of each protein was estimated to be higher than 95% based on densitometry of SDS–PAGE gels [26].

Screening of Rv0223c for interactions with multiple nucleosides and nucleotides

Recombinant proteins were evaluated for their ligand-binding properties using a modified affinity elution chromatography protocol [27]. Individual proteins were diluted to 2 mg/ml in column buffer (CB, containing 50 mM potassium phosphate, pH 7.5, 1 mM MgCl₂ and 2 mM DTT) and adsorbed to multiple small aliquots of F3GA resin (100 μg protein per 10 mg resin) in 2 ml spin-columns (Costar, Fisher Scientific). Binding was for 1 h at 4°C with very gentle vortexing, followed by recovery of unbound protein (flow-through fraction) and washing of the resin (4 × 0.4 ml washes with CB); spin-columns were micro-centrifuged for 30 s at 10,000×g, to recover fractions and change solutions. Individual spin-columns containing resin-bound proteins were then incubated (as for protein binding, above) with 50 μl 1 mM test ligand in CB, and the elution fractions recovered by centrifugation. Protein which remained bound to the resin was recovered by heating at 95°C for 5 min in 100 μl SDS sample buffer, and centrifugation (resin fraction). Aliquots of initial protein, spin-column flow-through and eluate fractions were diluted 1:1 with 2× SDS sample buffer, and loaded in equal proportion (equivalent to 1 μg input protein) on 15% gels and stained with silver.

Rv0223c protein–ligand crystallization and data collection

Crystallization experiments were carried out by the hanging-drop vapor diffusion method [28] at room temperature (25°C) using 24-well plates. Recombinant Rv0223c was tested for the presence of bound nucleotides (see Supplementary Methods); the results indicated that the protein has at most 0.1:1 bound NAD or other nucleotide (see Supplementary Fig. 1). Each protein–ligand solution was prepared by mixing protein (0.68 mM in solution of Buffer D) with the corresponding ligand (20 mM in H₂O) at a molar ratio of 1:2 protein:ligand. The mixtures were incubated at room temperature for 30 min prior to setting up crystallization experiments. The final concentration of protein in each protein–ligand mixture was between 0.60 and 0.63 mM. The ligands used were NAD, NADH, NADP, NADPH, adenosine, AMP, ADP, ATP, GTP, FAD, FMN and Cibacron Blue F3GA (free dye). Crystals were grown from drops consisting of 1 μl protein–ligand solution mixed with 1 μl of reservoir solution against a reservoir containing 0.1 M MES (pH 6.0) and 0.8 M ammonium sulfate, for 3 days at room temperature. For some ligands (e.g. ATP), the effect of Mg⁺⁺ was tested for crystallization, but without any noticeable effect, crystallization experiments reported were executed without Mg⁺⁺. Native and SeMet Rv0223c-NAD complex crystals were flash-cooled in liquid N₂ with the addition of 10% glycerol in the crystallization buffer as cryoprotectant. Three-wavelength selenium multi-wavelength anomalous dispersion (MAD) data was collected at the beam line 5.0.2 at the Advanced Light Source (ALS). A native data set at a resolution of 1.8 Å was collected at the beam line 8.2.1 at the ALS. Both data sets were processed with the HKL2000 program suite [29].

Structure determination and data analysis of Rv0223c protein–ligand complexes

Initial phasing was carried out with the program SOLVE [30] using the MAD data set. The resulting experimental map was density modified and traced using the program RESOLVE [31]. The protein model was further improved and built with the ARP/WARP package [32, 33] against the 1.8 Å native data. Manual model rebuilding was carried out with programs COOT [34]. The final model of this complex is deposited in the Protein Data Bank (http://www.rcsb.org) as entry 3B4W, and has R/R _free values of 0.18/0.20 at a resolution of 1.8 Å, after refinement with the phenix.refine program from the PHENIX software package [35]. Difference electron density maps were calculated for each Rv0223c-ligand complex by refining the structure of the Rv0223c protein (without ligands or solvent molecules) against the observed structure factor amplitudes for each complex. The resulting crystallographic phases were used to construct an (mF _o − DF _φ)e ^iφc difference map [36]. The LigandFit algorithm for automated ligand-fitting in PHENIX [37] was used to identify the location of the largest contiguous regions of high density in the difference map. In this algorithm the contour level for identification of contiguous regions of density is set to a level such that the largest region is approximately the size of the anticipated ligand. In this way, the location of this largest region gives an indication of the location of the ligand. The difference electron density for each complex is shown in the region of the NAD in the Rv0223c protein-NAD complex and is displayed with PyMOL [38]. The overall structure comparison between Mtb Rv0223c and human mitochondrial ALDH-2 was performed by the DaliLite program [39].

Results

Sequence similarities of four Mtb ALDHs captured by dye-ligand chromatography to each other and to human ALDH proteins

In Table 2, we examine the degree of sequence identity among these four Mtb ALDHs and their closest human homologs in more detail. The sequence identities show that the four Mtb proteins have more sequence homology to specific human ALDHs than to each other; for example, Rv0223c, Rv0458, and Rv2858c show distinctively smaller e-values and higher percentages of identical residues when aligned with human ALDH class 1 and 2 protein (than when they are aligned with other Mtb ALDHs) by the NCBI Blastp program, and Rv3293 shows the same result with human ALDH class 7.

Table 2 Homology analysis of Mtb ALDHs being compared with other Mtb ALDH or Human ALDH by the NCBI Blastp program

Full size table

Analysis of the specificity of ligand–protein interactions with recombinant Rv0223c using dye-ligand chromatography

We used a modified version of our dye-resin/ligand-elution procedure to examine the specificity of ligand binding of one of the four Mtb ALDHs (see Materials and Methods). In this ligand-specific elution screen, recombinant Rv0223c was adsorbed on small aliquots of F3GA resin in spin-columns and assayed for elution by twelve nucleotides and nucleosides using one ligand per column. By using purified recombinant protein and one ligand per column, we identify effects of individual ligands on the stability of the dye-Rv0223c protein complex. We expect that those ligands that cause the Rv0223c protein to elute from the resin are likely to bind specifically to the protein, although non-specific interactions could potentially also cause elution to occur [24].

Figure 1 shows that the Rv0223c protein was eluted by NAD and NADH but only weakly by NADP and NADPH. This result is consistent with the dinucleotide preferences of homologous NAD-dependent aldehyde dehydrogenases (E.C. 1.2.1.3) from other organisms [40] and is addressed further below in the context of structural data. Additionally, the elution of Rv0223c protein by AMP/ADP/ATP is consistent with adenylate-binding by other dehydrogenases (cf. [3]).

The differences in the release of Rv0223c from the affinity resin by various ligands were substantial (Fig. 1), and the elution of the purified recombinant Mtb ALDH with ATP is consistent with the observations from experiments with crude cytosolic extracts (Fig. 1; Table 3 of [24]. Comparing the methods used for recombinant proteins versus native proteins in cell extracts, we note that the method for cell extracts has some potential complications. As cell extracts are mixtures of many proteins of various abundances, binding and detection of minor proteins may be difficult. Moreover, some proteins retained on the column may be bound via other proteins in native complexes, rather than directly to the dye, while others may have increased or unchanged affinity for the dye when bound to ligands [24]. Finally, the order of elution, especially when several related ligands are used, could potentially influence gel spot intensity and identification of some proteins due to depletion by earlier ligand(s). This method is therefore only useful as a positive screen; any protein eluted by a specific ligand is a possible in vivo target of that ligand, whereas failure to detect a given protein is uninformative. In contrast, our analysis of recombinant proteins eliminates most of these variables, and allows the direct comparison of a protein’s affinity for an array of ligands at the dye-binding site.

Overall, Fig. 1 indicates that NAD, NADH, ATP, ADP, AMP and FAD all interact with Rv0223c protein, and that the highest affinity is for NADH and ADP. This pattern was identical in 3 replicate experiments. Further it indicates that NADP, NADPH, adenosine, GTP, FMN, and NMN interact only weakly with Rv0223c at the dye-binding site; ligand binding elsewhere in the protein cannot be ruled out.

Crystallization and X-ray diffraction of Mtb Rv0223c with multiple ligands

Ligands that bind specifically to proteins are potentially useful for protein crystallization and structure determination [41]. We therefore examined the effects of the nucleotide/nucleoside ligands shown in Fig. 1 on the crystallization of recombinant Rv0223c protein. We carried out a systematic screen for conditions that led to crystallization of Rv0223c in the presence of NAD, and found one condition that led to crystals of the Rv0223c+NAD complex diffracting to high resolution (1.8 Å; see Materials and Methods; Table 3). We then carried out crystallization trials of the purified protein under this condition, but in the presence of either no ligand or the various nucleoside ligands used in the dye-ligand chromatography assay. As shown in Fig. 2, ligand-free protein generally yielded either very small or no crystals in repeated tests for crystallization (in one experiment, crystals were obtained that diffracted to a relatively low resolution of about 3 Å). In contrast, co-crystallization with several ligands (NAD, NADH, NADP, adenosine, AMP, GTP and FAD) enhanced protein crystal formation, yielding crystals diffracting to high resolution suitable for structural analysis. Other ligands either hardly affected crystallization (NADPH, ADP and ATP) or promoted precipitation (FMN, and non-resin-bound F3GA). NMN was not tested. As summarized in Table 4, (although further screening of crystallization conditions may be required for the interactions between Rv0223c and ADP, ATP and F3GA) the strongly interacting ligands (NAD and NADH) enhanced crystal formation, leading to higher resolution data than the moderately or weakly interacting ligands (AMP, FAD, adenosine, GTP and NADP).

Table 3 Data collection and refinement statistics

Full size table

Table 4 Summary of ligand elution from dye and co-crystallization of Rv0223c

Full size table

The structure of the Rv0223c+NAD complex was determined by MAD phasing at a resolution of 1.8 Å (PDB ID 3B4W; Table 3). The structures of all the complexes, Rv0223c+ligand (NADH, AMP, FAD, adenosine, GTP or NADP) were isomorphic with this one, with less than 1% changes in cell parameters. Consequently, we were able to use the structure of the Rv0223c protein (without ligand) from the Rv0223c+NAD complex to calculate phases and a difference electron density map for each of these structures (after refinement using the observed structure factors for each structure) using the diffraction data indicated in Table 4.

Location of nucleoside ligands in the Rv0223c structures

The crystals obtained with NAD, and with its reduced form NADH, were the largest and best-diffracting crystals (resolution range 1.8–2.0 Å) that we obtained in repeated experiments, and the ligand was most clearly defined in the difference electron density maps for these complexes (Fig. 3) although the crystals could potentially contain NAD because NADH showed ~50% conversion to NAD after about 3 days in the crystallization condition used. These data suggest that Rv0223c forms the most well-defined complexes with NAD and NADH, consistent with the gene annotation (putative aldehyde dehydrogenase).

The other ligand-Rv0223c protein complexes diffracted to 2.3–3.0 Å, and for GTP, NADP and AMP, the strongest difference electron density corresponding to a bound ligand was found at the NAD-binding site (Fig. 3). In the case of adenosine-Rv0223c co-crystals, the second-largest region of contiguous density was at the NAD-binding site, but the density was weak. In a final case (FAD), no clear peaks of difference density were found. The varied electron density of ligands binding to a common site in Rv0223c suggests that the conformation of the ligands at this site may vary and that there may be considerable flexibility in binding at this site. It seems possible that the ligands that are normally involved in the function of Rv0223c might bind in a more specific fashion than those that are binding adventitiously, but we cannot be certain, as we do not know the ligand specificity of Rv0223c necessary for its function.

These structural studies provide independent evidence for various protein–ligand interactions detected using dye-ligand chromatography (Fig. 1). The crystallization data together with structural and biochemical analyses show strong and specific interactions between NAD (or NADH) and Rv0223c, while weaker interactions were also detected between Rv0223c and several other purine nucleosides/nucleotides (adenosine, AMP, GTP, NADP and FAD).

Structural homology between His tagged recombinant Rv0223c and human mitochondrial ALDH

The structure of Mtb Rv0223c is the first reported structure from the Mtb ALDH family. We compared this structure with that of its human homologue, NAD-dependent mitochondrial ALDH, ALDH-2 (PDB ID 1CW3) [42], using the DaliLite server [39] to align the structures, as shown in Fig. 4. The choice of human ALDH-2 for comparison was based on sequence similarity (Table 2), and because Rv0223c has generally been categorized as a class 2 enzyme [43, 44]. The structure of Rv0223c closely overlapped that of human ALDH-2, with an overall rmsd of 1.5 Å over 184 residues in common. Only minor regions of incongruity were found, and these were in peripheral regions of the protein (Fig. 4).

The orientation of NAD in Rv0223c was similar to human ALDH-2 (data not shown) [45]. As Perez-Miller and Hurley described, the adenine ring of NAD as located in the hydrophobic pocket formed by helices αD and αE in the coenzyme domain, with the nicotinamide end facing and interacting with residues of the catalytic domain. The amino acid residues within a 5 Å range of the NAD binding site (residues in red in Fig. 5) were substantially conserved. Of the 25 residues of Rv0223c ALDH within this range, 14 were identical to those in the human protein. The spatial locations of all 14 of these conserved residues were very close in the two structures with rmsd 1.1 Å (versus rmsd 1.5 Å for the structures overall), and had a similar orientation (data not shown), suggesting that these are critical amino acids for the interaction with NAD.

We noticed that Mtb Rv0223c is a monomer, whereas human mitochondrial ALDH is an octamer in crystal lattices [42, 45]. Both samples used for structural analysis were expressed in E. coli, but the Mtb Rv0223c protein has a His-tag at its C-terminus. In the human mitochondrial ALDH, the C-terminus of the protein is part of the oligomerization domain. It is possible that the His-tag of Rv0223c may interfere with oligomerization. However, gel filtration chromatography data with His-tagged and non-tagged Rv0223c proteins indicated that the His-tag does not influence the oligomerization state of the protein as reflected in the monomeric elution times of both proteins from the column (data not shown). It is more likely that the Rv0223c protein is monomeric for another reason, as the C-terminal residues of Mtb Rv0223c (Val486, Thr485, and Tyr484) are on the opposite side of the oligomerization domain from the side which is involved in monomer–monomer contacts based on the human ALDH-2 structure (PDB ID 1CE3). (Note that the His-tag is not visible in the electron density map and is therefore not shown in Fig. 4).

Sequence homology between Mtb ALDH Rv0223c and human mitochondrial ALDH

The partial amino acid sequences of Rv0223c and human mitochondrial ALDH were aligned based on secondary structure for sequence homology analysis, as shown in Fig. 5. The full length Mtb and human sequences are shown (487 and 518 amino acids, respectively), and were determined by NCBI Blastp to be 39% identical and 53% similar. The two ALDH signature sequences identified by Prosite [46], and including cysteine and glutamic acid active site residues, were conserved in both proteins and found at identical locations.

Discussion

The dye-resin chromatography method generates data about the interaction of protein and specific nucleoside ligand(s), and the identified ligand(s) may help stabilize the protein and improve the target protein’s crystallization (Fig. 2). Structural data obtained from such crystals may also show the interactions of the ligand with the protein. In conjunction with functional information derived from the identification of ligands, this may help elucidate the biochemical role of the protein. Additionally, since many drugs are analogues of nucleosides, proteins bound by nucleoside analogue drugs can be identified through this approach as a biomedical application (Fig. 5 of [24].

The native Mtb Rv0223c protein did not crystallize readily. As shown in Fig. 2, co-crystallization with NAD and NADH, which showed strong interaction by dye-ligand chromatography (Fig. 1), greatly improved the crystal quality and yielded structures of the complexes at resolutions of 1.8–2.0 Å (Fig. 3; Table 3). In contrast, NADP, adenosine, GTP, AMP and FAD, which showed relatively weak interactions (Fig. 1), generated lower resolution (2.3 Å or higher) structural data. We note that the ligands ADP and ATP, which showed strong interactions with Rv0223c, did not generate crystals (Table 4). Figure 3 reveals that ligands share a common interaction site within Rv0223c, and that Rv0223c contains a hydrophobic pocket and adenine recognition motif (composed of Gln217 and Glu239) in the coenzyme domain, which is a common motif for binding adenine-derivative coenzymes [47]. This information, in combination with our biochemical data on protein–ligand interactions (Fig. 1) and crystallization data from protein–ligand mixtures (Fig. 2), supports the conclusion that the binding of each ligand to Rv0223c occurs via contacts of the ligand to both the adenine recognition motif and hydrophobic pocket. Further, the stability of different protein–ligand crystals and their quality for high resolution structure determination depends on the degree of stabilization of protein by the interaction of the nicotinamide moiety of NAD(H) with residues in the catalytic domain.

A major bottleneck in structural genomics projects is the production of crystals suitable for analysis by X-ray diffraction. The demonstration in this work that information on ligands generated by dye-ligand chromatography can be used to identify ligands binding to nucleoside-binding proteins and to improve crystallization may be a significant contribution to overcoming this bottleneck.

There are several extensions to our method that we will make the process higher throughput. Using LC-MS (or LC-MS-MS) systems to identify native proteins in cell extracts that are eluted by specific ligands [48] may allow the protein identification process to be finished immediately after the ligand elution. In addition to Cibacron Blue F3GA dye resin, which was used in our dye-ligand chromatography approach, there are several other dye resins which are known to interact with a specific group of proteins [49]. These resins may be useful in binding groups of related proteins in a high throughput manner. The techniques described here and in Roberts et al. [24] could be applied to newly sequenced organisms to identify the metabolically important proteins that interact with nucleoside ligands, and to complement the annotation of each gene based on the interacting ligand(s) [51].

Abbreviations

DNA:: Deoxyribonucleic acid
RNA:: Ribonucleic acid
AMP:: Adenosine-5′-monophosphate
ADP:: Adenosine-5′-diphosphate
ATP:: Adenosine-5′-triphosphate
FAD:: Flavin-adenine dinucleotide
FMN:: Flavin mononucleotide
NMN:: Nicotinamide mononucleotide
GTP:: Guanosine-5′-triphosphate
IPTG:: Isopropyl-β-d-thiogalactoside
NAD:: Nicotinamide adenine dinucleotide
NADP:: Nicotinamide adenine dinucleotide phosphate
LC:: Liquid chromatography
MS:: Mass spectrometry

References

Nelson LD, Cox MM (2005) Lehninger principles of biochemistry, 4th edn. W.H. Freeman & Co
Voet D, Voet JG (2004) Biochemistry, vol. 1, 3rd edn. Wiley, NJ
Google Scholar
Scopes RK (1994) Protein purification: principles and practice, 3rd edn. Springer, NY
Google Scholar
Eppink MH, Overkamp KM, Schreuder HA, Van Berkel WJ (1999) Switch of coenzyme specificity of p-hydroxybenzoate hydroxylase. J Mol Biol 292:87–96
Article CAS PubMed Google Scholar
Goward CR, Nicholls DJ (1994) Malate dehydrogenase: a model for structure, evolution, and catalysis. Protein Sci 3:1883–1888
Article CAS PubMed Google Scholar
Wang X, Kemp RG (1999) Identification of residues of Escherichia coli phosphofructo- kinase that contribute to nucleotide binding and specificity. Biochemistry 38:4313–4318
Article CAS PubMed Google Scholar
George RA, Spriggs RV, Bartlett GJ, Gutteridge A, MacArthur MW, Porter CT, Al-Lazikani B, Thornton JM, Swindells MB (2005) Effective function annotation through catalytic residue conservation. Proc Natl Acad Sci USA 102:12299–12304
Article CAS PubMed Google Scholar
Edgar RC, Batzoglou S (2006) Multiple sequence alignment. Curr Opin Struct Biol 16:368–373
Article CAS PubMed Google Scholar
Kono H, Rusyn I, Yin M, Gabele E, Yamashina S, Dikalova A, Kadiiska MB, Connor HD, Mason RP, Segal BH, Bradford BU, Holland SM, Thurman RG (2000) NADPH oxidase-derived free radicals are key oxidants in alcohol-induced liver disease. J Clin Invest 106:867–872
Article CAS PubMed Google Scholar
Szöcs K, Lassègue B, Wenzel P, Wendt M, Daiber A, Oelze M, Meinertz T, Műnzel T, Baldus S (2007) Increased superoxide production in nitrate tolerance is associated with NAD(P)H oxidase and aldehyde dehysrogenase 2 downregulation. J Mol Cell Cardiol 42:1111–1118
Article PubMed Google Scholar
Jelski W, Szmitkowski M (2008) Alcohol dehydrogenase (ADH) and aldehyde dehydrogenase (ALDH) in the cancer diseases. Clin Chim Acta 395:1–5
Article CAS PubMed Google Scholar
Wenzel P, Műller J, Zurmeyer S, Schuhmacher S, Schulz E, Oelze M, Pautz A, Kawamoto T, Wojnowski L, Kleinert H, Műnzel T, Daiber A (2008) ALDH-2 deficiency increases cardiovascular oxidative stress—evidence for indirect antioxidative properties. Biochem Biophys Res Commun 367:137–143
Article CAS PubMed Google Scholar
Marchitti SA, Brocker C, Stagos D, Vasilou V (2008) Non-P450 aldehyde oxidizing enzymes: the aldehyde dehydrogenase superfamily. Expert Opin Drug Metab Toxicol 4:697–720
Article CAS PubMed Google Scholar
Vasilou V, Nebert DW (2005) Analysis and update of the human aldehyde dehysrogenase (ALDH) gene family. Hum Genomics 2:138–143
Google Scholar
Rizzo WB (2007) Sjögren-Larsson syndrome: molecular genetics and biochemical pathogenesis of fatty aldehyde dehydrogenase deficiency. Mol Genet Metab 90:1–9
Article CAS PubMed Google Scholar
Gibson KM (2005) Gamma-hydroxybutyric aciduria: a biochemistry’s education from a heritable disorder of GABA metabolism. J Inherit Metab Dis 28:247–263
Article CAS PubMed Google Scholar
Kanno J, Kure S, Narisawa A, Kamada F, Takayanagi M, Yamamoto K, Hoshino H, Goto T, Takahashi T, Haginoya K, Tsuchiya S, Baumeister FA, Hasegawa Y, Aoki Y, Yamaguchi S, Matsubara Y (2007) Allelic and non-allelic heterogeneities in pyridoxine dependent seizures revealed by ALDH7A1 mutational analysis. Mol Genet Metab 91:384–389
Article CAS PubMed Google Scholar
Chen Z, Zhang J, Stamler JS (2002) Identification of the enzymic mechanism of nitroglycerin bioactivation. Proc Natl Acad Sci USA 99:8306–8311
Article CAS PubMed Google Scholar
Cederbaum AI (1989) Role of lipid peroxidation and oxidative stress in alcohol toxicity. Free Radic Biol Med 7:537–539
Article CAS PubMed Google Scholar
Tuma DJ (2002) Role of malondialdehyse-acetaldehyde adducts in liver injury. Free Radic Biol Med 32:303–308
Article CAS PubMed Google Scholar
Zhang X, Li SY, Brown RA, Ren J (2004) Ethanol and acetaldehyde in alcohol cardiomyopathy: from bad to ugly en route to oxidative stress. Alcohol 32:175–186
Article CAS PubMed Google Scholar
Seitz HK, Maurer B, Stickel F (2005) Alcohol consumption and cancer of the gastrointestinal tract. Dig Dis 23:297–303
Article PubMed Google Scholar
Kleiger G, Eisenberg D (2002) GXXXG and GXXXA motifs stabilize FAD and NAD(P)-binding Rossman folds through C^α-H···O hydrogen bonds and van der Waals interactions. J Mol Biol 323:69–76
Article CAS PubMed Google Scholar
Roberts JKM, Webster C, Terwilliger TC, Kim C-Y (2009) High-throughput analysis of nucleoside- and nucleotide-binding by proteins. In: Hicks MG, Kettner C (eds) Proceedings of Systems Chemistry Symposium, Beilstein Institute, 26–30 May 2008, Bozen, Italy, pp 207–219 (http://www.beilstein-institut.de/index.php?id=222&L=3)
Studier FW (2005) Protein production by auto-induction in high-density shaking cultures. Protein Expr Purif 41:207–234
Article CAS PubMed Google Scholar
Watanabe T (1995) Analysis of a polyacrylamide gel electorphoretogram of beagle serum protein by laser densitometer. Lab Anim Sci 45:295–298
CAS PubMed Google Scholar
Kim C-Y, Takahashi K, Nguyen TB, Roberts JKM, Webster C (1999) Identification of a nucleic acid binding domain in eukaryotic initiation factor eIFiso4G from wheat. J Biol Chem 274:10603–10608
Article CAS PubMed Google Scholar
McPherson A (1999) Crystallization of biological macromolecules. Cold Spring Harbor Laboratory Press, Plainview
Google Scholar
Otwinowski Z, Minor W (1997) Processing of X-ray diffraction data collected in oscillation mode. Methods Enzym 276:307–326 Academic Press
Article CAS Google Scholar
Terwilliger TC, Berendzen J (1999) Automated MAD and MIR structure solution. Acta Cryst D55:849–861
CAS Google Scholar
Terwilliger TC (1999) Reciprocal-space solvent flattening. Acta Cryst D55:1863–1871
CAS Google Scholar
Lamzin VS, Wilson KS (1993) Automated refinement of protein models. Acta Cryst D49:129–147
CAS Google Scholar
Perrakis A, Sixma TK, Wilson KS, Lamzin VS (1997) wARP: improvement and extension of crystallographic phases by weighted averaging of multiple-refined dummy atomic models. Acta Cryst D53:448–455
CAS Google Scholar
Emsley P, Cowtan K (2004) Coot: model-building tools for molecular graphics. Acta Cryst D60:2126–2132
CAS Google Scholar
Adams PD, Grosse-Kunstleve RW, Hung L-W, Ioerger TR, McCoy AJ, Moriarty NW, Read RJ, Sacchettini JC, Sauter NK, Terwilliger TC (2002) PHENIX: building new software for automated crystallographic structure determination. Acta Cryst D58:1948–1954
CAS Google Scholar
Read RJ (1986) Improved fourier coefficients for maps using phases from partial structures with errors. Acta Cryst A42:140–149
CAS Google Scholar
Terwilliger TC, Klei H, Adams PD, Moriarty NW, Cohn JD (2006) Automated ligand fitting by core-fragment fitting and extension into density. Acta Cryst D62:915–922
CAS Google Scholar
DeLano WL (2002) The PyMol molecular graphics system, http://www.pymol.org
Holm L, Park J (2000) DaliLite workbench for protein structure comparision. Bioinformatics 16:566–567 (http://www.ebi.ac.uk/Tools/dalilite/)
Google Scholar
Perozich J, Kuo I, Lindahl R, Hempel J (2001) Coenzyme specificity in aldehyde dehydrogenase. Chem Biol Interact 130–132:115–124
Article PubMed Google Scholar
Vedadi M, Niesen FH, Allali-Hassani A, Fedorov OY, Finerty JPJ, Wasney GA, Yeung R, Arrowsmith C, Ball LJ, Berglund H, Hui R, Marsden BD, Nordlund P, Sundstrom M, Weigelt J, Edwards AM (2006) Chemical screening methods to identify ligands that promote protein stability, protein crystallization, and structure determination. Proc Natl Acad Sci USA 103:15835–15840
Article CAS PubMed Google Scholar
Zhou NLJ, Hurley TD, Weiner H (1999) Human liver mitochondrial aldehyde dehydrogenase: three-dimensional structure and the restoration of solubility and activity of chimeric forms. Protein Sci 8:2784–2790
Article PubMed Google Scholar
Ni L, Zhou J, Hurley TD, Weiner H (1999) Human liver mitochondrial aldehyde dehydrogenase: three-dimensional structure and the restoration of solubility and activity of chimeric forms. Protein Sci 8:2784–2790
Article CAS PubMed Google Scholar
Yoshida A, Rzhetsky A, Hsu LC, Chang C (1998) Human aldehyde dehydrogenase gene family. Eur J Biochem 251:549–557
Article CAS PubMed Google Scholar
Perez-Miller SJ, Hurley TD (2003) Coenzyme isomerization is integral to catalysis in aldehyde dehydrogenase. Biochemistry 42:7100–7109
Article CAS PubMed Google Scholar
Hulo N, Bairoch A, Bulliard V, Cerutti L, Cuche B, De Castro E, Langendijk-Genevaux PS, Pagni M, Sigrist CJA (2006) The PROSITE database. Nucleic Acids Res 34:D227–D230
Article CAS PubMed Google Scholar
Denessiouk KA, Rantanen V-V, Johnson MS (2001) Adenine recognition: a motif present in ATP-, CoA-, NAD-, NADP-, and FAD-dependent proteins. Proteins 44:282–291
Article CAS PubMed Google Scholar
Wysocki VH, Resing KA, Zhang Q, Cheng G (2005) Mass spectrometry of peptides and proteins. Methods 35:211–222
Article CAS PubMed Google Scholar
Denizli A, Pişkin E (2001) Dye-ligand affinity systems. J Biochem Biophys Methods 49:391–416
Article CAS PubMed Google Scholar
Kabsch W, Sander C (1983) Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22:2577–2637
Article CAS PubMed Google Scholar
Brylinski M, Skolnick J (2008) A threading-based method (FINDSITE) for ligand-binding site prediction and functional annotation. Proc Natl Acad Sci USA 105:129–134
Article CAS PubMed Google Scholar
Vasiliou V, Nebert DW (2005) Analysis and update of the human aldehyde dehydrogenase (ALDH) gene family. Hum Genomics 2:138–143
CAS PubMed Google Scholar

Download references

Acknowledgments

The authors are grateful to N. Maes for her technical assistance. We also would like to thank the staff at the BL 5.0.2 and BL 8.2.1 managed by the Berkeley Center for Structural Biology (BCSB) at the ALS for technical support. The BCSB is supported in part by the National Institutes of Health, National Institute of General Medical Sciences. The ALS is supported by the Director, Office of Science, Office of Basic Energy Sciences, of the US Department of Energy under Contract No. DE-AC02-05CH11231. This work was in part supported by the LANL-UCR CARE program (STB-UC:06-29) and the NIGMS Protein Structure Initiative program (NIH U54 GM074946).

Open Access

This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Author information

Authors and Affiliations

Advanced Measurement Science Group (B-9), Bioscience Division, MS M888, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
Chang-Yub Kim, Emily Z. Alipio Lyon, Heungbok Kim & Thomas C. Terwilliger
Department of Biochemistry, University of California, Riverside, CA, 92521, USA
Cecelia Webster & Justin K. M. Roberts
Institute of Life Sciences and Natural Resources, College of Life Sciences & Biotechnology, Korea University, Seoul, Korea
Jin Ho Moon
Physical Biosciences Division, Lawrence Berkeley National Laboratory, MS4R0230, Berkeley, CA, 94720, USA
Minmin Yu
Physics Division, MS D454, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
Li-Wei Hung

Authors

Chang-Yub Kim
View author publications
You can also search for this author in PubMed Google Scholar
Cecelia Webster
View author publications
You can also search for this author in PubMed Google Scholar
Justin K. M. Roberts
View author publications
You can also search for this author in PubMed Google Scholar
Jin Ho Moon
View author publications
You can also search for this author in PubMed Google Scholar
Emily Z. Alipio Lyon
View author publications
You can also search for this author in PubMed Google Scholar
Heungbok Kim
View author publications
You can also search for this author in PubMed Google Scholar
Minmin Yu
View author publications
You can also search for this author in PubMed Google Scholar
Li-Wei Hung
View author publications
You can also search for this author in PubMed Google Scholar
Thomas C. Terwilliger
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chang-Yub Kim.

Electronic supplementary material

Below is the link to the electronic supplementary material.

DOC 24 kb

PDF 50 kb

Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

Kim, CY., Webster, C., Roberts, J.K.M. et al. Analysis of nucleoside-binding proteins by ligand-specific elution from dye resin: application to Mycobacterium tuberculosis aldehyde dehydrogenases. J Struct Funct Genomics 10, 291–301 (2009). https://doi.org/10.1007/s10969-009-9073-z

Download citation

Received: 27 May 2009
Accepted: 29 October 2009
Published: 13 November 2009
Issue Date: December 2009
DOI: https://doi.org/10.1007/s10969-009-9073-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Analysis of nucleoside-binding proteins by ligand-specific elution from dye resin: application to Mycobacterium tuberculosis aldehyde dehydrogenases

Abstract

Similar content being viewed by others

Ion-Exchange Chromatography: Basic Principles and Application

Functionalizing tandem mass tags for streamlining click-based quantitative chemoproteomics

Identification of potential JNK3 inhibitors through virtual screening, molecular docking and molecular dynamics simulation as therapeutics for Alzheimer’s disease

Introduction

Materials and methods

Cloning and expression of putative Mtb ALDHs

Screening of Rv0223c for interactions with multiple nucleosides and nucleotides

Rv0223c protein–ligand crystallization and data collection

Structure determination and data analysis of Rv0223c protein–ligand complexes

Results

Sequence similarities of four Mtb ALDHs captured by dye-ligand chromatography to each other and to human ALDH proteins

Analysis of the specificity of ligand–protein interactions with recombinant Rv0223c using dye-ligand chromatography

Crystallization and X-ray diffraction of Mtb Rv0223c with multiple ligands

Location of nucleoside ligands in the Rv0223c structures

Structural homology between His tagged recombinant Rv0223c and human mitochondrial ALDH

Sequence homology between Mtb ALDH Rv0223c and human mitochondrial ALDH

Discussion

Abbreviations

References

Acknowledgments

Open Access

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

DOC 24 kb

PDF 50 kb

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Analysis of nucleoside-binding proteins by ligand-specific elution from dye resin: application to Mycobacterium tuberculosis aldehyde dehydrogenases

Abstract

Similar content being viewed by others

Ion-Exchange Chromatography: Basic Principles and Application

Functionalizing tandem mass tags for streamlining click-based quantitative chemoproteomics

Identification of potential JNK3 inhibitors through virtual screening, molecular docking and molecular dynamics simulation as therapeutics for Alzheimer’s disease

Introduction

Materials and methods

Cloning and expression of putative Mtb ALDHs

Screening of Rv0223c for interactions with multiple nucleosides and nucleotides

Rv0223c protein–ligand crystallization and data collection

Structure determination and data analysis of Rv0223c protein–ligand complexes

Results

Sequence similarities of four Mtb ALDHs captured by dye-ligand chromatography to each other and to human ALDH proteins

Analysis of the specificity of ligand–protein interactions with recombinant Rv0223c using dye-ligand chromatography

Crystallization and X-ray diffraction of Mtb Rv0223c with multiple ligands

Location of nucleoside ligands in the Rv0223c structures

Structural homology between His tagged recombinant Rv0223c and human mitochondrial ALDH

Sequence homology between Mtb ALDH Rv0223c and human mitochondrial ALDH

Discussion

Abbreviations

References

Acknowledgments

Open Access

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

DOC 24 kb

PDF 50 kb

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation