Background

The plant apoplast comprises the cell wall matrix and the intercellular spaces, and plays a major role in a wide range of physiological processes, including water and nutrient transport [1], plant-pathogen interactions, and perception and transduction of environmental signals [2, 3]. Proteins present in the plant apoplast reflect this broad functional diversity. Studies on the dynamic change of apoplast protein composition revealed new insights into plant responses to abiotic stress [47], nutrient supply [810], wounding [11], water deficiency [12, 13], pathogen response [1416] and xylem composition [17, 18]. The selection of a suitable extraction protocol is a crucial step in proteomics surveys as proteins reveal a high degree of biochemical heterogeneity and investigated plant materials can be characterized by the presence of non-protein components interfering with subsequent analytical techniques, e.g. two-dimensional gel electrophoresis (2-DE) or liquid chromatography-mass spectrometry (LC-MS). These biological realities led to the establishment of sample preparation methods for numerous plant species and tissues, such as Arabidopsis leaves [19], papaya leaves [20], sunflower leaves [21], cotton seedlings [22], apple and strawberry fruit [23], potato tuber [24], grapevine leaves and roots [25], grape berry cell wall [26], rubber latex [27], cotton fibers [28], banana meristem [29] and chloroplast [30], among others. Despite their biological significance, investigations on apoplastic proteins are hampered due to their low abundance compared to intracellular protein concentrations. The extraction of proteins from the leaf and root apoplast is mainly based on the principle of vacuum infiltration with an extraction solution, followed by a mild centrifugation step to collect the apoplastic washing fluid. The composition of the infiltration solution is essential as it has to fulfil certain prerequisites, such as maintenance of osmotic pressure to prevent collapsing of plasma membrane and stringency for extracting cell wall-bound proteins. Borderies et al. [31] compared different solutions to extract loosely bound cell wall proteins of Arabidopsis cell suspension cultures and showed that the composition of extraction solution determines the efficiency of preparation. Similarly, Boudart et al. [32] investigated weakly cell wall-bound proteins in rosettes of Arabidopsis. Here, we compared protein extracts obtained by six different infiltration solutions already described for apoplastic proteins from different plant species. We aimed at identifying a protocol most suitable for the extraction of leaf apoplast proteins of maize, a crop of high economic importance. We evaluated the protein patterns as resolved by 1-DE or 2-DE, identified the proteins using LC-MS and located them to cellular compartments.

Results and discussion

In this study, six different solutions were tested for the ability to extract proteins from the maize leaf apoplast: water [8], 20 mM ascorbic acid/20 mM CaCl2 [6], 100 mM sorbitol [4], 25 mM Tris-HCl [9], 100 mM sodium phosphate buffer [16] and 50 mM NaCl [33] (Figure 1). In most cases, the infiltration solutions were applied for wheat leaves and no comparison of the efficiency of protein extraction for each method was performed. Thus, this study focussed on identifying the optimal method for extracting apoplastic proteins from maize leaves.

Figure 1
figure 1

Schematic representation of protein extraction from maize leaf apoplast. Different infiltration solutions were analyzed for their specificity by proteome profiling using gel-based and gel-free approaches.

Proteins from the leaf apoplast and symplast extracted with the six infiltration solutions were compared on 1-DE (Figure 2A, Additional file 1). A sharp band pattern was obtained from all apoplast extracts with a high number of protein bands in each extract. While the yield of protein extraction was similar, the protein profiles showed distinct differences. A prominent band of about 20 kDa was present in extracts of 100 mM sodium phosphate buffer, 25 mM Tris-HCl, 20 mM ascorbic acid/20 mM CaCl2 and 50 mM NaCl, but not in water or 100 mM sorbitol. One protein band of high molecular weight (approximately 100-130 kDa) was apparent in extracts of water, 100 mM sodium phosphate buffer and 100 mM sorbitol, but not in 25 mM Tris-HCl, 20 mM ascorbic acid/20 mM CaCl2 or 50 mM NaCl. While there were similarities, each extract revealed specific protein bands indicating that different subsets of proteins were isolated by the six infiltration solutions. Proteins with a molecular weight < 15 kDa were underrepresented in all extracts and this corresponds to previous proteomic reports on some of the infiltration solutions [4, 16]. The observed selective protein patterns generated by the individual infiltration solutions emphasize the necessity of careful selection of isolation method [34]. Band patterns from symplast preparations did not reveal significant differences among the infiltrates and the overall band patterns were more complex as from apoplastic preparations. This demonstrates an apparent subfractionation of the cellular compartments.

Figure 2
figure 2

Profiles of maize leaf protein extracts as resolved by 1-DE (A) and 2-DE (B). A: SDS-PAGE of apoplastic and symplastic proteins extracted with water (i), 100 mM sodium phosphate buffer (ii), 25 mM Tris-HCl (iii), 100 mM sorbitol (iv), 20 mM ascorbic acid/20 mM CaCl2 (v) or 50 mM NaCl (vi). A total of 10 μg protein per lane was loaded. B: 2-DE profiles of protein extracts from the maize leaf apoplast isoelectric focussed on IPG 3-10 and visualized by Coomassie staining. A total of 25 μg protein per gel was loaded.

Equal amounts of apoplast proteins were separated by 2-DE to assess the protein patterns in more detail (Figure 2B). We found areas of good and poor resolved proteins spots on all 2-D gels. Proteins in the acidic gel region of pH 4-6 showed horizontal streaking. Although all samples were precipitated, dissolved in urea-containing buffer system and dialyzed prior to 2-DE to avoid the contamination with nucleic acids or other interfering substances, these poorly separated spots were observed. Contrary to this, proteins in the basic region of 2-D gels near the pH 6-10 interval showed a superior resolution with minimal streaking. The spot patterns resembled the band patterns to a certain extent, e.g. as observed for the 20 kDa band that was prominent also on 2-D gels of the respective apoplastic extracts. The best resolution of proteins in 25-45 kDa intervals was achieved on extracts of 20 mM ascorbic acid/20 mM CaCl2 infiltration solution, while high molecular weight proteins separated best in extracts of 100 mM sodium phosphate buffer infiltration solution. The latter was applied with success to extract proteins from the leaf apoplast of lupin and resulted in the generation of well resolved protein maps containing about 50 spots to evaluate the effect of water and boron deficiency [9]. Our results showed that this separation was not reached, probably due to substances present in the maize apoplast interfering with isoelectric focusing. As 2-DE did not result in a comprehensible evaluation of the employed infiltration solutions, we used nano-LC-ESI-Q-TOF MS for proteomic analysis of all extracts.

In order to obtain an overview of all proteins present in the six different extracts, we aimed at establishing qualitative protein profiles by LC-MS analysis. An automatic data directed analysis mode was applied as described in materials and methods section. Results exceeding the PLGS score of 12 for protein identification and probability score of 50% for de novo sequencing of peptides were accepted.

A total of 328 proteins were identified from all extracts. Additional file 2 shows the identities of those proteins, along with the predicted subcellular localization and detection in the six apoplastic extracts. Additional file 3 provides the respective identifier, PLGS score, number of peptides, protein coverage, peptide sequences and peptide sequence probability score for all identified proteins. In order to visualize and identify infiltration solutions with similar protein abundance patterns, a hierarchical clustering method was applied. Two main clusters were found, with the first represented by the 100 mM sodium phosphate buffer and the second containing all other infiltration solutions indicating the isolation of a rather different set of proteins by the first one than compared to all other solutions under examination (Figure 3). The most similar abundance patterns derived from leaf infiltration with 25 mM Tris-HCl and 50 mM NaCl reflecting a comparable degree of protein extraction efficiency.

Figure 3
figure 3

Hierarchical clustering analysis of protein abundance patterns. Columns represent LC-MS experiments on protein extracts of indicated infiltration solutions. Rows display the presence (yellow) or absence (black) of proteins in the respective extracts. Additional file 2 provides protein identifications and their detection in the respective apoplast extracts.

The highest number of proteins was found in apoplastic extracts using water as infiltration solution. Here, 171 proteins were detected. Extracts of 25 mM Tris-HCl, 100 mM sorbitol and 20 mM ascorbic acid/20 mM CaCl2 yielded in the identification of a similar number of 131, 133 and 133 proteins, respectively. We found 114 proteins in extracts of 50 mM NaCl solution and 107 proteins in those of 100 mM sodium phosphate buffer. Out of all 328 proteins, only 28 proteins were common across all six extracts (Additional file 4). A similar observation was made for Arabidopsis cell wall proteins when extracted by different solutions; here, only 11 out of 96 proteins were found to be common in all extracts [31]. Exhydrolase II [UniProt: Q9XE93] was found in all extracts and its identification is illustrated in Additional file 5 as an example. Here, the amino acid sequence is shown and the 12 detected peptides are marked within, resulting in protein sequence coverage of 28.7%.

The quality of apoplastic protein preparations is estimated in many cases by enzymatic measurements of specific proteins such as malate dehydrogenase [5, 9] and glucose-6-phosphate dehydrogenase [6]. However, it is known that the activity of those enzymes is detectable in respective cellular compartment as well [35]. To assess the amount of symplast contaminations in our samples, we used topology prediction tools. The identified proteins were classified for their subcellular localization as deduced by Expasy tools Target P and WoLF PSORT (Figure 4). A number of proteins in this study were allocated to other cellular compartments then the apoplast, suggesting considerable amounts of intracellular protein contaminations. However, previous reports using different plant species and extraction methods described the detection of cytosolic, mitochondrial or vacuolar proteins in cell wall or apoplast preparations [31, 3638]. These consistent findings point to the occurrence of non-classical secretory pathways for proteins lacking signal sequences [39, 40] and therefore, differentiation between yet unknown apoplastic proteins and ones resident in other organelles remains difficult. Water-infiltrated leaves revealed 23 protein identifications localized to the apoplast, while a high number of intracellular proteins were detected from the vacuole (19), cytosol (46) and chloroplast (23). This observation is indicative for the disrupture of plasma membrane during the infiltration process. Also, apoplastic extracts with 100 mM sorbitol as infiltration solution contained a superior proportion of chloroplast (26) and cytosolic (41) proteins with only 15 predicted apoplastic proteins. This result was unexpected as the sugar alcohol sorbitol was applied to maintain the osmotic cell pressure. Similar numbers of proteins in infiltrates with 25 mM Tris-HCl, 20 mM ascorbic acid/20 mM CaCl2 and 50 mM NaCl were assigned to the chloroplast (14, 17, 14), the cytosol (36, 34, 27), and the apoplast (25, 31, 25). Of all tested infiltration solutions, 100 mM sodium phosphate buffer contained the lowest number of proteins assigned to intracellular compartments (chloroplast: 10, cytosol: 16) and the highest number of proteins targeted to the extracellular apoplast with 34 identified proteins.

Figure 4
figure 4

Predicted subcellular distribution of identified proteins from the maize leaf apoplast extracted by different infiltration solutions. Topology prediction was performed with Expasy tools Target P (http://www.cbs.dtu.dk/services/TargetP/) and WoLF PSORT (http://wolfpsort.org/).

Table 1 presents the 67 proteins allocated to the apoplast of maize leaves and grouped according to their function into 7 classes. The largest class consisted of 39 proteins related to cell structural processes, including carbohydrate metabolism (e.g.: lichenase 2, alpha N-arabinofuranosidase, beta galactosidase, exoglucanase, exhydrolase II) and cell wall modification (e.g.; pectinesterase, xyloglucan endotransglycosylase hydrolase, peroxidases). Synthesis and integration of polysaccharides into the cell wall and extension of this network during plant growth are the major biological functions of proteins present in the apoplast [41] and our findings reflect this reality. Fifteen proteins were involved in disease and defense reactions, the second prime function of the apoplast [42]. The third class was related to proteins with transporting function and here, 7 proteins were identified. Further classes were related to cell growth/division, protein destination/storage, secondary metabolism and signal transduction.

Table 1 Identification of proteins assigned to the apoplast.

The number of proteins identified exclusively in any of the extracts was compared and revealed that 16 out of 34 apoplastic proteins were found only in extracts of 100 mM sodium phosphate buffer, representing the highest number of unique proteins in all tested infiltration solutions (Table 1). Usage of this infiltration solution appears to prevent damaging the plasma membrane and enables extraction of proteins adhesive to the cell wall. Most polypeptides found in the analysis were annotated as hypothetical based on an in silico match to a genome sequence, or putative due to a homology to a protein with known function (Figure 4). The identification of these proteins in apoplastic preparations reveals the potential inherited in proteomic surveys for establishing comprehensive maps of all translated polypeptides present in a subcellular compartment. A number of 12 proteins with unknown function were exclusively identified using the 100 mM sodium phosphate buffer infiltration solution (see Additional file 2). As this protein fraction performed best regarding contaminations from other cellular compartments and contained most of the apoplastic proteins, we assume that these yet unknown proteins are involved in physiological processes of the apoplast.

Conclusions

The plant apoplast is a dynamic compartment with a broad range of physiological functions. To study proteins involved in nutrition, growth, signaling or transport processes, it is crucial to apply extraction methods selective for apoplastic proteins. In this study, we compared six different infiltration solutions already reported for the isolation of this protein subset. The protein patterns resolved by 1-DE revealed clear differences between apoplast and symplast preparations. We found the lowest number of intracellular protein contaminants with the highest number of extracted proteins present in the apoplastic fluid obtained with 100 mM sodium phosphate buffer. Also, the number of secreted proteins exclusively found in a single fraction was highest for that buffer. Those findings are now employed in comparative proteomic studies aiming at identifying proteins involved in abiotic stress responses.

Materials and methods

Plant cultivation

Maize grains cv. Lector (LG Seeds, http://www.lgseeds.com) were imbibed overnight in aerated 1 mM CaSO4 solution and germinated at 28°C in the dark between two layers of filter paper moistened with 0.5 mM CaSO4. After 4 days, seedlings were transferred to light in constantly aerated plastic pots containing one-fourth concentrated nutrient solution. The concentration of nutrient solution was increased to half and full strength after 2 and 4 days of cultivation, respectively. The full strength nutrient solution had the following concentrations: 2.0 mM Ca(NO3)2, 1.0 mM K2SO4, 0.2 mM KH2PO4, 0.5 mM MgSO4, 2.0 mM CaCl2, 5.0 μM H3BO3, 2.0 μM MnSO4, 0.5 μM ZnSO4, 0.3 μM CuSO4, 0.01 μM (NH4)6Mo7O24, 200 μM Fe-EDTA. Nutrient solution was changed twice a week to avoid nutrient deficiencies. The experiments were carried out under greenhouse conditions with an average day/night temperature of 28/18°C and a photoperiod of 14 h for 5 weeks with relative humidity about 70% ± 5%. The fifth and sixth leaf from medium part of the stem was harvested 16 d after reaching the full nutrient solution for collection of apoplast proteins.

Extraction of apoplastic and symplastic proteins

Apoplastic proteins were collected using the infiltration-centrifugation technique [43] with minor modifications. Leaves were cut into segments of about 5.5 cm and washed with deionised water. For infiltration, leaf segments were placed in plastic syringes (60 ml) filled with 40 ml of the respective infiltrating medium and were infiltrated by pulling the plunger, producing a reduced pressure of estimated about 20 kPa. Leaves were infiltrated either with water, 20 mM ascorbic acid/20 mM CaCl2, 100 mM sorbitol, 0.1 M sodium phosphate buffer (pH 6.5), 25 mM Tris-HCl (pH 8.0) or 50 mM NaCl (Figure 1). Thereafter, intact leave segments were carefully blotted dry, and then placed in a 10 ml plastic vessel and centrifuged immediately at 400 g for 5 min at 5°C. The clear infiltrate, now referred to as apoplast fraction, was collected at the bottom of the tube.

After the extraction of the apoplastic fraction, the residual leaf tissue was shock frozen in liquid nitrogen, thawed, and centrifuged at 715 g for 5 min for cell sap extraction, now referred to as symplast fraction. Four pools of extracts from five plants each were combined for subsequent analyses. Extracts were stored at -80°C until analysis.

Gel electrophoretic protein separation

Proteins contained in the different extracts were precipitated by chloroform/methanol method [44]: 200 μl of sample was mixed with 800 μl MeOH, 400 μl chloroform and 600 μl deionized water. The incubation at 4°C for 5 min was followed by a centrifugation step (9,000 g, 2 min, 4°C). The upper phase was removed and 600 μl MeOH was added to the lower and interphase. A further centrifugation sedimented the proteins, the supernatant was removed and the pellet was dried in a vacuum centrifuge.

For one-dimensional separation of proteins, the pellets were dissolved in 10% glycerol, 2.3% SDS, 5% β-mercaptoethanol, 0.25% bromphenol blue, 63 mM Tris-HCl (pH 6.8). The 2-D Quant Kit (GE Healthcare, http://www.gehealthcare.com) was used for determining the protein concentration. A sample of 10 μg was separated by SDS-PAGE according to Laemmli [45]. The two-dimensional separation of proteins was accomplished as described in Zörb et al. [46] with the following modifications. Protein pellets were first dissolved in 8 M urea, 2 M thiourea, 0.5% IPG (immobilized pH gradient) buffer, 4% w/v CHAPS, 30 mM DTT, 20 mM Tris and then dialyzed using 3.5 kDa cut-off membrane (ZelluTrans, Carl Roth, http://www.carlroth.com) against the same buffer. The protein concentration was determined with the 2-D Quant Kit (GE Healthcare) and 25 μg of protein were separated on IPG strips of 7 cm in length with pH gradient of 3-10. Protein gels were stained according to the hot-staining protocol with Coomassie R350 tablets (PlusOne Coomassie tablets PhastGel Blue R-350, GE Healthcare) [47] and digitized with an Epson Perfection V700 Photo scanner (Epson, http://www.epson.com).

LC-MS-based protein identification

Dialyzed protein extracts were precipitated by chloroform/methanol method and about 30 μg of protein were resolubilized in 50 μl 0.1% Rapigest (Waters Corporation, http://www.waters.com) in 50 mM ammonium bicarbonate. Protein concentrations were determined using the Bradford method [48] and bovine serum albumin as standard protein. Five μg of protein were reduced, alkylated and digested with trypsin over night at 37°C as described earlier [49]. The enzymatic reaction was stopped with 1N HCl and peptide solutions were adjusted to 0.1 μg/μl final concentration.

Three μl of protein digest were used for LC-separation on a nanoAcquity UPLC system (Waters) followed by mass spectrometry analysis on a Q-TOF Premier MS instrument (Waters) in a data directed analysis (DDA) mode, as described in Agrawal et al. [50].

Peptide separation was performed on a 180 μm × 20 mm Symmetry (5 μm) C18 precolumn (Waters) coupled to a 150 mm × 75 μm BEH130 (1.7 μm) C18 column (Waters), with a gradient of 3-40% actonitrile over 90 min. The MS operated in a positive ion mode with a source temperature of 80°C, a cone gas flow of 50 l/h, and a capillary voltage of approximately 3 kV. Mass spectra were acquired in a continuum V-mode and spectra integrated over 1 s intervals using MassLynx 4.1 software (Waters). The instrument was calibrated using selected fragment ions of the CID (collision-induced dissociation) of Glu-Fibrinopeptide B (SIGMA-ALDRICH, http://www.sigmaaldrich.com). Automatic data directed analysis (DDA) was employed for MS/MS analysis on doubly and triply charged precursor ions. The MS spectra were collected from m/z 400 to m/z 1600, and product ion MS/MS spectra were collected from m/z 50 to m/z 1600. Lock mass correction of the precursor and product ions was conducted with 500 fmol/μl Glu-Fibrinopeptide B in 0.1% formic acid in AcN/water (50:50, v/v) respectively, and introduced via the reference sprayer of the NanoLockSpray interface. ProteinLynx GlobalSERVER v2.3 software was used as a software platform for data processing, deconvolution, de novo sequence annotation of the spectra, and database search. A 10 ppm peptide, 0.1 Da fragment tolerance, one missed cleavage, and variable oxidation (Met) and carbamidomethylation (Cys) were used as the search parameters.

The resulting mass spectra were searched against the protein index of the UniProt viridiplantae database (release: July 2010 with 722.718 protein sequences) for protein identification applying the algorithm implemented in the ProteinLynxGlobalServer software (PLGS, Waters Cooperation). All samples were run as technical triplicates. Protein identifications consistent in two out of three LC-MS runs were considered as present in that sample. The false discovery rate was set to 4% of proteins included in the database.

Hierarchical clustering of protein abundances was performed using Gene Expression Similarity Investigation Suite Genesis v1.7.6 [51]. Average linkage clustering was applied for LC-MS experiments and protein abundances.