Background

Rhizobia are nitrogen-fixing soil bacteria that show intracellular symbiosis with their host legume. This symbiotic interaction has become a model system to identify and characterize the attractive mechanism employed by invasive bacteria during chronic host interactions [1]. This symbiosis begins with the secretion of flavonoids by the legume. Subsequently, nod genes of rhizobia are activated, and Nod factors (i.e. lipopolysaccharides; LPS) are secreted by rhizobia as signals [2]. After signal exchange between host and symbiont, rhizobia infect the host legume, escaping the vegetative defense responses. The host then produces nodules to maintain symbionts and endocytically incorporates rhizobia into the nodules [3]. In a legume nodule, the host provides C4 dicarboxylates to symbiotic rhizobia as the carbon source; rhizobia fix atmospheric nitrogen and provide ammonia to the host as a nitrogen source in return [4]. Thus, the host plants are able to overcome their nitrogen deficiency.

Lotus japonicus and Mesorhizobium loti are model organisms of legume-rhizobia symbiosis. The entire genome structures of L. japonicus MG-20 and M. loti MAFF303099 have been reported previously [5, 6], and the database is maintained by the Kazusa DNA Research Institute (Rhizobase; http://genome.microbedb.jp/rhizobase). Transcriptome analysis of M. loti by DNA microarray revealed that most of the transposase genes and nif, fix, fdx, and rpoN on the symbiosis island were highly upregulated under the symbiotic condition, while genes for cell wall synthesis, cell division, DNA replication, and flagella formation were strongly repressed under the symbiotic condition [7]. However, less information is available about M. loti than about other genera of rhizobia, such as Sinorhizobium meliloti, Rhizobium leguminosarum, and Bradyrhizobium japonicum.

In addition to transcriptome analysis, proteome analysis has recently attracted much attention. While genome sequencing has provided a considerable amount of useful information to clarify biological phenomena, proteins, rather than genes, actually function in vivo. Thus, it is necessary to analyze the entire set of produced proteins [8]. Proteome analysis of M. loti in mid-growth phase has been reported [9], but it has not been performed for the symbiotic phase. Proteome analyses of other rhizobia, such as B. japonicum[1014], and S. meliloti[1520], have been previously reported. They employed 2-dimensional polyacrylamide gel electrophoresis (2D-PAGE)-based analysis combined with matrix-assisted laser desorption and ionization time-of-flight mass spectrometry (MALDI-MS), but time-consuming steps, such as gel spot isolation and individual measurement, are necessary in this method. In addition, previous 2D-PAGE-based analyses have only identified up to 500 proteins [13]. Another report employed liquid chromatography-tandem mass spectrometry (LC-MS/MS)-based technology combined with prefractionation, such as multidimensional chromatography [21] or gel-based separation [22], but these prefractionation steps decreased throughput. Furthermore, all of them included a complicated isolation step of the bacteroid (a symbiotic form of rhizobia) from the nodule, and the step required a large amount of biological samples, such as 1–5 g nodules collected from approximately 40 plants [23]. Detection of small amount of proteins present in complex biological samples remains difficult and requires a combination of prefractionation steps.

To solve the problems, we used a nanoLC-MS/MS system equipped with a long monolithic silica capillary column (200 cm long, 0.1 mm ID). Monolithic silica materials offer high separation efficiency in long column formats because of their high permeability [24], and they have been successfully applied to separate tryptic fragments in highly complex samples with a shallow gradient. As this high-resolution system does not require any additional prefractionation prior to the separation and detection step by LC-MS/MS, this approach can simplify the workflow of shotgun proteomics and minimize the sample amount, as well as total analysis time [25]. Using this system, we have successfully performed proteome analysis of Candida albicans[26] and Clostridium cellulovorans[27].

Here, we report the first comparative proteome analysis of M. loti under the free-living and symbiotic conditions by using our system. Our data should accelerate functional and comprehensive studies focused on molecular mechanisms of L. japonicus - M. loti symbiosis.

Results and discussion

Identification of proteins extracted from free-living and symbiotic M. loti

The tryptic digests were injected to a LC-MS/MS system equipped with a long monolithic silica capillary column; 1,658 proteins were successfully identified by efficient separation (Additional file 1). Specifically, 1,533 proteins were identified under the free-living condition, and 847 proteins were identified by the analytes extracted from nodules without bacteroid isolation and prefractionation (Figure 1). Many proteins encoded in the symbiosis island were also identified. The symbiosis island of M. loti MAFF303099 is one of the notable features, which occurs by integration of a horizontally transferred DNA segment, and is located on a 610,975-bp DNA segment of the chromosome at coordinates 4,644,702 to 5,255,766 [5]. A total of 582 protein-encoding genes were located on the symbiosis island. Mapping the identified proteins to the symbiosis island showed that 74 proteins (8.7% of 847 proteins) were produced under the symbiotic condition, whereas only 22 proteins (1.4% of 1,533 proteins) were produced under the free-living condition. From the viewpoint of reproducibility, our data show highly-reproducible result with the strict criteria for protein identification (Additional file 2). As shown in this figure, 87% of proteins were identified from 3 data set under the free-living conditions, although the previous report indicated that protein profile of free-living M. loti in stationary phase was not reproducible [9]. And identified proteins under the symbiotic condition also show high-reproducibility because 84% of proteins were identified at all measurements. These results indicated that the protein profile successfully obtained with our system reflected the free-living and the symbiotic conditions.

Figure 1
figure 1

Venn diagram of proteins identified in M. loti. A total of 1,658 proteins were identified. Although 722 proteins were commonly identified under the free-living and symbiotic conditions, 811 and 125 proteins were uniquely identified under the free-living and symbiotic conditions, respectively.

KEGG pathway analysis

For further investigation about the lifestyle of rhizobia under each condition, the identified proteins were classified according to the Kyoto Encyclopedia of Genes and Genomes (KEGG; http://www.genome.jp/kegg/), and metabolic pathways were compared under the free-living and symbiotic conditions. The number of classified enzymes in each pathway is shown in Table 1, and the annotated genes in Table 1 are listed in Additional file 3.

Table 1 The number of classified enzymes detected by proteome analysis

Central carbon metabolism

Most enzymes classified in carbon metabolism, such as glycolysis, gluconeogenesis, TCA cycle, pentose phosphate (PP), and Entner-Doudoroff (ED) pathways, were commonly identified (Figure 2). It is assumed that the same pathways located in central carbon metabolism remained largely unchanged, irrespective of conditions.

Figure 2
figure 2

The map of central carbon metabolic pathways under the free-living and/or symbiotic conditions. The map of central carbon metabolic pathways involving gluconeogenesis and the Embden-Meyerhof-Parnas (EMP) pathway, the Entner-Doudoroff (ED) pathway, the pentose phosphate (PP) pathway, and the TCA cycle are shown. Symbols represent the following: fully-filled box (■), enzymes that were commonly identified under each condition; boxes filled in the bottom-right corner (◪), enzymes identified only under the free-living condition; boxes filled in the upper-left corner (◩), enzymes that were identified only under the symbiotic condition; open box (□), enzymes not identified in this study but proposed in M. loti by KEGG pathway analysis. Abbreviations are as follows: DHAP, dihydroxyacetone phosphate; GAP, glyceraldehyde-3-phosphate; PEP, phosphoenolpyruvate; KDPG, 2-dehydro-3-deoxy-phosphogluconate; ACP, acyl carrier protein; PHB, polyhydroxybutyrate.

To investigate the functional distribution, identified proteins under each condition were classified into 15 major functional categories according to Rhizobase (Figure 3). There was no significant difference between the functional profiles under each condition. (Statistical significances were determined using Pearson’s chi-square test, p > 0.01). This indicated that the metabolic pathways, which constitute the backbone of life, were commonly used under both conditions.

Figure 3
figure 3

Functional classification according to Rhizobase. Relative frequency of genes/proteins belonging to a category is given for 2 data sets: the proteins detected under the free-living condition (1,533) (dark gray) and in the L. japonicus nodule (847) (light gray). The relative frequencies were calculated by dividing the number of proteins into each category by the total number of identified proteins.

Nitrogen fixation

Nitrogenase complex core subunits (NifH, NifD, NifK) and the electron donor proteins (FixA, FixB, FixC), which transfer electrons to the nitrogenase complex, were detected only under the symbiotic condition (Figure 4a). Fixation of atmospheric nitrogen is a characteristic feature of rhizobia only under the symbiotic condition [7]. The proteins related to nitrogen fixation, such as nitrogenase construction (NifN, NifX, NifS, NifW) [28], electron donation (FixX, FixP), and symbiosis-unique ferredoxins (mlr5869, mlr5930, msl8750), were also found to be unique to the symbiotic condition. In addition, NifA and RpoN, which are known to cooperatively regulate nif and fix genes, were detected only under the symbiotic condition [29]. The protein profile strongly reflected the phenotype that was predicted by transcriptome analysis [7].

Figure 4
figure 4

The map of metabolic pathways under the symbiotic and/or free-living conditions. The map of metabolic pathways is shown: (a) nitrogen fixation, (b) ubiquinone biosynthesis, (c) amino sugar metabolism, (d) peptidoglycan biosynthesis. Box symbols indicate the same things as in Figure 2. Daggers (†) indicate the reactions that have universally existed but have not been proposed in M. loti by KEGG pathway analysis. Asterisks indicate the focused enzymes discussed in the “Results and discussion” section. Abbreviations are as follows: GlcN, Glucosamine; GlcNAc, N-acetyl-D-glucosamine; MurNAc, N-acetylmuramic acid.

Farnesyl diphosphate (FPP) biosynthesis

It is generally known that rhizobia provide ammonia and other amino acids as a nitrogen source to the host [4], while no other compound is known to be provided. However, the obtained protein profile suggested that FPP might be provided from rhizobia to plant root cells. In the quinone biosynthetic pathway, the enzymes necessary to FPP biosynthesis, such as isopentenyl pyrophosphate isomerase (mlr6371) and geranyltransferase (mlr6368), which are located in the rhizobia symbiosis island, were uniquely detected under the symbiotic condition (Figure 4b). These enzymes produce FPP from isopentenyl diphosphate and dimethyl allyl diphosphate. FPP is an intermediate in the mevalonate pathway, which is present in all higher eukaryotes and many bacteria. FPP is used for the biosynthesis of ubiquinone in M. loti. However, the enzymes which catalyze the ubiquinone biosynthesis reactions from FPP (shown in asterisks in Figure 4b) were not detected at the protein level. Additionally, the symbiosis island does not include genes encoding octaprenyl-diphosphate synthase (mlr7426) and 4-hydroxybenzoate polyprenyltransferase (mll7442), which are involved in the pathway of ubiquinone biosynthesis. On the other hand, higher plants utilize FPP as the intermediate precursor of many secondary metabolites, such as sesquiterpenes, triterpenes, and sterols [30]. It is reasonable to suppose that FPP is provided to the host legume from rhizobia as a source of secondary metabolites because FPP was synthesized only under the symbiotic condition, as the enzymes that metabolize FPP after its production were not detected in M. loti at the protein level. However, the estimation is just based on the obtained protein profile, and further investigation of the migration of FPP will be carried out by using deletion mutants, and by analysis at mRNA and metabolite levels.

Nucleotide sugar metabolism and peptidoglycan biosynthesis

On the other hand, the enzymes involved in uridine diphosphate (UDP) sugar metabolism were not produced under the symbiotic condition (Figure 4c), and LPS transporters (mll3197, mll7564, mll7866) were not produced under the symbiotic condition. UDP-N-acetylglucosamine (UDP-MurNAc) is the starting material for LPS biosynthesis. LPS is known as one of the “nod factors,” which is secreted by the rhizobial body when it perceives the root through the flavonoid groups secreted from host legume [2]. The secretion of LPS is likely unnecessary under the symbiotic condition (after infection). In addition, UDP-N-acetylmuramic acid, the end product of this pathway, is the starting material of peptidoglycan biosynthesis. The enzymes of peptidoglycan biosynthesis were uniquely detected under the free-living condition (Figure 4d).

Under the symbiotic condition, rhizobia are differentiated into a bacteroid, and the peribacteroid membrane (PBM)-enclosed bacteroids are essentially a nitrogen-fixing intracellular organelle, termed the ‘symbiosome.’ In PBM, bacteroids are stationary and become slightly larger than the free-living rhizobia [31]. However, the remarkable structural changes have not been confirmed at the protein level. Proteome data could detect the proteins involved in the structural changes, as well as changes in metabolic pathway; thus, we focused on cell surface structure.

From our data, it was predicted that peptidoglycan was not biosynthesized under the symbiotic condition described above (Figure 4d). Peptidoglycan, which is the main material of bacterial cell wall, plays an important role in the maintenance of structure by providing tolerance to osmotic pressure and mechanical stress, and it is also involved in cell division during growth [32]. The inactivation of the peptidoglycan biosynthetic pathway under the symbiotic condition is supported by the following: (1) the neogenesis of peptidoglycan is unnecessary because fully symbiotic rhizobia cease their cell division, (2) symbiotic rhizobia are able to avoid mechanical stress because of enclosure by PBM and immobility, and (3) the host legume might control the surrounding environment not to impose an osmotic stress on rhizobia. The protein profile indicates that the interruption of peptidoglycan biosynthesis in symbiotic M. loti occurs at the protein level, and rhizobia under the symbiotic condition might lose its cell wall.

Flagellum and pilus components

We investigated structural proteins, such as flagellum and pilus components. The flagellum is connected to bacterial motility and attachment of rhizobia to developing root hairs, which is one of the first steps of nitrogen-fixing root nodule symbiosis [33]. The pilus is a hair-like appendage found on the surface of many bacteria and is related to the process of bacterial conjugation. Rhizobia have not only conjugative pili but also type IV pili, which generate motile forces called twitching motility, in which the pilus works as a grappling hook to bind to a variety of surfaces [34]. The flagellum component proteins, FlaA (mlr2925, mlr2927), FlgL (mlr2939), FlgK (mlr2938), MotB (mlr3926), and FliN (mll2902), were detected only under the free-living condition. DNA microarray analysis has shown that the gene of flagellar L-ling protein (FlgH; mll2921) is repressed at the mRNA level [7]. Therefore, the obtained protein profile confirmed that under the symbiotic condition, rhizobia repress flagellum genes, and it also indicated that structural proteins of the flagellum are not present under the symbiotic condition. In addition, the pilus assembly proteins, CpaB (mll5595), CpaD (mll5598), and CpaE (mll5600), were also detected only under the free-living condition. Flagella and pili were lost under the symbiotic condition because rhizobia under the symbiotic condition would have no need for conjugation, infection, and motility in PBM. In contrast, rhizobia under the free-living condition require the flagellum and pilus component proteins.

Conclusion

In order to detect the changes in M. loti between free-living and symbiotic conditions, we performed proteome analysis of M. loti. We used our LC-MS/MS system, equipped with a long monolithic silica capillary column, to successfully identify 1,658 proteins without bacteroid isolation and prefractionation. This analytical system opens up a new horizon for symbiotic proteome analysis from small amounts of unpurified crude biological samples. The protein profile indicated some interesting and unexpected results associated with the cell surface structure and metabolism, in accordance with the external environment of each condition (Figure 5). The data set revealed that M. loti under the symbiotic condition simplifies the components of the cell surface, such as flagellum, pilus, and cell wall. In addition, we found that M. loti under the symbiotic condition provided not only a nitrogen source but also FPP, which is a source of secondary metabolism. Our data should be helpful in carrying out detailed studies on the change of these 2 conditions of rhizobia.

Figure 5
figure 5

Schematic representation of the lifestyle under the symbiotic condition compared to the free-living condition. The illustration shows the changes in the lifestyles of M. loti: the lifestyle model under the (a) free-living and (b) symbiotic conditions. The central carbon metabolic pathway is essential under both conditions. Under the symbiotic condition, nitrogen is fixed by electrons from the TCA cycle or other energy metabolism and is provided to the host legume or used for amino acid biosynthesis. Moreover, the flagellum and pilus are lost, and the cell wall, which is mainly composed of peptidoglycan, may become thin or disappear. In contrast, FPP is synthesized to provide to the host legume. Under the free-living condition, LPS is secreted extracellularly as a nod factor to infect the host legume.

Methods

Strains and growth conditions

M. loti MAFF303099 was cultured in tryptone-yeast extract (TY) medium [35] at 28°C. Cells were harvested in the early stationary phase for 72 h. Cells were subjected to sample preparation in the free-living condition.

For the symbiotic condition, L. japonicus MG-20 Miyakojima [36] seeds were sterilized, germinated, and inoculated with M. loti and grown in MM1 [37] medium at 25°C with a 16-h light/8-h dark cycle. Root nodules from several plants were harvested at 7 weeks post-inoculation. Nodules from 3 independently grown pools of plants were collected and processed in parallel.

Nodules were frozen with liquid nitrogen, homogenized with an ice-cold mortar, and subjected to sample preparation.

Sample preparation

Collected cells were resuspended with 500 μL of lysis buffer (2% (w/v) 3-(3-cholamidopropyl)dimethylammonio-1-propanesulfonate, 10 mM dithiothreitol, 1% (v/v) protease inhibitor cocktail (Sigma-Aldrich, St. Louis, MO, USA), 7 M urea, and 2 M thiourea in 50 mM Tris–HCl (Nacalai tesque, Kyoto, Japan)). The solution was mixed with an equal volume of 0.5-mm glass beads (Tomy Seiko, Tokyo, Japan). The cells were then disrupted mechanically in triplicate by using BeadSmash 12 (Wakenyaku, Kyoto, Japan) at 4°C, 4,000 × g for 1 min. The solution was centrifuged at 14,000 × g for 10 min, and the supernatant was collected. The supernatant was filtered by 0.45 μm Ultrafree-MC (Millipore, Billerica, MA, USA). The filtered solution was subjected to ultrafiltration using Amicon Ultra YM-10 (Millipore) and buffer-exchanged by 200 mM triethyl ammonium bicarbonate (TEAB; Sigma-Aldrich). The proteins were reduced by adding 10 mM tris-(2-carboxyethyl)phosphine (Thermo Fisher Scientific, Waltham, MA, USA) and incubated at 55°C for 1 h. After the reaction, 20 mM iodoacetamide was added to the solution, and incubated for 30 min. The reactant was mixed with 1 mL of ice-cold acetone and incubated at −20°C for 3 h to precipitate proteins. The precipitated proteins were resuspended with 100 μL of 200 mM TEAB and mixed with 2 μl (1 μg μL-1) of sequencing grade modified trypsin (Promega, Madison, WI, USA) at 37°C overnight. The peptide concentration of the tryptic digests was measured using Protein Assay Bicinchoninate Kit (Nacalai tesque). The concentrations of the injected digests were 1.06 ± 0.12 μg μL-1 digest for free-living M. loti and 4.96 ± 0.90 μg μL-1 digest for nodules, respectively. (mean ± SD, N = 3).

LC-MS/MS analysis

Proteome analyses were performed by a liquid chromatography (UltiMate3000 RSLCnano system (Thermo Fisher Scientific))/mass spectrometry (LTQ Velos mass spectrometer (Thermo Fisher Scientific)) system equipped with a long monolithic silica capillary column (200-cm long, 0.1-mm ID) [24, 27]. 10 and 5 μL of tryptic digests were injected for free-living and symbiotic conditions, respectively, and separated by reversed-phase chromatography at a flow rate of 500 nL min-1. The gradient was provided by changing the mixing ratio of the 2 eluents: A, 0.1% (v/v) formic acid and B, 80% (v/v) acetonitrile containing 0.1% (v/v) formic acid. The gradient was started with 5% B, increased to 50% B for 600 min, further increased to 95% B to wash the column, then returned to the initial condition, and held for re-equilibration. The separated analytes were detected on a mass spectrometer with a full scan range of 350–1,500 m/z. For data-dependent acquisition, the method was set to automatically analyze the top 5 most intense ions observed in the MS scan. An ESI voltage of 2.4 kV was applied directly to the LC buffer end of the chromatography column by using a MicroTee (Upchurch Scientific, Oak Harbor, WA, USA). The ion transfer tube temperature was set to 300°C. Triplicate analyses were done for each sample of 3 biological replicates, and blank runs were inserted between different samples.

Data analysis

The mass spectrometry data of each sample were used for protein identification using MASCOT (Matrix Science, London, UK), working on Proteome Discoverer (Thermo Fisher Scientific) against the database at Rhizobase containing 7,283 sequences with a peptide tolerance of 1.2 Da, MS/MS tolerance of 0.8 Da, and maximum number of missed cleavages of 2. For trypsin digestion, cysteine carbamidomethylation (+57.021 Da) and methionine oxidation (+15.995 Da) were set as a variable modification. The data were then filtered at a q-value ≤ 0.01 corresponding to 1% false discovery rate on a spectral level. Moreover, proteins identified by at least 2 peptides per protein or identified by a single peptide per protein at any 3 data points were accepted as ‘identified proteins.’

The pathway analysis of identified proteins is performed by using the pathway mapping tool on KEGG (http://www.genome.jp/kegg/). The functional classification of proteins was performed by using Rhizobase at Kazusa DNA Research Institute (http://www.kazusa.or.jp/e/index.html).