Data Processing for 3D Mass Spectrometry Imaging
Data processing for three dimensional mass spectrometry (3D-MS) imaging was investigated, starting with a consideration of the challenges in its practical implementation using a series of sections of a tissue volume. The technical issues related to data reduction, 2D imaging data alignment, 3D visualization, and statistical data analysis were identified. Software solutions for these tasks were developed using functions in MATLAB. Peak detection and peak alignment were applied to reduce the data size, while retaining the mass accuracy. The main morphologic features of tissue sections were extracted using a classification method for data alignment. Data insertion was performed to construct a 3D data set with spectral information that can be used for generating 3D views and for data analysis. The imaging data previously obtained for a mouse brain using desorption electrospray ionization mass spectrometry (DESI-MS) imaging have been used to test and demonstrate the new methodology.
Key words3D Mass Spectrometry Imaging Tissue imaging Statistical Analysis
Mass spectrometry (MS) imaging brings the advantages of MS to microscopy and provides the spatial distribution of chemicals on a surface without the need for fluorescent or radioactive labeling [1, 2, 3, 4]. The development of 2D-MS imaging of biological tissue analysis provides highly specific molecular information on the distribution of proteins [5, 6, 7, 8, 9, 10], lipids [11, 12, 13, 14, 15, 16, 17, 18, 19], and therapeutic drugs [20, 21, 22, 23, 24] in the material. This information serves as a powerful tool for finding disease biomarkers as well as for understanding and developing drug delivery systems [25, 26]. While 2D-MS imaging has been widely applied to the analysis of thin tissue sections, it has also been recognized that it is highly valuable to acquire 3D spatial distributions of the chemicals in a tissue volume or in an entire organ [27, 28, 29, 30, 31, 32, 33, 34]. The two basic approaches used for 3D-MS imaging are, first, depth profiling using an ionization source that ablates tissue and second, recording a sequence of 2D images from serial sections taken from a tissue volume and then combining this information. In the depth profiling experiments, ablation of the tissue material is used to expose lower layers of tissue for analysis; this has been achieved with high energy ion beams in secondary ion mass spectrometry (SIMS) imaging [29, 33, 35] or with lasers in methods that include matrix assisted laser desorption (MALDI) [5, 36, 37], laser ablation electrospray ionization (LAESI) [28, 38], and laser ablation followed by atmospheric-pressure afterglow (LA-FAPA) . In the alternative serial-sectioning approach, a volume of tissue is sliced into thin sections and each of the sections is imaged using standard 2D-MS imaging. The information obtained from the 2D images is then processed to construct the 3D distributions of the chemicals. By appropriately selecting a representative number of sections for analysis, a large volume of tissue can be analyzed in a relatively short time with adequate information being acquired to reconstruct the 3D chemical distributions. This approach has been implemented using MALDI [27, 30, 32, 40] and desorption electrospray ionization (DESI) .
Data processing for MS imaging is important and also challenging since a large amount of raw data is acquired and needs to be processed and analyzed. Software tools for 2D-MS image data processing are readily available. In addition to the software provided with commercial mass spectrometers , free software such as BioMap [43, 44], Datacube Explorer , and MITICS  have been used widely for generating 2D-MS images. Currently there is no software available for processing MS data to assemble 3D images directly. In recent studies [32, 40, 41], 2D images for selected ions on a series of sections were first generated using 2D data processing software, and then the color distribution information was further processed by image software to generate the 3D images. Note that the m/z values and corresponding ion abundance information for distributions of multiple compounds are not represented in the 3D data set constructed in this approach. To obtain 3D images for different ions, different 2D images have to be generated first, and their color distributions are used to represent difference for the 3D image construction. Application of advanced data analysis methods to a 3D volume, such as principal component analysis (PCA), is not possible because the original mass spectral information is not retained through the data processing. As discussed for a previous study of peptide and protein imaging in rat brain , many more extensive data processing procedures are required for true 3D data processing that retains the MS spectral information. These methods could include, but are not limited to, spectral smoothing, intra-section registration (2D image rotating and rescaling), inter-section registration (alignment, quality measurement), and validation (surface rendering), etc.
In this study, we explored the methods to reconstruct a 3D data set retaining the mass spectral information for 3D-MS imaging. With the accurate masses and the abundances of the ions representative for the compounds of interest, 3D images can be instantly produced with arbitrary views, and statistical analysis can be performed in the 3D volume. The key steps necessary for the data processing were identified, and solutions were developed and implemented using selected capabilities of MATLAB so that they can be integrated into a complete software package. The data set previously acquired for 36 sections of a mouse brain with DESI imaging was used here to test the methods and to demonstrate the new software solutions. Data reduction, tissue section alignment, data visualization, as well as statistical analysis using PCA and cluster analysis (CA) have been developed.
2 Data Registration and Storage
Similar to 2D-MS imaging, the spectra recorded for each point in a 3D volume tissue needs to be co-registered with the position of each sampling point. When using a series of sections from a tissue volume, the x and y coordinates are registered with the individual spectra acquired while 2D imaging is being performed on each section. The actual z coordinate value of a section needs to be registered together with the data recorded for each point on that section. A point close to the bottom-left corner of each tissue section is set as the reference point (0, 0) in the program we developed, while a relative x-y position system is used to register the points in that section. There is a challenge in aligning the x-y positions between different sections, for which a solution will be discussed later in this paper. Typically, multiple spectra are recorded for each point of the section and the averaged spectra are used for data processing. The entire data set can be stored in a data base defined in various ways. The data used in our study were recorded using an LTQ mass spectrometer (Thermo Fisher Scientific, Inc., San Jose, CA, USA) equipped with a homebuilt DESI imaging source. Thirty-six tissue sections of a mouse brain were imaged in the negative ion mode, with a total of 50 rows of scans and 69 spectra recorded per row . The x and y positions corresponding to each spectrum were determined by the scanning speed and step length of the moving stage in the x and y directions, respectively. An index file was created to correlate each file name with the x-y coordinates. During the data processing, the peaks were identified and a single file was created for each analyte with its intensities at every point in the 3D volume.
3 Data Reduction
Data reduction has been shown to be necessary for 2D-MS imaging , and it is even more desirable for 3D imaging, especially when high resolution mass spectra are recorded using FTICR, Orbitrap, or TOF analysis, since a significantly larger amount of raw data is then collected. Use of the raw spectra causes problem in data storage as well as in subsequent data analysis, which is typically limited by the memory size and data transfer speed of the computer. The binning method is commonly used for reduction of raw data. For each spectrum, the bin width is first selected based on the mass resolution of the instrument, and one peak with a nominal m/z value centered within every bin window is assumed to represent the information with the peak intensity being defined as either the maximum or the sum of the signal intensity across the bin width.
Peak alignment can be performed after peak identification to further decrease the data size while retaining the accurate m/z values of the compounds detected in the tissue. Mass shifts exist for some compounds in spectra acquired from different spots on a tissue section, and they can be caused by the conditions used for mass analysis and the composition of the sample matrix [51, 52, 53, 54, 55]. Peak alignment allows the assignment of the correct m/z value to a compound uniformly for all the pixels on a tissue section or in a tissue volume, based on the statistical analysis of the spectra and the mass accuracy and resolution of the mass spectrometer. This process plays an important role in subsequent data analysis. As an example, peak alignment for phosphatidylinisitols (PS) 18:0/22:6 was performed using the method shown in Figure 1c. The distribution of the peak positions obtained from all 3450 spectra acquired for a tissue section covers a narrow mass range around m/z 834.6 and could be fitted to a Gaussian distribution. The corresponding m/z value at the maximum in the distribution was then assigned to all the peaks counted to this distribution for all the spectra, while retaining the original measured peak intensity. If internal references can be used for mass calibration of each spectrum, the statistical analysis shown in Figure 1c is not necessary, but the peak positions still need to be identified as shown in Figure 1b. Practically, identifying multiple endogenous calibrators for in situ calibration can be difficult for MS imaging while adding external calibrators can also be cumbersome.
Data Reduction for Bin and Peak Detection and Peak Alignment (PD&PA) Methods, for LTQ and Orbitrap
Pixels per tissue section
Raw data size
4 Section Alignment
In our study, an unsupervised, self-organizing feature map (SOFM) artificial neural network method was applied to classify the imaged area into the sample and substrate region, so that the shape and location of the sample region can be used for the inter-section alignment. SOFM is different from other artificial neural networks in the sense that it uses a neighborhood function to preserve the topologic or morphologic properties of a data space, which is useful for producing low-dimension views through classification of high-dimensional data [56, 57]. A significant advantage for using SOFM is that no training process is required to generate the low-dimensional views. This makes SOFM very suitable for identifying the main morphologic features universal in all tissue sections that can be used to differentiate sample regions from non-sample regions. The MATLAB Neural Network Training Tool was used to implement the SOFM. Identification of the tissue sample area is done with the SOFM using the spectra with their original intensities, with instruction set for two features into two categories (neuronal structure 1 × 2). As shown in Figure 1c, the sample region is clearly separated from the substrate background. More detailed morphologic features can be extracted using SOFM with the spectral intensity first normalized for the tissue region (Figure 2d, e, f). A potential limitation for using SOFM routinely in 3D tissue imaging is that it could be time consuming, depending on the number of categories that need to be identified.
To align the 36 tissues sections for the 3D data construction, SOFM is applied twice to provide images of the regions of white and grey matter. A program written in MATLAB allows for the overlay of two images from adjacent tissue sections (Figure 2g, h) and their relative movement and rotation. The program also calculates the number of pixels with color mismatch between these two images (Figure 2i), which is minimized when best alignment is achieved (Figure 2j). The x-y coordinates are then corrected and saved for the 3D data reconstruction.
This alignment method provides a process with a quantitative measure, the number of misaligned pixels, which can be implemented to achieve automated alignment. It has been applied for aligning two tissue sections with different sample areas, such as those at z = 2.22 mm and z = 3.04 mm shown in Figure 1a. The symmetry in distribution of the mismatched pixels can be used to assist the alignment process. When images with three or more categories identified are used in alignment, empirically it is found that number of mismatched pixels is also minimized when the sections of different sample areas are best aligned. In some cases, the observed sample area is enlarged due to stretching of the tissue section during sectioning, instead of the actual change in the original shape of the tissue volume. Additional instructions need to be included to reshape the sample area; however, the rule of achieving minimum number of misaligned pixels can still be used as a measure during that process.
5 Data Interpolation
The interpolation for the virtual layers helps to generate images with better smoothness. The biologically meaningful interpolation has to be validated with comparison between images with all real layers and with a mixture of real and virtual layers. This could also be sample-specific and analyte-specific. In this study, we demonstrate how to enable the interpolation capability and use three classic methods as examples. No significance was observed among them for the imaging using the 19 lipid peaks. For various biological studies, the interpolation method can be easily switched based on the user’s knowledge about the sample and the distribution of the biomarkers.
6 3D Visualization and Data Analysis
In this work, we explored a procedure and developed tools for data processing in 3D mass spectrometry imaging. The reconstruction of the 3D data set containing the mass spectral information, viz. the accurate masses and abundances, for all the compounds of interest is the critical step. The identification of the peaks and the alignment of the masses are performed based on the statistical analysis of the 2D imaging data acquired over an entire tissue section, which is important for reducing the data size while retaining the accurate mass information. Appropriate solutions were also identified for other technical challenges, including aligning the section data, producing continuous images, and generating arbitrary 3D views. These capabilities and the results of utilizing the various procedures and software tools were demonstrated with the 3D-MS imaging data acquired for a mouse brain using DESI-MS imaging. Though only data by DESI imaging are used in the demonstrations, the capabilities of the software and methods are not limited by mass range or resolution. They can be applied to data acquired by MALDI and other imaging methods, with proper m/z windows selected for the peak alignment based on the specified resolution and mass accuracy of the mass spectrometer used to record the data. In future development, the strategies for the proper interpolation of data and insertion of the virtual layers need to be explored and validated. The capability allowing direct comparison of 3D images acquired by a variety of technologies, such as mass spectrometry, MRI (magnetic resonance imaging), and spectroscopic imaging methods, would provide comprehensive morphologic and molecular information for the biological study.
The authors acknowledge support for this work by National Science Foundation (projects CHE-0847205 and DBI-0852740), National Institute of Health (project 5R21RR031246), National Natural Science Foundation of China (project No. 20728505), and the Walther Cancer Institute (grant 205017).
- 10.van Remoortere, A., van Zeijl, R.J.M., van den Oever, N., Franck, J., Longuespée, R., Wisztorski, M., Salzet, M., Deelder, A.M., Fournier, I., McDonnell, L.A.: MALDI Imaging and Profiling MS of Higher Mass Proteins from Tissue. J. Am. Soc. Mass Spectrom. 21(11), 1922–1929 (2010)Google Scholar
- 18.Amaya, K.R., Monroe, E.B., Sweedler, J.V., Clayton, D.F.: Lipid imaging in the zebra finch brain with secondary ion mass spectrometry. Int. J. Mass Spectrom. 260(2/3), 121–127 (2007)Google Scholar
- 19.Eberlin, L.S., Ferreira, C.R., Dill, A.L., Ifa, D.R., Cooks, R.G.: Desorption electrospray ionization mass spectrometry for lipid characterization and biological tissue imaging. Biochim. Biophys. Acta Mol. Cell Biol. Lipids 1811(11), 946–960 (2011)Google Scholar
- 20.Zha, X.H., Ausserer, W.A., Morrison, G.H.: Quantitative Imaging of a Radiotherapeutic Drug, Na2b12h11sh, at Subcellular Resolution in Tissue–Cultures Using Ion Microscopy. Cancer Res. 52(19), 5219–5222 (1992)Google Scholar
- 21.Smith, D.R., Chandra, S., Barth, R.F., Yang, W.L., Joel, D.D., Coderre, J.A.: Quantitative imaging and microlocalization of boron-10 in brain tumors and infiltrating tumor cells by SIMS ion microscopy: Relevance to neutron capture therapy. Cancer Res. 61(22), 8179–8187 (2001)Google Scholar
- 22.Kim, D.W., Huamani, J., Reyzer, M.L., Mi, D., Caprioli, R.M., Hallahan, D.E.: Imaging mass spectrometry to map distribution of radiation enhancing vasculature targeted drug and protein biomarkers of response to therapy in prostate cancer. Int. J. Radiat. Oncol. Biol. Phys. 66(3), 2620 (2006)Google Scholar
- 24.Wiseman, J.M., Ifa, D.R., Zhu, Y.X., Kissinger, C.B., Manicke, N.E., Kissinger, P.T., Cooks, R.G.: Mass Spectrometry Across the Sciences Special Feature: Desorption electrospray ionization mass spectrometry: Imaging drugs and metabolites in tissues (vol 105, pg 18120, 2008). Proc. Natl. Acad. Sci. U. S. A. 106(14), 6022–6022 (2009)CrossRefGoogle Scholar
- 25.Chaurand, P., Rahman, M.A., Hunt, T., Mobley, J.A., Gu, G., Latham, J.C., Caprioli, R.M., Kasper, S.: Monitoring mouse prostate development by profiling and imaging mass spectrometry. Mol. Cell. Proteomics 7(2), 411–423 (2008)Google Scholar
- 37.Chaurand, P., Caprioli, R.: Profiling and Imaging of proteins in tissue sections using MALDI mass spectrometry.Direct applications to clinical diagnosis and drug discovery. Protein Sci. 13, 583 (2004)Google Scholar
- 47.Klinkert, I., McDonnell, L.A., Luxembourg, S.L., Altelaar, A.F.M., Amstalden, E.R., Piersma, S.R., Heeren, R.M.A.: Tools and strategies for visualization of large image data sets in high-resolution imaging mass spectrometry. Rev. Sci. Instrum. 78(5) (2007)Google Scholar
- 48.Levi-Setti, R., Crow, G., Wang, Y.L.: Progress in high resolution scanning ion microscopy and secondary ion mass spectrometry imaging microanalysis. Scan. Electron Microsc., 535–51 (1985)Google Scholar
- 52.Plass, W.R., Li, H.Y., Cooks, R.G.: Theory, simulation and measurement of chemical mass shifts in rf quadrupole ion traps. Int. J. Mass Spectrom. 228(2/3), 237–267 (2003)Google Scholar
- 58.http://www.vtk.org, March 3, 2011