Robust Numerical Features for Description and Classification of Subcellular Location Patterns in Fluorescence Microscope Images
The ongoing biotechnology revolution promises a complete understanding of the mechanisms by which cells and tissues carry out their functions. Central to that goal is the determination of the function of each protein that is present in a given cell type, and determining a protein's location within cells is critical to understanding its function. As large amounts of data become available from genome-wide determination of protein subcellular location, automated approaches to categorizing and comparing location patterns are urgently needed. Since subcellular location is most often determined using fluorescence microscopy, we have developed automated systems for interpreting the resulting images. We report here improved numeric features for describing such images that are fairly robust to image intensity binning and spatial resolution. We validate these features by using them to train neural networks that accurately recognize all major subcellular patterns with an accuracy higher than any previously reported. Having validated the features by using them for classification, we also demonstrate using them to create Subcellular Location Trees that group similar proteins and provide a systematic framework for describing subcellular location.
Unable to display preview. Download preview PDF.
- 1.M.V. Boland, M.K. Markey, and R.F. Murphy, “Classification of Protein Localization Patterns Obtained via Fluorescence Light Microscopy,” in 19th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Chicago, IL, USA, 1997, pp. 594–597.Google Scholar
- 3.R.F. Murphy, M.V. Boland, and M. Velliste, “Towards a Systematics for Protein Subcellular Location: Quantitative Description of Protein Localization Patterns and Automated Analysis of Fluorescence Microscope Images,” in Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology,San Diego, 2000, pp. 251–259.Google Scholar
- 5.J.W. Jarvik, S.A. Adler, C.A. Telmer, V. Subramaniam, and A.J. Lopez, “CD-Tagging: A New Approach to Gene and Protein Discovery and Analysis,” Biotechniques, vol. 20, 1996, pp. 896–904.Google Scholar
- 6.C.A. Telmer, P.B. Berget, B. Ballou, R.F. Murphy, and J.W. Jarvik, “Epitope Tagging Genomic DNA Using a CD-Tagging Tn10 Minitransposon,” Biotechniques, vol. 32, 2002, pp. 422–430.Google Scholar
- 7.J.W. Jarvik, G.W. Fisher, C. Shi, L. Hennen, C. Hauser, S. Adler, and P.B. Berget, “In Vivo Functional Proteomics: Mammalian Genome Annotation Using CD-Tagging,” BioTechniques, vol. 33, 2002, pp. 852–867.Google Scholar
- 11.R.I. Jennrich, “Stepwise Discriminant Analysis,” in Statistical Methods for Digital Computers, K. Enslein, A. Ralston, and H.S. Wilf (Eds.), John Wiley & Sons: New York, 1977, pp. 77–95.Google Scholar
- 13.R.F. Murphy, M. Velliste, J. Yao, and G. Porreca, “Searching Online Journals for Fluorescence Microscope Images Depicting Protein Subcellular Locations,” in Proceedings of the 2nd IEEE International Symposium on Bio-Informatics and Biomedical Engineering (BIBE 2001), Bethesda, MD, USA, 2001, pp. 119–128.Google Scholar
- 16.M. Velliste and R.F. Murphy, “Automated Determination of Protein Subcellular Locations from 3D Fluorescence Microscope Images,” in Proceedings of the 2002 IEEE International Symposium on Biomedical Imaging (ISBI-2002), Bethesda, MD, USA, 2002, pp. 867–870.Google Scholar