Randomized Tree Ensembles for Object Detection in Computational Pathology
Modern pathology broadly searches for biomarkers which are predictive for the survival of patients or the progression of cancer. Due to the lack of robust analysis algorithms this work is still performed manually by estimating staining on whole slides or tissue microarrays (TMA). Therefore, the design of decision support systems which can automate cancer diagnosis as well as objectify it pose a highly challenging problem for the medical imaging community.
In this paper we propose Relational Detection Forests (RDF) as a novel object detection algorithm, which can be applied in an off-the-shelf manner to a large variety of tasks. The contributions of this work are twofold: (i) we describe a feature set which is able to capture shape information as well as local context. Furthermore, the feature set is guaranteed to be generally applicable due to its high flexibility. (ii) we present an ensemble learning algorithm based on randomized trees, which can cope with exceptionally high dimensional feature spaces in an efficient manner. Contrary to classical approaches, subspaces are not split based on thresholds but by learning relations between features.
The algorithm is validated on tissue from 133 human clear cell renal cell carcinoma patients (ccRCC) and on murine liver samples of eight mice. On both species RDFs compared favorably to state of the art methods and approaches the detection accuracy of trained pathologists.
KeywordsRenal Cell Carcinoma Object Detection Clear Cell Renal Cell Carcinoma Voronoi Tessellation Trained Pathologist
Unable to display preview. Download preview PDF.
- 3.Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features (2001)Google Scholar
- 5.Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR 2005: Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), Washington, DC, USA, vol. 1, pp. 886–893. IEEE Computer Society, Los Alamitos (2005)Google Scholar
- 6.Freund, Y., Schapire, R.: Experiments with a new boosting algorithm. In: Machine Learning: Proceedings of the Thirteenth International Conference, pp. 148–156 (1996)Google Scholar
- 11.R Development Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2009) ISBN 3-900051-07-0Google Scholar
- 12.Hall, B., Chen, W., Reiss, M., Foran, D.J.: A clinically motivated 2-fold framework for quantifying and classifying immunohistochemically stained specimens. In: Ayache, N., Ourselin, S., Maeder, A. (eds.) MICCAI 2007, Part II. LNCS, vol. 4792, pp. 287–294. Springer, Heidelberg (2007)CrossRefGoogle Scholar
- 13.Yang, L., Chen, W., Meer, P., Salaru, G., Feldman, M.D., Foran, D.J.: High throughput analysis of breast cancer specimens on the grid. Med. Image Comput. Comput. Assist. Interv. Int. Conf. Med. Image Comput. Comput. Assist Interv. 10, 617–625 (2007)Google Scholar