Abstract
In telepresence applications each user is immersed in a rendered 3D-world composed from representations transmitted from remote sites. The challenge is to compute dense range data at high frame rates, since participants cannot easily communicate if the processing cycle or network latencies are long. Moreover, errors in new stereoscopic views of the remote 3D-world should be hardly perceptible. To achieve the required speed and accuracy, we use trinocular stereo, a matching algorithm based on the sum of modified normalized cross-correlations, and subpixel disparity interpolation. To increase speed we use Intel IPL functions in the pre-processing steps of background subtraction and image rectification as well as a four-processor parallelization. To evaluate our system we have developed a test-bed which provides a set of registered dense “ground-truth” laser data and image data from multiple views.
Similar content being viewed by others
References
Ayache, N. 1991. Artificial Vision for Mobile Robots: Stereo Vision and Multisensory Perception. The MIT Press: Cambridge, MA.
Banks, J. and Corke, P. 2001. Quantitative evaluation of matching methods and validity measures. The International Journal of Robotics Research, 20(7):512–532.
Barron, J.L., Fleet, D.J., and Beauchemin, S.S. 1994. Performance of optical flowtechniques. International Journal of Computer Vision, 12:43–78.
Belhumeur, P. 1996. A bayesian approach to binocular stereopsis. Intl. J. of Computer Vision, 19(3):237–260.
Cheung, G., Kanade, T., Bouguet, J., and Holler, M. 2000. A real time system for robust 3d voxel reconstruction of human motions. In IEEE Conf. Computer Vision and Pattern Recognition, Hilton Head Island, SC, June 13–15, 2000, pp. 714–720.
Dhond, U. and Aggrawal, J. 1989. Structure from stereo: A review. IEEE Transactions on Systems, Man, and Cybernetics, 19(6):1489–1510.
Faugeras, O. 1993. Three-Dimensional Computer Vision. MIT-Press: Cambridge, MA.
Faugeras, O. 1993. Three-Dimensional Computer Vision: A Geometric Viewpoint. The MIT Press: Cambridge, MA.
Kutulakos, K.N. and Seitz, S.M. 1999. A theory of shape by space carving. In Proc. Int. Conf. on Computer Vision, pp. 307–314.
Lanier, J. 2001. Virtually there. Scientific American, 284:66–75.
Leclerc, Y., Luong, Q.T., and Fua, P. 2000. Measuring the self-consistency of stereo algorithms. In Proc. Sixth European Conference on Computer Vision, Dublin, Ireland, pp. 282–298.
Martins, F.C.M., Nickerson, B.R., Bostrom, V., and Hazra, R. 1999. Implementation of a real-time foreground/background segmentation system on the intel architecture. In IEEE ICCV99 Frame Rate Workshop, Kerkyra, Greece, Sept. 1999.
Matthies, L. 1992. Stereo vision for planetary rovers: Stochastic modeling to near real-time implementation. International Journal of Computer Vision, 8:71–91.
Moezzi, S., Tai, L.-C., and Gerard, Ph. 1997. Virtual view generation from 3d digital video. IEEE Multimedia, 4:18–26.
Mulligan, J. and Daniilidis, K. 2000. Predicting disparity windows for real-time stereo. In Proc. Sixth European Conference on Computer Vision, Dublin, Ireland, pp. 220–235.
Mulligan, J. and Daniilidis, K. 2000. View-independent scene acquisition for tele-presence. In Proc. Int. Symposium on Augmented Reality, Munich, Germany, Oct. 5–6, 2000, pp. 105–110.
Mulligan, J., Isler, V., and Daniilidis, K. 2001. Performance evaluation of stereo for tele-presence. In Proc. Int. Conf. on Computer Vision, Vancouver, Canada, July 9–12, 2001, accepted.
Narayanan, P., Rander, P., and Kanade, T. 1998. Constructing virtual worlds using dense stereo. In Proc. Int. Conf. on Computer Vision, pp. 3–10.
Ohta, Y., Watanabe, M., and Ikeda, K. 1986. Improving depth map by right-angled trinocular stereo. In Proceedings of the 8th International Conference on Pattern Recognition (ICPR'86), Vol. I, Paris, France, Oct. 1986, pp. 519–521.
Okutomi, M. and Kanade, T. 1993. A multiple-baseline stereo. IEEE Trans. on Pattern Analysis and Machine Intelligence, 15(4):353–363.
Okutomi, M. and Kanade, T. 1993. A multiple-baseline stereo. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 15(4):353–363.
Raskar, R., Welch, G., Cutts, M., Lake, A., Stesin, L., and Fuchs, H. 1998. The office of the future: A unified approach to image-based modeling and spatially immersive displays. In ACM SIGGRAPH, pp. 179–188.
Scharstein, D., Szeliski, R., and Zabih, R. 2001. A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. In Proceedings of the IEEE Workshop on Stereo and Multi-Baseline Vision, Kauai, HI, Dec. 2001.
Seitz, S.M. and Dyer, C.R. 1997. Photorealistic scene reconstruction by voxel coloring. In IEEE Conf. Computer Vision and Pattern Recognition, Puerto Rico, June 17–19, 1997, pp. 1067–1073.
Snow, D., Viola, P., and Zabih, R. 2000. Exact voxel occupancy with graph cuts. In IEEE Conf. Computer Vision and Pattern Recognition, Hilton Head Island, SC, June 13–15, 2000.
Szeliski, R. 1999. Prediction error as a quality metric for motion and stereo. In Proc. Int. Conf. on Computer Vision, Kerkyra, Greece, Sept. 20–23, 1999.
Szeliski, R. and Zabih, R. 1999. An experimental comparison of stereo algorithms. In Workshop on Visual Algorithms.
Tomasi, C. and Manduchi, R. 1996. Stereo without search. Proc. European Conf. Computer Vision.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Mulligan, J., Isler, V. & Daniilidis, K. Trinocular Stereo: A Real-Time Algorithm and its Evaluation. International Journal of Computer Vision 47, 51–61 (2002). https://doi.org/10.1023/A:1014525320885
Issue Date:
DOI: https://doi.org/10.1023/A:1014525320885