Progress in Artificial Intelligence

, Volume 1, Issue 4, pp 259–265 | Cite as

Effective real-time visual object detection

  • Francisco MartínEmail author
  • Manuela Veloso
Regular Paper


Autonomous mobile robots equipped with visual perception aim at detecting objects towards intelligently acting in their environments. Such real-time vision processing continues to offer challenges in terms of getting the object detection algorithm to process images at the frame rate of live video. Our work contributes a novel algorithm that is capable of making use of all the frames, where each frame is efficiently processed as a “continuation” of the processing of the previous frames. From the 2D camera images as captured by the robot, our algorithm, Wave3D, maintains 3D hypotheses of the presence of the objects in the real 3D world relative to the robot. The algorithm does not ignore any new frame and continues its object detection on each frame by projecting the 3D hypotheses back into the 2D images to focus the object detection. We can view Wave3D as validating the 3D hypotheses in each of the images in the live video. Wave3D outperforms the static single-image classical approach in processing effort and detection accuracy, in particular for moving objects. In addition, the resulting reduced vision processing time translates into more computation available for task-related behaviors, as greatly needed in situated autonomous intelligent robot agents. We conduct targeted experiments using the humanoid NAO robot that illustrate the effectiveness of Wave3D.


Robot perception Humanoid Robot soccer 


  1. 1.
    Rowley, H.A., Baluja, S., Kanade, T.: Neural Network-Based Face Detection. IEEE Trans. Pattern Anal. Mach. Intell 20, 23–38 (1998)CrossRefGoogle Scholar
  2. 2.
    Hieselem, B., Ho, P., Poggio, T., Ho, P., Poggio, T.: Face recognition with support vector machines: global versus component-based approach. In: Proceedings of 8th International Conference on Computer Vision, pp. 688–694 (2001)Google Scholar
  3. 3.
    Ratsch, M., Blumer, C., Teschke, G., Vetter, T.: 3D Cascaded condensation tracking for multiple objects. In: Proceedings of 7th IASTED International Conference Signal Processing, Pattern Recognition and Applications, pp. 361–368 (2010)Google Scholar
  4. 4.
    Denman, S., Lamb, T., Fookes, C.B., Sridharan, S., Chandran, V.: Multi-sensor tracking using a scalable condensation filter. In: Proceedings of the International Conference on Signal Processing and Communication Systems (2007)Google Scholar
  5. 5.
    Ghosh, P., Manjunath, B.S., Ramakrishnan, K.R.: A compact image signature for RTS-invariant image retrieval. In: IEEE International Conference on Visual Information Engineering (VIE 2006), pp. 304–308 (2006)Google Scholar
  6. 6.
    Coifman, B., Beymer, D., McLauchlan, P., Malik, J.: A real-time computer vision system for vehicle tracking and traffic surveillance. J. Transp. Res. Part C: Emerg. Technol. 6, 271–281 (1998)CrossRefGoogle Scholar
  7. 7.
    Fuentes, L.M.,Velastin, S.A.: People tracking in surveillance applications. J Image Vision Compu 24(11), 1165–1171 (2006)CrossRefGoogle Scholar
  8. 8.
    Hastings, W.K.: Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57(1), 97–109 (1970)zbMATHCrossRefGoogle Scholar
  9. 9.
    Dellaert, F., Seitz, S.M., Thorpe, C.E., Thrun, S.: EM,MCMC, and chain flipping for structure from motion with unknown correspondence. J. Machine Learn 50(1–2), 45–71 (2003)zbMATHCrossRefGoogle Scholar
  10. 10.
    Hue, C., Le Cadre, J.P., Perez, P.: Sequential Monte Carlo methods for multiple target tracking and data fusion. IEEE Trans. Signal Process. 50(2), 309–325 (2002)Google Scholar
  11. 11.
    Barrera, P., Caas, J.M., Matellán, V., Martín, F.: Multicamera 3d tracking using particle filter. In: International Conference on Multimedia, Image Processing and Computer Vision (2005)Google Scholar
  12. 12.
    Nilsson, N.J.: Shakey The Robot. Technical Report, AI Center, SRI International (1984)Google Scholar
  13. 13.
    Bruce, J., Balch, T., Veloso, M.: Fast and inexpensive color image segmentation for interactive robots. In: Proceedings of IROS-2000, pp. 2061–2066 (2000)Google Scholar
  14. 14.
    Akin, H.L., Mericli, T., Özkucur, N.E., Kavakhoglu, C., Gökce, B.: Cerberus’10 SPL Team, Technical Report Bogazici University, Department of Computer Engineering (2010)Google Scholar
  15. 15.
    Laue, T., Jeffry de Haas, T., Burchardt, A., Graf, C., Röfer, T., Härtl, A., Rieskamp, A.: Efficient and reliable sensor models for humanoid soccer robot self-localization. In: Proceedings of the Fourth Workshop on Humanoid Soccer Robots in conjunction with the 2009 IEEE-RAS International Conference on Humanoid Robots. pp. 22–29 (2009)Google Scholar

Copyright information

© Springer-Verlag 2012

Authors and Affiliations

  1. 1.Robotics LabGSyC, Universidad Rey Juan CarlosMóstoles, MadridSpain
  2. 2.Computer Science DepartmentCarnegie Mellon UniversityPittsburghUSA

Personalised recommendations