Skip to main content

Distributed Mobile Computer Vision: Advances, Challenges and Applications

  • Chapter
  • First Online:
Distributed Embedded Smart Cameras

Abstract

The role of mobile devices has shifted from purely passively transmitting text messages and voice calls to proactively providing any kind of information that is also accessible to a PC. The recent advances in the field of micro technology have also made possible to include a camera sensor in any mobile device. This innovation is now attracting both the research community and the industries that aim to develop mobile applications that exploit recent computer vision algorithms. In this chapter we provide an analysis of the recent advances of mobile computer vision, then we discuss the current challenges that the community is currently dealing with. Next, an analysis of two recent case studies where mobile vision is used for augmented reality and surveillance applications is discussed. Finally, we introduce the next challenges in mobile vision where the mobile devices are part of a visual sensor network.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Agrawal M, Konolige K, Blas MR (2008) CenSurE: center surround extremas for realtime feature detection and matching. In: Forsyth D, Torr P, Zisserman A (eds) European conference on computer vision. Lecture notes in computer science, vol 5305. Springer, Berlin, pp 102–115. doi:10.1007/978-3-540-88693-8

  2. Alahi A, Ortiz R, Vandergheynst P (2012) FREAK: fast retina keypoint. In: International conference on computer vision and pattern recognition, pp 510–517. doi:10.1109/CVPR.2012.6247715, http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6247715

  3. Amazon: SnapTell (2007) http://www.A9.com

  4. Bay H, Ess A, Tuytelaars T, Van Gool L (2008) Speeded-up robust features (SURF). Comput Vis Image Underst 110(3):346–359. doi:10.1016/j.cviu.2007.09.014, http://linkinghub.elsevier.com/retrieve/pii/S1077314207001555

  5. Bay H, Fasel B, Gool LV (2006) Interactive museum guide: fast and robust recognition of museum objects. In: International workshop on mobile vision

    Google Scholar 

  6. Bolliger P, Köhler M, Römer K (2007) Facet: towards a smart camera network of mobile phones. In: 1st international conference on Autonomic computing and communication systems, p 17

    Google Scholar 

  7. Bosch A, Zisserman A, Munoz X (2007) Image classification using random forests and ferns. In: International conference on computer vision, pp 1–8. doi:10.1109/ICCV.2007.4409066

  8. Brown M, Lowe DG (2006) Automatic panoramic image stitching using invariant features. Int J Comput Vis 74(1):59–73. doi:10.1007/s11263-006-0002-3

    Article  MathSciNet  Google Scholar 

  9. Bruns E, Bimber O (2008) Adaptive training of video sets for image recognition on mobile phones. Pers Ubiquitous Comput 13(2):165–178. doi:10.1007/s00779-008-0194-3

    Article  Google Scholar 

  10. Davison AJ, Reid ID, Molton ND, Stasse O (2007) MonoSLAM: real-time single camera SLAM. IEEE Trans Pattern Anal Mach Intell 29(6):1052–67. doi:10.1109/TPAMI.2007.1049, http://www.ncbi.nlm.nih.gov/pubmed/17431302

  11. Fritz G, Seifert C, Paletta L (2006) A mobile vision system for urban detection with informative local descriptors. In: International conference on computer vision systems, pp 30–30. doi:10.1109/ICVS.2006.5, http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=1578718

  12. Girod B, Chandrasekhar V, Chen D, Cheung NM, Grzeszczuk R, Reznik Y, Takacs  G, Tsai S, Vedantham R (2011) Mobile visual search. IEEE Sig Process Magazine 28(4):61–76. doi:10.1109/MSP.2011.940881, http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5888642

  13. Google: Google Goggles (2009). http://www.google.com/mobile/goggles

  14. Greene K (2006) Hyperlinking reality via phones. MIT Technology Review, USA

    Google Scholar 

  15. Gruber L, Richter-Trummer T, Schmalstieg D (2012) Real-time photometric registration from arbitrary geometry. In: 2012 IEEE international symposium on mixed and augmented reality (ISMAR), pp 119–128. doi:10.1109/ISMAR.2012.6402548

  16. Hagbi N, Bergig O, El-Sana J, Billinghurst M (2011) Shape recognition and pose estimation for mobile Augmented Reality. IEEE Trans Vis Comput Graph 17(10):1369–79. doi:10.1109/TVCG.2010.241, http://www.ncbi.nlm.nih.gov/pubmed/21041876

  17. Hall SP, Anderson E (2009) Operating systems for mobile computing. J Comput Sci Coll 25(2):64–71

    Google Scholar 

  18. Hoff WA, Nguyen K, Lyon T (1996) Computervision-based registration techniques for augmented reality. In: Casasent DP (ed) Intelligent robots and computer vision XV, 2904, pp 538–548, doi:10.1117/12.256311, http://proceedings.spiedigitallibrary.org/proceeding.aspx?

  19. IOnRoad: iOnRoad. http://www.ionroad.com/

  20. Kooaba: Kooaba (2007). http://www.kooaba.com

  21. LeafSnap: LeafSnap. http://www.leafsnap.com/

  22. Lepetit V (2008) On computer vision for augmented reality. In: 2008 international symposium on ubiquitous virtual reality, pp 13–16 (2008). doi:10.1109/ISUVR.2008.10, http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=4568635

  23. Leutenegger S, Chli M, Siegwart RY (2011) BRISK: binary robust invariant scalable keypoints. In: 2011 International conference on computer vision, pp 2548–2555. doi:10.1109/ICCV.2011.6126542, http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6126542

  24. Li Z, Yap KH (2012) Content and context boosting for mobile landmark recognition. In: IEEE Sig Process Lett 19(8):459–462. doi:10.1109/LSP.2012.2203120, http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6213072

  25. Liu X, Kulkarni P, Shenoy P, Ganesan D (2006) Snapshot: a self-calibration protocol for camera sensor networks. In: IEEE 3rd international conference on broadband communications, networks and systems, 2006 (BROADNETS 2006), pp 1–10

    Google Scholar 

  26. Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110. doi:10.1023/B:VISI.0000029664.99615.94

    Article  Google Scholar 

  27. Martinel N, Micheloni C (2012) Re-identify people in wide area camera network. In: 2012 IEEE computer society conference on computer vision and pattern recognition workshops, pp 31–36. IEEE, Providence, RI. doi:10.1109/CVPRW.2012.6239203, http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6583277

  28. Martinel N, Micheloni C, Foresti GL (2013) Robust painting recognition and registration for mobile augmented reality. IEEE Sig Process Lett 20(11):1022–1025. doi:10.1109/LSP.2013.2279014, http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6604402

  29. Martinel N, Micheloni C, Piciarelli C, Foresti GL (2013) Camera selection for adaptiveHuman-computer interface. In: IEEE transactions on systems, man, and cybernetics: systems pp 1–1. doi:10.1109/TSMC.2013.2279661, http://www.sciencedirect.com/science/article/pii/S0893608011002693

  30. Micheloni C, Rani A, Kumar S, Foresti GL (2012) A balanced neural tree for pattern classification. Neural Netw 27:81–90. doi:10.1016/j.neunet.2011.10.007

    Article  Google Scholar 

  31. Micheloni C, Remagnino P, Eng HL, Geng J (2010) Intelligent monitoring of complex environments. IEEE Intell Syst 25(3):12–14. doi:10.1109/MIS.2010.85

    Article  Google Scholar 

  32. Miksik O, Mikolajczyk K (2012) Evaluation of local detectors and descriptors for fast feature matching. In: International conference on pattern recognition. Tsukuba, Japan, pp 2681–2684. ISBN:978-1-4673-2216-4

    Google Scholar 

  33. Neurotechnology: Neurotechnology Sentisight. http://www.neurotechnology.com/sentisight.html

  34. Nokia: Nokia Point and Find (2006) https://betalabs.nokia.com/trials/nokia-point-and-find

  35. Qualcomm: Qualcomm Vuforia. http://www.qualcomm.com/solutions/augmented-reality

  36. Römer K (2001) Time synchronization in ad hoc networks. In: Proceedings of the 2nd ACM international symposium on Mobile ad hoc networking and computing, pp 173–182. ACM (2001)

    Google Scholar 

  37. Rosten E, Porter R, Drummond T (2010) Faster and better: a machine learning approach to corner detection. IEEE Trans Pattern Anal Mach Intell 32(1):105–19, doi:10.1109/TPAMI.2008.275, http://www.ncbi.nlm.nih.gov/pubmed/19926902

  38. Rublee E, Rabaud V, Konolige K, Bradski G (2011) ORB: an efficient alternative to SIFT or SURF. In: IEEE International conference on computer vision. Barcellona, Spain, pp 2564–2571. doi:10.1109/ICCV.2011.6126544, http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6126544

  39. Schroth G, Huitl R, Chen D, Abu-Alqumsan M, Al-Nuaimi A, Steinbach E (2011) Mobile visual location recognition. IEEE Sig Process Mag 28(4):77–89. doi:10.1109/MSP.2011.940882, http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5888650

  40. Seifert C, Paletta L, Jeitler A, Hödl E, Andreu JP, Luley P, Almer A (2004) Visual object detection for mobile road sign inventory. In: International symposium on mobile, human–computer interaction, pp 491–495. doi:

    Google Scholar 

  41. Skrypnyk I, Lowe D (2004) Scene modelling, recognition and tracking with invariant image features. In: IEEE International symposium on mixed and augmented reality, pp 110–119. doi:10.1109/ISMAR.2004.53, http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=1383048

  42. Takacs G, Chandrasekhar V, Gelfand N, Xiong Y, Chen WC, Bismpigiannis T, Grzeszczuk R, Pulli K, Girod B (2008) Outdoors augmented reality on mobile phone using loxel-based visual feature organization. In: Proceeding of the 1st ACM international conference on Multimedia information retrieval—MIR’08, p 427. doi:10.1145/1460096.1460165, http://portal.acm.org/citation.cfm?doid=1460096.1460165

  43. Takacs G, Chandrasekhar V, Girod B, Grzeszczuk R (2007) Feature tracking for mobile augmented reality using video coder motion vectors. In: IEEE international symposium on mixed and augmented reality, pp 1–4. doi:10.1109/ISMAR.2007.4538838, http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=4538838

  44. W3C: Mobile Web Initiative Device Description Working Group Charter (2005) http://www.w3.org/2005/01/DDWGCharter/

  45. W3C: W3C Extensible multiModal annotation markup language (2009) http://www.w3.org/TR/emma/

  46. Wagner D, Reitmayr G, Mulloni A, Drummond T, Schmalstieg D (2010) Real-time detection and tracking for augmented reality on mobile phones. IEEE Trans Visual Comput Graph 16(3):355–68 (2010). doi:10.1109/TVCG.2009.99, http://www.ncbi.nlm.nih.gov/pubmed/20224132

  47. Xu L, Oja E (1993) Randomized hough transform (RHT): basic mechanisms, algorithms, and computational complexities. CVGIP Image Underst 57(2):131–154. doi:10.1006/ciun.1993.1009, http://linkinghub.elsevier.com/retrieve/pii/S1049966083710090

  48. Yeh T, Tollmar K, Darrell T (2004) Searching the web with mobile images for location recognition. In: IEEE International conference on computer vision and pattern recognition, vol 2, pp. 76–81. doi:10.1109/CVPR.2004.1315147

  49. You B, Lane ND, Chen F, Wang R, Chen Z, Bao TJ, Cheng Y, Lin M, Torresani L, Campbell AT (2013) CarSafe App: alerting drowsy and distracted drivers using dual cameras on smartphones. http://now.dartmouth.edu/2012/09/dartmouth-smartphone-app-targets-driver-safety/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Niki Martinel .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer Science+Business Media New York

About this chapter

Cite this chapter

Martinel, N., Prati, A., Micheloni, C. (2014). Distributed Mobile Computer Vision: Advances, Challenges and Applications. In: Bobda, C., Velipasalar, S. (eds) Distributed Embedded Smart Cameras. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-7705-1_5

Download citation

  • DOI: https://doi.org/10.1007/978-1-4614-7705-1_5

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4614-7704-4

  • Online ISBN: 978-1-4614-7705-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics