Compressive Sensing for Background Subtraction

  • Volkan Cevher
  • Aswin Sankaranarayanan
  • Marco F. Duarte
  • Dikpal Reddy
  • Richard G. Baraniuk
  • Rama Chellappa
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5303)


Compressive sensing (CS) is an emerging field that provides a framework for image recovery using sub-Nyquist sampling rates. The CS theory shows that a signal can be reconstructed from a small set of random projections, provided that the signal is sparse in some basis, e.g., wavelets. In this paper, we describe a method to directly recover background subtracted images using CS and discuss its applications in some communication constrained multi-camera computer vision problems. We show how to apply the CS theory to recover object silhouettes (binary background subtracted images) when the objects of interest occupy a small portion of the camera view, i.e., when they are sparse in the spatial domain. We cast the background subtraction as a sparse approximation problem and provide different solutions based on convex optimization and total variation. In our method, as opposed to learning the background, we learn and adapt a low dimensional compressed representation of it, which is sufficient to determine spatial innovations; object silhouettes are then estimated directly using the compressive samples without any auxiliary image reconstruction. We also discuss simultaneous appearance recovery of the objects using compressive measurements. In this case, we show that it may be necessary to reconstruct one auxiliary image. To demonstrate the performance of the proposed algorithm, we provide results on data captured using a compressive single-pixel camera. We also illustrate that our approach is suitable for image coding in communication constrained problems by using data captured by multiple conventional cameras to provide 2D tracking and 3D shape reconstruction results with compressive measurements.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Elgammal, A., Harwood, D., Davis, L.: Non-parametric model for background subtraction. In: IEEE FRAME-RATE Workshop. Springer, Heidelberg (1999)Google Scholar
  2. 2.
    Piccardi, M.: Background subtraction techniques: a review. In: IEEE International Conference on Systems, Man and Cybernetics, vol. 4 (2004)Google Scholar
  3. 3.
    Cheung, G.K.M., Kanade, T., Bouguet, J.Y., Holler, M.: Real time system for robust 3 D voxel reconstruction of human motions. In: CVPR, pp. 714–720 (2000)Google Scholar
  4. 4.
    Wakin, M.B., Laska, J.N., Duarte, M.F., Baron, D., Sarvotham, S., Takhar, D., Kelly, K.F., Baraniuk, R.G.: An architecture for compressive imaging. In: ICIP, Atlanta, GA, October 2006, pp. 1273–1276 (2006)Google Scholar
  5. 5.
    Candes, E.: Compressive sampling. In: Proceedings of the International Congress of Mathematicians (2006)Google Scholar
  6. 6.
    Donoho, D.L.: Compressed Sensing. IEEE Trans. Info. Theory 52(4), 1289–1306 (2006)MathSciNetCrossRefMATHGoogle Scholar
  7. 7.
    Mallat, S., Zhang, S.: Matching pursuits with time-frequency dictionaries. Matching pursuits with time-frequency dictionaries 41(12), 3397–3415 (1993)MATHGoogle Scholar
  8. 8.
    Aggarwal, A., Biswas, S., Singh, S., Sural, S., Majumdar, A.K.: Object Tracking Using Background Subtraction and Motion Estimation in MPEG Videos. In: ACCV, pp. 121–130. Springer, Heidelberg (2006)Google Scholar
  9. 9.
    Lamarre, M., Clark, J.J.: Background subtraction using competing models in the block-DCT domain. In: ICPR (2002)Google Scholar
  10. 10.
    Wang, W., Chen, D., Gao, W., Yang, J.: Modeling background from compressed video. In: IEEE Int. Workshop on VSPE of TS, pp. 161–168 (2005)Google Scholar
  11. 11.
    Töreyin, B.U., Çetin, A.E., Aksay, A., Akhan, M.B.: Moving object detection in wavelet compressed video. Signal Processing: Image Communication 20(3), 255–264 (2005)Google Scholar
  12. 12.
    Oliver, N., Rosario, B., Pentland, A.: A Bayesian Computer Vision System for Modeling Human Interactions. In: ICVS. Springer, Heidelberg (1999)Google Scholar
  13. 13.
    Uttam, S., Goodman, N.A., Neifeld, M.A.: Direct reconstruction of difference images from optimal spatial-domain projections. In: Proc. SPIE, San Diego, CA, August 2008, vol. 7096 (2008)Google Scholar
  14. 14.
    Chen, S.S., Donoho, D.L., Saunders, M.A.: Atomic Decomposition by Basis Pursuit. SIAM Journal on Scientific Computing 20, 33 (1998)MathSciNetCrossRefMATHGoogle Scholar
  15. 15.
    Mallat, S.: A Wavelet Tour of Signal Processing. Academic Press, London (1999)MATHGoogle Scholar
  16. 16.
    Cevher, V., Chellappa, R., McClellan, J.H.: Gaussian approximations for energy-based detection and localization in sensor networks. In: IEEE Statistical Signal Processing Workshop, Madison, WI, August 26–29 (2007)Google Scholar
  17. 17.
    Van Trees, H.L.: Detection, Estimation, and Modulation Theory, Part I. John Wiley & Sons, Inc., Chichester (1968)MATHGoogle Scholar
  18. 18.
    Hale, E.T., Yin, W., Zhang, Y.: A fixed-point continuation method for ℓ1-regularized minimization with applications to compressed sensing. Technical Report TR07-07, Rice University Department of Computational and Applied Mathematics, Houston, TX (2007)Google Scholar
  19. 19.
    Joo, S., Zheng, Q.: A Temporal Variance-Based Moving Target Detector. In: Proc. IEEE Int. Workshop on Performance Evaluation of Tracking and Surveillance (PETS) (2005)Google Scholar
  20. 20.
    Matsumoto, M., Nishimura, T.: Mersenne Twister: A 623-Dimensionally Equidistributed Uniform Pseudo-Random Number Generator. ACM Transactions on Modeling and Computer Simulation 8(1), 3–30 (1998)CrossRefMATHGoogle Scholar
  21. 21.
    Martin, D., Fowlkes, C., Tal, D., Malik, J.: A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In: Proc. 8th Int’l Conf. Computer Vision, July 2001, vol. 2, pp. 416–423 (2001)Google Scholar
  22. 22.
    Khan, S.M., Shah, M.: A multi-view approach to tracking people in crowded scenes using a planar homography constraint. In: ECCV 2006, vol. 4, pp. 133–146 (2006)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Volkan Cevher
    • 1
  • Aswin Sankaranarayanan
    • 2
  • Marco F. Duarte
    • 1
  • Dikpal Reddy
    • 2
  • Richard G. Baraniuk
    • 1
  • Rama Chellappa
    • 2
  1. 1.Rice University, ECE, Houston
  2. 2.University of Maryland, UMIACS

Personalised recommendations