A Fast Random Sampling Algorithm for Sparsifying Matrices

  • Sanjeev Arora
  • Elad Hazan
  • Satyen Kale
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4110)


We describe a simple random-sampling based procedure for producing sparse matrix approximations. Our procedure and analysis are extremely simple: the analysis uses nothing more than the Chernoff-Hoeffding bounds. Despite the simplicity, the approximation is comparable and sometimes better than previous work.

Our algorithm computes the sparse matrix approximation in a single pass over the data. Further, most of the entries in the output matrix are quantized, and can be succinctly represented by a bit vector, thus leading to much savings in space.


Error Parameter Input Matrix Unit Eigenvector Eigenvector Computation Lanczos Iteration 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [AHK05]
    Arora, S., Hazan, E., Kale, S.: Fast algorithms for approximate semidefinite programming using the multiplicative weights update method. In: 46th FOCS, pp. 339–348 (2005)Google Scholar
  2. [AM01]
    Achlioptas, D., McSherry, F.: Fast computation of low rank matrix approximations. In: 32nd STOC, pp. 611–618 (2001)Google Scholar
  3. [DFK+04]
    Drineas, P., Frieze, A.M., Kannan, R., Vempala, S., Vinay, V.: Clustering large graphs via the singular value decomposition. Machine Learning 56(1-3), 9–33 (2004)MATHCrossRefGoogle Scholar
  4. [DK03]
    Drineas, P., Kannan, R.: Pass efficient algorithms for approximating large matrices. In: SODA, pp. 223–232 (2003)Google Scholar
  5. [DMM06]
    Drineas, P., Mahoney, M., Muthukrishnan, S.: Column-based relative-error. In: Díaz, J., Jansen, K., Rolim, J.D.P., Zwick, U. (eds.) APPROX 2006 and RANDOM 2006. LNCS, vol. 4110, Springer, Heidelberg (2006)CrossRefGoogle Scholar
  6. [DV06]
    Deshpande, A., Vempala, S.: Adaptive sampling and fast low-rank matrix approximation. In: Díaz, J., Jansen, K., Rolim, J.D.P., Zwick, U. (eds.) APPROX 2006 and RANDOM 2006. LNCS, vol. 4110, Springer, Heidelberg (2006)CrossRefGoogle Scholar
  7. [FKV04]
    Frieze, A.M., Kannan, R., Vempala, S.: Fast monte-carlo algorithms for finding low-rank approximations. J. ACM 51(6), 1025–1041 (2004)MATHCrossRefMathSciNetGoogle Scholar
  8. [FO05]
    Feige, U., Ofek, E.: Spectral techniques applied to sparse random graphs. Random Structures and Algorithms 27(2), 251–275 (2005)MATHCrossRefMathSciNetGoogle Scholar
  9. [Hoe63]
    Hoeffding, W.: Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association 58(301), 13–30 (1963)MATHCrossRefMathSciNetGoogle Scholar
  10. [MR95]
    Motwani, R., Raghavan, P.: Randomized Algorithms. Cambridge University Press, Cambridge (1995)MATHGoogle Scholar
  11. [TB97]
    Trefethen, L.N., Bau, D.: Numerical Linear Algebra. SIAM, Philadelphia (1997)MATHCrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Sanjeev Arora
    • 1
  • Elad Hazan
    • 1
  • Satyen Kale
    • 1
  1. 1.Computer Science DepartmentPrinceton UniversityPrincetonUSA

Personalised recommendations