Abstract
The goal of this chapter is to demonstrate how concentration of measure plays a central role in these modern randomized algorithms. There is a convergence of sensing, computing, networking and control. Data base is often neglected in traditional treatments in estimation, detection, etc.
Modern scientific computing demands efficient algorithms for dealing with large datasets—Big Data. Often these datasets can be fruitfully represented and manipulated as matrices; in this case, fast low-error methods for making basic linear algebra computations are key to efficient algorithms. Examples of such foundational computational tools are low-rank approximations, matrix sparsification, and randomized column subset selection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Outer products x y T of two vectors x and y are rank-one matrices.
Bibliography
N. Nguyen, P. Drineas, and T. Tran, “Tensor sparsification via a bound on the spectral norm of random tensors,” arXiv preprint arXiv:1005.4732, 2010.
D. Hsu, S. Kakade, and T. Zhang, “Tail inequalities for sums of random matrices that depend on the intrinsic dimension,” 2011.
N. Nguyen, P. Drineas, and T. Tran, “Matrix sparsification via the khintchine inequality,” 2009.
D. Donoho et al., “High-dimensional data analysis: The curses and blessings of dimensionality,” AMS Math Challenges Lecture, pp. 1–32, 2000.
P. Drineas, R. Kannan, and M. Mahoney, “Fast monte carlo algorithms for matrices i: Approximating matrix multiplication,” SIAM Journal on Computing, vol. 36, no. 1, p. 132, 2006.
C. Boutsidis and A. Gittens, “Im@articlegittens2012var, title=Var (Xjk), author=GITTENS, A., year=2012,” arXiv preprint arXiv:1204.0062, 2012.
A. GITTENS, “Var (xjk),” 2012.
M. Magdon-Ismail, “Using a non-commutative bernstein bound to approximate some matrix algorithms in the spectral norm,” Arxiv preprint arXiv:1103.5453, 2011.
M. Magdon-Ismail, “Row sampling for matrix algorithms via a non-commutative bernstein bound,” Arxiv preprint arXiv:1008.0587, 2010.
C. Faloutsos, T. Kolda, and J. Sun, “Mining large time-evolving data using matrix and tensor tools,” in ICDM Conference, 2007.
M. Mahoney, “Randomized algorithms for matrices and data,” Arxiv preprint arXiv:1104.5557, 2011.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer Science+Business Media New York
About this chapter
Cite this chapter
Qiu, R., Wicks, M. (2014). Database Friendly Data Processing. In: Cognitive Networked Sensing and Big Data. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-4544-9_12
Download citation
DOI: https://doi.org/10.1007/978-1-4614-4544-9_12
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-4543-2
Online ISBN: 978-1-4614-4544-9
eBook Packages: EngineeringEngineering (R0)