Database Friendly Data Processing

Qiu, Robert; Wicks, Michael

doi:10.1007/978-1-4614-4544-9_12

Robert Qiu³ &
Michael Wicks⁴

2189 Accesses

Abstract

The goal of this chapter is to demonstrate how concentration of measure plays a central role in these modern randomized algorithms. There is a convergence of sensing, computing, networking and control. Data base is often neglected in traditional treatments in estimation, detection, etc.

Modern scientific computing demands efficient algorithms for dealing with large datasets—Big Data. Often these datasets can be fruitfully represented and manipulated as matrices; in this case, fast low-error methods for making basic linear algebra computations are key to efficient algorithms. Examples of such foundational computational tools are low-rank approximations, matrix sparsification, and randomized column subset selection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Outer products x y ^T of two vectors x and y are rank-one matrices.

Bibliography

N. Nguyen, P. Drineas, and T. Tran, “Tensor sparsification via a bound on the spectral norm of random tensors,” arXiv preprint arXiv:1005.4732, 2010.
Google Scholar
D. Hsu, S. Kakade, and T. Zhang, “Tail inequalities for sums of random matrices that depend on the intrinsic dimension,” 2011.
Google Scholar
N. Nguyen, P. Drineas, and T. Tran, “Matrix sparsification via the khintchine inequality,” 2009.
Google Scholar
D. Donoho et al., “High-dimensional data analysis: The curses and blessings of dimensionality,” AMS Math Challenges Lecture, pp. 1–32, 2000.
Google Scholar
P. Drineas, R. Kannan, and M. Mahoney, “Fast monte carlo algorithms for matrices i: Approximating matrix multiplication,” SIAM Journal on Computing, vol. 36, no. 1, p. 132, 2006.
Google Scholar
C. Boutsidis and A. Gittens, “Im@articlegittens2012var, title=Var (Xjk), author=GITTENS, A., year=2012,” arXiv preprint arXiv:1204.0062, 2012.
Google Scholar
A. GITTENS, “Var (xjk),” 2012.
Google Scholar
M. Magdon-Ismail, “Using a non-commutative bernstein bound to approximate some matrix algorithms in the spectral norm,” Arxiv preprint arXiv:1103.5453, 2011.
Google Scholar
M. Magdon-Ismail, “Row sampling for matrix algorithms via a non-commutative bernstein bound,” Arxiv preprint arXiv:1008.0587, 2010.
Google Scholar
C. Faloutsos, T. Kolda, and J. Sun, “Mining large time-evolving data using matrix and tensor tools,” in ICDM Conference, 2007.
Google Scholar
M. Mahoney, “Randomized algorithms for matrices and data,” Arxiv preprint arXiv:1104.5557, 2011.
Google Scholar

Download references

Author information

Authors and Affiliations

Tennessee Technological University, Cookeville, Tennessee, USA
Robert Qiu
Utica, NY, USA
Michael Wicks

Authors

Robert Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Michael Wicks
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Qiu, R., Wicks, M. (2014). Database Friendly Data Processing. In: Cognitive Networked Sensing and Big Data. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-4544-9_12

Download citation

DOI: https://doi.org/10.1007/978-1-4614-4544-9_12
Published: 27 June 2013
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-4543-2
Online ISBN: 978-1-4614-4544-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics