Low-Rank Matrix Factorization and Co-clustering Algorithms for Analyzing Large Data Sets

Donavalli, Archana; Rege, Manjeet; Liu, Xumin; Jafari-Khouzani, Kourosh

doi:10.1007/978-3-642-27872-3_41

Archana Donavalli¹⁸,
Manjeet Rege¹⁸,
Xumin Liu¹⁸ &
…
Kourosh Jafari-Khouzani¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 6411))

Included in the following conference series:

International Conference on Data Engineering and Management

1462 Accesses
2 Citations

Abstract

With the ever increasing data, there is a greater need for analyzing and extracting useful and meaningful information out of it. The amount of research being conducted in extracting this information is commendable. From clustering to bi and multi clustering, there are a lot of different algorithms proposed to analyze and discover the hidden patterns in data, in every which way possible. On the other hand, the size of the data sets is increasing with each passing day and hence it is becoming increasingly difficult to try and analyze all this data and find clusters in them without the algorithms being computationally prohibitive. In this study, we have tried to study both the domains and understand the development of the algorithms and how they are being used. We have compared the different algorithms to try and get a better idea of which algorithm is more suited for a particular situation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Berry, M.W., Stewart, G.W., Pulatova, S.A.: Algorithm 844: Computing sparse reduced-rank approximations to sparse matrices. ACM Transactions on Mathermatical Software 31 (2005)
Google Scholar
Dhillon, I.S.: Co-clustering documents and words using bipartite spectral graph partitioning. In: 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), San Francisco (2001)
Google Scholar
Drineas, P., Kannan, R., Mahoney, M.W.: Fast monte carlo algorithms for matrices iii: Computing a compressed approximate matrix decomposition. Society for Industrial and Applied Mathematics (SIAM) 36, 184–206 (2006)
MATH Google Scholar
Gu, Q., Zhou, J.: Co-clustering of manifolds. In: 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (ICDM), Paris (2009)
Google Scholar
Long, B., Zhang, Z., Yu, P.S.: Co-clustering by block value decomposition. In: 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), Chicago (2005)
Google Scholar
Pan, F., Zhang, X., Wang, W.: Crd: Fast co-clustering of large datasets utilizing sampling-based matrix decomposition. In: ACM SIGMOD/PODS Conference, Vancouver (2008)
Google Scholar
Rege, M., Dong, M., Fotouhi, F.: Co-clustering documents and words using bipartite isoperimetric graph partitioning. In: 6th IEEE International Conference on Data Mining (ICDM), Hong Kong (2006)
Google Scholar
Sun, J., Xie, Y., Zhang, H., Faloutsos, C.: Less is more: Complex matrix decomposition for large sparse graphs. In: 7th SIAM International Conference on Data Mining (ICDM), Minneapolis (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Rochester Institute of Technology, Rochester, NY, USA
Archana Donavalli, Manjeet Rege & Xumin Liu
Department of Radiology Research, Henry Ford Hospital, Detroit, Michigan, USA
Kourosh Jafari-Khouzani

Authors

Archana Donavalli
View author publications
You can also search for this author in PubMed Google Scholar
Manjeet Rege
View author publications
You can also search for this author in PubMed Google Scholar
Xumin Liu
View author publications
You can also search for this author in PubMed Google Scholar
Kourosh Jafari-Khouzani
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Bishop Heber College(Autonomous), 620017, Tiruchirappalli, India
Rajkumar Kannan
National Institute of Informatics (NII), 101-8430, Tokyo, Japan
Frederic Andres

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Donavalli, A., Rege, M., Liu, X., Jafari-Khouzani, K. (2012). Low-Rank Matrix Factorization and Co-clustering Algorithms for Analyzing Large Data Sets. In: Kannan, R., Andres, F. (eds) Data Engineering and Management. ICDEM 2010. Lecture Notes in Computer Science, vol 6411. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27872-3_41

Download citation

DOI: https://doi.org/10.1007/978-3-642-27872-3_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27871-6
Online ISBN: 978-3-642-27872-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics