Abstract
With the ever increasing data, there is a greater need for analyzing and extracting useful and meaningful information out of it. The amount of research being conducted in extracting this information is commendable. From clustering to bi and multi clustering, there are a lot of different algorithms proposed to analyze and discover the hidden patterns in data, in every which way possible. On the other hand, the size of the data sets is increasing with each passing day and hence it is becoming increasingly difficult to try and analyze all this data and find clusters in them without the algorithms being computationally prohibitive. In this study, we have tried to study both the domains and understand the development of the algorithms and how they are being used. We have compared the different algorithms to try and get a better idea of which algorithm is more suited for a particular situation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Berry, M.W., Stewart, G.W., Pulatova, S.A.: Algorithm 844: Computing sparse reduced-rank approximations to sparse matrices. ACM Transactions on Mathermatical Software 31 (2005)
Dhillon, I.S.: Co-clustering documents and words using bipartite spectral graph partitioning. In: 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), San Francisco (2001)
Drineas, P., Kannan, R., Mahoney, M.W.: Fast monte carlo algorithms for matrices iii: Computing a compressed approximate matrix decomposition. Society for Industrial and Applied Mathematics (SIAM) 36, 184–206 (2006)
Gu, Q., Zhou, J.: Co-clustering of manifolds. In: 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (ICDM), Paris (2009)
Long, B., Zhang, Z., Yu, P.S.: Co-clustering by block value decomposition. In: 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), Chicago (2005)
Pan, F., Zhang, X., Wang, W.: Crd: Fast co-clustering of large datasets utilizing sampling-based matrix decomposition. In: ACM SIGMOD/PODS Conference, Vancouver (2008)
Rege, M., Dong, M., Fotouhi, F.: Co-clustering documents and words using bipartite isoperimetric graph partitioning. In: 6th IEEE International Conference on Data Mining (ICDM), Hong Kong (2006)
Sun, J., Xie, Y., Zhang, H., Faloutsos, C.: Less is more: Complex matrix decomposition for large sparse graphs. In: 7th SIAM International Conference on Data Mining (ICDM), Minneapolis (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Donavalli, A., Rege, M., Liu, X., Jafari-Khouzani, K. (2012). Low-Rank Matrix Factorization and Co-clustering Algorithms for Analyzing Large Data Sets. In: Kannan, R., Andres, F. (eds) Data Engineering and Management. ICDEM 2010. Lecture Notes in Computer Science, vol 6411. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27872-3_41
Download citation
DOI: https://doi.org/10.1007/978-3-642-27872-3_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27871-6
Online ISBN: 978-3-642-27872-3
eBook Packages: Computer ScienceComputer Science (R0)