Comparison of Non-negative Matrix Factorization Methods for Clustering Genomic Data
Non-negative matrix factorization (NMF) is a useful method of data dimensionality reduction and has been widely used in many fields, such as pattern recognition and data mining. Compared with other traditional methods, it has unique advantages. And more and more improved NMF methods have been provided in recent years and all of these methods have merits and demerits when used in different applications. Clustering based on NMF methods is a common way to reflect the properties of methods. While there are no special comparisons of clustering experiments based on NMF methods on genomic data. In this paper, we analyze the characteristics of basic NMF and its classical variant methods. Moreover, we show the clustering results based on the coefficient matrix decomposed by NMF methods on the genomic datasets. We also compare the clustering accuracies and the cost of time of these methods.
KeywordsNon-negative matrix factorization Clustering Genomic data Dimensionality reduction
This work was supported in part by the NSFC under grant Nos. 61572284, 61502272, 61572283 and 61272339; the Shandong Provincial Natural Science Foundation, under grant Nos. BS2014DX004 and BS2014DX005; Shenzhen Municipal Science and Technology Innovation Council (No. JCYJ20140417172417174); the Project of Shandong Province Higher Educational Science and Technology Program (No. J13LN31).
- 1.Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Advances in Neural Information Processing Systems, pp. 556–562 (2001)Google Scholar
- 3.Chung, F.R.: Spectral graph theory (CBMS regional conference series in mathematics). Am. Math. Soc., vol. 92 (1997)Google Scholar
- 4.Belkin, M., Niyogi, P.: Laplacian Eigenmaps and spectral techniques for embedding and clustering. NIPS 14, 585–591 (2001)Google Scholar
- 7.Kong, D., Ding, C., Huang, H.: Robust nonnegative matrix factorization using l21-norm. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 673–682 (2011)Google Scholar
- 13.Ding, C., Li, T., Peng, W., Park, H.: Orthogonal nonnegative matrix t-factorizations for clustering. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 126–135 (2006)Google Scholar
- 14.Li, S.Z., Hou, X.W., Zhang, H., Cheng, Q.: Learning spatially localized, parts-based representation. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, vol. 201, no. 1, pp. I-207–I-212 (2001)Google Scholar
- 15.Chen, X., Gu, L., Li, S.Z., Zhang, H.-J.: Learning representative local features for face detection. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, vol. 1121, no. 1, pp. I-1126–I-1131 (2001)Google Scholar
- 17.Xu, W., Liu, X., Gong, Y.: Document clustering based on non-negative matrix factorization. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval, pp. 267–273 (2003)Google Scholar