Versatile Sparse Matrix Factorization and Its Applications in High-Dimensional Biological Data Analysis

  • Yifeng Li
  • Alioune Ngom
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7986)


Non-negative matrix factorization and sparse representation models have been successfully applied in high-throughput biological data analysis. In this paper, we propose our versatile sparse matrix factorization (VSMF) model for biological data mining. We show that many well-known sparse models are specific cases of VSMF. Through tuning parameters, sparsity, smoothness, and non-negativity can be easily controlled in VSMF. Our computational experiments corroborate the advantages of VSMF.


versatile sparse matrix factorization non-negative matrix factorization sparse representation feature extraction feature selection biological processes identification 


  1. 1.
    Alon, U.: Broad patterns of gene expression revealed by clustering of tumor and normal colon tissues probed by oligonucleotide arrays. PNAS 96(12), 6745–6750 (1999)CrossRefGoogle Scholar
  2. 2.
    Bertsekas, D.P.: Nonlinear Programming. Athena Scientific, 2nd edn., Belmont, MA (2008)Google Scholar
  3. 3.
    Brunet, J., Tamayo, P., Golub, T., Mesirov, J.: Metagenes and molecular pattern discovery using matrix factorization. PNAS 101(12), 4164–4169 (2004)CrossRefGoogle Scholar
  4. 4.
    Carmona-Saez, P., Pascual-Marqui, R.D., Tirado, F., Carazo, J.M., Pascual-Montano, A.: Biclustering of gene expression data by non-smooth non-negative matrix factorization. BMC Bioinformatics 7, 78 (2006)CrossRefGoogle Scholar
  5. 5.
    Ding, C., Li, T., Jordan, M.I.: Convex and semi-nonnegative matrix factorizations. TPAMI 32(1), 45–55 (2010)CrossRefGoogle Scholar
  6. 6.
    Elad, M.: Sparse and Redundant Representations: From Theory to Applications in Signal and Image Processing. Springer, New York (2010)zbMATHCrossRefGoogle Scholar
  7. 7.
    Hu, Z.: The molecular portraits of breast tumors are conserved across microarray platforms. BMC Genomics 7, 96 (2006)Google Scholar
  8. 8.
    Kim, H., Park, H.: Sparse non-negatice matrix factorization via alternating non-negativity-constrained least aquares for microarray data analysis. SIAM J. Matrix Analysis and Applications 23(12), 1495–1502 (2007)Google Scholar
  9. 9.
    Kim, P., Tidor, B.: Subsystem identification through dimensionality reduction of large-scale gene expression data. Genome Research 13, 1706–1718 (2003)CrossRefGoogle Scholar
  10. 10.
    Lee, D.D., Seung, S.: Learning the parts of objects by non-negative matrix factorization. Nature 401, 788–791 (1999)CrossRefGoogle Scholar
  11. 11.
    Lee, D., Seung, S.: Algorithms for non-negative matrix factorization. In: Advances in Neural Information Processing Systems, pp. 556–562. MIT Press (2001)Google Scholar
  12. 12.
    Li, Y., Ngom, A.: Non-negative matrix and tensor factorization based classification of clinical microarray gene expression data. In: BIBM, pp. 438–443. IEEE Press, Piscataway (2010)Google Scholar
  13. 13.
    Li, Y., Ngom, A.: A new kernel non-negative matrix factorization and its application in microarray data analysis. In: CIBCB, pp. 371–378. IEEE Press, Piscataway (2012)Google Scholar
  14. 14.
    Li, Y., Ngom, A.: The non-negative matrix factorization toolbox for biological data mining. BMC Source Code for Biology and Medicine 8, 10 (2013)CrossRefGoogle Scholar
  15. 15.
    Li, Y., Ngom, A.: Sparse representation approaches for the classification of high-dimensional biological data. BMC Systems Biology (in press, 2013)Google Scholar
  16. 16.
    Ochs, M., Fertig, E.: Matrix factorization for transcriptional regulatory network inference. In: CIBCB, pp. 387–396. IEEE Press, Piscataway (2012)Google Scholar
  17. 17.
    Ochs, M., Rink, L., Tarn, C., Mburu, S., Taguchi, T., Eisenberg, B., Godwin, A.: Detection of treatment-induced changes in signaling pathways in sastrointestinal stromal tumors using transcripttomic data. Cancer Res. 69(23), 9125–9132 (2009)CrossRefGoogle Scholar
  18. 18.
    Zou, H., Hastie, T.: Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society - Series B: Statistical Methodology 67(2), 301–320 (2005)MathSciNetzbMATHCrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Yifeng Li
    • 1
  • Alioune Ngom
    • 1
  1. 1.School of Computer ScienceUniversity of WindsorWindsorCanada

Personalised recommendations