Parallelized Preconditioned Model Building Algorithm for Matrix Factorization

Kaya, Kamer; İlker Birbil, Ş.; Kaan Öztürk, M.; Gohari, Amir

doi:10.1007/978-3-319-72926-8_31

Kamer Kaya¹⁸,
Ş. İlker Birbil¹⁸,
M. Kaan Öztürk¹⁸ &
…
Amir Gohari¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10710))

Included in the following conference series:

International Workshop on Machine Learning, Optimization, and Big Data

2932 Accesses

Abstract

Matrix factorization is a common task underlying several machine learning applications such as recommender systems, topic modeling, or compressed sensing. Given a large and possibly sparse matrix A, we seek two smaller matrices W and H such that their product is as close to A as possible. The objective is minimizing the sum of square errors in the approximation. Typically such problems involve hundreds of thousands of unknowns, so an optimizer must be exceptionally efficient. In this study, a new algorithm, Preconditioned Model Building is adapted to factorize matrices composed of movie ratings in the MovieLens data sets with 1, 10, and 20 million entries. We present experiments that compare the sequential MATLAB implementation of the PMB algorithm with other algorithms in the minFunc package. We also employ a lock-free sparse matrix factorization algorithm and provide a scalable shared-memory parallel implementation. We show that (a) the optimization performance of the PMB algorithm is comparable to the best algorithms in common use, and (b) the computational performance can be significantly increased with parallelization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://github.com/sibirbil/PMBSolve.
2.
The PMB results in this section are obtained with the MATLAB implementation, which is not parallelized and thus different from the results given in Sect. 4.
3.
We repeated this experiment by performing eight iterations at once but no further improvement is observed.

References

Berry, M.W., Browne, M., Langville, A.N., Pauca, V.P., Plemmons, R.J.: Algorithms and applications for approximate nonnegative matrix factorization. Comput. Stat. Data Anal. 52(1), 155–173 (2007)
Article MathSciNet Google Scholar
Bertsekas, D.P.: Nonlinear Programming. Athena Scientific, Belmont (1999)
MATH Google Scholar
Duff, I.S., Erisman, A.M., Reid, J.K.: Direct Methods for Sparse Matrices. Oxford University Press Inc., New York (1986)
MATH Google Scholar
Harper, F.M., Konstan, J.A.: The movielens datasets: history and context. ACM Trans. Interact. Intell. Syst. 5(4), 19:1–19:19 (2015)
Google Scholar
Hernando, A., Bobadilla, J., Ortega, F.: A non negative matrix factorization for collaborative filtering recommender systems based on a bayesian probabilistic model. Knowl.-Based Syst. 97, 188–202 (2016)
Article Google Scholar
Koren, Y., Bell, R., Volinsky, C.: Matrix factorization techniques for recommender systems. Computer 42(8), 30–37 (2009)
Article Google Scholar
Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401(6755), 788–791 (1999)
Article Google Scholar
Liu, D.C., Nocedal, J.: On the limited-memory BFGS method for large scale optimization. Math. Program. 45, 503–528 (1989)
Article MathSciNet Google Scholar
Öztoprak, F.: Parallel Algorithms for Nonlinear Optimization. Ph.D. thesis. Sabancı University (2011)
Google Scholar
Öztoprak, F., Birbil, S.I.: An alternative globalization strategy for unconstrained optimization. arXiv preprint, arXiv:1705.05158 (2017) (To appear in Optimization)
Article MathSciNet Google Scholar
Schmidt, M.: minFunc: unconstrained differentiable multivariate optimization in matlab, http://www.cs.ubc.ca/~schmidtm/Software/minFunc.html. Accessed 22 March 2017

Download references

Author information

Authors and Affiliations

Faculty of Engineering and Natural Sciences, Sabancı University, Istanbul, Turkey
Kamer Kaya, Ş. İlker Birbil, M. Kaan Öztürk & Amir Gohari

Authors

Kamer Kaya
View author publications
You can also search for this author in PubMed Google Scholar
Ş. İlker Birbil
View author publications
You can also search for this author in PubMed Google Scholar
M. Kaan Öztürk
View author publications
You can also search for this author in PubMed Google Scholar
Amir Gohari
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kamer Kaya .

Editor information

Editors and Affiliations

University of Catania, Catania, Italy
Giuseppe Nicosia
University of Florida, Gainesville, FL, USA
Panos Pardalos
University of Catania, Catania, Italy
Giovanni Giuffrida
Harvard University, Cambridge, MA, USA
Renato Umeton

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kaya, K., İlker Birbil, Ş., Kaan Öztürk, M., Gohari, A. (2018). Parallelized Preconditioned Model Building Algorithm for Matrix Factorization. In: Nicosia, G., Pardalos, P., Giuffrida, G., Umeton, R. (eds) Machine Learning, Optimization, and Big Data. MOD 2017. Lecture Notes in Computer Science(), vol 10710. Springer, Cham. https://doi.org/10.1007/978-3-319-72926-8_31

Download citation

DOI: https://doi.org/10.1007/978-3-319-72926-8_31
Published: 21 December 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-72925-1
Online ISBN: 978-3-319-72926-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics