Abstract
In this paper, we consider the problem of maximizing a monotone submodular function subject to a knapsack constraint in a streaming setting. In such a setting, elements arrive sequentially and at any point in time, and the algorithm can store only a small fraction of the elements that have arrived so far. For the special case that all elements have unit sizes (i.e., the cardinality-constraint case), one can find a \((0.5-\varepsilon )\)-approximate solution in \(O(K\varepsilon ^{-1})\) space, where K is the knapsack capacity (Badanidiyuru et al. KDD 2014). The approximation ratio is recently shown to be optimal (Feldman et al. STOC 2020). In this work, we propose a \((0.4-\varepsilon )\)-approximation algorithm for the knapsack-constrained problem, using space that is a polynomial of K and \(\varepsilon \). This improves on the previous best ratio of \(0.363-\varepsilon \) with space of the same order. Our algorithm is based on a careful combination of various ideas to transform multiple-pass streaming algorithms into a single-pass one.
Similar content being viewed by others
Notes
The approximation ratio is recently improved to 0.405 [40].
References
Alon, N., Gamzu, I., Tennenholtz, M.: Optimizing budget allocation among channels and influencers. In: Proceedings of the 21st International Conference on World Wide Web (WWW), pp. 381–388 (2012)
Badanidiyuru, A., Mirzasoleiman, B., Karbasi, A., Krause, A.: Streaming submodular maximization: massive data summarization on the fly. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp. 671–680 (2014)
Badanidiyuru, A., Vondrák, J.: Fast algorithms for maximizing submodular functions. In: Proceedings of the 25th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 1497–1514 (2013)
Balkanski, E., Rubinstein, A., Singer, Y.: An exponential speedup in parallel running time for submodular maximization without loss in approximation. In: Proceedings of the 30th Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2019, San Diego, California, USA, January 6-9, 2019, pp. 283–302 (2019)
Balkanski, E., Singer, Y.: The adaptive complexity of maximizing a submodular function. In: Proceedings of the 50th Annual ACM Symposium on Theory of Computing, STOC 2018, pp. 1138–1151. ACM, New York, NY, USA (2018)
Barbosa, R., Ene, A., Le Nguyen, H., Ward, J.: The power of randomization: Distributed submodular maximization on massive datasets. In: Proceedings of the 32nd International Conference on International Conference on Machine Learning - Volume 37, ICML’15, pp. 1236–1244. JMLR.org (2015). http://dl.acm.org/citation.cfm?id=3045118.3045250
Barbosa, R.D.P., Ene, A., Nguyen, H.L., Ward, J.: A new framework for distributed submodular maximization. In: 2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS), pp. 645–654 (2016)
Bateni, M., Esfandiari, H., Mirrokni, V.: Almost optimal streaming algorithms for coverage problems. In: Proceedings of the 29th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA ’17, pp. 13–23. ACM, New York, NY, USA (2017)
Calinescu, G., Chekuri, C., Pál, M., Vondrák, J.: Maximizing a monotone submodular function subject to a matroid constraint. SIAM J. Comput. 40(6), 1740–1766 (2011)
Chakrabarti, A., Kale, S.: Submodular maximization meets streaming: matchings, matroids, and more. Math. Program. 154(1–2), 225–247 (2015)
Chan, T.H.H., Huang, Z., Jiang, S.H.C., Kang, N., Tang, Z.G.: Online submodular maximization with free disposal: Randomization beats for partition matroids online. In: Proceedings of the 28th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 1204–1223 (2017)
Chan, T.H.H., Jiang, S.H.C., Tang, Z.G., Wu, X.: Online submodular maximization problem with vector packing constraint. In: Annual European Symposium on Algorithms (ESA), pp. 24:1–24:14 (2017)
Chekuri, C., Gupta, S., Quanrud, K.: Streaming algorithms for submodular function maximization. In: Proceedings of the 42nd International Colloquium on Automata, Languages, and Programming (ICALP), vol. 9134, pp. 318–330 (2015)
Chekuri, C., Quanrud, K.: Submodular function maximization in parallel via the multilinear relaxation. In: Proceedings of the 30th Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2019, San Diego, California, USA, January 6-9, 2019, pp. 303–322 (2019)
Chekuri, C., Vondrák, J., Zenklusen, R.: Submodular function maximization via the multilinear relaxation and contention resolution schemes. SIAM J. Comput. 43(6), 1831–1879 (2014)
Ene, A., Nguy\(\tilde{\hat{{\rm e}}}\)n, H.L.: A nearly-linear time algorithm for submodular maximization with a knapsack constraint. In: The 46th International Colloquium on Automata, Languages and Programming (ICALP 2019), to appear (2019)
Ene, A., Nguy\(\tilde{\hat{{\rm e}}}\)n, H.L.: Submodular maximization with nearly-optimal approximation and adaptivity in nearly-linear time. In: Proceedings of the 30th Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2019, San Diego, California, USA, January 6-9, 2019, pp. 274–282 (2019)
Feldman, M., Karbasi, A., Kazemi, E.: Do less, get more: Streaming submodular maximization with subsampling. In: Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS 2018, 3-8 December 2018, Montréal, Canada., pp. 730–740 (2018). http://papers.nips.cc/paper/7353-do-less-get-more-streaming-submodular-maximization-with-subsampling
Feldman, M., Norouzi-Fard, A., Svensson, O., Zenklusen, R.: The one-way communication complexity of submodular maximization with applications to streaming and robustness. In: Makarychev K., Makarychev Y., Tulsiani M., Kamath G., Chuzhoy J. (eds.) Proccedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, STOC 2020, Chicago, IL, USA, June 22-26, 2020, pp. 1363–1374. ACM (2020)
Filmus, Y., Ward, J.: A tight combinatorial algorithm for submodular maximization subject to a matroid constraint. SIAM J. Comput. 43(2), 514–542 (2014)
Fisher, M.L., Nemhauser, G.L., Wolsey, L.A.: An analysis of approximations for maximizing submodular set functions i. Math. Program. 14, 265–294 (1978)
Fisher, M.L., Nemhauser, G.L., Wolsey, L.A.: An analysis of approximations for maximizing submodular set functions ii. Math. Program. Study 8, 73–87 (1978)
Huang, C.C., Kakimura, N.: Multi-pass streaming algorithms for monotone submodular function maximization (2018). arXiv:1802.06212
Huang, C.C., Kakimura, N., Yoshida, Y.: Streaming algorithms for maximizing monotone submodular functions under a knapsack constraint. In: The 20th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems(APPROX2017) (2017)
Kazemi, E., Mitrovic, M., Zadimoghaddam, M., Lattanzi, S., Karbasi, A.: Submodular streaming in all its glory: Tight approximation, minimum memory and low adaptive complexity. In: International Conference on Machine Learning (ICML2019), pp. 3311–3320 (2019)
Kempe, D., Kleinberg, J., Tardos, É.: Maximizing the spread of influence through a social network. In: Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp. 137–146 (2003)
Krause, A., Singh, A.P., Guestrin, C.: Near-optimal sensor placements in gaussian processes: Theory, efficient algorithms and empirical studies. J. Mach. Learn. Res. 9, 235–284 (2008)
Kulik, A., Shachnai, H., Tamir, T.: Maximizing submodular set functions subject to multiple linear constraints. In: Proceedings of the 20th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 545–554 (2013)
Kumar, R., Moseley, B., Vassilvitskii, S., Vattani, A.: Fast greedy algorithms in mapreduce and streaming. ACM Trans. Parallel Comput. 2(3), 14:1–14:22 (2015)
Lee, J.: Maximum Entropy Sampling. Encyclopedia of Environmetrics, vol. 3, pp. 1229–1234. John Wiley & Sons, Ltd., New Jersey (2006)
Lee, J., Sviridenko, M., Vondrák, J.: Submodular maximization over multiple matroids via generalized exchange properties. Math. Oper. Res. 35(4), 795–806 (2010)
Lin, H., Bilmes, J.: Multi-document summarization via budgeted maximization of submodular functions. In: Proceedings of the 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), pp. 912–920 (2010)
Lin, H., Bilmes, J.: A class of submodular functions for document summarization. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT), pp. 510–520 (2011)
McGregor, A.: Graph stream algorithms: A survey. SIGMOD Rec. 43(1), 9–20 (2014)
McGregor, A., Vu, H.T.: Better streaming algorithms for the maximum coverage problem. In: International Conference on Database Theory (ICDT) (2017)
Mirzasoleiman, B., Jegelka, S., Krause, A.: Streaming non-monotone submodular maximization: Personalized video summarization on the fly. In: Proceedings of the International Conference on Artificial Intelligence (AAAI) (2018)
Nutov, Z., Shoham, E.: Practical budgeted submodular maximization (2020). arXiv:2007.04937
Soma, T., Kakimura, N., Inaba, K., Kawarabayashi, K.: Optimal budget allocation: Theoretical guarantee and efficient algorithm. In: Proceedings of the 31st International Conference on Machine Learning (ICML), pp. 351–359 (2014)
Sviridenko, M.: A note on maximizing a submodular set function subject to a knapsack constraint. Oper. Res. Lett. 32(1), 41–43 (2004)
Tang, J., Tang, X., Lim, A., Han, K., Li, C., Yuan, J.: Revisiting modified greedy algorithm for monotone submodular maximization with a knapsack constraint (2020). arXiv:2008.05391
Wolsey, L.: Maximising real-valued submodular functions: primal and dual heuristics for location problems. Math. Oper. Res. 7, 410–425 (1982)
Yoshida, Y.: Maximizing a monotone submodular function with a bounded curvature under a knapsack constraint (2016). https://epubs.siam.org/doi/10.1137/16M1107644
Yu, Q., Xu, E.L., Cui, S.: Streaming algorithms for news and scientific literature recommendation: Submodular maximization with a \(d\)-knapsack constraint. IEEE Global Conference on Signal and Information Processing (2016)
Acknowledgements
The authors thank the referees for their valuable comments on this manuscript.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
A preliminary version appears in The Algorithms and Data Structures Symposium (WADS) 2019. The first author is supported by ANR-19-CE48-0016 and ANR-18-CE40-0025-01 from the French National Research Agency (ANR). The second author is supported by JSPS KAKENHI Grant Numbers JP17K00028 and JP18H05291.
Rights and permissions
About this article
Cite this article
Huang, CC., Kakimura, N. Improved Streaming Algorithms for Maximizing Monotone Submodular Functions under a Knapsack Constraint. Algorithmica 83, 879–902 (2021). https://doi.org/10.1007/s00453-020-00786-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00453-020-00786-4