Abstract
In the problem of maximizing regularized two-stage submodular functions in streams, we assemble a family \({\cal F}\) of m functions each of which is submodular and is visited in a streaming style that an element is visited for only once. The aim is to choose a subset S of size at most ℓ from the element stream \({\cal V}\), so as to maximize the average maximum value of these functions restricted on S with a regularized modular term. The problem can be formally casted as \({\max _{S \subseteq V,\left| S \right| \leqslant \ell }}{1 \over m}\sum\nolimits_{i = 1}^m {{{\max }_{T \subseteq S,\left| T \right| \leqslant k}}\left[ {{f_i}\left( T \right) - c\left( T \right)} \right]} \), where \(c:{\cal V} \to {\mathbb{R}_ + }\) is a non-negative modular function and \({f_i}:{2^{\cal V}} \to {\mathbb{R}_ + },\forall i \in \left\{ {1, \ldots ,m} \right\}\) is a non-negative monotone non-decreasing submodular function. The well-studied regularized problem of \({\max _{S \subseteq {\cal V},\left| S \right| \leqslant k}}f(S) - c(S)\) is exactly a special case of the above regularized two-stage submodular maximization by setting m = 1 and ℓ = k. Although f(·) − c(·) is submodular, it is potentially negative and non-monotone and admits no constant multiplicative factor approximation. Therefore, we adopt a slightly weaker notion of approximation which constructs S such that f(S) − c(S) ⩾ ρ · f(O) − c(O) holds against optimum solution O for some ρ ∈ (0, 1). Eventually, we devise a streaming algorithm by employing the distorted threshold technique, achieving a weaker approximation ratio with ρ = 0.2996 for the discussed regularized two-stage model.
Similar content being viewed by others
References
Kempe D, Kleinberg J, Tardos É. Maximizing the spread of influence through a social network. In: Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington DC, 2003. 137–146
Lin H, Bilmes J. A class of submodular functions for document summarization. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, Portland, 2011. 510–520
Krause A, McMahan H B, Guestrin C, et al. Robust submodular observation selection. J Machine Learn Res, 2008, 9: 2761–2801
Balkanski E, Mirzasoleiman B, Krause A, et al. Learning sparse combinatorial representations via two-stage submodular maximization. In: Proceedings of the 33rd International Conference on International Conference on Machine Learning, New York City, 2016. 2207–2216
Badanidiyuru A, Mirzasoleiman B, Karbasi A, et al. Streaming submodular maximization: massive data summarization on the fly. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York City, 2014. 671–680
Nemhauser G L, Wolsey L A, Fisher M L. An analysis of approximations for maximizing submodular set functions-I. Math Programming, 1978, 14: 265–294
Calinescu G, Chekuri C, Pál M, et al. Maximizing a monotone submodular function subject to a matroid constraint. SIAM J Comput, 2011, 40: 1740–1766
Sviridenko M. A note on maximizing a submodular set function subject to a knapsack constraint. Operations Res Lett, 2004, 32: 41–43
Buchbinder N, Feldman M, Schwartz R. Online submodular maximization with preemption. In: Proceedings of the 26th Annual ACM-SIAM Symposium on Discrete Algorithms, San Diego, 2015. 1202–1216
Norouzi-Fard A, Tarnawski J, Mitrović S, et al. Beyond 1/2-approximation for submodular maximization on massive data streams. In: Proceedings of the 35th International Conference on Machine Learning, Stockholm, 2018. 3826–3835
Kazemi E, Mitrovic M, Zadimoghaddam M, et al. Submodular streaming in all its glory: tight approximation, minimum memory and low adaptive complexity. In: Proceedings of International Conference on Machine Learning, Long Beach, 2019. 3311–3320
Chekuri C, Gupta S, Quanrud K. Streaming algorithms for submodular function maximization. In: Proceedings of International Colloquium on Automata, Languages and Programming, Kyoto, 2015. 318–330
Huang C C, Kakimura N. Improved streaming algorithms for maximizing monotone submodular functions under a knapsack constraint. In: Proceedings of the 16th International Symposium Workshop on Algorithms and Data Structures, Edmonton, 2019. 438–451
Huang C C, Kakimura N, Yoshida Y. Streaming algorithms for maximizing monotone submodular functions under a knapsack constraint. Algorithmica, 2020, 82: 1006–1032
Kumar R, Moseley B, Vassilvitskii S, et al. Fast greedy algorithms in MapReduce and streaming. ACM Trans Parallel Comput, 2015, 2: 1–22
Mirzasoleiman B, Jegelka S, Krause A. Streaming non-monotone submodular maximization: personalized video summarization on the fly. In: Proceedings of the 32nd AAAI Conference on Artificial Intelligence, New Orleans, 2018. 1379–1386
Feldman M. Guess free maximization of submodular and linear sums. In: Proceedings of Workshop on Algorithms and Data Structures, Edmonton, 2019. 380–394
Sviridenko M, Vondrák J, Ward J. Optimal approximation for submodular and supermodular optimization with bounded curvature. Math OR, 2017, 42: 1197–1218
Harshaw C, Feldman M, Ward J, et al. Submodular maximization beyond non-negativity: guarantees, fast algorithms, and applications. In: Proceedings of the 36th International Conference on Machine Learning, Long Beach, 2019. 2634–2643
Kazemi E, Minaee S, Feldman M, et al. Regularized submodular maximization at scale. In: Proceedings of International Conference on Machine Learning, 2021. 5356–5366
Stan S, Zadimoghaddam M, Krause A, et al. Probabilistic submodular maximization in sub-linear time. In: Proceedings of International Conference on Machine Learning, Sydney, 2017. 3241–3250
Mitrovic M, Kazemi E, Zadimoghaddam M, et al. Data summarization at scale: a two-stage submodular approach. In: Proceedings of the 35th International Conference on Machine Learning, Stockholm, 2018. 3593–3602
Gong S, Nong Q, Liu W, et al. Parametric monotone function maximization with matroid constraints. J Glob Optim, 2019, 75: 833–849
Yang R, Xu D, Guo L, et al. Parametric streaming two-stage submodular maximization. In: Proceedings of the 16th Annual Conference on Theory and Applications of Models of Computation (TAMC), Changsha, 2020. 193–204
Acknowledgements
Ruiqi YANG was supported by National Natural Science Foundation of China (Grant No. 12101587), China Postdoctoral Science Foundation (Grant No. 2021M703167), and Fundamental Research Funds for the Central Universities (Grant No. EIE40108X2). Dachuan XU was supported by National Natural Science Foundation of China (Grant No. 12131003) and Beijing Natural Science Foundation Project (Grant No. Z200002). Longkun GUO was supported by National Natural Science Foundation of China (Grant No. 61772005) and Outstanding Youth Innovation Team Project for Universities of Shandong Province (Grant No. 2020KJN008). Dongmei ZHANG was supported by National Natural Science Foundation of China (Grant No. 11871081).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yang, R., Xu, D., Guo, L. et al. Regularized two-stage submodular maximization under streaming. Sci. China Inf. Sci. 65, 140602 (2022). https://doi.org/10.1007/s11432-020-3420-9
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11432-020-3420-9