Know When to Persist: Deriving Value from a Stream Buffer

Georgiou, Konstantinos; Karakostas, George; Kranakis, Evangelos; Krizanc, Danny

doi:10.1007/978-3-319-41168-2_9

Konstantinos Georgiou¹⁶,
George Karakostas¹⁷,
Evangelos Kranakis¹⁸ &
…
Danny Krizanc¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9778))

Included in the following conference series:

International Conference on Algorithmic Applications in Management

384 Accesses
1 Citations

Abstract

We consider Persistence, a new online problem concerning optimizing weighted observations in a stream of data when the observer has limited buffer capacity. A stream of weighted items arrive one at a time at the entrance of a buffer with two holding locations. A processor (or observer) can process (observe) an item at the buffer location it chooses, deriving this way the weight of the observed item as profit. The main constraint is that the processor can only move synchronously with the item stream. Persistence is the online problem of scheduling the processor movements through the buffer so that its total derived value is maximized under this constraint. We study the performance of the straight-forward heuristic Threshold, i.e., forcing the processor to “follow” an item through the whole buffer only if its value is above a threshold. We analyze both the optimal offline and Threshold algorithms in the cases where the input stream is either a random permutation, or its items are iid valued. We show that in both cases the competitive ratio achieved by the Threshold algorithm is at least 2/3 when the only statistical knowledge of the items is the median of all possible values. We generalize our results by showing that Threshold, equipped with some minimal statistical advice about the input, achieves competitive ratios in the whole spectrum between 2/3 and 1, following the variation of a newly defined density-like measure of the input. This result is a significant improvement over the case of arbitrary input streams, where we show that no online algorithm can achieve a competitive ratio better than 1/2.

K. Georgiou, G. Karakostas, and E. Kranakis—Research supported in part by NSERC Discovery grant.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ajtai, M., Jayram, T.S., Kumar, R., Sivakumar, D.: Approximate counting of inversions in a data stream. In: Reif, J.H. (ed.) Proceedings of 34th Annual ACM Symposium on Theory of Computing, Montréal, Québec, Canada, 19–21 May 2002, pp. 370–379. ACM (2002)
Google Scholar
Alon, N., Duffield, N., Lund, C., Thorup, M.: Estimating arbitrary subset sums with few probes. In: ACM (ed.) Proceedings of the Twenty-Fourth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems: PODS 2005, Baltimore, Maryland, 13–15 June 2005, pp. 317–325. ACM Press (2005)
Google Scholar
Alon, N., Matias, Y., Szegedy, M.: The space complexity of approximating the frequency moments. J. Comput. Syst. Sci. 58(1), 137–147 (1999)
Article MathSciNet MATH Google Scholar
Babaioff, M., Immorlica, N., Kleinberg, R.: Matroids, secretary problems, and online mechanisms. In: Bansal, N., Pruhs, K., Stein, C. (eds.) SODA, pp. 434–443. SIAM (2007)
Google Scholar
Bateni, M., Hajiaghayi, M., Zadimoghaddam, M.: Submodular secretary problem and extensions. ACM Trans. Algorithms 9(4), Art. 32, 23 (2013)
Google Scholar
Bearden, J.N.: A new secretary problem with rank-based selection and cardinal payoffs. J. Math. Psychol. 50(1), 58–59 (2006)
Article MathSciNet MATH Google Scholar
Cormode, G., Muthukrishnan, S., Rozenbaum, I.: Summarizing and mining inverse distributions on data streams via dynamic inverse sampling. In: Böhm, K., Jensen, C.S., Haas, L.M., Kersten, M.L., Larson, P.-Å., Ooi, B.C. (eds.) VLDB, pp. 25–36. ACM (2005)
Google Scholar
Devanur, N.R., Jain, K., Sivan, B., Wilkens, C.A.: Near optimal online algorithms and fast approximation algorithms for resource allocation problems. In: Shoham, Y., Chen, Y., Roughgarden, T. (eds.) EC, pp. 29–38. ACM (2011)
Google Scholar
Ferguson, T.S.: Who solved the secretary problem? Stat. Sci. 4, 282–289 (1989)
Article MathSciNet MATH Google Scholar
Freeman, P.R.: The secretary problem and its extensions: a review. Int. Stat. Rev./Revue Internationale de Statistique 51, 189–206 (1983)
MathSciNet MATH Google Scholar
Georgiou, K., Karakostas, G., Kranakis, E., Krizanc, D.: Know when to persist: deriving value from a stream buffer. CoRR, abs/1604.03009 (2016)
Google Scholar
Gianini, J., Samuels, S.M.: The infinite secretary problem. Ann. Probab. 4, 418–432 (1976)
Article MathSciNet MATH Google Scholar
Greenwald, M., Khanna, S.: Space-efficient online computation of quantile summaries. In: Mehrotra, S., Sellis, T.K. (eds.) SIGMOD Conference, pp. 58–66. ACM (2001)
Google Scholar
Hoffman, M., Muthukrishnan, S., Raman, R.: Location streams: Models and algorithms. Technical report, DIMACS TR (2004)
Google Scholar
Kleinberg, R.D.: A multiple-choice secretary algorithm with applications to online auctions. In: SODA, pp. 630–631. SIAM (2005)
Google Scholar
Manku, G.S., Motwani, R.: Approximate frequency counts over data streams. In: Bernstein, P.A., et al. (eds.) VLDB 2002: Proceedings of the Twenty-Eighth International Conference on Very Large Data Bases, Hong Kong SAR, China, 20–23 August 2002, pp. 346–357. Morgan Kaufmann Publishers (2002)
Google Scholar
Muthukrishnan, S.: Data Streams: Algorithms and Applications, vol. 1 (2005)
Google Scholar
Muthukrishnan, S.: Data stream algorithms (notes from a series of lectures). In: The 2009 Barbados Workshop on Computational Complexity, 1–8 March 2009
Google Scholar
Smith, M.: A secretary problem with uncertain employment. J. Appl. Probab. 12, 620–624 (1975)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics, Ryerson University, Toronto, ON, Canada
Konstantinos Georgiou
Department of Computing and Software, McMaster University, Hamilton, ON, Canada
George Karakostas
School of Computer Science, Carleton University, Ottawa, ON, Canada
Evangelos Kranakis
Department of Mathematics and Computer Science, Wesleyan University, Middletown, CT, USA
Danny Krizanc

Authors

Konstantinos Georgiou
View author publications
You can also search for this author in PubMed Google Scholar
George Karakostas
View author publications
You can also search for this author in PubMed Google Scholar
Evangelos Kranakis
View author publications
You can also search for this author in PubMed Google Scholar
Danny Krizanc
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Konstantinos Georgiou .

Editor information

Editors and Affiliations

Università degli Studi di Bergamo , Bergamo, Italy
Riccardo Dondi
Université de Nantes , Nantes, France
Guillaume Fertin
Università degli Studi di Milano-Bicocca , Milano, Italy
Giancarlo Mauri

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Georgiou, K., Karakostas, G., Kranakis, E., Krizanc, D. (2016). Know When to Persist: Deriving Value from a Stream Buffer. In: Dondi, R., Fertin, G., Mauri, G. (eds) Algorithmic Aspects in Information and Management. AAIM 2016. Lecture Notes in Computer Science(), vol 9778. Springer, Cham. https://doi.org/10.1007/978-3-319-41168-2_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-41168-2_9
Published: 05 July 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41167-5
Online ISBN: 978-3-319-41168-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics