Adaptive Algorithm of Tracking the Best Experts Trajectory

V’yugin, V. V.; Stel’makh, I. A.; Trunov, V. G.

doi:10.1134/S1064226917120117

Adaptive Algorithm of Tracking the Best Experts Trajectory

Mathematical Models and Computational Methods
Published: 27 February 2018

Volume 62, pages 1434–1447, (2017)
Cite this article

Journal of Communications Technology and Electronics Aims and scope Submit manuscript

V. V. V’yugin¹,
I. A. Stel’makh¹ &
V. G. Trunov¹

69 Accesses
2 Citations
Explore all metrics

Abstract

The problem of decision theoretic online learning is discussed. There is the set of methods, experts, and algorithms capable of making solutions (or predictions) and suffering losses due to the inaccuracy of their solutions. An adaptive algorithm whereby expert solutions are aggregated and sustained losses not exceeding (to a certain quantity called a regret) those of the best combination of experts distributed over the prediction interval is proposed. The algorithm is constructed using the Fixed-Share method combined with the Ada-Hedge algorithm used to exponentially weight expert solutions. The regret of the proposed algorithm is estimated. In the context of the given approach, there are no any stochastic assumptions about an initial data source and the boundedness of losses. The results of numerical experiments concerning the mixing of expert solutions with the help of the proposed algorithm are presented. The strategies of games on financial markets, which were suggested in our previous papers, play the role of expert strategies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

A survey of Bayesian Network structure learning

Article Open access 17 January 2023

Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations

Article Open access 07 July 2017

References

V. V. V’yugin and V. G. Trunov, “Applications of combined financial strategies based on universal adaptive forecasting,” Autom. Remote Control 77, 1428–1446 (2016).
Article MathSciNet MATH Google Scholar
V. V. V’yugin and I. A. Stel’makh, “Tracking the best expert trajectory with the help of the AdaHedge algorithm,” Information Technologies and Systems 2016, 40th Interdisciplinary Conference and School, September 25–30, Repino, St. Petersburg, Russia (http://itas2016.iitp.ru/pdf/1570282312.pdf).
O. Bousquet and M. K. Warmuth, “Tracking a small set of experts by mixing past posteriors,” J. Machine Learning Res. 3, 31–47 (2003).
MathSciNet MATH Google Scholar
N. Cesa-Bianchi and G. Lugosi, Prediction, Learning, and Games (Cambridge Univ. Press, Cambridge, 2006).
Book MATH Google Scholar
Y. Freund and R. E. Schapire, “A decision-theoretic generalization of on-line learning and an application to boosting,” J. Comput. Syst. Sci. 55, 119–139 (1997).
Article MathSciNet MATH Google Scholar
I. Herbster and M. Warmuth, “Tracking the best expert,” Machine Learning 32 (2), 151–178 (1998).
Article MATH Google Scholar
N. Littlestone and M. Warmuth, “The weighted majority algorithm,” Inf. Comput. 108, 212–261 (1994).
Article MathSciNet MATH Google Scholar
S. de Rooij, T. van Erven, D. Grunwald, and M. Koolen, “Follow the leader if you can, hedge if you must,” J. Machine Learning Res. 15, 1281–1316 (2014).
MathSciNet MATH Google Scholar
V. Vovk, “Aggregating strategies,” in Proc. 3rd Ann. Workshop on Comput. Learning Theory, San Mateo, CA, 1990 (Morgan Kaufmann, 1990), pp. 371–383.
Google Scholar
V. Vovk, “A game of prediction with expert advice,” J. Comput. Syst. Sci. 56, 153–173 (1998).
Article MathSciNet MATH Google Scholar
V. Vovk, “Derandomizing stochastic prediction strategies,” Machine Learning 35, 247–282 (1999).
Article MATH Google Scholar
V. V’yugin and V. Trunov, “Universal algorithmic trading,” J. Investment Strategies. Winter 2012/13 2 (1), 63–88 (2013).
Article Google Scholar
V. V. V’yugin, “Universal Algorithm for Trading in Stock Market Based on the Method of Calibration,” Lecture Notes in Artificial Intelligence (LNAI), 8139, 53–67 (2013).
MathSciNet MATH Google Scholar
V. V. V’yugin, “The following the perturbed leader algorithm and its application for constructing game strategies,” J. Commun. Technol. Electron. 60, 647–657 (2015).
Article Google Scholar

Download references

Author information

Authors and Affiliations

Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, 127051, Russia
V. V. V’yugin, I. A. Stel’makh & V. G. Trunov

Authors

V. V. V’yugin
View author publications
You can also search for this author in PubMed Google Scholar
I. A. Stel’makh
View author publications
You can also search for this author in PubMed Google Scholar
V. G. Trunov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to V. V. V’yugin.

Additional information

Original Russian Text © V.V. V’yugin, I.A. Stel’makh, V.G. Trunov, 2016, published in Informatsionnye Protsessy, 2016, Vol. 16, No. 3, pp. 260–280.

Rights and permissions

Reprints and permissions

About this article

Cite this article

V’yugin, V.V., Stel’makh, I.A. & Trunov, V.G. Adaptive Algorithm of Tracking the Best Experts Trajectory. J. Commun. Technol. Electron. 62, 1434–1447 (2017). https://doi.org/10.1134/S1064226917120117

Download citation

Received: 26 September 2016
Published: 27 February 2018
Issue Date: December 2017
DOI: https://doi.org/10.1134/S1064226917120117

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Adaptive Algorithm of Tracking the Best Experts Trajectory

Abstract

Access this article

Similar content being viewed by others

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

A survey of Bayesian Network structure learning

Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Adaptive Algorithm of Tracking the Best Experts Trajectory

Abstract

Access this article

Similar content being viewed by others

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

A survey of Bayesian Network structure learning

Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation