Advertisement

Abstract

We describe a model-free approach to bidding in the Ad-Auctions Trading Agents Competition: First, a simple and robust yet high-performing agent using a Regret Minimization optimization algorithm for the 2010 competition, followed by our top performing agent for the 2011 competition, still using simplified modeling and optimization methods. Specifically, we model the user populations using particle filters, but base the observations on a Nearest Neighbor estimator (instead of game specific parameters). We implement a simple and effective bid optimization algorithm by applying the equimarginal principle combined with perplexity-based regularization. The implementation of our 2011 agent also remains model-free in the sense that we do not attempt to model the competing agents behavior for estimating costs and associated game parameters.

Keywords

User Population Trading Agent Regularization Factor Reserve Prex Neighbor Estimator 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Berg, J., Greenwald, A., Naroditskiy, V., Sodomka, E.: A First Approach to Autonomous Bidding in Ad Auctions. In: ACM EC 2010 Workshop on Trading Agent Design and Analysis (2010)Google Scholar
  2. 2.
    Pardoe, D., Chakraborty, D., Stone, P.: TacTex09: A Champion Bidding Agent for Ad Auctions. In: Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems, Toronto, Canada (2010)Google Scholar
  3. 3.
    Pardoe, D., Stone, P.: A Particle Filter for Bid Estimation in Ad Auctions with Periodic Ranking Observations. In: EC 2010 Workshop on Trading Agent Design and Analysis (TADA), Cambridge, Massachusetts (2010)Google Scholar
  4. 4.
    Jordan, P.R., Wellman, M.P.: Designing an ad auctions game for the trading agent competition. In: David, E., Gerding, E., Sarne, D., Shehory, O. (eds.) AMEC 2009. LNBIP, vol. 59, pp. 147–162. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  5. 5.
    Blum, A., Mansour, Y.: Learning, Regret minimization, and Equilibria. In: Algorithmic Game Theory, ch. 4 (2007)Google Scholar
  6. 6.
    Littlestone, N., Warmuth, M.: The weighted majority algorithm. Info. and Computation 108(2), 212–261 (1994)MathSciNetCrossRefzbMATHGoogle Scholar
  7. 7.
    Freund, Y., Schapire, R.: A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Comp. and System Sciences 55(1), 119–139 (1997)MathSciNetCrossRefzbMATHGoogle Scholar
  8. 8.
    Cesa-Bianchi, N., Lugosi, G.: Prediction, learning, and games. Cambridge University Press (2006)Google Scholar
  9. 9.
    Vorobeychik, Y.: A Game Theoretic Bidding Agent for the Ad Auction Game. In: Third International Conference on Agents and Articial Intelligence (2011)Google Scholar
  10. 10.
    Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA Data Mining Software: An Update. SIGKDD Explorations 11(1) (2009)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Mariano Schain
    • 1
  • Shai Hertz
    • 1
  • Yishay Mansour
    • 1
  1. 1.Tel Aviv UniversityIsrael

Personalised recommendations