Skip to main content
Log in

Learning competitive dynamic airline pricing under different customer models

  • Research Article
  • Published:
Journal of Revenue and Pricing Management Aims and scope

Abstract

There is a tendency to focus on the overly simplistic dynamic airline pricing games or to even ignore competition completely, because of the difficulty in solving game theoretic models. Recent changes in the industry mean that airlines can no longer ignore competitors in their model. This article adds more complex customer model aspects – that is, customer choice using a logit model, customer demand using a linear probabilistic demand model and market size using a binary random function – into an existing solvable airline pricing game; originally, this game only used a simple customer model. The newly formed games were solved using a reinforcement learning algorithm with mixed results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3
Figure 4
Figure 5

Similar content being viewed by others

References

  • Bansal, M. and Maglaras, C. (2008) Dynamic pricing when customers strategically time their purchase: Asymptotic optimality of a two-price policy. Journal of Revenue and Pricing Management 8 (1): 42–66.

    Article  Google Scholar 

  • Bellman, R. (1954) The theory of dynamic programming. Bulletin of American Mathematical Society 60: 503–516.

    Article  Google Scholar 

  • Bertsimas, D. and de Boer, S. (2005) Simulation-based booking limits for airline revenue management. Operations Research 53 (1): 90–106.

    Article  Google Scholar 

  • Boyd, E.A. (2007) Perspectives on the future of pricing. Presented at the 7th Annual INFORMS Pricing and Revenue Management Conference; 28–29 June, Barcelona, Spain.

  • Boyd, E.A. and Kallesen, R. (2004) The science of revenue management when passengers purchase the lowest available fare. Journal of Revenue & Pricing Management 3 (2): 171–177.

    Article  Google Scholar 

  • Bridle, J.S. (1990) Training stochastic model recognition algorithms as networks can lead to maximum mutual information estimation of parameters. In: D.S. Touretzky (ed.) Advances in Neural Information Processing Systems: Proceedings of the 1989 Conference. San Mateo, CA: Morgan Kaufmann, pp. 211–217.

    Google Scholar 

  • Brown, G.W. (1951) Iterative solution of game by fictitious play. In: T.C. Koopmans (ed.) Activity Analysis of Production and Allocation. New York: Wiley, pp. 374–376.

    Google Scholar 

  • Burger, B. and Fuchs, M. (2005) Dynamic pricing – A future airline business model. Journal of Revenue & Pricing Management 4 (1): 39–53.

    Article  Google Scholar 

  • Chinthalapati, V.L.R., Yadati, N. and Karumanchi, R. (2006) Learning dynamic prices in multiseller electronic retail markets with price sensitive customers, stochastic demands, and inventory replenishments. IEEE Transactions on Systems, Man, and Cybernetics – Part C: Applications and Reviews 36 (1): 92–106.

    Article  Google Scholar 

  • Collins, A.J. (2009) Evaluating reinforcement learning for game theory application learning to price airline seats under competition. PhD thesis, University of Southampton, UK.

  • Collins, A.J. and Thomas, L. (2012) Comparing reinforcement learning approaches for solving game theoretic models: A dynamic airline pricing game example. Journal of the Operational Research Society 63 (8): 1165–1173.

    Article  Google Scholar 

  • Cooper, W.L., Homem-de-Mello, T. and Kleywegt, A.J. (2006) Models of the spiral-down effect in revenue management. Operations Research 54 (5): 968–987.

    Article  Google Scholar 

  • Currie, C., Cheng, R.C.H. and Smith, H.K. (2008) Dynamic pricing of airline tickets with competition. Journal of the Operational Research Society 59 (8): 1026–1037.

    Article  Google Scholar 

  • Debreu, G. (1960) Review of individual choice behaviour: A theoretical analysis by R. Duncan Luce. The American Economic Review 50: 186–188.

    Google Scholar 

  • Donnelly, S., James, A. and Binnion, C. (2004) bmi's response to the changing European airline marketplace. Journal of Revenue & Pricing Management 3 (1): 10–17.

    Article  Google Scholar 

  • Dunleavy, H. and Phillips, G. (2009) The future of airline revenue management. Journal of Revenue & Pricing Management 8 (4): 388–395.

    Article  Google Scholar 

  • Fudenberg, D. and Levine, D.K. (1998) The Theory of Learning in Games. London: MIT Press.

    Google Scholar 

  • Gao, H., Ball, M.O. and Karaesmen, I.Z. (2010) Competitive seat inventory control decisions under the regret criterion. Journal of Revenue and Pricing Management 9 (1–2): 49–65.

    Article  Google Scholar 

  • Garrow, L.A. and Koppelman, F.S. (2004) Multinomial and nested logit models of airline passengers’ no-show and standby behaviour. Journal of Revenue and Pricing Management 3 (3): 237–253.

    Article  Google Scholar 

  • Gosavi, A. (2003) Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning. Norwell, MA: Springer.

    Book  Google Scholar 

  • Gosavi, A., Bandia, N. and Das, T.K. (2002) A reinforcement learning approach to a single leg airline revenue management problem with multiple fare classes and overbooking. IIE Transactions 34 (9): 729–742.

    Google Scholar 

  • Gosavi, A., Bandia, N. and Das, T.K. (2007) Simulation optimization for revenue management of airlines with cancellation and overbooking. OR Spectrum 29 (1): 21–38.

    Article  Google Scholar 

  • Isler, K. and Imhof, H. (2008) A game theoretic model for airline revenue management and competitive pricing. Journal of Revenue and Pricing Management 7 (4): 384–396.

    Article  Google Scholar 

  • Joe, H. (1997) Multivariate Models and Multivariate Dependence Concepts. Boca Raton, FL: Chapman and Hall/CRC.

    Book  Google Scholar 

  • Kononen, V. (2006) Dynamic pricing based on asymmetric multiagent reinforcement learning. International Journal of Intelligent Systems 21 (1): 73–98.

    Article  Google Scholar 

  • Kunnumkal, S. and Topaloglu, H. (2010) A stochastic approximation algorithm for making pricing decisions in network revenue management problems. Journal of Revenue and Pricing Management 9 (5): 419–442.

    Article  Google Scholar 

  • Luce, R.D. (1959) Individual Choice Behaviour. Oxford: Wiley.

    Google Scholar 

  • Manski, C.F. (1977) The structure of random utility models. Theory and Decision 8 (3): 229–254.

    Article  Google Scholar 

  • Maynard Smith, J. (1982) Evolution and the Theory of Games. Cambridge: Cambridge University Press.

    Book  Google Scholar 

  • McFadden, D. (1980) Econometric models of probabilistic choice among products. Journal of Business 53: S13–S29.

    Article  Google Scholar 

  • Nash, J. (1951) Non-cooperative games. The Annals of Mathematics 54 (2): 286–295.

    Article  Google Scholar 

  • Netessine, S. and Shumsky, R.A. (2005) Revenue management games: Horizontal and vertical competition. Management Science 51 (5): 813–831.

    Article  Google Scholar 

  • Oum, T.H. (1979) A warning on the use of linear logit models in transport mode choice studies. Bell Journal of Economics 10: 374–388.

    Article  Google Scholar 

  • Pavlov, I.P. (1927) Conditioned Reflexes. Oxford: Oxford University Press.

    Google Scholar 

  • Pidd, M. (1996) Tools for Thinking: Modelling in Management Science. Chichester, UK: Wiley.

    Google Scholar 

  • Ratliff, R. and Vinod, B. (2005) Future of revenue management: Airline pricing and revenue management: A future outlook. Journal of Revenue and Pricing Management 4 (3): 302–307.

    Article  Google Scholar 

  • Rodriguez-Fernández, J.L. (1999) Ockham's razor. Endeavour 23 (3): 121–125.

    Article  Google Scholar 

  • Rummery, G.A. (1995) Problem solving with reinforcement learning. PhD thesis, Cambridge University, UK.

  • Salt, J.D. (2008) The seven habits of highly defective simulation projects. Journal of Simulation 2 (3): 155–161.

    Article  Google Scholar 

  • Shenk, D. (2007) The Immortal Game: A History of Chess. New York: Anchor.

    Google Scholar 

  • Sridharan, M. and Tesauro, G. (2000) Multi-Agent Q-learning and Regression Trees for Automated Pricing Decisions, In: Proceedings of the 17th International Conference on Machine Learning; 29 June – 2 July, Stanford, CA, pp. 927–934.

  • Sutton, R.S. and Barto, A. (1998) Reinforcement Learning: An Introduction. Cambridge: MIT Press.

    Google Scholar 

  • Talluri, K.T. and van Ryzin, G.L. (2004) Theory and Practice of Revenue Management. New York: Springer-Verlag.

    Book  Google Scholar 

  • Tesauro, G. and Kephart, J.O. (2002) Pricing in agent economies using multi-agent Q-learning. Autonomous Agents and Multi-Agent Systems 5 (3): 289–304.

    Article  Google Scholar 

  • Thurstone, L.L. (1927a) A law of comparative judgement. Psychological Review 34: 273–286.

    Article  Google Scholar 

  • Thurstone, L.L. (1927b) Psychological analysis. American Journal of Psychology 38: 368–389.

    Article  Google Scholar 

  • van Ryzin, G. and Vulcano, G. (2008) Simulation-based optimization of virtual nesting controls for network revenue management. Operations Research 56 (4): 865–880.

    Article  Google Scholar 

  • Vinod, B., Narayan, C.P. and Ratliff, R.M. (2009) Pricing decision support: Optimising fares in competitive markets. Journal of Revenue and Pricing Management 8 (4): 295–312.

    Article  Google Scholar 

  • von Neumann, J. and Morgenstern, O. (1944) Theory of Games and Economic Behaviour. Princeton, NJ: Princeton University Press.

    Google Scholar 

  • von Stackelberg, H.F. (1934) Marktform und Gleichgewicht. Vienna: Julius Springer.

    Google Scholar 

  • Walczak, D., Mardan, S. and Kallesen, R. (2010) Customer choice, fare adjustments and the marginal expected revenue data transformation: A note on using old yield management techniques in the brave new world of pricing. Journal of Revenue and Pricing Management 9 (1–2): 94–109.

    Article  Google Scholar 

  • Watkins, C.J.C.H. (1989) Learning from delayed rewards. PhD thesis, Cambridge University, UK.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Andrew Collins.

Additional information

2has been the Professor of Management Science at the University of Southampton since 2000. He was formerly the president of the Operational Research (OR) Society (1994–1995) and was awarded the Beale Medal of OR Society in 2008. Specialist research areas include credit scoring and credit control, and use of management science techniques in financial and banking areas.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Collins, A., Thomas, L. Learning competitive dynamic airline pricing under different customer models. J Revenue Pricing Manag 12, 416–430 (2013). https://doi.org/10.1057/rpm.2013.10

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1057/rpm.2013.10

Keywords

Navigation