Inter-carrier SLA negotiation using Q-learning

Pouyllau, Hélia; Carofiglio, Giovanna

doi:10.1007/s11235-011-9505-5

Inter-carrier SLA negotiation using Q-learning

Published: 15 June 2011

Volume 52, pages 611–622, (2013)
Cite this article

Telecommunication Systems Aims and scope Submit manuscript

Hélia Pouyllau¹ &
Giovanna Carofiglio¹

246 Accesses
11 Citations
Explore all metrics

Abstract

Inter-domain high performance services (e.g. telepresence) are not sustainable over the current Internet architecture. The Quality of Service (QoS) guarantees they demand require to settle on end-to-end Service Level Agreements (SLAs) among providers (aka. carriers) and across different networks. This process is critical since it must provide the most benefits while dealing with heterogeneous operators’ business interests and confidentiality constraints. In this paper, we propose, in the frame of a cooperative organizational model called federation, a composition technique for inter-carrier SLAs that respects end-user’s QoS requirements while maximizing network operators’ long-term benefits. We formulate the dynamic optimization problem as a Markov Decision process (MDP). This latter allows to provide an iterative near-optimal solution through reinforcement learning (more precisely, Q-learning). The SLA composition is thus performed taking into account customers and network providers’ utilities. We also propose a version including several negotiation rounds and observe how it affects the results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Bakiras, S., & Li, V. (2004). A scalable architecture for end-to-end QoS provisioning. Computer Communications, 27, 1330–1340.
Article Google Scholar
Barth, D., Echabbi, L., & Hamlaoui, C. (2008). Optimal transit price negotiation: The distributed learning perspective. Journal of Universal Computer Science, 14, 745–765.
Google Scholar
Bertsekas, D. P., & Tsitsiklis, J. N. (1996). Neuro-dynamic programming. Nashua: Athena Scientific.
Google Scholar
Djarallah, N., & Pouyllau, H. (2009). Algorithms for SLA composition to provide inter-domain services. In IEEE IM mini-conference.
Google Scholar
Casier, K. et al. (2006). A fair cost allocation scheme for CapEx and OpEx for a network service provider. In Conference on telecommunication techno-economics.
Google Scholar
Howarth, M. P. et al. (2006). End-to-end quality of service provisioning through inter-provider traffic engineering. Computer Communications.
Even-dar, E., & Mansour, Y. (2003). Learning rates for q-learning. Journal of Machine Learning Research, pp. 1–25.
Fei, Y., Wong, V., & Leung, V. (2006). Efficient QoS provisioning for adaptive multimedia in mobile communication networks by reinforcement learning. Mobile Networks and Applications, 11, 101–110.
Article Google Scholar
Telemanagement forum. www.tmforum.org/.
Kumar, N., & Saraph, G. (2006). End-to-end QoS in interdomain routing. In ICNS. Los Alamitos: IEEE Computer Society.
Google Scholar
Ma, R., Chiu, D., Lui, J., Misra, V., & Rubenstein, D. (2007). Internet economics: The use of Shapley value for ISP settlement. In ACM conference on emerging network experiment and technology.
Google Scholar
Mellouk, A., Hoceini, S., & Larynouna, S. (2006). Flow based routing for irregular traffic using reinforcement learning approach in dynamic networks. In ISCC.
Google Scholar
Pouyllau, H., & Douville, R. (2010). End-to-end qos negotiation in network federations. In IEEE NOMS bandwidth on demand (BoD) workshop.
Google Scholar
Le Sauze, N., Chiosi, A., Douville, R., Pouyllau, H., Lonsethagen, H., Fantini, P., Palas-ciano, C., Cimmino, A., Callejo Rodriguez, M. A., Dugeon, O., Kofman, D., Gadefait, X., Cuer, P., Ciulli, N., Carrozzo, G., Soppera, A., Briscoe, B., Bornstaedt, F., Andreou, M., Stamoulis, G., Courcoubetis, C., Reichl, P., Gojmerac, I., Rougier, J. L., Vaton, S., Barth, D., & Orda, A. (2010). Etics: Qos-enabled interconnection for future Internet services. In Future network and mobile summit.
Google Scholar
Shakkottai, S., & Srikant, R. (2005). Economics of network pricing with multiple ISPs. In INFOCOM.
Google Scholar
Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: an introduction (adaptive computation and machine learning). Cambridge: MIT Press.
Google Scholar
Tesauro, G., Jong, N. K., Das, R., & Bennani, M. N. (2006). A hybrid reinforcement learning approach to autonomic resource allocation. In ICAC ’06: Proceedings of the 2006 IEEE international conference on autonomic computing. Los Alamitos: IEEE Computer Society.
Google Scholar
Tong, H., & Brown, T. X. (2002). Reinforcement learning for call admission control and routing under quality of service constraints in multimedia networks. Journal of Machine Learning Research, pp. 111–139.
Watkins, C. J. C. H., & Dayan, P. (1992). Technical note: Q-learning. Journal of Machine Learning Research, pp. 279–292.
Williamson, O. E. (1991). Strategizing, economizing and economic organization. Strategic Management Journal, 12, 75–94. (Special Issue).
Article Google Scholar
Xiao, J., & Boutaba, R. (2005). QoS-aware service composition and adaptation in autonomic communication. IEEE Journal on Selected Areas in Communications 23.

Download references

Author information

Authors and Affiliations

Alcatel-Lucent Bell Labs France, Centre de Villarceaux, 91620, Nozay, France
Hélia Pouyllau & Giovanna Carofiglio

Authors

Hélia Pouyllau
View author publications
You can also search for this author in PubMed Google Scholar
Giovanna Carofiglio
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hélia Pouyllau.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pouyllau, H., Carofiglio, G. Inter-carrier SLA negotiation using Q-learning. Telecommun Syst 52, 611–622 (2013). https://doi.org/10.1007/s11235-011-9505-5

Download citation

Published: 15 June 2011
Issue Date: February 2013
DOI: https://doi.org/10.1007/s11235-011-9505-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Inter-carrier SLA negotiation using Q-learning

Abstract

Access this article

Similar content being viewed by others

Availability-aware and energy-aware dynamic SFC placement using reinforcement learning

DM-CSAT: a LTE-U/Wi-Fi coexistence solution based on reinforcement learning

Approximate Q-learning-based (AQL) network slicing in mobile edge-cloud for delay-sensitive services

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Inter-carrier SLA negotiation using Q-learning

Abstract

Access this article

Similar content being viewed by others

Availability-aware and energy-aware dynamic SFC placement using reinforcement learning

DM-CSAT: a LTE-U/Wi-Fi coexistence solution based on reinforcement learning

Approximate Q-learning-based (AQL) network slicing in mobile edge-cloud for delay-sensitive services

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation