Non-linear Continuous Action Spaces for Reinforcement Learning in Type 1 Diabetes

Hettiarachchi, Chirath; Malagutti, Nicolo; Nolan, Christopher J.; Suominen, Hanna; Daskalaki, Elena

doi:10.1007/978-3-031-22695-3_39

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13728))

Included in the following conference series:

Australasian Joint Conference on Artificial Intelligence

1586 Accesses
2 Altmetric

Abstract

Artificial Pancreas Systems (APS) aim to improve glucose regulation and relieve people with Type 1 Diabetes (T1D) from the cognitive burden of ongoing disease management. They combine continuous glucose monitoring and control algorithms for automatic insulin administration to maintain glucose homeostasis. The estimation of an appropriate control action—or—insulin infusion rate is a complex optimisation problem for which Reinforcement Learning (RL) algorithms are currently being explored due to their performance capabilities in complex, uncertain environments. However, insulin requirements vary markedly according to sleep patterns, meal and exercise events. Hence, a large dynamic range of insulin infusion rates is required necessitating a large continuous action space which is challenging for RL algorithms. In this study, we introduced the use of non-linear continuous action spaces as a method to tackle the problem of efficiently exploring the large dynamic range of insulin towards learning effective control policies. Three non-linear action space formulations inspired by clinical patterns of insulin delivery were explored and analysed based on their impact to performance and efficiency in learning. We implemented a state-of-the-art RL algorithm and evaluated the performance of the proposed action spaces in-silico using an open-source T1D simulator based on the UVA/Padova 2008 model. The proposed exponential action space achieved a 24% performance improvement over the linear action space commonly used in practice, while portraying fast and steady learning. The proposed action space formulation has the potential to enhance the performance of RL algorithms for APS.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Automatic glycemic regulation for the pediatric population based on switched control and time-varying IOB constraints: an in silico study

Article 24 July 2020

A metamodel-based flexible insulin therapy for type 1 diabetes patients subjected to aerobic physical activity

Article Open access 16 May 2022

Blood Glucose Regulation Models in Artificial Pancreas for Type-1 Diabetic Patients

Article 01 January 2023

References

Bothe, M.K., Dickens, L., et al.: The use of reinforcement learning algorithms to meet the challenges of an artificial pancreas. Expert Rev. Med. Devices 10(5), 661–673 (2013)
Article Google Scholar
Brew-Sam, N., Chhabra, M., et al.: Experiences of young people and their caregivers of using technology to manage type 1 diabetes mellitus: systematic literature review and narrative synthesis. JMIR Diabetes 6(1), e20973 (2021)
Article Google Scholar
Brockman, G., et al.: OpenAI gym. arXiv Eprint arXiv:1606.01540 (2016)
Cobelli, C., Renard, E., Kovatchev, B.: Artificial pancreas: past, present, future. Diabetes 60(11), 2672–2682 (2011)
Article Google Scholar
DiMeglio, L.A., Evans-Molina, C., Oram, R.A.: Type 1 diabetes. Lancet 391(10138), 2449–2462 (2018)
Article Google Scholar
Dulac-Arnold, G., Mankowitz, D., Hester, T.: Challenges of real-world reinforcement learning. arXiv preprint arXiv:1904.12901 (2019)
Fox, I., Wiens, J.: Reinforcement learning for blood glucose control: challenges and opportunities. In: Reinforcement Learning for Real Life (RL4RealLife) Workshop in the 36th International Conference on Machine Learning (2019)
Google Scholar
Fox, I., et al.: Deep reinforcement learning for closed-loop blood glucose control. In: Machine Learning for Healthcare Conference, pp. 508–536. PMLR (2020)
Google Scholar
Kovatchev, B.P., Breton, M., et al.: In silico preclinical trials: a proof of concept in closed-loop control of type 1 diabetes. J. Diabetes Sci. Technol. 3(1), 44–55 (2009)
Article Google Scholar
Kovatchev, B.P., Clarke, W.L., et al.: Quantifying temporal glucose variability in diabetes via continuous glucose monitoring: mathematical methods and clinical application. Diabetes Technol. Ther. 7(6), 849–862 (2005)
Article Google Scholar
Lazaric, A., Restelli, M., Bonarini, A.: Reinforcement learning in continuous action spaces through sequential monte carlo methods. In: Advances in Neural Information Processing Systems, vol. 20 (2007)
Google Scholar
Lee, S., Kim, J., et al.: Toward a fully automated artificial pancreas system using a bioinspired reinforcement learning design: in silico validation. IEEE J. Biomed. Health Inform. 25(2), 536–546 (2020)
Article Google Scholar
Lillicrap, T.P., Hunt, J.J., Pritzel, A., et al.: Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971 (2015)
Lim, M.H., Lee, W.H., et al.: A blood glucose control framework based on reinforcement learning with safety and interpretability: in silico validation. IEEE Access 9, 105756–105775 (2021)
Article Google Scholar
Mann, H.B., Whitney, D.R.: On a test of whether one of two random variables is stochastically larger than the other. Ann. Math. Stat. 18, 50–60 (1947)
Article MATH Google Scholar
Naik, A., Shariff, R., et al.: Discounted reinforcement learning is not an optimization problem. arXiv preprint arXiv:1910.02140 (2019)
Nathan, D., Genuth, S., et al.: The effect of intensive treatment of diabetes on the development and progression of long-term complications in insulin-dependent diabetes mellitus. N. Engl. J. Med. 329(14), 977–986 (1993)
Article Google Scholar
Online: Insulin pump comparison. http://www.betterlivingnow.com/forms/Insulin-Pump-Comparison.pdf. Accessed 24 Mar 2022
Rorsman, P., Eliasson, L., Renstrom, E., Gromada, J., Barg, S., Gopel, S.: The cell physiology of biphasic insulin secretion. Physiology 15(2), 72–77 (2000)
Article Google Scholar
Schrittwieser, J., Antonoglou, I., et al.: Mastering Atari, go, chess and shogi by planning with a learned model. Nature 588(7839), 604–609 (2020)
Article Google Scholar
Schulman, J., Moritz, P., Levine, S., et al.: High-dimensional continuous control using generalized advantage estimation. arXiv preprint arXiv:1506.02438 (2015)
Schulman, J., Wolski, F., et al.: Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017)
Shah, R.B., Patel, M., et al.: Insulin delivery methods: past, present and future. Int. J. Pharm. Investig. 6(1), 1–9 (2016)
Article Google Scholar
Shapiro, S.S., Wilk, M.B.: An analysis of variance test for normality (complete samples). Biometrika 52(3/4), 591–611 (1965)
Article MATH Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (2018)
MATH Google Scholar
Tassa, Y., Doron, Y., Muldal, A., Erez, T., et al.: DeepMind control suite. arXiv preprint arXiv:1801.00690 (2018)
Todorov, E., Erez, T., Tassa, Y.: MuJoCo: a physics engine for model-based control. In: 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 5026–5033. IEEE (2012)
Google Scholar
Vajapey, A.: Predicting optimal sedation control with reinforcement learning. Ph.D. thesis, Massachusetts Institute of Technology (2019)
Google Scholar
Xie, J.: Simglucose v0. 2.1 (2018). https://github.com/jxx123/simglucose. Accessed 13 Jan 2022
Zahavy, T., et al.: Learn what not to learn: action elimination with deep reinforcement learning. In: Advances in Neural Information Processing Systems, vol. 31 (2018)
Google Scholar
Zhu, T., Li, K., Georgiou, P.: A dual-hormone closed-loop delivery system for type 1 diabetes using deep reinforcement learning. arXiv preprint arXiv:1910.04059 (2019)
Zhu, T., Li, K., Herrero, P., Georgiou, P.: Basal glucose control in type 1 diabetes using deep reinforcement learning: an in silico validation. IEEE J. Biomed. Health Inform. 25(4), 1223–1232 (2020)
Article Google Scholar

Download references

Acknowledgments

This research was funded in part by the Australian National University and the Our Health in Our Hands initiative.

Author information

Authors and Affiliations

Australian National University, Canberra, Australia
Chirath Hettiarachchi, Nicolo Malagutti, Christopher J. Nolan, Hanna Suominen & Elena Daskalaki
University of Turku, Turku, Finland
Hanna Suominen

Authors

Chirath Hettiarachchi
View author publications
You can also search for this author in PubMed Google Scholar
Nicolo Malagutti
View author publications
You can also search for this author in PubMed Google Scholar
Christopher J. Nolan
View author publications
You can also search for this author in PubMed Google Scholar
Hanna Suominen
View author publications
You can also search for this author in PubMed Google Scholar
Elena Daskalaki
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chirath Hettiarachchi .

Editor information

Editors and Affiliations

University of New South Wales, Sydney, NSW, Australia
Haris Aziz
University of Western Australia, Perth, WA, Australia
Débora Corrêa
University of Western Australia, Perth, WA, Australia
Tim French

Ethics declarations

Code Availability

A repository of code used in this study, and further supplementary material, is available at https://github.com/chirathyh/G2P2C.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hettiarachchi, C., Malagutti, N., Nolan, C.J., Suominen, H., Daskalaki, E. (2022). Non-linear Continuous Action Spaces for Reinforcement Learning in Type 1 Diabetes. In: Aziz, H., Corrêa, D., French, T. (eds) AI 2022: Advances in Artificial Intelligence. AI 2022. Lecture Notes in Computer Science(), vol 13728. Springer, Cham. https://doi.org/10.1007/978-3-031-22695-3_39

Download citation

DOI: https://doi.org/10.1007/978-3-031-22695-3_39
Published: 03 December 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-22694-6
Online ISBN: 978-3-031-22695-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Non-linear Continuous Action Spaces for Reinforcement Learning in Type 1 Diabetes

Abstract

Access this chapter

Similar content being viewed by others

Automatic glycemic regulation for the pediatric population based on switched control and time-varying IOB constraints: an in silico study

A metamodel-based flexible insulin therapy for type 1 diabetes patients subjected to aerobic physical activity

Blood Glucose Regulation Models in Artificial Pancreas for Type-1 Diabetic Patients

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Ethics declarations

Code Availability

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Non-linear Continuous Action Spaces for Reinforcement Learning in Type 1 Diabetes

Abstract

Access this chapter

Similar content being viewed by others

Automatic glycemic regulation for the pediatric population based on switched control and time-varying IOB constraints: an in silico study

A metamodel-based flexible insulin therapy for type 1 diabetes patients subjected to aerobic physical activity

Blood Glucose Regulation Models in Artificial Pancreas for Type-1 Diabetic Patients

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Ethics declarations

Code Availability

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation