Automated Insulin Delivery (AID) systems have shown great promise in managing diabetes by automating insulin administration. However, a significant challenge remains: preventing hypoglycemia or hyperglycemia during dynamic glucose fluctuations while minimizing the daily insulin dosage, or control effort. This study explores ways to enhance AID systems using a Reinforcement Learning (RL) algorithm called Maskable Proximal Policy Optimization (Maskable PPO), based on invalid action masking. Our findings demonstrate that this approach leads to a safety-aware framework for AIDs, providing highly realistic simulation scenarios for individual adult, adolescent, and child patients. The results show improved Time In Range (TIR) metrics (96.39%, 96.85%, and 54.43%), prevention of emergency bolus administration in adolescent and adult patients, and a reduction in Total Injected Insulin (TII) (17.32 U, 13.52 U, and 4.35 U per day).
A Safety Aware Deep Reinforcement Learning Technique for Automated Insulin Delivery / Lops, Giada; Manfredi, Gioacchino; Racanelli, Vito Andrea; De Cicco, Luca; Mascolo, Saverio. - (2025), pp. 120-125. ( 2025 33rd Mediterranean Conference on Control and Automation (MED) Tangier, Morocco June 10-13, 2025) [10.1109/med64031.2025.11073297].
A Safety Aware Deep Reinforcement Learning Technique for Automated Insulin Delivery
Lops, Giada
;Manfredi, Gioacchino;Racanelli, Vito Andrea;De Cicco, Luca;Mascolo, Saverio
2025
Abstract
Automated Insulin Delivery (AID) systems have shown great promise in managing diabetes by automating insulin administration. However, a significant challenge remains: preventing hypoglycemia or hyperglycemia during dynamic glucose fluctuations while minimizing the daily insulin dosage, or control effort. This study explores ways to enhance AID systems using a Reinforcement Learning (RL) algorithm called Maskable Proximal Policy Optimization (Maskable PPO), based on invalid action masking. Our findings demonstrate that this approach leads to a safety-aware framework for AIDs, providing highly realistic simulation scenarios for individual adult, adolescent, and child patients. The results show improved Time In Range (TIR) metrics (96.39%, 96.85%, and 54.43%), prevention of emergency bolus administration in adolescent and adult patients, and a reduction in Total Injected Insulin (TII) (17.32 U, 13.52 U, and 4.35 U per day).I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

