使用深钢筋学习中的1型糖尿病中的基底葡萄糖控制：硅验证

论文标题

使用深钢筋学习中的1型糖尿病中的基底葡萄糖控制：硅验证

Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation

论文作者

Zhu, Taiyu, Li, Kezhi, Herrero, Pau, Georgiou, Pantelis

论文摘要

患有1型糖尿病（T1D）的人需要定期对胰岛素进行外源性输注，以在治疗范围足够的靶范围内保持其血糖浓度。尽管事实证明，人工胰腺和连续的葡萄糖监测可以有效地实现闭环控制，但由于葡萄糖动力学和技术中的局限性的高度复杂性，仍然存在重大挑战。在这项工作中，我们提出了一种新颖的深钢筋学习模型，用于单激素（胰岛素）和双激素（胰岛素和胰高血糖素）的递送。特别是，通过双重Q学习和扩张的复发神经网络开发了交付策略。为了设计和测试目的，采用了FDA所接受的UVA/PADOVA 1型模拟器。首先，我们进行了长期的广义培训以获得人口模型。然后，该模型是通过小型数据集的特定于特定数据集的个性化的。在计算机结果中表明，与具有低葡萄糖胰岛素悬浮液的标准基底胶疗法相比，单一和双激素的递送策略可获得良好的葡萄糖控制。具体而言，在成人队列（n = 10）中，目标范围[70，180] mg/dl的百分比从单激素控制的77.6％提高到80.9％，并具有双激素控制的$ 85.6 \％\％。在青少年队列（n = 10）中，单激素对照的目标范围的百分比从55.5％提高到65.9％，并在双激素控制方面提高到78.8％。在所有情况下，观察到低血糖的显着降低。这些结果表明，使用深钢筋学习是T1D中闭环葡萄糖控制的可行方法。

People with Type 1 diabetes (T1D) require regular exogenous infusion of insulin to maintain their blood glucose concentration in a therapeutically adequate target range. Although the artificial pancreas and continuous glucose monitoring have been proven to be effective in achieving closed-loop control, significant challenges still remain due to the high complexity of glucose dynamics and limitations in the technology. In this work, we propose a novel deep reinforcement learning model for single-hormone (insulin) and dual-hormone (insulin and glucagon) delivery. In particular, the delivery strategies are developed by double Q-learning with dilated recurrent neural networks. For designing and testing purposes, the FDA-accepted UVA/Padova Type 1 simulator was employed. First, we performed long-term generalized training to obtain a population model. Then, this model was personalized with a small data-set of subject-specific data. In silico results show that the single and dual-hormone delivery strategies achieve good glucose control when compared to a standard basal-bolus therapy with low-glucose insulin suspension. Specifically, in the adult cohort (n=10), percentage time in target range [70, 180] mg/dL improved from 77.6% to 80.9% with single-hormone control, and to $85.6\%$ with dual-hormone control. In the adolescent cohort (n=10), percentage time in target range improved from 55.5% to 65.9% with single-hormone control, and to 78.8% with dual-hormone control. In all scenarios, a significant decrease in hypoglycemia was observed. These results show that the use of deep reinforcement learning is a viable approach for closed-loop glucose control in T1D.

下载PDF全文

下载文献需遵守相关版权规定

论文标题