论文标题
使用深钢筋学习中的1型糖尿病中的基底葡萄糖控制:硅验证
Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation
论文作者
论文摘要
患有1型糖尿病(T1D)的人需要定期对胰岛素进行外源性输注,以在治疗范围足够的靶范围内保持其血糖浓度。尽管事实证明,人工胰腺和连续的葡萄糖监测可以有效地实现闭环控制,但由于葡萄糖动力学和技术中的局限性的高度复杂性,仍然存在重大挑战。在这项工作中,我们提出了一种新颖的深钢筋学习模型,用于单激素(胰岛素)和双激素(胰岛素和胰高血糖素)的递送。特别是,通过双重Q学习和扩张的复发神经网络开发了交付策略。为了设计和测试目的,采用了FDA所接受的UVA/PADOVA 1型模拟器。首先,我们进行了长期的广义培训以获得人口模型。然后,该模型是通过小型数据集的特定于特定数据集的个性化的。在计算机结果中表明,与具有低葡萄糖胰岛素悬浮液的标准基底胶疗法相比,单一和双激素的递送策略可获得良好的葡萄糖控制。具体而言,在成人队列(n = 10)中,目标范围[70,180] mg/dl的百分比从单激素控制的77.6%提高到80.9%,并具有双激素控制的$ 85.6 \%\%。在青少年队列(n = 10)中,单激素对照的目标范围的百分比从55.5%提高到65.9%,并在双激素控制方面提高到78.8%。在所有情况下,观察到低血糖的显着降低。这些结果表明,使用深钢筋学习是T1D中闭环葡萄糖控制的可行方法。
People with Type 1 diabetes (T1D) require regular exogenous infusion of insulin to maintain their blood glucose concentration in a therapeutically adequate target range. Although the artificial pancreas and continuous glucose monitoring have been proven to be effective in achieving closed-loop control, significant challenges still remain due to the high complexity of glucose dynamics and limitations in the technology. In this work, we propose a novel deep reinforcement learning model for single-hormone (insulin) and dual-hormone (insulin and glucagon) delivery. In particular, the delivery strategies are developed by double Q-learning with dilated recurrent neural networks. For designing and testing purposes, the FDA-accepted UVA/Padova Type 1 simulator was employed. First, we performed long-term generalized training to obtain a population model. Then, this model was personalized with a small data-set of subject-specific data. In silico results show that the single and dual-hormone delivery strategies achieve good glucose control when compared to a standard basal-bolus therapy with low-glucose insulin suspension. Specifically, in the adult cohort (n=10), percentage time in target range [70, 180] mg/dL improved from 77.6% to 80.9% with single-hormone control, and to $85.6\%$ with dual-hormone control. In the adolescent cohort (n=10), percentage time in target range improved from 55.5% to 65.9% with single-hormone control, and to 78.8% with dual-hormone control. In all scenarios, a significant decrease in hypoglycemia was observed. These results show that the use of deep reinforcement learning is a viable approach for closed-loop glucose control in T1D.