论文标题
早产儿的姿势估计具有时空特征
Preterm infants' pose estimation with spatio-temporal features
论文作者
论文摘要
目的:早产儿在新生儿重症监护病房(NICUS)中的肢体监测对于评估婴儿的健康状况和运动/认知发展至关重要。本文中,我们提出了一种新的方法,用于早产儿的肢体姿势估计,该方法具有时空信息,以从具有高可靠性的深度视频中检测和跟踪肢体关节。方法:分别使用检测和回归卷积神经网络(CNN)组成的深度学习框架进行粗糙和精确的关节定位,进行肢体置换估计。通过3D卷积,实现了CNN以在时间方向上编码连接性。对拟议框架的评估是通过一项综合研究进行的,其中16个早产儿(Babypose数据集)在实际临床实践中获得了16个深度视频。结果:当应用于姿势估计时,在所有四肢,估计和地面真相姿势之间计算的中值均方根距离为9.06像素,仅基于空间特征(仅11.27像素)来克服方法。结论:结果表明,时空特征对姿势估计性能有重大影响,尤其是在具有挑战性的情况下(例如,均质图像强度)。意义:本文通过引入使用时空特征来自动评估早产儿的健康状况,从而显着增强了艺术状态,并成为第一个使用实际临床实践中获得的深度视频进行肢体估计的深度视频的研究。 Babypose数据集已作为婴儿姿势估计的第一个注释数据集发布。
Objective: Preterm infants' limb monitoring in neonatal intensive care units (NICUs) is of primary importance for assessing infants' health status and motor/cognitive development. Herein, we propose a new approach to preterm infants' limb pose estimation that features spatio-temporal information to detect and track limb joints from depth videos with high reliability. Methods: Limb-pose estimation is performed using a deep-learning framework consisting of a detection and a regression convolutional neural network (CNN) for rough and precise joint localization, respectively. The CNNs are implemented to encode connectivity in the temporal direction through 3D convolution. Assessment of the proposed framework is performed through a comprehensive study with sixteen depth videos acquired in the actual clinical practice from sixteen preterm infants (the babyPose dataset). Results: When applied to pose estimation, the median root mean squared distance, computed among all limbs, between the estimated and the ground-truth pose was 9.06 pixels, overcoming approaches based on spatial features only (11.27pixels). Conclusion: Results showed that the spatio-temporal features had a significant influence on the pose-estimation performance, especially in challenging cases (e.g., homogeneous image intensity). Significance: This paper significantly enhances the state of art in automatic assessment of preterm infants' health status by introducing the use of spatio-temporal features for limb detection and tracking, and by being the first study to use depth videos acquired in the actual clinical practice for limb-pose estimation. The babyPose dataset has been released as the first annotated dataset for infants' pose estimation.