论文标题
毯子下的多模式内姿势和形状估计
Multimodal In-bed Pose and Shape Estimation under the Blankets
论文作者
论文摘要
人类在床上花费大量时间 - 平均大约是一生的三分之一。此外,静止的人在许多医疗保健应用中至关重要。通常,在休息时,人类被毯子覆盖,为此,我们提出了一种多模式的方法来揭示受试者,因此可以在不遮挡上面的毯子的情况下查看其静止的身体。我们提出了一种金字塔方案,以最好地利用多模式传感器捕获的知识的方式有效地融合不同的方式。具体而言,首先将两种最有用的方式(即深度和红外图像)融合来产生良好的初始姿势和形状估计。然后,压力图和RGB图像进一步融合,以分别为覆盖部分提供遮挡不变的信息,并分别为未覆盖的部分提供准确的形状信息,以完善结果。但是,即使有了多模式数据,由于身体的极端阻塞,检测到静止的人体的任务仍然非常具有挑战性。为了进一步减少毯子闭塞的负面影响,我们采用了基于注意力的重建模块来产生发现的模式,这些模式进一步融合以通过循环方式更新当前估计。广泛的实验验证了所提出的模型的优越性,而不是其他模型。
Humans spend vast hours in bed -- about one-third of the lifetime on average. Besides, a human at rest is vital in many healthcare applications. Typically, humans are covered by a blanket when resting, for which we propose a multimodal approach to uncover the subjects so their bodies at rest can be viewed without the occlusion of the blankets above. We propose a pyramid scheme to effectively fuse the different modalities in a way that best leverages the knowledge captured by the multimodal sensors. Specifically, the two most informative modalities (i.e., depth and infrared images) are first fused to generate good initial pose and shape estimation. Then pressure map and RGB images are further fused one by one to refine the result by providing occlusion-invariant information for the covered part, and accurate shape information for the uncovered part, respectively. However, even with multimodal data, the task of detecting human bodies at rest is still very challenging due to the extreme occlusion of bodies. To further reduce the negative effects of the occlusion from blankets, we employ an attention-based reconstruction module to generate uncovered modalities, which are further fused to update current estimation via a cyclic fashion. Extensive experiments validate the superiority of the proposed model over others.