论文标题
4D-MultispectralNet:使用人口罩的多光谱立体差异估计
4D-MultispectralNet: Multispectral Stereoscopic Disparity Estimation using Human Masks
论文作者
论文摘要
多光谱立体镜是一个新兴领域。在经典的立体镜检查中已经完成了许多工作,但是多光谱立体镜的经常研究。这种类型的立体镜可以在自动驾驶汽车中使用,以完成RGB摄像机给出的信息。当情况更加困难时,例如在夜面场景中,它有助于识别周围环境中的物体。本文着重于RGB-LWIR频谱。 RGB-LWIR立体镜具有与经典立体镜的挑战,即遮挡,无纹理表面和重复性模式,以及与不同模态相关的特定模式。在两个频谱之间查找匹配增加了另一层复杂性。颜色,纹理和形状更有可能从光谱到另一个频谱变化。为了应对这一额外的挑战,本文着重于估计场景中存在的人们的差异。鉴于人们在RGB和LWIR中都捕获了人们的形状,我们提出了一种新颖的方法,该方法在频谱中使用人类的分割掩码,而不是将它们与暹罗网络第一层之前的原始图像相连。此方法有助于提高准确性,尤其是在一个像素误差范围内。
Multispectral stereoscopy is an emerging field. A lot of work has been done in classical stereoscopy, but multispectral stereoscopy is not studied as frequently. This type of stereoscopy can be used in autonomous vehicles to complete the information given by RGB cameras. It helps to identify objects in the surroundings when the conditions are more difficult, such as in night scenes. This paper focuses on the RGB-LWIR spectrum. RGB-LWIR stereoscopy has the same challenges as classical stereoscopy, that is occlusions, textureless surfaces and repetitive patterns, plus specific ones related to the different modalities. Finding matches between two spectrums adds another layer of complexity. Color, texture and shapes are more likely to vary from a spectrum to another. To address this additional challenge, this paper focuses on estimating the disparity of people present in a scene. Given the fact that people's shape is captured in both RGB and LWIR, we propose a novel method that uses segmentation masks of the human in both spectrum and than concatenate them to the original images before the first layer of a Siamese Network. This method helps to improve the accuracy, particularly within the one pixel error range.