论文标题
高保真爆头场景的新颖查看综合
Novel View Synthesis for High-fidelity Headshot Scenes
论文作者
论文摘要
从任意角度来看,用高质量的人脸渲染场景是许多现实世界应用的实用技术。最近,一种使用神经网络近似经典射线追踪的渲染技术神经辐射场(NERF)被认为是从一组稀疏图像中综合新视图的有希望的方法之一。我们发现NERF可以在保持几何一致性的同时呈现新的视图,但是它不能适当地维护皮肤细节,例如痣和毛孔。这些细节对于面孔特别重要,因为当我们查看面部的图像时,我们对细节的敏感性要比查看其他物体时要敏感得多。另一方面,基于传统网格和纹理的3D Morpable模型(3DMM)在皮肤细节方面的表现良好,尽管它具有较少的精确几何形状,并且无法用背景覆盖头部和整个场景。基于这些观察结果,我们提出了一种使用NERF和3DMM的方法,以综合具有脸部场景的高保真小说。我们的方法学习了一个生成的对抗网络(GAN),以混合NERF合成的图像和3DMM渲染的图像,并产生一个逼真的场景,并带有脸部保留皮肤细节。各种现实场景的实验证明了我们方法的有效性。该代码将在https://github.com/showlab/headshot上提供。
Rendering scenes with a high-quality human face from arbitrary viewpoints is a practical and useful technique for many real-world applications. Recently, Neural Radiance Fields (NeRF), a rendering technique that uses neural networks to approximate classical ray tracing, have been considered as one of the promising approaches for synthesizing novel views from a sparse set of images. We find that NeRF can render new views while maintaining geometric consistency, but it does not properly maintain skin details, such as moles and pores. These details are important particularly for faces because when we look at an image of a face, we are much more sensitive to details than when we look at other objects. On the other hand, 3D Morpable Models (3DMMs) based on traditional meshes and textures can perform well in terms of skin detail despite that it has less precise geometry and cannot cover the head and the entire scene with background. Based on these observations, we propose a method to use both NeRF and 3DMM to synthesize a high-fidelity novel view of a scene with a face. Our method learns a Generative Adversarial Network (GAN) to mix a NeRF-synthesized image and a 3DMM-rendered image and produces a photorealistic scene with a face preserving the skin details. Experiments with various real-world scenes demonstrate the effectiveness of our approach. The code will be available on https://github.com/showlab/headshot .