论文标题
机器人手术中可变形组织的立体3D重建的神经渲染
Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery
论文作者
论文摘要
内窥镜立体视频中软组织在机器人手术中的重建对于许多应用,例如术中导航和图像引导的机器人手术自动化很重要。此任务的先前工作主要依赖于基于SLAM的方法,这些方法难以处理复杂的手术场景。受神经渲染的最新进展的启发,我们提出了一个新型的框架,用于在单视图设置下从机器人手术中的双眼捕获中进行可变形的组织重建。我们的框架采用动态神经辐射场来表示MLP中可变形的手术场景,并以基于学习的方式优化形状和变形。除了非刚性变形外,从单个角度来看,工具阻塞和较差的3D线索也是软组织重建的特殊挑战。为了克服这些困难,我们提出了一系列工具面膜引导的射线铸造,立体声深度提示射线行进和立体声深度避免优化的策略。通过关于Davinci机器人手术视频的实验,我们的方法显着优于处理各种复杂非刚性变形的当前最新重建方法。据我们所知,这是利用神经渲染的第一项工作,用于手术场景3D重建,具有显着的潜力。代码可在以下网址获得:https://github.com/med-air/endonerf。
Reconstruction of the soft tissues in robotic surgery from endoscopic stereo videos is important for many applications such as intra-operative navigation and image-guided robotic surgery automation. Previous works on this task mainly rely on SLAM-based approaches, which struggle to handle complex surgical scenes. Inspired by recent progress in neural rendering, we present a novel framework for deformable tissue reconstruction from binocular captures in robotic surgery under the single-viewpoint setting. Our framework adopts dynamic neural radiance fields to represent deformable surgical scenes in MLPs and optimize shapes and deformations in a learning-based manner. In addition to non-rigid deformations, tool occlusion and poor 3D clues from a single viewpoint are also particular challenges in soft tissue reconstruction. To overcome these difficulties, we present a series of strategies of tool mask-guided ray casting, stereo depth-cueing ray marching and stereo depth-supervised optimization. With experiments on DaVinci robotic surgery videos, our method significantly outperforms the current state-of-the-art reconstruction method for handling various complex non-rigid deformations. To our best knowledge, this is the first work leveraging neural rendering for surgical scene 3D reconstruction with remarkable potential demonstrated. Code is available at: https://github.com/med-air/EndoNeRF.