论文标题

了解互联网视频中的3D对象发音

Understanding 3D Object Articulation in Internet Videos

论文作者

Qian, Shengyi, Jin, Linyi, Rockwell, Chris, Chen, Siyi, Fouhey, David F.

论文摘要

我们建议研究和表征普通视频中物体的3D平面表达的检测和表征。尽管对于人类来说似乎很容易,但这个问题对计算机构成了许多挑战。我们建议通过组合自上而下的检测系统来解决此问题,该检测系统可以找到可以阐明的平面以及解决3D平面的优化方法,该方法可以解释一系列观察到的关节。我们表明,可以将该系统组合到视频和3D扫描数据集的组合。当在具有挑战性的互联网视频和Charades数据集的数据集上进行测试时,我们的方法获得了强劲的性能。项目网站:https://jasonqsy.github.io/articulation3d

We propose to investigate detecting and characterizing the 3D planar articulation of objects from ordinary videos. While seemingly easy for humans, this problem poses many challenges for computers. We propose to approach this problem by combining a top-down detection system that finds planes that can be articulated along with an optimization approach that solves for a 3D plane that can explain a sequence of observed articulations. We show that this system can be trained on a combination of videos and 3D scan datasets. When tested on a dataset of challenging Internet videos and the Charades dataset, our approach obtains strong performance. Project site: https://jasonqsy.github.io/Articulation3D

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源