论文标题
SGDRAW:使用面向对象的表示形式的场景图绘图接口
SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation
论文作者
论文摘要
场景理解是计算机视觉中的必不可少的任务。为了提供图像的视觉基本图形结构,由于其强大的语义表示,场景图引起了人们的关注。但是,很难为图像检索,图像生成和多模式应用绘制适当的场景图。传统的场景图表注释接口不容易在图像注释中使用,并且使用深神经网络的自动场景图生成方法很容易生成冗余内容,同时忽略了细节。在这项工作中,我们建议使用面向对象的场景图表表示SGDRAD,这是一个场景图绘图界面,以帮助用户交互绘制和编辑场景图。对于所提出的面向对象的表示,我们将对象的对象,属性和关系视为结构单元。 SGDRAD提供了基于Web的场景图表注释和生成工具,以了解场景理解应用程序。为了验证提出的界面的有效性,我们与常规工具和用户体验研究进行了比较研究。结果表明,SGDRAD可以帮助生成更丰富的细节的场景图,并比传统的边界框注释更准确地描述图像。我们认为,提出的SGDRAD在各种视觉任务(例如图像检索和产生)中很有用。
Scene understanding is an essential and challenging task in computer vision. To provide the visually fundamental graphical structure of an image, the scene graph has received increased attention due to its powerful semantic representation. However, it is difficult to draw a proper scene graph for image retrieval, image generation, and multi-modal applications. The conventional scene graph annotation interface is not easy to use in image annotations, and the automatic scene graph generation approaches using deep neural networks are prone to generate redundant content while disregarding details. In this work, we propose SGDraw, a scene graph drawing interface using object-oriented scene graph representation to help users draw and edit scene graphs interactively. For the proposed object-oriented representation, we consider the objects, attributes, and relationships of objects as a structural unit. SGDraw provides a web-based scene graph annotation and generation tool for scene understanding applications. To verify the effectiveness of the proposed interface, we conducted a comparison study with the conventional tool and the user experience study. The results show that SGDraw can help generate scene graphs with richer details and describe the images more accurately than traditional bounding box annotations. We believe the proposed SGDraw can be useful in various vision tasks, such as image retrieval and generation.