论文标题
室内家具布局生成的结构化图形自动编码器
Structured Graph Variational Autoencoders for Indoor Furniture layout Generation
论文作者
论文摘要
我们提出了一个结构化图形自动编码器,用于生成室内3D场景的布局。鉴于房间类型(例如,客厅或图书馆)和房间布局(例如,地板和墙壁等房间元素),我们的建筑产生了与房间类型和布局一致的物体集合(例如,沙发,桌子和椅子)。这是一个具有挑战性的问题,因为生成的场景应满足多个约束,例如,每个对象都必须位于房间内,并且两个对象不能占据相同的卷。为了应对这些挑战,我们提出了一个深层生成模型,将这些关系编码为属性图上的软约束(例如,节点捕获房间和家具元素的属性,例如类,姿势和大小,而边缘捕获了几何关系,例如相对方向)。该体系结构由一个图形编码器组成,该图形将输入图映射到结构化的潜在空间,以及一个生成家具图的图形解码器,给定一个潜在代码和房间图。潜在空间以自动回归先验建模,这有助于产生高度结构化的场景。我们还提出了一个有效的培训程序,将匹配和受约束的学习结合在一起。 3D前数据集的实验表明,我们的方法会产生多样的场景,并适合房间布局。
We present a structured graph variational autoencoder for generating the layout of indoor 3D scenes. Given the room type (e.g., living room or library) and the room layout (e.g., room elements such as floor and walls), our architecture generates a collection of objects (e.g., furniture items such as sofa, table and chairs) that is consistent with the room type and layout. This is a challenging problem because the generated scene should satisfy multiple constrains, e.g., each object must lie inside the room and two objects cannot occupy the same volume. To address these challenges, we propose a deep generative model that encodes these relationships as soft constraints on an attributed graph (e.g., the nodes capture attributes of room and furniture elements, such as class, pose and size, and the edges capture geometric relationships such as relative orientation). The architecture consists of a graph encoder that maps the input graph to a structured latent space, and a graph decoder that generates a furniture graph, given a latent code and the room graph. The latent space is modeled with auto-regressive priors, which facilitates the generation of highly structured scenes. We also propose an efficient training procedure that combines matching and constrained learning. Experiments on the 3D-FRONT dataset show that our method produces scenes that are diverse and are adapted to the room layout.