论文标题

分叉或失败:与多一对映射的循环一致培训

Fork or Fail: Cycle-Consistent Training with Many-to-One Mappings

论文作者

Guo, Qipeng, Jin, Zhijing, Wang, Ziyu, Qiu, Xipeng, Zhang, Weinan, Zhu, Jun, Zhang, Zheng, Wipf, David

论文摘要

循环一致的训练被广泛用于共同学习两个感兴趣域之间的前向和反映射,而无需收集每个域内匹配对的匹配对。在这方面,隐式假设是(至少)存在(大约)基地的两者,以便可以通过连续应用各个映射来准确地重建来自任何域的给定输入。但是在许多应用中,没有预期这种两次射击的存在,并且大型重建错误会损害周期一致训练的成功。作为此限制的一个重要实例,我们考虑在域之间存在多一对一或过渡性的映射的实际情况。为了解决该制度,我们开发了一种有条件的变异自动编码器(CVAE)方法,可以将其视为将汇聚映射转换为隐式界面,从而将两个方向的重建错误都可以最小化,并且可以作为一种天然的副产品,可以在一对一方向上获得现实的输出多样性。作为理论动机,我们分析了一个简化的方案,从而使所提出的基于CVAE的能量函数的最小化与地面真相映射的恢复保持一致。在经验方面,我们考虑了一个具有已知地面真相的合成图像数据集,以及一个现实世界中的应用,涉及从知识图中生成自然语言,反之亦然,这是一种典型的过滤案例。对于后者,我们的CVAE管道可以在周期训练期间捕获如此多一对一的映射,同时促进图形到文本任务的质地多样性。我们的代码可在github.com/qipengguo/cyclegt上找到 *AISTATS 2021已接受了本文的凝结版本。此版本包含其他内容和更新。

Cycle-consistent training is widely used for jointly learning a forward and inverse mapping between two domains of interest without the cumbersome requirement of collecting matched pairs within each domain. In this regard, the implicit assumption is that there exists (at least approximately) a ground-truth bijection such that a given input from either domain can be accurately reconstructed from successive application of the respective mappings. But in many applications no such bijection can be expected to exist and large reconstruction errors can compromise the success of cycle-consistent training. As one important instance of this limitation, we consider practically-relevant situations where there exists a many-to-one or surjective mapping between domains. To address this regime, we develop a conditional variational autoencoder (CVAE) approach that can be viewed as converting surjective mappings to implicit bijections whereby reconstruction errors in both directions can be minimized, and as a natural byproduct, realistic output diversity can be obtained in the one-to-many direction. As theoretical motivation, we analyze a simplified scenario whereby minima of the proposed CVAE-based energy function align with the recovery of ground-truth surjective mappings. On the empirical side, we consider a synthetic image dataset with known ground-truth, as well as a real-world application involving natural language generation from knowledge graphs and vice versa, a prototypical surjective case. For the latter, our CVAE pipeline can capture such many-to-one mappings during cycle training while promoting textural diversity for graph-to-text tasks. Our code is available at github.com/QipengGuo/CycleGT *A condensed version of this paper has been accepted to AISTATS 2021. This version contains additional content and updates.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源