通过个性化扩展图对比度学习

论文标题

通过个性化扩展图对比度学习

Graph Contrastive Learning with Personalized Augmentation

论文作者

Zhang, Xin, Tan, Qiaoyu, Huang, Xiao, Li, Bo

论文摘要

图形对比学习（GCL）已成为学习图形无监督表示的有效工具。关键思想是通过数据扩展最大化每个图的两个增强视图之间的一致性。现有的GCL模型主要集中在给定情况下的所有图表上应用\ textit {相同的增强策略}。但是，实际图通常不是单态的，而是各种本质的抽象。即使在相同的情况下（例如，大分子和在线社区），不同的图形可能需要多种增强才能执行有效的GCL。 Thus, blindly augmenting all graphs without considering their individual characteristics may undermine the performance of GCL arts.To deal with this, we propose the first principled framework, termed as \textit{G}raph contrastive learning with \textit{P}ersonalized \textit{A}ugmentation (GPA), to advance conventional GCL by allowing each graph to choose its own suitable augmentation operations.In本质，GPA Inders通过其拓扑构造和节点属性为每个图定制的增强策略，这是一个可学习的增强选择器，这是一个插件模块，可以通过下游GCL模型端到端有效地训练。来自不同类型和域的11个基准图的广泛实验证明了GPA与最先进的竞争对手的优越性。此外，通过可视化不同类型的数据集跨不同类型的数据集的学习增强分布，我们表明GPA可以根据其特征有效地识别每个图的最合适的增强。

Graph contrastive learning (GCL) has emerged as an effective tool for learning unsupervised representations of graphs. The key idea is to maximize the agreement between two augmented views of each graph via data augmentation. Existing GCL models mainly focus on applying \textit{identical augmentation strategies} for all graphs within a given scenario. However, real-world graphs are often not monomorphic but abstractions of diverse natures. Even within the same scenario (e.g., macromolecules and online communities), different graphs might need diverse augmentations to perform effective GCL. Thus, blindly augmenting all graphs without considering their individual characteristics may undermine the performance of GCL arts.To deal with this, we propose the first principled framework, termed as \textit{G}raph contrastive learning with \textit{P}ersonalized \textit{A}ugmentation (GPA), to advance conventional GCL by allowing each graph to choose its own suitable augmentation operations.In essence, GPA infers tailored augmentation strategies for each graph based on its topology and node attributes via a learnable augmentation selector, which is a plug-and-play module and can be effectively trained with downstream GCL models end-to-end. Extensive experiments across 11 benchmark graphs from different types and domains demonstrate the superiority of GPA against state-of-the-art competitors.Moreover, by visualizing the learned augmentation distributions across different types of datasets, we show that GPA can effectively identify the most suitable augmentations for each graph based on its characteristics.

下载PDF全文

下载文献需遵守相关版权规定

论文标题