论文标题

MG-GCN:使用混合粒的聚合器快速有效学习,用于训练大图卷积网络

MG-GCN: Fast and Effective Learning with Mix-grained Aggregators for Training Large Graph Convolutional Networks

论文作者

Huang, Tao, Zhang, Yihan, Wu, Jiajing, Fang, Junyuan, Zheng, Zibin

论文摘要

Graph卷积网络(GCN)最近被用作许多基于图的应用程序的重要工具。受卷积神经网络(CNN)的启发,GCN通过按一层汇总其邻居的信息来生成节点的嵌入。但是,由于GCN层的递归邻里扩展,GCN的高计算和记忆成本使其无法在大图上进行训练。为了解决此问题,已经提出了在信息聚合过程中的几种抽样方法以迷你批量随机梯度下降(SGD)方式训练GCN。然而,这些抽样策略有时会引起人们对信息收集不足的担忧,这可能会阻碍学习绩效的准确性和融合。为了解决准确性和效率之间的困境,我们建议使用具有不同粒度的聚合器来收集不同层的邻里信息。然后,构建了一种基于学位的抽样策略,该策略避免了指数复杂性,用于对固定数量的节点进行采样。提出的模型结合了上述两种机制,称为混合元素GCN(MG-GCN),通过在四个常用的基准数据集和新的Ethereum DataSet上的四个常用基准数据集和新的Ethereum DataSet上的全面实验来实现最先进的性能。

Graph convolutional networks (GCNs) have been employed as a kind of significant tool on many graph-based applications recently. Inspired by convolutional neural networks (CNNs), GCNs generate the embeddings of nodes by aggregating the information of their neighbors layer by layer. However, the high computational and memory cost of GCNs due to the recursive neighborhood expansion across GCN layers makes it infeasible for training on large graphs. To tackle this issue, several sampling methods during the process of information aggregation have been proposed to train GCNs in a mini-batch Stochastic Gradient Descent (SGD) manner. Nevertheless, these sampling strategies sometimes bring concerns about insufficient information collection, which may hinder the learning performance in terms of accuracy and convergence. To tackle the dilemma between accuracy and efficiency, we propose to use aggregators with different granularities to gather neighborhood information in different layers. Then, a degree-based sampling strategy, which avoids the exponential complexity, is constructed for sampling a fixed number of nodes. Combining the above two mechanisms, the proposed model, named Mix-grained GCN (MG-GCN) achieves state-of-the-art performance in terms of accuracy, training speed, convergence speed, and memory cost through a comprehensive set of experiments on four commonly used benchmark datasets and a new Ethereum dataset.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源