达加德：图形异常检测的数据增强

论文标题

达加德：图形异常检测的数据增强

DAGAD: Data Augmentation for Graph Anomaly Detection

论文作者

Liu, Fanzhen, Ma, Xiaoxiao, Wu, Jia, Yang, Jian, Xue, Shan, Beheshti, Amin, Zhou, Chuan, Peng, Hao, Sheng, Quan Z., Aggarwal, Charu C.

论文摘要

本文中的图异常检测旨在区分与良性淋巴结不同的淋巴结，这些淋巴结与良性的淋巴结有所不同。从图形数据中学习信息性的异常行为时，受到学术界和行业的关注越来越多，但现有的研究仍然存在两个关键问题。一方面，由于其微妙的异常行为和背景知识的短缺，通常很难捕获异常，这会导致严重的异常样本稀缺性。同时，实际图中的绝大多数对象都是正常的，这也带来了类不平衡问题。 To bridge the gaps, this paper devises a novel Data Augmentation-based Graph Anomaly Detection (DAGAD) framework for attributed graphs, equipped with three specially designed modules: 1) an information fusion module employing graph neural network encoders to learn representations, 2) a graph data augmentation module that fertilizes the training set with generated samples, and 3) an imbalance-tailored learning module to discriminate the distributions of the minority （异常）和多数（正常）类。在三个数据集上进行的一系列实验证明，达加德的表现优于十个有关各种大多数指标的最先进的基线探测器，以及一项广泛的消融研究验证了我们提出的模块的强度。

Graph anomaly detection in this paper aims to distinguish abnormal nodes that behave differently from the benign ones accounting for the majority of graph-structured instances. Receiving increasing attention from both academia and industry, yet existing research on this task still suffers from two critical issues when learning informative anomalous behavior from graph data. For one thing, anomalies are usually hard to capture because of their subtle abnormal behavior and the shortage of background knowledge about them, which causes severe anomalous sample scarcity. Meanwhile, the overwhelming majority of objects in real-world graphs are normal, bringing the class imbalance problem as well. To bridge the gaps, this paper devises a novel Data Augmentation-based Graph Anomaly Detection (DAGAD) framework for attributed graphs, equipped with three specially designed modules: 1) an information fusion module employing graph neural network encoders to learn representations, 2) a graph data augmentation module that fertilizes the training set with generated samples, and 3) an imbalance-tailored learning module to discriminate the distributions of the minority (anomalous) and majority (normal) classes. A series of experiments on three datasets prove that DAGAD outperforms ten state-of-the-art baseline detectors concerning various mostly-used metrics, together with an extensive ablation study validating the strength of our proposed modules.

下载PDF全文

下载文献需遵守相关版权规定

论文标题