论文标题
GraphChallenge.org三角计数性能
GraphChallenge.org Triangle Counting Performance
论文作者
论文摘要
图形分析系统的兴起创造了一种新方法来衡量和比较图形处理系统的功能。 MIT/Amazon/IEEE图挑战已开发出来,以提供一个定义明确的社区场所,以刺激研究并突出图形分析软件,硬件,算法和系统中的创新。 GraphChallenge.org提供了广泛的预先放置图形数据集,图形生成器,数学定义的图形算法,以多种语言的示例序列实现以及用于衡量性能的特定指标。 GraphChallenge.org的三角计数组件测试了图形处理系统的性能,以计算图中所有三角形的计算,并练习许多图形算法中发现的关键图形操作。在2017年,2018年和2019年,从众多作者和组织收到了许多三角计数的提交。本文介绍了这些意见书中表现最好的人的性能分析。这些提交表明,他们最先进的三角计数执行时间,$ t _ {\ rm tri} $,是图中边缘数量的强大功能,$ n_e $,从2017年开始显着提高($ t _ {\ t _ {\ rm tri}} tri} \大约N_e/10^9 $),并且在2018年至2019年之间保持可比性。图形挑战提供了当前的图形分析系统的清晰图片,并强调了对在非常大图上实现高性能的新创新的需求。
The rise of graph analytic systems has created a need for new ways to measure and compare the capabilities of graph processing systems. The MIT/Amazon/IEEE Graph Challenge has been developed to provide a well-defined community venue for stimulating research and highlighting innovations in graph analysis software, hardware, algorithms, and systems. GraphChallenge.org provides a wide range of pre-parsed graph data sets, graph generators, mathematically defined graph algorithms, example serial implementations in a variety of languages, and specific metrics for measuring performance. The triangle counting component of GraphChallenge.org tests the performance of graph processing systems to count all the triangles in a graph and exercises key graph operations found in many graph algorithms. In 2017, 2018, and 2019 many triangle counting submissions were received from a wide range of authors and organizations. This paper presents a performance analysis of the best performers of these submissions. These submissions show that their state-of-the-art triangle counting execution time, $T_{\rm tri}$, is a strong function of the number of edges in the graph, $N_e$, which improved significantly from 2017 ($T_{\rm tri} \approx (N_e/10^8)^{4/3}$) to 2018 ($T_{\rm tri} \approx N_e/10^9$) and remained comparable from 2018 to 2019. Graph Challenge provides a clear picture of current graph analysis systems and underscores the need for new innovations to achieve high performance on very large graphs.