论文标题

A1:分布式内存中图数据库

A1: A Distributed In-Memory Graph Database

论文作者

Buragohain, Chiranjeeb, Risvik, Knut Magne, Brett, Paul, Castro, Miguel, Cho, Wonhee, Cowhig, Joshua, Gloy, Nikolas, Kalyanaraman, Karthik, Khanna, Richendra, Pao, John, Renzelmann, Matthew, Shamis, Alex, Tan, Timothy, Zheng, Shuheng

论文摘要

A1是Bing搜索引擎使用的内存分布式数据库,用于支持结构化数据的复杂查询。 A1的关键推动因素是商品硬件中便宜的DRAM和高速RDMA(远程直接内存访问)网络的可用性。 A1使用农场作为其基础存储层,并在顶部构建图形抽象和查询引擎。内存存储和RDMA访问的组合需要重新思考如何在大型分布式系统中分配,组织和查询数据。一个单个A1群集可以存储数百亿个顶点和边缘,并支持每秒350百万个顶点读取的吞吐量,而单位数毫秒中的端到端查询延迟。在本文中,我们描述了A1数据模型,RDMA优化数据结构和查询执行。

A1 is an in-memory distributed database used by the Bing search engine to support complex queries over structured data. The key enablers for A1 are availability of cheap DRAM and high speed RDMA (Remote Direct Memory Access) networking in commodity hardware. A1 uses FaRM as its underlying storage layer and builds the graph abstraction and query engine on top. The combination of in-memory storage and RDMA access requires rethinking how data is allocated, organized and queried in a large distributed system. A single A1 cluster can store tens of billions of vertices and edges and support a throughput of 350+ million of vertex reads per second with end to end query latency in single digit milliseconds. In this paper we describe the A1 data model, RDMA optimized data structures and query execution.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源