论文标题

NFSlicer:浅网络功能的数据移动优化

NFSlicer: Data Movement Optimization for Shallow Network Functions

论文作者

Sarma, Anirudh, Seyedroudbari, Hamed, Gupta, Harshit, Ramachandran, Umakishore, Daglis, Alexandros

论文摘要

商品服务器上的网络功能(NF)部署已在数据中心和企业设置中变得无处不在。许多常用的NFS,例如防火墙,负载平衡器和NAT都很浅 - 即,尽管整个数据包在服务器上和服务器上都被传输了,但它们仅检查数据包的标头。结果,处理大数据包的移动数据和检查数据之间的差距超过20倍。在现代网络速度下,这种过多的数据移动对性能有害,使大数据包的平均尾部潜伏期和90%的尾部潜伏期高达1.7倍。我们的彻底性能分析将NIC服务器PCIE接口和服务器内存层次结构上的高竞争性视为主要瓶颈。 我们介绍了NFSlicer,这是一种数据运动优化,作为NIC扩展而实施,以减轻来自数据移动在浅NF在商品服务器上的部署中造成的瓶颈。 NFSLICER仅通过将数据包的有效载荷切片并将其临时存储在NIC内存中,将部署NFS实际检查的每个数据包的小部分转移。当服务器随后发送处理后的数据包时,NFSlicer将其拼接到先前切成薄片的有效载荷上。我们开发了一个基于软件的仿真平台,并证明NFSlicer可以有效地最大程度地减少NIC和服务器之间的数据移动,从而弥合小数据包NF处理之间的延迟差距。在一系列浅的NFS处理1518B数据包上,NFSlicer分别将平均值和90%的尾巴潜伏期降低了17% / 29%。

Network Function (NF) deployments on commodity servers have become ubiquitous in datacenters and enterprise settings. Many commonly used NFs such as firewalls, load balancers and NATs are shallow - i.e., they only examine the packet's header, despite the entire packet being transferred on and off the server. As a result, the gap between moved and inspected data when handling large packets exceeds 20x. At modern network rates, such excess data movement is detrimental to performance, hurting both the average and 90% tail latency of large packets by up to 1.7x. Our thorough performance analysis identifies high contention on the NIC-server PCIe interface and in the server's memory hierarchy as the main bottlenecks. We introduce NFSlicer, a data movement optimization implemented as a NIC extension to mitigate the bottlenecks stemming from data movement deluge in deployments of shallow NFs on commodity servers. NFSlicer only transfers the small portion of each packet that the deployed NFs actually inspect, by slicing the packet's payload and temporarily storing it in on-NIC memory. When the server later transmits the processed packet, NFSlicer splices it to its previously sliced payload. We develop a software-based emulation platform and demonstrate that NFSlicer effectively minimizes data movement between the NIC and the server, bridging the latency gap between small and large packet NF processing. On a range of shallow NFs handling 1518B packets, NFSlicer reduces average and 90% tail latency by up to 17% / 29%, respectively.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源