论文标题

基于大象流量检测的流量表使用减少算法的边界

Boundaries of Flow Table Usage Reduction Algorithms Based on Elephant Flow Detection

论文作者

Jurkiewicz, Piotr

论文摘要

大多数互联网流量是由相对较少的流量(所谓的大象流)引起的。可以利用这种现象来促进交通工程:资源成本的单个流动转发条目只能为大象创建,同时在最短的路径上为小鼠服务。尽管这个想法已经出现在提出的TE系统中,但并未单独检查它。流量表占用的程度和操作数量降低,或者如何选择阈值或采样率以覆盖所需的流量部分仍然未知。在本文中,我们使用了从30天的校园痕迹获得40亿个流量的可再现交通模型来回答这些问题。我们在达到预定义的计数阈值或通过采样来检测大象后,建立了自第一个数据包以来流动流量降低算法的理论边界。一个重要的发现是,简单的数据包采样在逼真的流量上表现出色,将流入量的数量减少了400个,但仍覆盖了80%的流量。我们还提供了一个开源软件包,允许复制我们的实验或对其他算法或流量分布进行类似评估。

The majority of Internet traffic is caused by a relatively small number of flows (so-called elephant flows). This phenomenon can be exploited to facilitate traffic engineering: resource-costly individual flow forwarding entries can be created only for elephants while serving mice over the shortest paths. Although this idea already appeared in proposed TE systems, it was not examined by itself. It remains unknown what extent of flow table occupancy and operations number reduction can be achieved or how to select thresholds or sampling rates to cover the desired fraction of traffic. In this paper, we use reproducible traffic models obtained from a 30-day-long campus trace covering 4 billion flows, to answer these questions. We establish theoretical boundaries for flow table usage reduction algorithms that classify flows since the first packet, after reaching a predefined counter threshold or detect elephants by sampling. An important finding is that simple packet sampling performs surprisingly well on realistic traffic, reducing the number of flow entries by a factor up to 400, still covering 80% of the traffic. We also provide an open-source software package allowing the replication of our experiments or the performing of similar evaluations for other algorithms or flow distributions.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源