论文标题
批处理平衡分配:简化和广义
Balanced Allocations in Batches: Simplified and Generalized
论文作者
论文摘要
我们考虑将$ M $ Balls(Jobs)分配到$ n $ bins(服务器)中。在两项选择过程中,对于$ m $依次到达的球中的每一个,都采样了两个随机选择的垃圾箱,并将球放在负载最少的垃圾箱中。众所周知,最大负载为$ m/n + \ log_2 \ log n + o(1)$ W.H.P. Berenbrink,Czumaj,Englert,Friedetzky和Nagel(2012)引入了此过程的平行版本,其中$ m $ Balls连续到达$ B = N $的连续批次。使用批次开始时的垃圾箱的负载信息并行分配同一批内的球。他们证明了此过程的差距为$ O(\ log n)$,概率很高。 在这项工作中,我们对此环境进行了新的分析,该分析基于指数势能。这使我们能够以不同的方式简化和推广[BCE12]的分析: $ \ Quad 1. $我们的分析涵盖了一类流程。这不仅包括两种选择,还包括较少的bin样品(例如$(1+β)$)的过程,这些过程只能从每个bin样本和图形分配中接收一点点信息,其中bins对应于图中的顶点。 $ \ Quad 2. $球可能具有不同的权重,只要它们的权重是来自分布的独立样本,该样本满足了其力矩生成功能的技术条件。 $ \ Quad 3. $对于任意批处理大小$ b \ geq n $,我们证明了$ o(b/n \ cdot \ log n)$的差距。对于[n,n^3] $中的任何$ b \,我们将其改进到$ o(b/n + \ log n)$,并表明它对于一个流程家庭来说都是紧张的。这意味着为例如$(1+β)$带有常数$β\在(0,1] $中,差距为$θ(\ log n)$的所有$ b \ in [n,n,n \ log n] $。 我们还进行了支持我们的理论结果的实验,甚至暗示了大批量尺寸的$(1+β)$(1+β)$的优势。
We consider the allocation of $m$ balls (jobs) into $n$ bins (servers). In the Two-Choice process, for each of $m$ sequentially arriving balls, two randomly chosen bins are sampled and the ball is placed in the least loaded bin. It is well-known that the maximum load is $m/n+\log_2 \log n + O(1)$ w.h.p. Berenbrink, Czumaj, Englert, Friedetzky and Nagel (2012) introduced a parallel version of this process, where $m$ balls arrive in consecutive batches of size $b=n$ each. Balls within the same batch are allocated in parallel, using the load information of the bins at the beginning of the batch. They proved that the gap of this process is $O(\log n)$ with high probability. In this work, we present a new analysis of this setting, which is based on exponential potential functions. This allows us to both simplify and generalize the analysis of [BCE12] in different ways: $\quad 1.$ Our analysis covers a broad class of processes. This includes not only Two-Choice, but also processes with fewer bin samples like $(1+β)$, processes which can only receive one bit of information from each bin sample and graphical allocation, where bins correspond to vertices in a graph. $\quad 2.$ Balls may be of different weights, as long as their weights are independent samples from a distribution satisfying a technical condition on its moment generating function. $\quad 3.$ For arbitrary batch sizes $b \geq n$, we prove a gap of $O(b/n \cdot \log n)$. For any $b \in [n , n^3]$, we improve this to $O(b/n + \log n)$ and show that it is tight for a family of processes. This implies the unexpected result that for e.g. $(1+β)$ with constant $β\in (0, 1]$, the gap is $Θ(\log n)$ for all $b \in [n,n \log n]$. We also conduct experiments which support our theoretical results, and even hint at a superiority of less powerful processes like $(1+β)$ for large batch sizes.