论文标题
常见问题的拓扑范围
Topology Dependent Bounds For FAQs
论文作者
论文摘要
在本文中,我们证明了拓扑依赖于计算Abo Khamis等人研究的功能骨料查询(FAQ)所需的回合数量。 [PODS 2016]在Chattopadhyay等人考虑的模型下的同步分布式网络中。 [FOCS 2014,汽水2017]。与Chattopadhyay等人在大规模并行计算模型中计算数据库查询的最新工作不同,节点只能通过私人点对点通道进行通信,并且我们对在{\ em tunsutary}通信拓扑的范围内感兴趣。这是在该分布式模型中考虑更实际动机问题的第一项工作。为了展示,我们重点介绍了本文的两个特殊问题:概率图形模型(PGMS)中的布尔结合性查询(BCQ)和计算变量/因子边缘。只要查询的基础超图为$ o(1)$ - 否认,并且具有$ o(1)$ - arity,我们就会获得计算此类查询所需的回合数量的紧密界限。特别是,$ o(1)$ - 退化条件涵盖了大多数研究的查询,这些查询在集中式计算模型(如具有恒定树宽的查询)中可以有效地计算。这些紧密的界限取决于针对无环超图的广义大型分解(GHD)的“宽度”(即内部节点宽度)的新概念,该概念可最大程度地减少GHD子类内部节点的数量。据我们所知,在理论数据库文献中尚未明确研究此宽度。最后,我们考虑了使用基于最小的基于最小值的参数来计算具有一系列矩阵(在两个元素的有限场上)(在两个元素的有限字段)上计算载体的乘积的问题。
In this paper, we prove topology dependent bounds on the number of rounds needed to compute Functional Aggregate Queries (FAQs) studied by Abo Khamis et al. [PODS 2016] in a synchronous distributed network under the model considered by Chattopadhyay et al. [FOCS 2014, SODA 2017]. Unlike the recent work on computing database queries in the Massively Parallel Computation model, in the model of Chattopadhyay et al., nodes can communicate only via private point-to-point channels and we are interested in bounds that work over an {\em arbitrary} communication topology. This is the first work to consider more practically motivated problems in this distributed model. For the sake of exposition, we focus on two special problems in this paper: Boolean Conjunctive Query (BCQ) and computing variable/factor marginals in Probabilistic Graphical Models (PGMs). We obtain tight bounds on the number of rounds needed to compute such queries as long as the underlying hypergraph of the query is $O(1)$-degenerate and has $O(1)$-arity. In particular, the $O(1)$-degeneracy condition covers most well-studied queries that are efficiently computable in the centralized computation model like queries with constant treewidth. These tight bounds depend on a new notion of `width' (namely internal-node-width) for Generalized Hypertree Decompositions (GHDs) of acyclic hypergraphs, which minimizes the number of internal nodes in a sub-class of GHDs. To the best of our knowledge, this width has not been studied explicitly in the theoretical database literature. Finally, we consider the problem of computing the product of a vector with a chain of matrices and prove tight bounds on its round complexity (over the finite field of two elements) using a novel min-entropy based argument.