论文标题
Kafka消费者集团Autoscaler
Kafka Consumer Group Autoscaler
论文作者
论文摘要
消息经纪人通过为有序队列分配消息,在分布式环境中在分布式环境中启用异步通信。消息经纪系统通常提供机制,以使消费者之间的任务并行,以提高消耗数据的速率。消费率必须超过生产率,否则队列将无限期增长。尽管如此,消费者还是昂贵的,应将其数量最小化。我们将确定所需数量的消费者数量和分区式分配的问题建模为可变项目尺寸bin包装变体。当队列迁移到另一个消费者时,无法读取数据。因此,我们建议R-评分指标来解释这些重新平衡成本。然后,我们引入了各种基于R分数的算法,并将其性能与为本应用程序的bin包装问题建立的启发式方法进行了比较。我们在现有系统中实例化方法,证明其有效性。我们的方法保证了足够的消费率,以前系统无法以较低的运营成本所能获得的某些东西。
Message brokers enable asynchronous communication between data producers and consumers in distributed environments by assigning messages to ordered queues. Message broker systems often provide with mechanisms to parallelize tasks between consumers to increase the rate at which data is consumed. The consumption rate must exceed the production rate or queues would grow indefinitely. Still, consumers are costly and their number should be minimized. We model the problem of determining the required number of consumers, and the partition-consumer assignments, as a variable item size bin packing variant. Data cannot be read when a queue is being migrated to another consumer. Hence, we propose the R-score metric to account for these rebalancing costs. Then, we introduce an assortment of R-score based algorithms, and compare their performance to established heuristics for the Bin Packing Problem for this application. We instantiate our method within an existing system, demonstrating its effectiveness. Our approach guarantees adequate consumption rates something the previous system was unable to at lower operational costs.