论文标题

部分可观测时空混沌系统的无模型预测

Incentive Compatible Queues Without Money

论文作者

Grosof, Isaac, Mitzenmacher, Michael

论文摘要

对于工作调度系统,工作需要一定的处理然后离开系统,每个用户很自然地估算其工作时间要求以帮助调度程序。但是,如果没有真实性的激励机制,则每个用户将有动力提供估算的估算值,以便在时间表中赋予他们的工作优先级,以便尽早完成工作。 我们研究了在自然排队理论框架下如何使此类调度系统激励兼容,而无需使用货币指控。在我们的设置中,每个用户都可以估算其工作的运行时间,但是此估计可能是不正确的。我们研究了调度政策,如果工作超过其估计值,则有一些概率“受到惩罚”并在其他工作后重新安排,以免低估工作时间的低估。但是,由于用户估计可能是不正确的(没有任何恶意意图),因此过度的惩罚可能会激励用户高估其工作时间,从而导致计划效率较低。我们描述了两种自然的调度策略,即盲和测量值。我们表明,对于这两种策略,鉴于系统的参数,我们可以有效地确定兼容激励的惩罚概率集,因为用户被激励以提供其工作时间的实际估计。此外,我们证明了测量值的限制,随着估计的收敛到完美的准确性,激励兼容兼容的惩罚概率范围会收敛到$ [0,1] $。我们的形式主义建立了一个框架,用于研究利用用户的工作时间估算的进一步基于队列的调度问题,并且该系统需要激励估算的真实报告。

For job scheduling systems, where jobs require some amount of processing and then leave the system, it is natural for each user to provide an estimate of their job's time requirement in order to aid the scheduler. However, if there is no incentive mechanism for truthfulness, each user will be motivated to provide estimates that give their job precedence in the schedule, so that the job completes as early as possible. We examine how to make such scheduling systems incentive compatible, without using monetary charges, under a natural queueing theory framework. In our setup, each user has an estimate of their job's running time, but it is possible for this estimate to be incorrect. We examine scheduling policies where if a job exceeds its estimate, it is with some probability "punished" and re-scheduled after other jobs, to disincentivize underestimates of job times. However, because user estimates may be incorrect (without any malicious intent), excessive punishment may incentivize users to overestimate their job times, which leads to less efficient scheduling. We describe two natural scheduling policies, BlindTrust and MeasuredTrust. We show that, for both of these policies, given the parameters of the system, we can efficiently determine the set of punishment probabilities that are incentive compatible, in that users are incentivized to provide their actual estimate of the job time. Moreover, we prove for MeasuredTrust that in the limit as estimates converge to perfect accuracy, the range of punishment probabilities that are incentive compatible converges to $[0,1]$. Our formalism establishes a framework for studying further queue-based scheduling problems where job time estimates from users are utilized, and the system needs to incentivize truthful reporting of estimates.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源