通过卷积镜下降对双基PID控制器的分析

论文标题

通过卷积镜下降对双基PID控制器的分析

Analysis of Dual-Based PID Controllers through Convolutional Mirror Descent

论文作者

Balseiro, Santiago R., Lu, Haihao, Mirrokni, Vahab, Sivan, Balasubramanian

论文摘要

基于双重的成比例综合衍生（PID）控制器通常在实践中用于解决在线分配问题，例如在线广告中的预算加快。但是，控制器以启发式方式使用，并且没有可证明的保证。本文为在线分配问题的双基PID控制器的性能提供了第一个遗憾。我们首先建立了基于双基的PID控制器与一种用于在线凸优化的新的一阶算法之间的基本联系，称为\ emph {卷积镜下降}（CMD），该算法根据过去梯度的加权移动平均值来更新迭代。在特殊情况下，CMD恢复了具有动量和乐观的镜像下降的在线镜像。我们建立了足够的条件，在这些条件下，CMD对对抗性输入的一般在线凸优化问题感到遗憾。我们利用这一新结果使基于双基的PID控制器的首次遗憾解决在线分配问题。作为我们证明的副产品，我们为非平滑凸优化的CMD提供了第一个遗憾，这可能具有独立的利益。

Dual-based proportional-integral-derivative (PID) controllers are often employed in practice to solve online allocation problems with global constraints, such as budget pacing in online advertising. However, controllers are used in a heuristic fashion and come with no provable guarantees on their performance. This paper provides the first regret bounds on the performance of dual-based PID controllers for online allocation problems. We do so by first establishing a fundamental connection between dual-based PID controllers and a new first-order algorithm for online convex optimization called \emph{Convolutional Mirror Descent} (CMD), which updates iterates based on a weighted moving average of past gradients. CMD recovers, in a special case, online mirror descent with momentum and optimistic mirror descent. We establish sufficient conditions under which CMD attains low regret for general online convex optimization problems with adversarial inputs. We leverage this new result to give the first regret bound for dual-based PID controllers for online allocation problems. As a byproduct of our proofs, we provide the first regret bound for CMD for non-smooth convex optimization, which might be of independent interest.

下载PDF全文

下载文献需遵守相关版权规定

论文标题