可信赖的联合学习通过区块链

论文标题

可信赖的联合学习通过区块链

Trustworthy Federated Learning via Blockchain

论文作者

Yang, Zhanpeng, Shi, Yuanming, Zhou, Yong, Wang, Zixin, Yang, Kai

论文摘要

人工智能（AI）（例如自动驾驶，物联网，智能医疗保健等）的关键安全情景已提出了值得信赖的AI的关键要求，以通过可靠的决定来保证隐私和安全性。作为值得信赖的AI的新生分支，联邦学习（FL）被视为一个有前途的隐私保护框架，用于培训与协作设备的全球AI模型。但是，FL框架中仍然存在安全挑战，例如，恶意设备的拜占庭式攻击，以及模型从恶意服务器篡改攻击，该攻击将降低或破坏受过训练的全球AI模型的准确性。在本文中，我们将通过使用安全的全球聚合算法来抵抗恶意设备，并在多个边缘服务器之间使用高效能和低能消耗来防止模型篡改恶意服务器，从而提出一个基于分散的区块链的FL（B-FL）架构。但是，要在网络边缘实施B-FL系统，区块链共识协议中的多轮交叉验证将导致较长的训练潜伏期。因此，我们制定了一个网络优化问题，该问题共同考虑带宽和功率分配，以最大程度地减少包括渐进式学习回合的长期平均训练潜伏期。我们进一步建议将网络优化问题转变为马尔可夫决策过程，并利用基于深入学习的算法以低计算复杂性提供高系统性能。仿真结果表明，B-FL可以抵抗边缘设备和服务器的恶意攻击，并且与基线算法相比，基于强化的基于强化学习的算法可以显着降低B-FL的训练潜伏期。

The safety-critical scenarios of artificial intelligence (AI), such as autonomous driving, Internet of Things, smart healthcare, etc., have raised critical requirements of trustworthy AI to guarantee the privacy and security with reliable decisions. As a nascent branch for trustworthy AI, federated learning (FL) has been regarded as a promising privacy preserving framework for training a global AI model over collaborative devices. However, security challenges still exist in the FL framework, e.g., Byzantine attacks from malicious devices, and model tampering attacks from malicious server, which will degrade or destroy the accuracy of trained global AI model. In this paper, we shall propose a decentralized blockchain based FL (B-FL) architecture by using a secure global aggregation algorithm to resist malicious devices, and deploying practical Byzantine fault tolerance consensus protocol with high effectiveness and low energy consumption among multiple edge servers to prevent model tampering from the malicious server. However, to implement B-FL system at the network edge, multiple rounds of cross-validation in blockchain consensus protocol will induce long training latency. We thus formulate a network optimization problem that jointly considers bandwidth and power allocation for the minimization of long-term average training latency consisting of progressive learning rounds. We further propose to transform the network optimization problem as a Markov decision process and leverage the deep reinforcement learning based algorithm to provide high system performance with low computational complexity. Simulation results demonstrate that B-FL can resist malicious attacks from edge devices and servers, and the training latency of B-FL can be significantly reduced by deep reinforcement learning based algorithm compared with baseline algorithms.

下载PDF全文

下载文献需遵守相关版权规定

论文标题