论文标题
评估联邦学习算法的沟通效率
Evaluating the Communication Efficiency in Federated Learning Algorithms
论文作者
论文摘要
在高级技术的时代,移动设备配备了计算和传感功能,可以收集大量数据。这些数据量适合培训不同的学习模型。这些学习模型与深度学习的进步(DL)合作,赋予了许多有用的应用,例如图像处理,语音识别,医疗保健,车辆网络等。传统上,机器学习(ML)方法要求数据集中在基于云的数据中心中。但是,这些数据的数量和隐私敏感性通常很大,这阻止了登录这些数据中心以训练学习模型。反过来,这导致了高潜伏期和沟通效率低下的关键问题。最近,鉴于许多国家的新隐私立法,已经引入了联邦学习的概念(FL)。在FL中,移动用户有权通过汇总其本地模型来学习全球模型,而无需共享对隐私敏感的数据。通常,这些移动用户与维护全局模型的数据中心具有缓慢的网络连接。此外,在复杂而大规模的网络中,涉及具有各种能量限制的异质设备。这引发了大规模实施FL时沟通成本的挑战。为此,在这项研究中,我们从FL的基本原理开始,然后,我们重点介绍了最近的FL算法,并通过详细的比较评估了他们的沟通效率。此外,我们提出了一系列解决方案,以减轻从交流的角度和隐私角度来减轻现有的FL问题。
In the era of advanced technologies, mobile devices are equipped with computing and sensing capabilities that gather excessive amounts of data. These amounts of data are suitable for training different learning models. Cooperated with advancements in Deep Learning (DL), these learning models empower numerous useful applications, e.g., image processing, speech recognition, healthcare, vehicular network and many more. Traditionally, Machine Learning (ML) approaches require data to be centralised in cloud-based data-centres. However, this data is often large in quantity and privacy-sensitive which prevents logging into these data-centres for training the learning models. In turn, this results in critical issues of high latency and communication inefficiency. Recently, in light of new privacy legislations in many countries, the concept of Federated Learning (FL) has been introduced. In FL, mobile users are empowered to learn a global model by aggregating their local models, without sharing the privacy-sensitive data. Usually, these mobile users have slow network connections to the data-centre where the global model is maintained. Moreover, in a complex and large scale network, heterogeneous devices that have various energy constraints are involved. This raises the challenge of communication cost when implementing FL at large scale. To this end, in this research, we begin with the fundamentals of FL, and then, we highlight the recent FL algorithms and evaluate their communication efficiency with detailed comparisons. Furthermore, we propose a set of solutions to alleviate the existing FL problems both from communication perspective and privacy perspective.