基于XGBoost的公平有效的混合联合学习框架，用于分布式电力预测

论文标题

基于XGBoost的公平有效的混合联合学习框架，用于分布式电力预测

A Fair and Efficient Hybrid Federated Learning Framework based on XGBoost for Distributed Power Prediction

论文作者

Liu, Haizhou, Zhang, Xuan, Shen, Xinwei, Sun, Hongbin

论文摘要

在现代电力系统中，有关发电/消费及其相关功能的实时数据存储在各种分布式各方，包括家用电表，变压器站和外部组织。为了充分利用这些分布式数据的基本模式进行准确的功率预测，需要将联合学习作为一种协作但保护隐私的培训计划。但是，当前的联合学习框架是为了解决数据的水平或垂直分离的两极分化，并且倾向于忽略两者都存在的情况。此外，在主流水平联合学习框架中，仅采用人工神经网络来学习数据模式，与表格数据集中的基于树的模型相比，这些模式被认为不准确和可解释。为此，我们提出了一个基于XGBoost的混合联合学习框架，以实时外部功能分布式功率预测。除了引入增强树以提高准确性和可解释性外，我们还结合了水平和垂直的联合学习，以解决该场景，在该场景中，在本地异质方和样本中分散的特征分散在各个地方地区。此外，我们设计了一个动态的任务分配方案，以便每个方获得相当一部分信息，并且可以完全利用各方的计算能力来提高培训效率。提出了一项后续案例研究，以证明采用拟议框架的必要性是合理的。还确认了拟议框架在公平，效率和准确性绩效方面的优势。

In a modern power system, real-time data on power generation/consumption and its relevant features are stored in various distributed parties, including household meters, transformer stations and external organizations. To fully exploit the underlying patterns of these distributed data for accurate power prediction, federated learning is needed as a collaborative but privacy-preserving training scheme. However, current federated learning frameworks are polarized towards addressing either the horizontal or vertical separation of data, and tend to overlook the case where both are present. Furthermore, in mainstream horizontal federated learning frameworks, only artificial neural networks are employed to learn the data patterns, which are considered less accurate and interpretable compared to tree-based models on tabular datasets. To this end, we propose a hybrid federated learning framework based on XGBoost, for distributed power prediction from real-time external features. In addition to introducing boosted trees to improve accuracy and interpretability, we combine horizontal and vertical federated learning, to address the scenario where features are scattered in local heterogeneous parties and samples are scattered in various local districts. Moreover, we design a dynamic task allocation scheme such that each party gets a fair share of information, and the computing power of each party can be fully leveraged to boost training efficiency. A follow-up case study is presented to justify the necessity of adopting the proposed framework. The advantages of the proposed framework in fairness, efficiency and accuracy performance are also confirmed.

下载PDF全文

下载文献需遵守相关版权规定

论文标题