论文标题
使用机器学习对云的有效能源管理的热预测
Thermal Prediction for Efficient Energy Management of Clouds using Machine Learning
论文作者
论文摘要
超尺度云数据中心中的热管理是一个关键问题。宿主温度升高会产生热点,从而大大提高冷却成本并影响可靠性。准确的宿主温度预测对于有效管理资源至关重要。由于数据中心的热变化,温度估计是一个非平凡的问题。由于其计算复杂性和缺乏准确的预测,现有的温度估计解决方案效率低下。但是,用于温度预测的数据驱动的机器学习方法是一种有前途的方法。在这方面,我们从私有云中收集和研究数据,并显示出热变化的存在。我们研究了几种机器学习模型,以准确预测宿主温度。具体而言,我们为温度预测提出了一个梯度提升机学习模型。实验结果表明,我们的模型可以准确预测温度,平均RMSE值为0.05或平均预测误差为2.38摄氏度,与现有理论模型相比,摄氏6度少6度。此外,我们提出了一种动态调度算法,以最大程度地减少宿主的峰值温度。结果表明,与基线算法相比,我们的算法将峰值温度降低6.5度,并且消耗的能量降低了34.5%。
Thermal management in the hyper-scale cloud data centers is a critical problem. Increased host temperature creates hotspots which significantly increases cooling cost and affects reliability. Accurate prediction of host temperature is crucial for managing the resources effectively. Temperature estimation is a non-trivial problem due to thermal variations in the data center. Existing solutions for temperature estimation are inefficient due to their computational complexity and lack of accurate prediction. However, data-driven machine learning methods for temperature prediction is a promising approach. In this regard, we collect and study data from a private cloud and show the presence of thermal variations. We investigate several machine learning models to accurately predict the host temperature. Specifically, we propose a gradient boosting machine learning model for temperature prediction. The experiment results show that our model accurately predicts the temperature with the average RMSE value of 0.05 or an average prediction error of 2.38 degree Celsius, which is 6 degree Celsius less as compared to an existing theoretical model. In addition, we propose a dynamic scheduling algorithm to minimize the peak temperature of hosts. The results show that our algorithm reduces the peak temperature by 6.5 degree Celsius and consumes 34.5% less energy as compared to the baseline algorithm.