当前位置: X-MOL 学术Future Gener. Comput. Syst. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Holistic thermal-aware workload management and infrastructure control for heterogeneous data centers using machine learning
Future Generation Computer Systems ( IF 7.5 ) Pub Date : 2021-01-12 , DOI: 10.1016/j.future.2021.01.007
SeyedMorteza MirhoseiniNejad , Ghada Badawy , Douglas G. Down

Two key contributors to the energy expenditure in data centers are information technology (IT) equipment and cooling infrastructures. The standard practice of data centers lacks a tight correlation between these two entities, resulting in considerable power wastage. Considering the cooling cost of different locations inside a data center (cooling heterogeneity) and various cooling capabilities of servers (server heterogeneity) has significant potential for saving power, yet has not been studied thoroughly in the literature. There is a necessity for state-of-the-art approaches to integrate the control of IT and cooling units. Moreover, the literature still lacks an accurate and fast thermal model for temperature prediction inside a data center. In this paper, innovative approaches to quantify data center thermal heterogeneities are presented. Using data center thermal models the cost of providing cold air at the front of servers can be (indirectly) calculated, and the capability of servers to be cooled is formulated. Our approach assigns jobs to locations that are efficient to cool (from the perspectives of both servers and cooling units) and tunes cooling unit parameters. The method, called holistic data center infrastructure control (HDIC), has the potential to save a considerable amount of power by exploiting synergies between the workload scheduler and operational parameters of cooling units.



中文翻译:

使用机器学习对异构数据中心进行整体热感知工作负载管理和基础架构控制

数据中心能耗的两个关键因素是信息技术(IT)设备和冷却基础设施。数据中心的标准做法在这两个实体之间缺乏紧密的联系,从而导致大量的功率浪费。考虑到数据中心内部不同位置的冷却成本(冷却异构性)和服务器的各种冷却能力(服务器异构性)具有显着的节能潜力,但尚未在文献中进行深入研究。有必要采用最先进的方法来集成IT和冷却设备的控制。此外,文献仍然缺乏用于数据中心内部温度预测的准确且快速的热模型。在本文中,提出了量化数据中心热异构性的创新方法。使用数据中心热模型,可以(间接)计算在服务器前端提供冷空气的成本,并确定服务器的冷却能力。我们的方法将作业分配到有效冷却的位置(从服务器和冷却单元的角度来看),并调整冷却单元参数。该方法被称为整体数据中心基础设施控制(HDIC),它有潜力通过利用工作负载调度程序和冷却单元的运行参数之间的协同作用来节省大量电量。我们的方法将作业分配到有效冷却的位置(从服务器和冷却单元的角度来看),并调整冷却单元参数。该方法被称为整体数据中心基础设施控制(HDIC),它有潜力通过利用工作负载调度程序和冷却单元的运行参数之间的协同作用来节省大量电量。我们的方法将作业分配到有效冷却的位置(从服务器和冷却单元的角度来看),并调整冷却单元参数。该方法被称为整体数据中心基础设施控制(HDIC),它有潜力通过利用工作负载调度程序和冷却单元的运行参数之间的协同作用来节省大量电量。

更新日期:2021-01-19
down
wechat
bug