EE-HPC a Framework for Energy Efficient HPC System Management

Terboven C, Liem R, Gracia J, Haldar K, Engels JF, Giesselmann P, Brayford D, Wilde T, Simmendinger C, Marquardt M, Eitzinger J, Gruber T (2024)


Publication Type: Conference contribution

Publication year: 2024

Publisher: Institute of Electrical and Electronics Engineers Inc.

Pages Range: 1878-1882

Conference Proceedings Title: Proceedings of SC 2024-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis

Event location: Atlanta, GA US

ISBN: 9798350355543

DOI: 10.1109/SCW63240.2024.00236

Abstract

Energy consumption has become a major cost factor in the procurement and operation of large scale HPC data centers. In addition, funding bodies and governments are starting to focus on assessment and improvement of energy efficiency, as well as reducing the overall environmental impact of data centers, like carbon usage reduction. The goal of the EE-HPC project is to develop a targeted job specific control and optimization of the hardware to enable a more efficient energy usage of HPC systems. The project started at the end of 2022 and builds upon the existing stable software components ClusterCockpit [1] and LIKWID [2] developed by FAU. It provides a simple, robust, secure and scalable monitoring & energy control framework for hybrid HPC cluster management. The EE-HPC project is developing energy aware software components that will be integrated with ClusterCockpit for power monitoring and reducing the energy consumption of the system. The framework is complemented with an instrumentation library for fine grained analysis, phase detection and tuning of MPI & OpenMP regions. The effectiveness of the approach is evaluated against a set of representative HPC applications ranging from molecular dynamics to earth system modelling.

Authors with CRIS profile

Involved external institutions

How to cite

APA:

Terboven, C., Liem, R., Gracia, J., Haldar, K., Engels, J.F., Giesselmann, P.,... Gruber, T. (2024). EE-HPC a Framework for Energy Efficient HPC System Management. In Proceedings of SC 2024-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (pp. 1878-1882). Atlanta, GA, US: Institute of Electrical and Electronics Engineers Inc..

MLA:

Terboven, C., et al. "EE-HPC a Framework for Energy Efficient HPC System Management." Proceedings of the 2024 Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC Workshops 2024, Atlanta, GA Institute of Electrical and Electronics Engineers Inc., 2024. 1878-1882.

BibTeX: Download