Close

Presentation

Benchmarking and Continuous Performance Monitoring of Ookami, an ARM Fujitsu A64FX Testbed Cluster
DescriptionContinuous performance monitoring is critical for maintaining optimal performance of High-Performance Computing resources. This is especially important for technological test bed systems, in which software updates occur often, and performance degradation in one place can be masked by performance improvement in other places. This paper reports on our experience running continuous performance monitoring on Ookami, an ARM Fujitsu A64FX machine (the first ARM CPU with SVE-512 support) using XDMoD. After over three years of monitoring, we found that the applications and numerical library performance improved the most on the initial release with new technology support, followed by a series of smaller performance gains. Another interesting observation about numeric libraries is that the most invested vendors produce optimized code faster than community codes.
Event Type
Workshop
TimeFriday, 22 November 202411:10am - 11:30am EST
LocationB309
Tags
Debugging and Correctness Tools
Hardware Technologies
Resource Management
State of the Practice
Registration Categories
W