Presentation
Benchmarking and Continuous Performance Monitoring of Ookami, an ARM Fujitsu A64FX Testbed Cluster
DescriptionContinuous performance monitoring is critical for maintaining optimal performance of High-Performance Computing resources. This is especially important for technological test bed systems, in which software updates occur often, and performance degradation in one place can be masked by performance improvement in other places. This paper reports on our experience running continuous performance monitoring on Ookami, an ARM Fujitsu A64FX machine (the first ARM CPU with SVE-512 support) using XDMoD. After over three years of monitoring, we found that the applications and numerical library performance improved the most on the initial release with new technology support, followed by a series of smaller performance gains. Another interesting observation about numeric libraries is that the most invested vendors produce optimized code faster than community codes.