Presentation
UNR: Unified Notifiable RMA Library for HPC
SessionScale–Out Interconnects
DescriptionRemote Memory Access (RMA) enables direct access to remote memory to achieve high performance for HPC applications. However, most modern parallel programming models lack schemes for the remote process to detect the completion of RMA operations. Many previous works have proposed programming models and extensions to notify the communication peer, but they did not solve the multi-NIC aggregation, portability, hardware-software co-design, and usability problems. In this work, we proposed a Unified Notifiable RMA (UNR) library for HPC to address these challenges. In addition, we demonstrate the best practice of utilizing UNR within a real-world scientific application, PowerLLEL. We deployed UNR across four HPC systems, each with a different interconnect. The results show that PowerLLEL powered by UNR achieves up to a 36% acceleration on 1728 nodes of the Tianhe-Xingyi supercomputing system.
Event Type
Paper
TimeThursday, 21 November 20244:30pm - 5pm EST
LocationB312-B313A
Data Compression
Data Movement and Memory
Distributed Computing
Message Passing
Network
TP
Archive
view