Close

Presentation

Hydrogen: Contention-Aware Hybrid Memory for Heterogeneous CPU-GPU Architectures
DescriptionIntegrating hybrid memories with heterogeneous processors could leverage heterogeneity in both compute and memory domains for better system efficiency. To ensure performance isolation, we introduce Hydrogen, a novel hardware architecture to optimize the allocation of hybrid memory resources to heterogeneous CPU-GPU systems. Hydrogen supports efficient capacity and bandwidth partitioning between CPUs and GPUs in both memory tiers. With the key observation that CPUs and GPUs exhibit distinct preferences to memory capacity and bandwidth, Hydrogen enables decoupled capacity and bandwidth allocation between CPUs and GPUs with flexible partitioning ratios. It also throttles overly excessive data migration from GPUs with a token-based mechanism. To effectively explore the large, multi-dimensional design space and support dynamically varying application behaviors, Hydrogen uses epoch-based online search for optimized configurations, and incorporates lightweight reconfiguration with reduced data movements.
Combining these novel techniques, Hydrogen significantly outperforms existing designs by 1.16× on average, and up to 1.31×.
Event Type
Paper
TimeTuesday, 19 November 202411am - 11:30am EST
LocationB311
Tags
Architecture
Codesign
Data Movement and Memory
Energy Efficiency
Green Computing
Linear Algebra
Registration Categories
TP