Close

Session

This content is available for: Workshop Reg Pass. Upgrade Registration
Workshop: PMBS24: The 15th International Workshop on Performance Modeling, Benchmarking, and Simulation of High-Performance Computer Systems
DescriptionThe PMBS24 workshop is concerned with the comparison of high-performance computing systems through performance modeling, benchmarking or through the use of tools such as simulators. We are particularly interested in research which reports the ability to measure and make tradeoffs in software/hardware co-design to improve sustained application performance. We are also keen to capture the assessment of future systems. The aim of this workshop is to bring together researchers, from industry and academia, concerned with the qualitative and quantitative evaluation and modeling of high-performance computing systems. Authors are invited to submit novel research in all areas of performance modeling, benchmarking and simulation, and we welcome research that brings together current theory and practice. We recognize that the term 'performance' has broadened to include power consumption and reliability, and that performance modeling is practiced through analytical methods and approaches based on software tools and simulators.
Event TypeWorkshop
TimeMonday, 18 November 20249am - 5:30pm EST
LocationB303
Tags
Accelerators
Modeling and Simulation
Performance Evaluation and/or Optimization Tools
Registration Categories
W
Presentations
9:00am - 9:30am ESTLLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators
9:30am - 10:00am ESTComprehensive Performance Modeling and System Design Insights for Foundation Models
10:00am - 10:30am ESTPMBS24 — Morning Break
10:30am - 10:50am ESTSystem-Wide Roofline Profiling: A Case Study on NERSC’s Perlmutter Supercomputer
10:50am - 11:10am ESTMicroarchitectural Comparison and In-Core Modeling of State-of-the-Art CPUs: Grace, Sapphire Rapids, and Genoa
11:10am - 11:30am ESTBenchmarking the Evolution of Performance and Energy Efficiency Across Recent Generations of Intel Xeon Processors
11:30am - 12:00pm ESTPerformance Analysis of Runtime Handling of Zero-Copy for OpenMP Programs on MI300A APUs
12:00pm - 12:30pm ESTPonte Vecchio Across the Atlantic: Single-Node Benchmarking of Two Intel GPU Systems
12:30pm - 2:00pm ESTPMBS24 — Lunch Break
2:00pm - 2:30pm ESTHello SME! Generating Fast Matrix Multiplication Kernels Using the Scalable Matrix Extension
2:30pm - 3:00pm ESTAI-Assisted Design-Space Analysis of High-Performance Arm Processors
3:00pm - 3:30pm ESTPMBS24 — Afternoon Break
3:30pm - 4:00pm ESTImpact of Varying BLAS Precision on DCMESH
4:00pm - 4:30pm ESTAssessing the GPU Offload Threshold of GEMM and GEMV Kernels on Modern Heterogeneous HPC Systems
4:30pm - 5:00pm ESTUnderstanding VASP Power Profiles on NVIDIA A100 GPUs
5:00pm - 5:30pm ESTWorkload-Adaptive Scheduling for Efficient Use of Parallel File Systems in High-Performance Computing Clusters