BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/New_York
X-LIC-LOCATION:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20250626T234540Z
LOCATION:B303
DTSTART;TZID=America/New_York:20241118T105000
DTEND;TZID=America/New_York:20241118T111000
UID:submissions.supercomputing.org_SC24_sess748_ws_pmbss111@linklings.com
SUMMARY:Microarchitectural Comparison and In-Core Modeling of State-of-the
 -Art CPUs: Grace, Sapphire Rapids, and Genoa
DESCRIPTION:Jan Laukemann (University of Erlangen-Nuremberg, Germany; Erla
 ngen National High Performance Computing Center) and Georg Hager and Gerha
 rd Wellein (Erlangen National High Performance Computing Center)\n\nWith N
 VIDIA’s release of the Grace Superchip, all three big semiconductor compan
 ies in HPC (AMD, Intel, NVIDIA) are currently competing in the race for th
 e best CPU. In this work we analyze the performance of these state-of-the-
 art CPUs and create an accurate in-core performance model for their microa
 rchitectures (Zen 4, Golden Cove, and Neoverse V2), extending the Open Sou
 rce Architecture Code Analyzer (OSACA) tool and comparing it with LLVM-MCA
 . Starting from the peculiarities and upsides/downsides of a single core, 
 we extend our comparison by a variety of microbenchmarks and the capabilit
 ies of a full node. The “write-allocate (WA) evasion” feature, which can a
 utomatically reduce the memory traffic caused by write misses, receives sp
 ecial attention; we show that the Grace Superchip has a next-to-optimal im
 plementation of WA evasion, and that the only way to avoid write allocates
  on Zen 4 is the explicit use of non-temporal stores.\n\nTag: Accelerators
 , Modeling and Simulation, Performance Evaluation and/or Optimization Tool
 s\n\nRegistration Category: Workshop Reg Pass\n\nSession Chairs: Simon Ham
 mond (National Nuclear Security Administration (NNSA)); Sascha Hunold (Tec
 hnical University of Vienna); and Steven A. Wright (University of York, En
 gland)\n\n
END:VEVENT
END:VCALENDAR
