BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/New_York
X-LIC-LOCATION:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20250626T234540Z
LOCATION:B306
DTSTART;TZID=America/New_York:20241117T103000
DTEND;TZID=America/New_York:20241117T105000
UID:submissions.supercomputing.org_SC24_sess734_ws_pawatm102@linklings.com
SUMMARY:Speeding-Up LULESH on HPX: Useful Tricks and Lessons Learned using
  a Many-Task-Based Approach
DESCRIPTION:Torben Kalkhof and Andreas Koch (Technical University Darmstad
 t)\n\nCurrent programming models face challenges in dealing with modern su
 percomputers' growing parallelism and heterogeneity. Emerging programming 
 models, like the task-based programming model found in the asynchronous ma
 ny-task HPX programming framework, offer new ways to express parallelism, 
 enhance scalability, and mask synchronization and communication latency on
  multi-core and distributed systems.\n\nRegular high-performance computing
  benchmarks are often unsuitable for comparing different programming model
 s due to their limited code complexity. However, real-world scientific app
 lications are usually too complex. As a middle ground, proxy applications 
 model the behavior of actual scientific problems, while reducing code comp
 lexity.\n\nIn our research on using HPX to program machines with heterogen
 eous compute units (e.g., GPU and FPGA/AI Engines), we have also substanti
 ally optimized a pure HPX-based software baseline of the LULESH proxy appl
 ication. This paper discusses the techniques we applied yielding single-no
 de speed-ups of 1.33x to 2.25x for different problem sizes relative to the
  LULESH OpenMP reference implementation.\n\nTag: Heterogeneous Computing, 
 Parallel Programming Methods, Models, Languages and Environments, PAW-Full
 , Task Parallelism\n\nRegistration Category: Workshop Reg Pass\n\nSession 
 Chairs: Engin Kayraklioglu (Hewlett Packard Enterprise (HPE)); Daniele Lez
 zi (Barcelona Supercomputing Center (BSC)); Karla Vanessa Morris Wright (S
 andia National Laboratories); Irene Moulitsas (Cranfield University); Elli
 ott Slaughter (SLAC National Accelerator Laboratory); and Kenjiro Taura (T
 he University of Tokyo, Japan)\n\n
END:VEVENT
END:VCALENDAR
