BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/New_York
X-LIC-LOCATION:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20260422T143052Z
LOCATION:B309
DTSTART;TZID=America/New_York:20241119T133000
DTEND;TZID=America/New_York:20241119T150000
UID:submissions.supercomputing.org_SC24_sess403@linklings.com
SUMMARY:Compiler Analysis and Code Generation
DESCRIPTION:Moirae: Generating High-Performance Composite Stencil Programs
  with Global Optimizations\n\nStencil computation is one of the most unive
 rsal computation motifs in scientific applications such as weather predict
 ion. Due to the complexity of scientific simulation, the stencil computati
 on can contain a set of complex stencil operations that form a directed ac
 yclic graph (referred to composite...\n\n\nXiaoyan Liu, Xinyu Yang, Kejie 
 Ma, Shanghao Liu, Kaige Zhang, Hailong Yang, Yi Liu, Zhongzhi Luan, and De
 pei Qian (Beihang University)\n---------------------\nAutomated Code Gener
 ation of High-Order Stencils for a Dataflow Architecture\n\nFinite-differe
 nce methods based on high-order stencils are widely used in seismic simula
 tions, weather forecasting, and computational fluid dynamics. Recently, mu
 ltiple research groups have begun exploring the use of dataflow architectu
 res, such as Cerebras' wafer-scale engine, to accelerate stencil...\n\n\nR
 yuichi Sai, John Mellor-Crummey, and Jinfan Xu (Rice University) and Mauri
 cio Araya-Polo (TotalEnergies E&P Research and Technology USA, LLC)\n-----
 ----------------\nautoGEMM: Pushing the Limits of Irregular Matrix Multipl
 ication on Arm Architectures\n\nThis paper presents an open-source library
  that pushes the limits of performance portability for irregular General M
 atrix Multiplication (GEMM) computations on the widely-used Arm architectu
 res. autoGEMM generates optimized kernels for various hardware configurati
 ons by auto-combining fragments of a...\n\n\nDu Wu (Tokyo Institute of Tec
 hnology, RIKEN); Jintao Meng (Shenzhen Institute of Advanced Technology, C
 hinese Academy of Sciences); Wenxi Zhu and Minwen Deng (Tencent); Xiao Wan
 g (Oak Ridge National Laboratory (ORNL)); Tao Luo (Agency for Science, Tec
 hnology and Research (A*STAR)); Mohamed Wahib (RIKEN); and Yanjie Wei (She
 nzhen Institute of Advanced Technology, Chinese Academy of Sciences)\n\nTa
 g: Accelerators, Compilers, Embedded and/or Reconfigurable Systems, Linear
  Algebra, Performance Evaluation and/or Optimization Tools\n\nRegistration
  Category: Tech Program Reg Pass\n\nSession Chair: Keita Teranishi (Oak Ri
 dge National Laboratory (ORNL))
END:VEVENT
END:VCALENDAR
