BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/New_York
X-LIC-LOCATION:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20250626T234540Z
LOCATION:B311
DTSTART;TZID=America/New_York:20241119T160000
DTEND;TZID=America/New_York:20241119T163000
UID:submissions.supercomputing.org_SC24_sess382_pap443@linklings.com
SUMMARY:Rapid GPU-Based Pangenome Graph Layout
DESCRIPTION:Jiajie Li (Cornell University); Jan-Niklas Schmelzle (Technica
 l University of Munich); Yixiao Du (Cornell University); Simon Heumos (Uni
 versity of Tübingen, Germany); Andrea Guarracino (University of Tennessee 
 Health Science Center); Giulia Guidi (Cornell University); Pjotr Prins and
  Erik Garrison (University of Tennessee Health Science Center); and Zhiru 
 Zhang (Cornell University)\n\nComputational Pangenomics is an emerging fie
 ld that studies genetic variation using a graph structure encompassing mul
 tiple genomes. Visualizing pangenome graphs is vital for understanding gen
 ome diversity. Yet, handling large graphs can be challenging due to the hi
 gh computational demands of the graph layout process. \n\nIn this work, we
  conduct a thorough performance characterization of a state-of-the-art pan
 genome graph layout algorithm, revealing significant data-level parallelis
 m, which makes GPUs a promising option for compute acceleration. However, 
 irregular data access and the algorithm's memory-bound nature present sign
 ificant hurdles. To overcome these challenges, we develop a solution imple
 menting three key optimizations: a cache-friendly data layout, coalesced r
 andom states, and warp merging. Additionally, we propose a quantitative me
 tric for scalable evaluation of pangenome layout quality.\n\nEvaluated on 
 24 human whole-chromosome pangenomes, our GPU-based solution achieves a 57
 .3x speedup over the state-of-the-art multithreaded CPU baseline without l
 ayout quality loss, reducing execution time from hours to minutes.\n\nTag:
  Accelerators, Applications and Application Frameworks, Graph Algorithms, 
 Modeling and Simulation, Numerical Methods\n\nRegistration Category: Tech 
 Program Reg Pass\n\nSession Chair: Wenqian Dong (Oregon State University)\
 n\n
END:VEVENT
END:VCALENDAR
