BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/New_York
X-LIC-LOCATION:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20250626T234539Z
LOCATION:B302-B305
DTSTART;TZID=America/New_York:20241119T120000
DTEND;TZID=America/New_York:20241119T170000
UID:submissions.supercomputing.org_SC24_sess487_post259@linklings.com
SUMMARY:Predicting Dataset Popularity for Improved Distributed Content Cac
 hing in High Energy Physics
DESCRIPTION:Malavikha Sudarshan (University of California, Berkeley)\n\nIn
  High Energy Physics (HEP), large-scale experiments generate massive amoun
 ts of data that are distributed globally. To reduce redundant data transfe
 rs and improve analysis efficiency, a disk caching system named XCache is 
 used to manage data accesses. By analyzing 11 months of access logs (4.5 m
 illion requests), we identified patterns in dataset usage and developed a 
 predictive model to forecast the popularity of frequently accessed dataset
 s. \n\nBased on extensive exploratory data analysis, we found that pinging
  the most popular datasets (pinning these in the cache) could significantl
 y improve access efficiency, and we implemented an LSTM model to predict d
 ataset accesses and optimize cache policies. \n\nThe model demonstrates st
 rong predictive performance with a low mean relative error of 0.779 across
  training and test datasets. Future work will incorporate anomaly detectio
 n techniques to improve robustness. This study highlights the potential of
  LSTM models in optimizing distributed content caching in HEP.\n\nRegistra
 tion Category: Tech Program Reg Pass, Exhibits Reg Pass\n\nSession Chairs:
  Ayesha Afzal (Friedrich-Alexander University, Erlangen-Nuremberg; Erlange
 n National High Performance Computing Center); Sally Ellingson (University
  of Kentucky); and Alan Sussman (University of Maryland)\n\n
END:VEVENT
END:VCALENDAR
