Close

Session

Workshop: PDSW24: The 9th International Parallel Data Systems Workshop
DescriptionEfficient data storage and data management are crucial to scientific productivity in both traditional simulation-oriented HPC environments and Cloud, AI/ML/Big Data analysis environments. This issue is further exacerbated by the growing volume of experimental and observational data, the widening gap between the performance of computational hardware and storage hardware, and the emergence of new data-driven algorithms in machine learning. The goal of this workshop is to facilitate research and development that addresses the most critical challenges in large-scale data storage and data processing. PDSW will continue to build on the successful tradition established by its predecessor workshops: the Petascale Data Storage Workshop (PDSW, 2006-2015) and the Data-Intensive Scalable Computing Systems (DISCS 2012-2015) workshop. These workshops were successfully combined in 2016, and the resulting joint workshop has attracted up to 38 full paper submissions and 195 attendees per year from 2016 to 2023.
Event TypeWorkshop
TimeSunday, 17 November 20249am - 5:30pm EST
LocationB309
Tags
Data Movement and Memory
I/O, Storage, Archive
Registration Categories
W
Presentations
9:00am - 9:10am ESTPDSW 2024 Welcome
Presenter
9:10am - 10:00am ESTInvited Talk: Bridging the Data Gaps in Computing for Science, Education and Society
10:00am - 10:30am ESTPDSW24 — Morning Break
10:30am - 11:00am ESTFault-Tolerant Deep Learning Cache with Hash Ring for Load Balancing in HPC Systems
11:00am - 11:30am ESTMOSAIC: Detection and Categorization of I/O Patterns in HPC Applications
11:30am - 12:00pm ESTExploring DAOS Interfaces and Performance
12:00pm - 12:05pm ESTScalable RPC Layer Towards Millions of IOPS per Server
12:05pm - 12:10pm ESTReducing I/O Bottleneck for Pretraining AI Foundation Models for Climate
12:10pm - 12:15pm ESTBULKI: Binary Unified Layout for Key-Value Interchange
12:15pm - 12:20pm ESTDistributed, Resilient and In-Memory Storage of Key-Value Data for HPC
12:20pm - 12:25pm ESTA Global In-Memory Cache and Computation Tier for DAOS
12:25pm - 12:30pm ESTAre Streaming Engines and Vector Databases Integrated Well?
12:30pm - 2:00pm ESTPDSW24 — Lunch Break
2:00pm - 2:30pm ESTInitial Experiences with DAOS Object Storage on Aurora
2:30pm - 3:00pm ESTUnderstanding and Predicting Cross-Application I/O Interference in HPC Storage Systems
3:00pm - 3:30pm ESTPDSW24 — Afternoon Break
3:30pm - 4:00pm ESTCopper: Cooperative Caching Layer for Scalable Data Loading in Exascale Supercomputers
4:00pm - 4:05pm ESTJarvis: Towards a Shared, User-Friendly, and Reproducible I/O Infrastructure
4:05pm - 4:10pm ESTDAOS Project Update - One Year in the DAOS Foundation
4:10pm - 4:15pm ESTImproving SQL Query Execution of Distributed Query Engines on Object-Based Computational Storage through Multi-Layered Offloading
4:15pm - 4:20pm ESTLustre for Grace Hopper: Current Status Report
Author/Presenters
4:20pm - 4:25pm ESTExploring the Proactive Data Containers Runtime System in VAST - A Case Study
Author/Presenters
4:25pm - 4:30pm ESTSilent Errors to Scientific Applications: Impacts of PFS Metadata Corruptions
4:30pm - 4:35pm ESTWhen Stream Processing Engine Meets Log-Structured Merge-Tree as State Store
Author/Presenters
4:35pm - 5:30pm ESTPanel: Data, Data Everywhere