Presentation
Leveraging DOE infrastructure for Large-Scale Data Analysis in Collaborative Science
DescriptionIn this talk, I will explore how the DOE Joint Genome Institute (JGI) advances large-scale genomics data generation while connecting the scientific community beyond sequence production. JGI can enable real-time exploration of vast datasets through collaborations with platforms such as the National Microbiome Data Collaborative (NMDC) and KBase. This is made possible by the infrastructure we are building to ensure that workflow analysis is portable, reproducible, and shareable.
The JGI Analysis Workflow Service (JAWS), a scalable, centralized framework is at the core of this infrastructure. JAWS was developed to streamline the execution and management of computational workflows across DOE resources by simplifying complex workflows in DOE high-performance computing (HPC) clusters and cloud environments.
This approach empowers scientists to rapidly identify patterns or filter results, leveraging HPC to expedite time-sensitive analyses. JGI’s infrastructure is becoming increasingly relevant to support urgent scientific efforts, from bioenergy research to addressing climate resilience.
The JGI Analysis Workflow Service (JAWS), a scalable, centralized framework is at the core of this infrastructure. JAWS was developed to streamline the execution and management of computational workflows across DOE resources by simplifying complex workflows in DOE high-performance computing (HPC) clusters and cloud environments.
This approach empowers scientists to rapidly identify patterns or filter results, leveraging HPC to expedite time-sensitive analyses. JGI’s infrastructure is becoming increasingly relevant to support urgent scientific efforts, from bioenergy research to addressing climate resilience.