Presentation
AIOps and Sustainability: Transforming Data Centers for a Greener Future
SessionSustainable Supercomputing
DescriptionEnterprise and high-performance computing data centers are dealing with thousands of sensor metrics and associated data. A top-end target for exascale machines is 10 million data points per second. The escalating volume and speed of data generation are making things more difficult, and outages are increasing. Uptime Institute's Outage Analysis report, published in June 2022, states that 30% of all outages in 2021 lasted more than 24 hours, a disturbing increase from 8% in 2017. While equipment is idle during downtime, it often continues to consume power, especially for cooling systems. This leads to wasted energy and higher operational costs. We propose an AIOps solution that uses advanced data analytics, machine learning, and deep learning methods to develop automated and advanced anomaly detection and predictive tools for data centers. They perform at scale and speed, and improve data center resiliency and energy efficiency, thereby promoting the sustainability of data centers.