NERC - Sunday 04/27/2025- Unexpected Overnight Compute Node Outage at MGHPCC Due to Chiller/Power Issue – Incident details

GETTING HELP
Email: help@nerc.mghpcc.org or, using the NERC's Support Ticketing System
NERC Documentation: https://nerc-project.github.io/nerc-docs/
Status page for the New England Research Cloud (NERC) and other resources.
Please scroll down to see details on any Incidents or maintenance notices.

Sunday 04/27/2025- Unexpected Overnight Compute Node Outage at MGHPCC Due to Chiller/Power Issue

Resolved
Operational
Started 17 days agoLasted less than a minute

Affected

NERC OPENSTACK

Operational from 5:05 AM to 5:05 AM

NERC OPENSTACK COMPUTE SERVICE (NOVA)

Operational from 5:05 AM to 5:05 AM

OPENSTACK NETWORKING SERVICE (NEUTRON)

Operational from 5:05 AM to 5:05 AM

OPENSTACK BLOCK STORAGE (CINDER)

Operational from 5:05 AM to 5:05 AM

OPENSTACK OBJECT STORE (SWIFT)

Operational from 5:05 AM to 5:05 AM

OPENSTACK IMAGE SERVICE (GLANCE)

Operational from 5:05 AM to 5:05 AM

Updates
  • Resolved
    Resolved

    There was a chiller issue at the MGHPCC overnight that caused all non-UPS power to be shut off (at 1:05AM on Sunday 04/27/2025).  The chiller has been fixed and non-UPS power was restored at 5:43am. We have manually powered on all affected nodes. So if you still facing any issues please let us know via email help@nerc.mghpcc.org.

    This incident has been resolved.