ITS Service Status Report

Emergency Maintenance — Resolved

Slurm maintenance on the ARC HPC Clusters

Services Affected: Research HPC

Start Time: 05/03/2022 7:00 am

Maintenance Completed: 05/03/2022 7:37 am

Issue Symptoms: Degradation

Slurm will be unavailable on all three ARC HPC clusters (Great Lakes, Armis2, and Lighthouse) while we update the Slurm software version.  Expected duration of the downtime is 30 minutes or less. 

  • Users can remain logged in and access their files during the updates. 
  • Jobs not able to complete prior to 7 a.m. May 3 will not be able to start until after maintenance is completed. These jobs will be eligible to start once maintenance has been completed.

 

Who is Impacted? ARC HPC Researchers

Next Update: 3 May 7:30am

Technical Details

Service Type: Production

Server Name: Great Lakes, Armis2, and Lighthouse

Comments:

All clusters updated and returned to production.

Report Additional Impacts

Contact the ITS Service Center for more information or to report additional impacts.