[Update 9/14/2023 1:02pm]
The cables have been replaced on Phoenix and Hive Storage with no interruption on production.
[Update 9/14/2023 5:54pm]
WHAT’S HAPPENING?
Two cables on Phoenix’s Lustre storage and one cable on Hive’s storage need to be replaced. Cable replacement will take about 2 hours to complete the work.
WHEN IS IT HAPPENING?
Thursday, September 14th, 2023 starting at 10 AM EDT.
WHY IS IT HAPPENING?
Required maintenance.
WHO IS AFFECTED?
Potential storage access outage and subsequent temporary decreased performance to all users.
WHAT DO YOU NEED TO DO?
During cable replacement for the Phoenix and Hive clusters, respectively, one of the controllers will be shut down and the redundant controller will take all the traffic. Data access should be preserved, but there have been cases where storage has become inaccessible. In case of storage unavailability during replacement becomes an issue, your job may fail or run without making progress. If you have such a job, please cancel it and resubmit it once storage availability is restored.
WHO SHOULD YOU CONTACT FOR QUESTIONS?
For any questions, please contact PACE at pace-support@oit.gatech.edu.