[Update – 02/11/2019] Our updated quarterly scheduled maintenance task list will include the following:
Compute
- (no user action needed) Vendor will replace defective components on groups of servers
Network
- (no user action needed) Ethernet network reconfiguration
Storage
- (no user action needed) GPFS / DDN enclosure reset
- (no user action needed) NAS maintenance and reconfiguration
Other
- (no user action needed) PACE VMWare reconfiguration to remove out of support hosts
[Original Post – 01/18/2019] We are preparing for a short maintenance day on February 15, 2019. Unlike our regular schedule, which starts on Thursdays and takes three days, this maintenance will start on a Friday and take only two days.
As usual, jobs with long walltimes will be held by the scheduler to ensure that no active jobs will be running when systems are powered off. These jobs will be released as soon as the maintenance activities are complete.
In general, we’ll perform maintenance on the GPFS storage, migrate some Virtual Machines to new servers, perform hardware changes on one of the clusters, and finalize the migration of “/usr/local”, which is network attached mount point on all machines, to a more reliable storage pool.
While we are still working on finalizing the task list and details, none of these tasks are expected to require any user actions.
We’ll update this post as we have more details.