Dear PACE Users,
Our scheduled maintenance has completed ahead of the schedule! All Coda datacenter clusters are ready for research. As usual, we have released all users jobs that were held by the scheduler. We appreciate everyone’s patience as we worked through these maintenance activities.
Our next maintenance period is tentatively scheduled to begin at 6:00AM on Wednesday, 11/03/2021, and it is tentatively scheduled to conclude by 11:59PM on Friday, 11/05/2021.
Here is an update on the tasks performed during this maintenance period.
ITEMS REQUIRING USER ACTION:
- None.
ITEMS NOT REQUIRING USER ACTION:
- [Complete] [Datacenter] Databank will need to replace components of one of the transformers feeding the room that will require a complete power off for the research hall that includes the PACE managed clusters.
- [Complete] [Storage] Upgrade controller for the storage appliances: SFA200NV, SFA18KE
- [Complete] [Storage] Replace a miniSAS cable on Hive storage appliance: SFA14KXE
- [Complete] [Storage] Replace a failed hard drive on a pre-production OSG cluster
- [Complete] [System/Security] Operating system patch installs
- [Complete] [System/Security] Endpoint Protection Updates
- [Complete] [Benchmarks] Conduct IO500 and HPCG benchmarks for Hive and Phoenix clusters
- [Complete] [System] Update NVidia drivers and add NVidia specific libraries
- [Complete] [System] Reboot scheduler nodes
If you have any questions or concerns, please do not hesitate to contact us at pace-support@oit.gatech.edu.
Best,
The PACE Team