All nodes and GPFS filesystem issues affected from the power failure should be resolved as of late Friday evening (June 16) . If you are still experiencing problems, please let us know at pace-support@oit.gatech.edu.
June 19, 2017
Storage (GPFS) and datacenter problems resolved
June 16, 2017
PACE is experiencing storage (GPFS) problems
We are experiencing intermittent problems with the GPFS storage system that hosts most of the project directories.
We are working with the vendor to investigate the ongoing issues. At this moment we don’t know whether they are related to yesterday’s power/cooling failures or not, but we will update the PACE community as we find out more.
This issue has potential impact on running jobs and we are sorry for this inconvenience.
PACE datacenter experienced a power/cooling failure
Impacted Queues:
June 7, 2017
Large Scale Problem
Update (6/7/2017, 1:20pm): The network issues are now addressed and systems are back in normal operation.Please check your jobs and resubmit failed jobs as needed. If you continue to experience any problems, or need our assistance for anything else, please contact us at pace-support@oit.gatech.edu. We are sorry for this inconvenience and thank you once again for your patience.
Original message: We are experiencing a large scale network problem impacting multiple storage servers and software repository with a potential impact on running jobs. We are currently actively working to get it resolved and will provide updates as much as possible. We appreciate your patience and understanding, and are committed to resolving the issue as soon as we possibly can.