Summary: The Hive scheduler became non-responsive last evening and was restored at approximately 8:30 AM today.
Details: The Torque resource manager on the Hive scheduler stopped responding around 7:00 PM yesterday. The PACE team restored its function around 8:30 AM this morning and is continuing to monitor its status. The scheduler was fully functional for some time after the system utility repair yesterday afternoon, and it is not clear if the issues are connected.
Impact: Commands such as “qsub” and “qstat” would not have worked, so new jobs could not be submitted, including via Hive Open OnDemand. Running jobs were not interrupted.
Thank you for your patience last night. Please contact us at pace-support@oit.gatech.edu with any questions.