[8/12/22 5:00 PM Update]
PACE continues work to deploy Hive-Slurm. Maintenance on Hive-Slurm only will be extended into next week, as we complete setting up the new environment. At this time, please use the existing (Moab/Torque) Hive, which was released earlier today. We will provide another update next week when the Slurm cluster is ready for research, along with details about how to access and use the new scheduler and updated software stack.
The Slurm Orientation session previously announced for Tuesday, August 16, will be rescheduled for a later time.
If you have any questions or concerns, please contact us at pace-support@oit.gatech.edu.
[8/12/22 2:20 PM Update]
The Phoenix, existing Hive (Moab/Torque), Firebird, PACE-ICE, COC-ICE, and Buzzard clusters are now ready for research and learning. We have released all jobs that were held by the scheduler.
We are continuing to work on launching the Hive-Slurm cluster, and we will provide another update to Hive researchers later today. Maintenance on the existing Hive (Moab/Torque) cluster has completed, and researchers can resume using it.
The next maintenance period for all PACE clusters is November 2, 2022, at 6:00 AM through November 4, 2022, at 11:59 PM. Additional maintenance periods are tentatively scheduled for 2023 on January 31 – February 2, May 9-11, August 8-10, and October 31 – November 2.
Status of activities:
ITEMS REQUIRING USER ACTION:
- [Complete][Utilities] PACE will merge the functionality of pace-whoami into the pace-quota utility. Please use the pace-quota command to find out all relevant information about your account, including storage directories and usage, job charge or tracking accounts, and other relevant information. Running pace-whoami will now report the same output as pace-quota.
- [In progress][Hive] Slurm migration and software stack update first phase for Hive cluster – see recent announcement for details
ITEMS NOT REQUIRING USER ACTION:
- [Complete][Hive][Storage] Cable replacement for GPFS (project/scratch) controller
- [Complete][Datacenter] Transformer repairs
- [Complete][Datacenter] Cooling tower cleaning
- [Complete][Scheduler] Accounting database maintenance
- [Complete][Firebird][Network] Add VPC redundancy
- [Complete][Phoenix][Storage] Replace redundant power supply on Lustre storage system
If you have any questions or concerns, please contact us at pace-support@oit.gatech.edu.
[8/9/22 Update]
This is a reminder that our next PACE maintenance period is scheduled to begin tomorrow at 6:00 AM on Wednesday, August 10, and end at 11:59 PM on Friday, August 12. As usual, jobs that request durations that would extend into the maintenance period will be held by the scheduler to run after maintenance is complete. During the maintenance window, access to all PACE-managed computational and storage resources will be unavailable. This includes Phoenix, Hive, Firebird, PACE-ICE, COC-ICE, and Buzzard.
Tentative list of activities:
ITEMS REQUIRING USER ACTION:
- [Utilities] PACE will merge the functionality of pace-whoami into the pace-quota utility. Please use the pace-quota command to find out all relevant information about your account, including storage directories and usage, job charge or tracking accounts, and other relevant information. Running pace-whoami will now report the same output as pace-quota.
- [Hive] Slurm migration and software stack update first phase for Hive cluster – see recent announcement for details
ITEMS NOT REQUIRING USER ACTION:
- [Hive][Storage] Cable replacement for GPFS (project/scratch) controller
- [Datacenter] Transformer repairs
- [Datacenter] Cooling tower cleaning
- [Scheduler] Accounting database maintenance
- [Firebird][Network] Add VPC redundancy
- [Phoenix][Storage] Replace redundant power supply on Lustre storage system
If you have any questions or concerns, please contact us at pace-support@oit.gatech.edu.
[8/3/22 Update]
This is a reminder that our next PACE maintenance period is scheduled to begin at 6:00 AM on Wednesday, August 10, and end at 11:59 PM on Friday, August 12. As usual, jobs that request durations that would extend into the maintenance period will be held by the scheduler to run after maintenance is complete. During the maintenance window, access to all PACE-managed computational and storage resources will be unavailable. This includes Phoenix, Hive, Firebird, PACE-ICE, COC-ICE, and Buzzard.
Tentative list of activities:
ITEMS REQUIRING USER ACTION:
- [Hive] Slurm migration and software stack update first phase for Hive cluster – see recent announcement for details
ITEMS NOT REQUIRING USER ACTION:
- [Hive][Storage] Cable replacement for GPFS (project/scratch) controller
- [Datacenter] Transformer repairs
- [Datacenter] Cooling tower cleaning
- [Scheduler] Accounting database maintenance
- [Firebird][Network] Add VPC redundancy
- [Phoenix][Storage] Replace redundant power supply on Lustre storage system
If you have any questions or concerns, please contact us at pace-support@oit.gatech.edu.
[7/27/22 Update]
As previously announced, our next PACE maintenance period is scheduled to begin at 6:00 AM on Wednesday, August 10, and end at 11:59 PM on Friday, August 12. As usual, jobs that request durations that would extend into the maintenance period will be held by the scheduler to run after maintenance is complete. During the maintenance window, access to all PACE-managed computational and storage resources will be unavailable. This includes Phoenix, Hive, Firebird, PACE-ICE, COC-ICE, and Buzzard.
Tentative list of activities:
ITEMS REQUIRING USER ACTION:
• [Hive] Slurm migration and software stack update first phase for Hive cluster – see recent announcement for details
ITEMS NOT REQUIRING USER ACTION:
• [Hive][Storage] Cable replacement for GPFS (project/scratch) controller
• [Datacenter] Transformer repairs
• [Datacenter] Cooling tower cleaning
• [Scheduler] Accounting database maintenance
• [Firebird][Network] Add VPC redundancy
If you have any questions or concerns, please contact us at pace-support@oit.gatech.edu.
[7/18/22 Early reminder]
Dear PACE Users,
This is friendly reminder that our next Maintenance period is scheduled to begin at 6:00AM on Wednesday, 08/10/2022, and it is tentatively scheduled to conclude by 11:59PM on Friday, 08/12/2022. As usual, jobs with resource requests that would be running during the Maintenance Period will be held until after the maintenance by the scheduler. During this Maintenance Period, access to all the PACE managed computational and storage resources will be unavailable.
As we get closer to this maintenance, we will share further details on the tasks, which will be posted here.
If you have any questions or concerns, please do not hesitate to contact us at pace-support@oit.gatech.edu.
Best,
The PACE Team