We now have a small cluster available for use by all PACE participants. The TestFlight cluster is a small testing resource intended to provide an insulated environment for debugging and validation both of user code as well as system updates.
Currently, it is running a slightly updated software stack relative to our other clusters – it’s main features are standard updates from RedHat as well as a newer version of the Panasas client we use to access the high-performance scratch storage. We are considering this software stack for roll-out at our next maintenance window on April 19, and ask that users test their current codes between now and then. Users may test the new software by submitting their jobs to a new queue named “testflight”.
Given the limited resource availability, our initial configuration for this queue is to limit each user to two (2) actively running jobs, each of which may run for no longer than 6 hours. Additional jobs may be queued into the TestFlight cluster, but at most two will run at a time for a particular user. The scheduler will run jobs for as many different users as queued jobs and resources are available. The details of this queue, as well as our other queues, are published in our user guide, at http://share-it.gatech.edu/oit/pace/user-guide/the-job-scheduler/queue-names.
Equipment details of the TestFlight cluster may be found on our web site, at http://pace.gatech.edu/testflight-cluster.
Unless we find significant issues, these updates will be deployed on the clusters listed below. Please note that the Atlas cluster is excluded from this list.
- aryabhata
- athena
- atlantis
- biocluster
- cygnus
- force
- joe
- uranus