Inhaltsverzeichnis
Job queues and parallel job environments at Numerikserver
Preliminary notes about the architecture of the Numerikserver
- SGE knows about several special environments for parallel jobs types, the PE. Each process of a parallel job is bound to one slot. In the current setup, a slot corresponds to a processor core.
- Parallel jobs need to specify a corresponding parallel environment within their bach job script.
- The Numerikserver divides into 21 clusters, each containing nodes with a specific hardware architecture.
About queues and parallel environments in general
- Queues allow to classify jobs with respect to their run-time (wall clock time).
- Parallel Environments (PE) allow to classify jobs with respect their type of parallelism.
Query list of available queues
- type the command:
qconf -sql
Query properties of a single queue
- type the command:
qconf -sq [name_of_queue]
Available queues - different run-time limits
- IMPORTANT NOTE: Depending on the run-time requirements of your job, the appropriate queue will be automatically chosen. No need to specify the queue explicitly.
- short: 12 h
- medium:72 hours (3 d)
- long: 220 hours (9 d 4 h)
- verylong (only on three clusters): 1344 hours (56 d)
Defining parallel job environments (PE)
- General syntax defining a PE:
[mp|ompi]<clusterid|*>_<noprocs-per-node> [number_of_proceses|range_number_of_processes]
(mandatory arguments in[]
and optional arguments in<>
given) - Several examples:
- SMP/MPI job intranode (request 9 slots on a single node within any cluster):
mp 9
- MPI job internode (request 18 slots on a variable number of nodes residing in any cluster; NB: all nodes reside within the same cluster) :
ompi 18
orompi* 18
- MPI job internode on a specific cluster (request 20 slots on a variable number of nodes residing in cluster 4):
ompi4 20
- MPI job internode with explicit partitioning (request in total 32 slots, on each node spending 8 slots):
ompi*_8 32
Query list of available PE
- type the command:
qconf -spl
Query properties of a single PE
- type the command:
qconf -sp [name_of_PE]
Query resources of a specific cluster node
- type the command:
qhost
date of revision: 06-19-2019 © kraus