Platform LSF allowed you to pack or spread jobs as required. Before that, just a few terms to defined
- Packing means always placing jobs on the hosts with the least available slots first. Packing jobs can make room for bigger parallel jobs.
- Spreading tries to spread jobs out and places jobs on the hosts with the most available slots first. Spreading jobs maximizes the performance of individual jobs.
I will deal with one situation where I want to pack all the serial jobs neatly in as few nodes as possible.
But here are some terms from the LSF Wiki
The slots keyword represents available slots on each host and it is a built-in numeric decreasing resource. When a job occupies some of the job slots on the host, the slots resource value is decreased accordingly. For example, if MXJ of an LSF host is defined as 8, the slots value will be 8 when the host is empty. When 6 LSF job slots have been occupied, slots becomes 2. The slots resource can only be used in select and order sections of a job resource requirement string. To apply a job packing or spreading policy, you can use the order section in the job resource requirement. For example, -R “order[-slots]” will order candidate hosts based on the least available slots, while –R “order[slots]” will order candidate hosts based on the hosts with the most available slots.
To use ! in an order clause, you must set SCHED_PER_JOB_SORT=Y in lsb.params. To make the parameter take effect, run badmin mbdrestart or badmin reconfig on the master host to reconfigure mbatchd.
The following is an example of using the slots resource:
Step 1: Configure RES_REQ in a Queue section of lsb.queues.
Begin Queue QUEUE_NAME = myqueue … RES_REQ = order[-slots] … End Queue
Step 2: Make the configuration take effects
# badmin reconfig
Step 3: Check whether the configuration take effects.
# bqueues -l myqueue QUEUE: myqueue … RES_REQ: order[-slots]
You can also do it at the application level. For more information, please read the PLatfrom LSF Wiki