4. Custom Job Submission

The Job Submission (submit.yml.erb) configuration file describes how the batch job should be submitted to your cluster. The location of this file must be specified in the respective /etc/ood/config/apps/bc_desktop/my_cluster.yml form configuration file, so that when a user submits the form, the specified submission configuration is used when submitting the batch job.

To customize job submission we will need to first edit our custom desktop app User Form (form.yml.erb) YAML file as such:

# /etc/ood/config/apps/bc_desktop/my_cluster.yml
---
title: "My Cluster Desktop"
cluster: "my_cluster"
submit: "submit/my_submit.yml.erb"

Notice we included the configuration option submit that points to our custom Job Submission (submit.yml.erb) YAML configuration file. This can be an absolute file path or a relative file path with respect to the /etc/ood/config/apps/bc_desktop/ directory. It is important to notice you must use this or some other directory outside the app's root.

Note

The *.erb file extension will cause the YAML configuration file to be processed using the eRuby (Embedded Ruby) templating system. This allows you to embed Ruby code into the YAML configuration file for flow control, variable substitution, and more.

Danger

Do not put the Job Submission (submit.yml.erb) configuration file directly underneath /etc/ood/config/apps/bc_desktop. If you do, OOD will think this a different app's form.yml. Instead we typically create the directory submit/ underneath the app's root directory and put our Job Submission (submit.yml.erb) configuration files underneath that.

We can now create and modify the Job Submission (submit.yml.erb) configuration file at:

/etc/ood/config/apps/bc_desktop/submit/my_submit.yml.erb

Since it has the extension .erb we can take advantage of the Ruby language to make the configuration file dynamic. In particular, you will now have access to the user-submitted form arguments defined as:

bc_num_hours

Default: "1"

A Ruby String containing the number of hours a user requested for the Desktop batch job to run.

bc_num_slots

Default: "1"

A Ruby String containing either the number of nodes or processors (depending on the type of resource manager the cluster uses) a user requested.

bc_account

Default: ""

A Ruby String that holds the account the user supplied to charge the job against.

bc_queue

Default: ""

A Ruby String that holds the queue the user requested for the job to run on.

bc_email_on_started

Default: "0"

A Ruby String that can either be "0" (do not send the user an email when the job starts) or "1" (send an email to the user when the job starts).

node_type

Default: ""

A Ruby String that can be used for more advanced job submission. This is an advanced option that is disabled by default and does nothing if you do enable it, unless you add it to a custom job submission configuration file.

Some examples on how to submit jobs using the above form attributes are given in the following sections for the given resource manager.

Slurm

For most cases of Slurm you will want to modify how the bc_num_slots (number of nodes) is submitted to the batch server.

This can be handled in your custom job submission configuration file as such:

# /etc/ood/config/apps/bc_desktop/submit/my_submit.yml.erb
---
script:
  native:
    - "-N"
    - "<%= bc_num_slots.blank? ? 1 : bc_num_slots.to_i %>"

All batch script options are underneath the script configuration option. In particular since there is no option to modify number of nodes, we need to directly interact with the native command line arguments. This is specified as an array of sbatch arguments.

Note

It is recommended you use the corresponding batch script options before using the native fallback.

Torque

For most cases of Torque you will want to modify how the bc_num_slots (number of nodes) is submitted to the batch server.

This can be handled in your custom job submission configuration file as such:

# /etc/ood/config/apps/bc_desktop/submit/my_submit.yml.erb
---
script:
  native:
    resources:
      nodes: "<%= bc_num_slots.blank? ? 1 : bc_num_slots.to_i %>:ppn=28"

All batch script options are underneath the script configuration option. In particular since there is no option to modify number of nodes, we need to directly interact with the native command line arguments.

For more information on the available options for the native attribute when using Torque please see the pbs-ruby documentation.

Note

It is recommended you use the corresponding batch script options before using the native fallback.

PBS Professional

For most cases of PBS Professional you will want to modify how the bc_num_slots (number of CPUs on a single node) is submitted to the batch server.

This can be handled in your custom job submission configuration file as such:

# /etc/ood/config/apps/bc_desktop/submit/my_submit.yml.erb
---
script:
  native:
    - "-l"
    - "select=1:ncpus=<%= bc_num_slots.blank? ? 1 : bc_num_slots.to_i %>"

All batch script options are underneath the script configuration option. In particular since there is no option to modify number of nodes or CPUs, we need to directly interact with the native command line arguments. This is specified as an array of qsub arguments.

If you would like to mimic how Torque handles bc_num_slots (number of nodes), then we will first need to change the form label of bc_num_slots that the user sees in the form. This can be done by modifying our Desktop app local YAML configuration file:

# /etc/ood/config/apps/bc_desktop/submit/my_submit.yml.erb
---
title: "Cluster1 Desktop"
cluster: "cluster1"
attributes:
  bc_num_slots:
    label: "Number of nodes"
submit: "submit/my_submit.yml.erb"

Now when we go to the Desktop app form in our browser it will have the new label "Number of nodes" instead of "Number of CPUs on a single node".

Next we will need to handle how we submit the bc_num_slots since it means something different now. So now modify the job submission configuration file as such:

# /etc/ood/config/apps/bc_desktop/submit/my_submit.yml.erb
---
script:
  native:
    - "-l"
    - "select=<%= bc_num_slots.blank? ? 1 : bc_num_slots.to_i %>:ncpus=28"

You can also append mem=...gb to the select=... statement if you'd like.

Note

It is recommended you use the corresponding batch script options before using the native fallback.

LinuxHost Adapter

If you're using the LinuxHost you actually don't need a specialized submit.yml.erb. There is no need to specify resources like the other adapters above.

You can however, use it to override the adapter's global fields for mount binding and specifying which container use.

# /etc/ood/config/apps/bc_desktop/submit/linuxhost_submit.yml.erb
 ---
 batch_connect:
   native:
      singularity_bindpath: /etc,/media,/mnt,/opt,/run,/srv,/usr,/var,/fs,/home
      singularity_container: /usr/local/modules/netbeans/netbeans_2019.sif