Job Submission (submit.yml.erb
)
The configuration file submit.yml.erb
controls the content of the batch
script as well as the submission arguments used when submitting the batch job.
It is located in the root of the application directory.
Tip
This page is an introduction to the submit.yml.erb
, it’s contents
and it’s use.
Reference documentation holds all configuration items for submit.yml.erb.
Assuming we already have a sandbox Interactive App deployed under:
${HOME}/ondemand/dev/my_app
The submit.yml.erb
configuration file can be found at:
${HOME}/ondemand/dev/my_app/submit.yml.erb
The .erb
file extension will cause the YAML configuration file to be
processed using the eRuby (Embedded Ruby) templating system. This allows you
to embed Ruby code into the YAML configuration file for flow control, variable
substitution, and more.
Configuration
The three possible configuration options that can be used in the
submit.yml.erb
file are given as:
- batch_connect (Hash)
The configuration describing the batch script content. Reference documentation holds all configuration items for submit.yml.erb.
- Example
Use the default basic web server template
# ${HOME}/ondemand/dev/my_app/submit.yml.erb --- batch_connect: template: "basic"
- script (Hash)
The configuration describing the job submission parameters for the batch script. Reference documentation holds all configuration items for submit.yml.erb.
- Example
Set the job’s charged account and queue
# ${HOME}/ondemand/dev/my_app/submit.yml.erb --- script: accounting_id: "PZS0001" queue_name: "parallel"
- cluster (String) (Optional)
the cluster to submit to
Tip
Use this field when you need to choose the cluster in a more dynamic way than what’s provided in the form. Or when you want to choose the cluster for the user given some choices like the example below.
- Example
Submit the job to the the large cluster if requesting more than 29 cores, else submit to the small cluster.
# ${HOME}/ondemand/dev/my_app/submit.yml.erb <%- cluster = if num_cores >= 29 "large_cluster" else "small_cluster" end -> --- cluster: "<%= cluster %>"
Each of these configuration options take a set of their own configuration options described below.
Batch Connect Template types
Open OnDemand has three distinct batch connect template types. These template types determine the resulting shell scripts that wrap your batch connect application.
For example the vnc
type will provide a script that boots up a
VNC server, exports a DISPLAY
variable and other utilities to
ensure that the job has been bootstrapped appropriately to
start GUI applications.
The options available are basic
, vnc
or vnc_container
.
vnc_container
is basically just like vnc
only in that it
bootstraps the VNC programs inside a container instead of on the
host machine.
- template (String)
The template used to create the wrapper script for your batch connect application.
- Default
Generate a basic wrapper script for HTTP applications.
--- batch_connect: template: "basic"
- Example
Generate a wrapper script for a VNC interactive application.
# ${HOME}/ondemand/dev/my_app/submit.yml.erb --- batch_connect: template: "vnc"
Reference documentation holds all configuration items for submit.yml.erb.
Note
The configuration template: "vnc"
comes with more batch_connect
configuration options which can be found under the code documentation for
Batch Connect VNC Options.
Configure Script
The script
configuration option defines the batch job submission parameters
(e.g., number of nodes, wall time, queue, …). The list of all possible
options can be found under the code documentation for Batch Connect Script Options.
It is recommended to refrain from using the native
option to best keep your
Interactive App as portable as possible. Although we understand this may not be
possible for all job submission parameters (e.g., number of nodes, memory, GPU)
it would be best to use the respective option corresponding to the submission
parameter if it is available.
For example, if I want to specify the charged account for the job, it is recommended I use:
# ${HOME}/ondemand/dev/my_app/submit.yml.erb
---
script:
accounting_id: "PZS0001"
as this is resource manager agnostic. But this can also be added for a Slurm resource manager as:
# ${HOME}/ondemand/dev/my_app/submit.yml.erb
---
script:
native: ["-A", "PZS0001"]
but now this app may not work at a center with a different resource manager.
Warning
Care must be taken when using the native
option as this is resource
manager specific. For all supported resource managers (e.g., Slurm, LSF,
PBSPro, …) other than Torque, the native
option is specified as an
array of command line arguments that are fed to the resource manager’s batch
submission tool (e.g., sbatch, qsub, bsub,
…)
So for Slurm, the following configuration will submit a job to 5 nodes with
feature c12
:
# ${HOME}/ondemand/dev/my_app/submit.yml.erb
---
script:
native: ["-N", "5", "-C", "c12"]
Examples
The simplest example consists of submitting a batch script built from the basic web server template using all the default options for the cluster’s batch job submission tool (e.g., sbatch, qsub, bsub, …).
# ${HOME}/ondemand/dev/my_app/submit.yml.erb
---
batch_connect:
template: "basic"
VNC Server
To submit a batch script built from the VNC server template:
# ${HOME}/ondemand/dev/my_app/submit.yml.erb
---
batch_connect:
template: "vnc"
Change Executable for Main Script
When the batch script is rendered from the template, one of the possible configuration options is specifying the executable command called for the main script it forks off into the background. This can be configured with:
# ${HOME}/ondemand/dev/my_app/submit.yml.erb
---
batch_connect:
template: "basic"
script_file: "./my_custom_script.sh"
Specify Job Submission Parameters
Cherry-picking some possible options from Batch Connect Script Options gives a batch job built from the basic web server template submitted with the following parameters:
# ${HOME}/ondemand/dev/my_app/submit.yml.erb
---
batch_connect:
template: "basic"
script:
wall_time: 3600
queue_name: "debug"
email_on_started: true
job_environment:
LICENSE_FILE: "1234@license.center.edu"
Dynamically Set Submission Parameters
Feel free to take advantage of the eRuby (Embedded Ruby) templating system
in the submit.yml.erb
file. You have access to all the
User Form (form.yml.erb) attributes.
For example, if you had a form attribute called number_of_hours
that you
had the user fill out. You can add this to the submission parameters as such:
# ${HOME}/ondemand/dev/my_app/submit.yml.erb
---
batch_connect:
template: "basic"
script:
wall_time: <%= (number_of_hours.blank? ? 1 : number_of_hours.to_i) * 3600 %>
We have to be careful here, because all form attributes are returned as Ruby Strings. So we need to:
First determine if the user filled in the attribute (check if it is
#blank?
).If they did, then we need to convert the string to an integer (
#to_i
) before performing arithmetic operations on it.Finally we convert hours to seconds.
Another scenario would be if the user specified the queue directly with a
custom form attribute called my_queue
. We can then add this user-supplied
queue conditionally to the submission parameters as such:
# ${HOME}/ondemand/dev/my_app/submit.yml.erb
---
batch_connect:
template: "basic"
script:
wall_time: 3600
<%- unless my_queue.blank? -%>
queue_name: <%= my_queue %>
<%- end -%>
email_on_started: true
In this case, queue_name
will only be added to the submission parameters if
the user supplied a non-blank value to the form attribute my_queue
.
Note
Most of the common form attributes that manipulate the job submission
parameters are provided for you as
Predefined Attributes. These
special attributes fill-in the script
configuration options internally,
so you do not have to.
For example, if you used the predefined form attribute bc_queue
, you do
not need to specify queue_name:
in the submit.yml.erb
.
Setting Batch Connect Options Globally
All of these configuration items can also be applied globally to the entire cluster
in the cluster definition files under /etc/ood/config/clusters.d/
. If set globally,
the option is applied to all applications in that cluster.
Here’s an example of how to set the header
configuration for both vnc
and basic
templates.
v2: batch_connect: basic: header: "#!/bin/bash" vnc: header: "#!/bin/bash"