Transferring data with HTCondor
Overview
This page will introduce users to transferring files using HTCondor's built-in transfer mechanisms. HTCondor has a built-in mechanism to transfer binaries and files to and from compute nodes. If users have relatively small amounts of data and binaries to transfer (<100MB) or needs to do ad-hoc job submissions, then this mechanism can be effective.
Conventions
- In the examples used on this page, text in red is being used as a placeholder and will need to be replaced with user specific information (e.g. username )
- Names of servers are denoted using blue text (e.g. login.cms.ci-connect.org)
- Directory or file names are denoted using green text (e.g. ~/my_file)
Preliminaries
Before getting started, users should login to login.cms.ci-connect.org and get a copy of the tutorial files:
% ssh login.cms.ci-connect.org $ tutorial htcondor-transfer $ cd tutorial-htcondor-transfer
Word Distribution Example
This example will use the HTCondor transfer mechanisms to transfer a binary (distribution) and a file with a list of words (random_words) to compute nodes that are running the jobs. The condor file that will be used is shown below:
The key parts of the submit file are the transfer_input_files
parameter that gives a comma separated list of paths to the files that will be transferred. In addition, ShouldTransferFiles
needs to be set to YES and when_to_transfer_output
needs to be set to ON_EXIT
in order to make sure that the HTCondor will return the output.
path warning
You must run condor_submit
in the same directory that you created the files and directories in. Otherwise HTCondor will give you an error due to not being able to find the distribution and random_words files
$ condor_submit transfer.submit
When the jobs are completed, verify the output: