[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Jobs are not spread across various machines




However, when I submit some jobs, they run on only on the node they are
submitted on.

The most common reason that I have seen is that Condor has been configured to have a different FILESYSTEM_DOMAIN on each machine--it's set to be the full hostname. If you don't tell Condor to transfer files, it assumes that there isn't a shared filesystem between computers (like NFS) if the FILESYSTEM_DOMAINs on two machines are different.


So: if you have a shared filesystem, FILESYSTEM_DOMAIN should be the same. If you don't have a shared filesystem, you need to tell Condor to transfer files.

See Sections 2.5.3 and 2.5.4 of the Condor 6.5 manual for more information.

There are some other common reasons for this problem. See this question in the FAQ from the manual.

http://www.cs.wisc.edu/condor/manual/v6.7/7_3Running_Condor.html#SECTION00833000000000000000

Also, make sure that DNS is working in two ways: you can lookup a name from an IP address and an IP address from a name.

-alain