[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [condor-users] Problem - jobs only running on one machine




but when i run a sequence of 20 jobs on the pool they only execute on one machine - this one

any ideas?

The most common reason that I have seen is that Condor has been configured to have a different FILESYSTEM_DOMAIN on each machine--it's set to be the full hostname. If you don't tell Condor to transfer files, it assumes that there isn't a shared filesystem between computers (like NFS) if the FILESYSTEM_DOMAINs on two machines are different.


So: if you have a shared filesystem, FILESYSTEM_DOMAIN should be the same. If you don't have a shared filesystem, you need to tell Condor to transfer files.

See Sections 2.5.3 and 2.5.4 of the Condor 6.5 manual for more information.

There are some other common reasons for this problem. See this question in the FAQ from the manual.

http://www.cs.wisc.edu/condor/manual/v6.5/7_3Running_Condor.html#SECTION00833000000000000000

-alain


--------------------------------------------------------------------- Alain Roy University of Wisconsin-Madison Condor Project Research Department of Computer Sciences roy@xxxxxxxxxxx 1210 W. Dayton St. +1 608 265 5736 Madison, WI 53706-1685 USA

Condor Support Information:
http://www.cs.wisc.edu/condor/condor-support/
To Unsubscribe, send mail to majordomo@xxxxxxxxxxx with
unsubscribe condor-users <your_email_address>