[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] condor_submit hangs / condor_q hangs



Hi everyone;

I'm having a problem with a fresh condor install.
My setup is this:

1 master node (master)
2 interactive nodes (submit only- n00 and n01)
64 worker nodes (execute only n02-n66)

Config structure:

master - /home/condor is nfs exported to all nodes. local configs are in
/home/condor/$HOSTNAME/condor_config.local


Whenever I use condor_submit, it hangs on "Submitting job.." and then eventually times out, saying that it failed to connect to the local machine on port x. and failed to fetch
ads from the localhost on port x.

Whenever I use condor_q, it will hang but if I give condor_q -global, it will return the
status of my condor pool and show some of the nodes in use.

The daemon listing in my config on the submit nodes is as such:
DAEMON_LIST = MASTER, SCHEDD

There are no errors in the local log (log/*)

Does anybody make sense of this? Please let me know if any more info is needed.

--
Adam Yates
Systems Administrator -- Research Infrastructure
Center for Computation and Technology
232 Johnston Hall,
Baton Rouge, LA 70803
W: 225.578.8235    C: 225.663.0218
<yates@xxxxxxxxxxx>