[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] jobs are only running at condor_master machine



Hello
i have setup a condor pool of two machines.
1st is condor master 
2nd is slave node.

when i submit the jobs  through condor master  it runs but at at
condor master machine.
jobs donot go any other machine even the machine are idle.

when i submit the jobs with the 2nd machine they remains idle in the
Que and never runs even on the same machine .

in either case i have found same error message in 

---------- Started Negotiation Cycle ----------
8/29 20:16:45 Phase 1:  Obtaining ads from collector ...
8/29 20:16:45   Getting all public ads ...
8/29 20:16:45   Sorting 7 ads ...
8/29 20:16:45   Getting startd private ads ...
8/29 20:16:45 Got ads: 7 public and 2 private
8/29 20:16:45 Public ads include 1 submitter, 2 startd
8/29 20:16:45 Phase 2:  Performing accounting ...
8/29 20:16:45 Phase 3:  Sorting submitter ads by priority ...
8/29 20:16:45 Phase 4.1:  Negotiating with schedds ...
8/29 20:16:45   Negotiating with condor@xxxxxxxxxxxxxxxxxxxxxxx at
<**.26.146.226:1173>
8/29 20:17:15 select returns 0, connect failed
8/29 20:17:15 Will keep trying for 30 seconds...
8/29 20:17:16 Connect failed for 30 seconds; returning FALSE
8/29 20:17:16     Failed to connect to <**.26.146.226:1173>
8/29 20:17:16   Error: Ignoring schedd for this cycle
8/29 20:17:16 ---------- Finished Negotiation Cycle ----------

what is  the problem here
why the central manger is unable to connect with  other  machine nodes
in the pool.
if I see the condor_status then it shows both computer in the list

Name          OpSys       Arch   State      Activity   LoadAv Mem   ActvtyTime

masterpc   LINUX       INTEL  Unclaimed  Idle       0.050   750  0+00:30:04
slavepc        LINUX       INTEL  Unclaimed  Idle       0.000   750  0+00:25:15

                     Machines Owner Claimed Unclaimed Matched Preempting

         INTEL/LINUX        2     0       0         2       0          0

               Total        2     0       0         2       0          0


any help
thanks in advance

Narunjan