[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] condor_status: startd not showing up...



Hi,
 
 New to Condor, but have it working (sort of) on serveral Linux servers.
 
 In the local config file for nps20235 I have the following daemons; DAEMON_LIST = MASTER,STARTD,SCHEDD. When I do a 'ps' on nps20235 they are running and there are no errors in the respective log files.
 
 But, when I run the condor_status command, the '-startd' option doesn't show any output.
 
$ condor_status -master nps20235
nps20235.netezza.com
$ condor_status -schedd nps20235
Name                 Machine    TotalRunningJobs TotalIdleJobs TotalHeldJobs
nps20235.netezza.com nps20235.n                0             0              0
                      TotalRunningJobs      TotalIdleJobs      TotalHeldJobs
                   
               Total                 0                  0                  0
$ condor_status -startd nps20235
$ condor_status -direct nps20235
condor_status: Can't find address for startd nps20235.netezza.com
 
 The .startd_address file is there. Permission are fine, and the content is ok.
 
 All other servers are perfectly fine. Except this one.
 
 The only difference with nps20235 is that it's got a '172' IP address and the other are all on '192'. I've set the appropriate HOSTALLOW_READ and HOSTALLOW_WRITE for the 192.168.*.* and 172.16.*.* networks.
 
The firewall in between is 100% open. Verified that with IT, and some 'socket' tests programs.
 
 The server nps20235 is located in the UK and the other servers are in the US. There is a fast T1 hooking up the network.
 
 Any ideas as to where to start debugging this?


Thanks,
Paul Wolmering

Cell: 617-803-3671
Email: pwolmering@xxxxxxxxx


Do you Yahoo!?
Yahoo! Mail - You care about security. So do we.