[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[condor-users] Flocking



Hello all!

I'm trying to use Flocking between Solaris and Linux Pools. And I
configured in both direction like below:

 In Linux global config file (on "ipc654"):

   FLOCK_FROM = isun01.inf.uni-jena.de
   FLOCK_TO   = isun01.inf.uni-jena.de

 In Unix (on "isun01"):

   FLOCK_FROM = ipc654.inf-ra.uni-jena.de
   FLOCK_TO   = ipc654.inf-ra.uni-jena.de


Is there some other variables which i have to configure? And how can i see
if flocking works? I tried in Solaris with command

  "condor_status -pool ipc654.inf-ra.uni-jena.de"

and in Linux wiht:

  "condor_status -pool isun01.inf.uni-jena.de"

I get following error message:
--------------------------------------------------------------------------
Error: Couldn't contact the condor_collector on ipc654.inf-ra.uni-jena.de.

Extra Info: the condor_collector is a process that runs on the central
manager of your Condor pool and collects the status of all the machines
and ......
--------------------------------------------------------------------------


And for a test i wrote a following submit file. Both binaries are in the
same directory where is this submit file. And i executed it from "isun01".
The job works on Solaris and thats way I uncommented the requirement for
Solaris to see if the flocking work. and it doesn't work. I put the part
of SchedLog behind of submit file.

Please help me to solve this Problem with flocking.
Sorry about my bad english.

Thank you

- degi


################
#
# Condor submit file for simple test job example
#
################

Universe        = vanilla
Executable      = hello.$$(OpSys).$$(Arch)

Requirements    = (Arch == "INTEL" && OpSys == "LINUX")
#(Arch == "SUN4u" && OpSys == "SOLARIS28") ||\


input           = /dev/null
output          = he.out
error           = he.error
log             = he.log

Queue

-----------------------------------------------------------------------
SchedLog file

10/2 18:55:09 DaemonCore: Command received via UDP from host
<141.35.14.22:41440>
10/2 18:55:09 DaemonCore: received command 421 (RESCHEDULE), calling
handler (reschedule_negotiator)
10/2 18:55:09 Sent ad to central manager for baatarts@xxxxxxxxxxxxxxx
10/2 18:55:09 Called reschedule_negotiator()
10/2 18:55:09 Activity on stashed negotiator socket
10/2 18:55:09 Negotiating for owner: baatarts@xxxxxxxxxxxxxxx
10/2 18:55:09 Checking consistency running and runnable jobs
10/2 18:55:09 Tables are consistent
10/2 18:55:09 Out of servers - 0 jobs matched, 1 jobs idle, 1 jobs
rejected
10/2 19:00:09 Activity on stashed negotiator socket
10/2 19:00:09 Negotiating for owner: baatarts@xxxxxxxxxxxxxxx
10/2 19:00:09 Checking consistency running and runnable jobs
10/2 19:00:09 Tables are consistent
10/2 19:00:09 Out of servers - 0 jobs matched, 1 jobs idle, 1 jobs
rejected
10/2 19:00:09 Sent ad to central manager for baatarts@xxxxxxxxxxxxxxx
10/2 19:04:08 DaemonCore: Command received via UDP from host
<141.35.14.22:43792>
10/2 19:04:08 DaemonCore: received command 421 (RESCHEDULE), calling
handler (reschedule_negotiator)
10/2 19:04:08 Sent ad to central manager for baatarts@xxxxxxxxxxxxxxx
10/2 19:04:08 Called reschedule_negotiator()
10/2 19:04:08 Activity on stashed negotiator socket
10/2 19:04:08 Negotiating for owner: baatarts@xxxxxxxxxxxxxxx
10/2 19:04:08 Checking consistency running and runnable jobs
10/2 19:04:08 Tables are consistent
10/2 19:04:08 Out of servers - 1 jobs matched, 1 jobs idle, 1 jobs
rejected











Condor Support Information:
http://www.cs.wisc.edu/condor/condor-support/
To Unsubscribe, send mail to majordomo@xxxxxxxxxxx with
unsubscribe condor-users <your_email_address>