[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] quill on farm with multiple schedds



Dear Condor team


I set up quill on a small test farm consisting of 4 nodes. Jobs can be submitted from all 4 of the nodes so schedd is running on all of them. In addition each
of them is running the quill daemon with a unique quill name for each node.
One node also serves as the master and one also serves the postgres data base.

When issuing the condor_q command there seems to be communication to all quills (see below) The problem is only that the response doesn't represent the status of the farm.
I know that 8 jobs are running just fine.

So the question now is
- what am I doing wrong?
- how to diagnose it
- how to fix it




wenzel@hotdog47 tcondor]$ /opt/condor/bin/condor_status

Name OpSys Arch State Activity LoadAv Mem ActvtyTime

vm1@hotdog47. LINUX INTEL Claimed Busy 0.000 334 0+00:00:02 vm2@hotdog47. LINUX INTEL Claimed Busy 0.000 334 0+00:00:02 vm3@hotdog47. LINUX INTEL Unclaimed Idle 0.000 334 0+00:25:06 vm1@hotdog48. LINUX INTEL Claimed Busy 0.000 334 0+00:03:13 vm2@hotdog48. LINUX INTEL Claimed Busy 0.000 334 0+00:02:31 vm3@hotdog48. LINUX INTEL Unclaimed Idle 0.010 334 0+00:25:06 vm1@hotdog49. LINUX INTEL Claimed Busy 0.000 334 0+00:00:02 vm2@hotdog49. LINUX INTEL Claimed Busy 0.000 334 0+00:00:03 vm3@hotdog49. LINUX INTEL Unclaimed Idle 0.010 334 0+00:25:06 vm1@hotdog54. LINUX INTEL Claimed Busy 0.000 334 0+00:00:43 vm2@hotdog54. LINUX INTEL Claimed Busy 0.000 334 0+00:00:43 vm3@hotdog54. LINUX INTEL Unclaimed Idle 0.000 334 0+00:25:06

Total Owner Claimed Unclaimed Matched Preempting Backfill

INTEL/LINUX 12 0 8 4 0 0 0

Total 12 0 8 4 0 0 0


---------------------------------------------------------------------------
[wenzel@hotdog47 tcondor]$ /opt/condor/bin/condor_q -submitter wenzel


-- Quill: hotdog49_quilld@xxxxxxxxxxxxxxxxx : <131.225.206.133:5432> : quill
ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD

0 jobs; 0 idle, 0 running, 0 held


-- Quill: hotdog48_quilld@xxxxxxxxxxxxxxxxx : <131.225.206.133:5432> : quill
ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD

0 jobs; 0 idle, 0 running, 0 held


-- Quill: hotdog54_quilld@xxxxxxxxxxxxxxxxx : <131.225.206.133:5432> : quill
ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD

0 jobs; 0 idle, 0 running, 0 held


-- Quill: hotdog47_quilld@xxxxxxxxxxxxxxxxx : <131.225.206.133:5432> : quill
ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD

0 jobs; 0 idle, 0 running, 0 held
--------------------------------------------------------------------------------

begin:vcard
fn:Dr. Hans  Wenzel
n:Wenzel;Dr. Hans 
org:;CD/CMS
adr:P.O. Box 500 ;;Mail Station: 205;Batavia ;Illinois;60510;USA
email;internet:wenzel@xxxxxxxx
title:Dr
tel;work:630 840 6034
tel;home:630 393 1756
tel;cell:630 253 1984
url:http://home.fnal.gov/~wenzel/
version:2.1
end:vcard