[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Claimed condor processes not showing up in condor_status



Why do I see a number of jobs from the scheduler node with condor_q and 
can see that the jobs are processing on the worker nodes, but a 
condor_status shows these nodes to be almost empty and the VMs idle (see 
below - there are 21 jobs running but condor_status only sees 4)?  
Almost the same information for condor_status is derived from the CM node 
(it sees 6 VMs claimed).

I switched to UPDATE_COLLECTOR_WITH_TCP over the weekend to avoid some
network issues on our site where we seem to drop lots of UDP packets.  
Seems unlikely this is the cause though???

Condor version: 6.7.3

% condor_q -run
-- Submitter: bigmac-lcg-ce.physics.utoronto.ca : <10.0.11.34:52723> : 
bigmac-lcg-ce.physics.utoronto.ca
 ID      OWNER            SUBMITTED     RUN_TIME HOST(S)         
4930.0   zeussgm         5/2  09:49   0+00:58:23 vm4@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4939.0   zeussgm         5/2  09:52   0+00:54:15 vm2@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4941.0   zeussgm         5/2  09:52   0+00:59:18 vm4@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4944.0   zeussgm         5/2  09:53   0+00:47:09 vm3@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4945.0   zeussgm         5/2  09:53   0+00:56:39 vm4@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4946.0   zeussgm         5/2  09:53   0+00:45:48 vm4@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4947.0   zeussgm         5/2  09:53   0+00:48:05 vm2@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4952.0   zeussgm         5/2  09:54   0+00:44:44 vm3@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4955.0   zeussgm         5/2  10:06   0+00:35:45 vm3@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4956.0   zeussgm         5/2  10:07   0+00:36:08 vm2@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4960.0   zeussgm         5/2  10:07   0+00:28:52 vm2@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4969.0   zeussgm         5/2  10:08   0+00:35:54 vm4@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4970.0   zeussgm         5/2  10:08   0+00:33:47 vm4@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4973.0   zeussgm         5/2  10:08   0+00:23:11 vm3@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4986.0   zeussgm         5/2  10:10   0+00:42:25 vm2@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4988.0   zeussgm         5/2  10:10   0+00:42:29 vm3@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4996.0   zeussgm         5/2  10:38   0+00:16:24 vm2@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4997.0   zeussgm         5/2  10:38   0+00:14:20 vm2@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4998.0   zeussgm         5/2  10:38   0+00:16:15 vm3@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
4999.0   zeussgm         5/2  10:38   0+00:07:44 vm2@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
5001.0   zeussgm         5/2  10:45   0+00:08:15 vm3@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

% condor_status
Name          OpSys       Arch   State      Activity   LoadAv Mem   
ActvtyTime

vm1@xxxxxxxxx LINUX       INTEL  Owner      Idle       0.000   501  1+23:34:10
vm2@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:05
vm3@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:05
vm4@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:05
vm1@xxxxxxxxx LINUX       INTEL  Owner      Idle       0.000   501  1+23:34:11
vm2@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:06
vm3@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:06
vm4@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:06
vm1@xxxxxxxxx LINUX       INTEL  Owner      Idle       0.000   501  1+23:34:15
vm2@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:10
vm3@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:10
vm4@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:10
vm1@xxxxxxxxx LINUX       INTEL  Owner      Idle       1.000   501  1+23:34:14
vm2@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.130   501  1+23:34:08
vm3@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:08
vm4@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:08
vm1@xxxxxxxxx LINUX       INTEL  Owner      Idle       0.000   501  1+23:34:13
vm2@xxxxxxxxx LINUX       INTEL  Claimed    Idle       0.000   501  1+23:31:21
vm3@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:30:23
vm4@xxxxxxxxx LINUX       INTEL  Claimed    Idle       0.000   501  1+23:32:10
vm1@xxxxxxxxx LINUX       INTEL  Owner      Idle       0.000   501  1+23:34:13
vm2@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:08
vm3@xxxxxxxxx LINUX       INTEL  Claimed    Idle       0.000   501  1+23:32:10
vm4@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:31:25
vm1@xxxxxxxxx LINUX       INTEL  Owner      Idle       0.000   501  1+23:34:09
vm2@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:03
vm3@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:31:17
vm4@xxxxxxxxx LINUX       INTEL  Claimed    Busy       0.000   501  1+23:32:03
vm1@xxxxxxxxx LINUX       INTEL  Owner      Idle       0.000   501  1+23:34:15
vm2@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:09
vm3@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:09
vm4@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:09
vm1@xxxxxxxxx LINUX       INTEL  Owner      Idle       0.000   501  1+23:34:16
vm2@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:10
vm3@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:10
vm4@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:10
vm1@xxxxxxxxx LINUX       INTEL  Owner      Idle       0.000   501  1+23:34:10
vm2@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:04
vm3@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:04
vm4@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:04
vm1@xxxxxxxxx LINUX       INTEL  Owner      Idle       0.000   501  1+23:34:12
vm2@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:06
vm3@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:06
vm4@xxxxxxxxx LINUX       INTEL  Unclaimed  Idle       0.000   501  1+23:34:06

                     Machines Owner Claimed Unclaimed Matched Preempting
         INTEL/LINUX       44    11       4        29       0          0
               Total       44    11       4        29       0          0

Thanks
Leslie

-- 
   ,-~~-.___.       ________________________________________________
  / |  '     \      groer@xxxxxxxxxxxxxxxxxxx  Department of Physics
 (  )        0           Tel: +1-416-978-2959  University of Toronto
  \_/-, ,----'           Fax: +1-416-978-8221  60 St. George Street
     ====           //                         Toronto, ON M5S 1A7
    /  \-'~;    /~~~(O)                        Canada
   /  __/~|   /       |  Office: McLennan Physics Lab Room 911
 =(  _____| (_________|  http://home.fnal.gov/~groer
     Leslie S. Groer