[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] absent nodes



Hi,

we use the 'condor_status -absent' command to check if we lost some nodes from time to time. 

>From my understanding, once the startd succeeds to send a classadd to the collector the 'absent' state should be deletetd. 

I feel that is not always the case (?) 

Here is an example of a very happy-job-running candidate that is still listed in the absent state: 

[root@bird-htc-master01 ~]# condor_status -absent | grep bird574
slot1@xxxxxxxxxxxxxxx              LINUX      X86_64    5/25 09:57  6/24 09:57
slot2@xxxxxxxxxxxxxxx              LINUX      X86_64    5/25 09:57  6/24 09:57

Here is what I find in the collectorlog about these slots: 

[root@bird-htc-master01 ~]# grep slot1@xxxxxxxxxxxxxxx  /var/log/condor/CollectorLog
05/23/20 12:27:06 		**** Removing stale ad: "< slot1@xxxxxxxxxxxxxxx , 131.169.77.56 >"
05/23/20 12:27:06 Added ad to persistent store key=<slot1@xxxxxxxxxxxxxxx,131.169.77.56>
05/23/20 12:27:06 		**** Removing stale ad: "< slot1@xxxxxxxxxxxxxxx , 131.169.77.56 >"
05/23/20 12:27:06 		**** Removing stale ad: "< slot1@xxxxxxxxxxxxxxx , 131.169.77.56 >"
05/23/20 12:29:37 StartdPvtAd  : Inserting ** "< slot1@xxxxxxxxxxxxxxx , 131.169.77.56 >"
05/23/20 12:29:37 Removed ad from persistent store key=<slot1@xxxxxxxxxxxxxxx,131.169.77.56>
05/25/20 08:27:08 		**** Removing stale ad: "< slot1@xxxxxxxxxxxxxxx , 131.169.77.56 >"
05/25/20 08:27:08 Added ad to persistent store key=<slot1@xxxxxxxxxxxxxxx,131.169.77.56>
05/25/20 08:27:08 		**** Removing stale ad: "< slot1@xxxxxxxxxxxxxxx , 131.169.77.56 >"
05/25/20 08:27:08 		**** Removing stale ad: "< slot1@xxxxxxxxxxxxxxx , 131.169.77.56 >"
05/25/20 09:23:58 StartdAd     : Inserting ** "< slot1@xxxxxxxxxxxxxxx , 127.0.0.1 >"
05/25/20 09:23:58 StartdPvtAd  : Inserting ** "< slot1@xxxxxxxxxxxxxxx , 127.0.0.1 >"
05/25/20 09:36:10 StartdPvtAd  : Inserting ** "< slot1@xxxxxxxxxxxxxxx , 131.169.77.56 >"
05/25/20 09:36:10 Removed ad from persistent store key=<slot1@xxxxxxxxxxxxxxx,131.169.77.56>
05/25/20 09:57:08 		**** Removing stale ad: "< slot1@xxxxxxxxxxxxxxx , 127.0.0.1 >"
05/25/20 09:57:08 Added ad to persistent store key=<slot1@xxxxxxxxxxxxxxx,127.0.0.1>
05/25/20 09:57:08 		**** Removing stale ad: "< slot1@xxxxxxxxxxxxxxx , 127.0.0.1 >"
05/25/20 09:57:08 		**** Removing stale ad: "< slot1@xxxxxxxxxxxxxxx , 127.0.0.1 >"
05/25/20 17:42:09 		**** Removing stale ad: "< slot1@xxxxxxxxxxxxxxx , 131.169.77.56 >"
05/25/20 17:42:09 Added ad to persistent store key=<slot1@xxxxxxxxxxxxxxx,131.169.77.56>
05/25/20 17:42:09 		**** Removing stale ad: "< slot1@xxxxxxxxxxxxxxx , 131.169.77.56 >"
05/25/20 17:42:09 		**** Removing stale ad: "< slot1@xxxxxxxxxxxxxxx , 131.169.77.56 >"
05/26/20 07:52:28 StartdPvtAd  : Inserting ** "< slot1@xxxxxxxxxxxxxxx , 131.169.77.56 >"
05/26/20 07:52:28 Removed ad from persistent store key=<slot1@xxxxxxxxxxxxxxx,131.169.77.56>

Best
Christoph

-- 
Christoph Beyer
DESY Hamburg
IT-Department

Notkestr. 85
Building 02b, Room 009
22607 Hamburg

phone:+49-(0)40-8998-2317
mail: christoph.beyer@xxxxxxx