[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] After negotiator problems and restart condor_userprio reports wrong values



Hi all,

in a set-up with multiple condor negotiators we had a situation where
the system had this problem:

condor_userprio: "Can't find address for negotiator"

We (carefully) restarted condor on that machine and everything looked
fine at first glance except:

(1) the user prio factors were all reset to default values
(2) we lost a lot of our "history" on that node:

Number of users: 17  1178    530122.60  4/06/2008 14:31        ???

on the other node:
Number of users: 24  1179   4369145.74  5/27/2008 23:36        ???

Finally:

on the "deranged" node we have this line in userprio:

user@xxxxxxxxxxx  500.00     0.50      1000.00    0   -202356.46
6/26/2008 15:41  7/15/2008 15:43

Please note the negative accumulated usage!

We will reboot the "deranged" node due to other reasons in a few
minutes, but my questions are:

* Was the Accountantnew.log file damaged (if that is the correct one)?
* How can we ensure that all nodes show again the same numbers (at leat
roughly)?

Thanks for any help

Carsten

-- 
Dr. Carsten Aulbert - Max Planck Institute for Gravitational Physics
Callinstrasse 38, 30167 Hannover, Germany
Phone/Fax: +49 511 762-17185 / -17193
http://www.top500.org/system/9234 | http://www.top500.org/connfam/6/list/31