[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] All jobs idle



On Apr 9, 2018, at 9:53 AM, Justin Fisher <justin0419@xxxxxxxxx> wrote:

Can anyone help? I recently updated my Condor version and I'm now having trouble getting it to work. Caveat â there were other OS (CentOS) packages I updated at the same time.

I have 48 slots all are reported Unclaimed and Idle. This is just a rerun of something that ran ok a few months ago â so I'm a bit lost.

What version of Condor did you upgrade from?

The security errors youâre seeing suggest some changes need to be made to your authorization settings.
But the current problem with your idle jobs is shown by these lines in the NegotiatorLog:

04/09/18 16:45:07 Request 00295.00000: autocluster 1 (request count 1 of 2700)

04/09/18 16:45:07 Rejected 295.0 group_ANALOG.jfisher@xxxxxxxxxxxxxx<192.168.1.206:9618?addrs=192.168.1.206-9618+[--1]-9618&noUDP&sock=1437_3daf_4>: no match found

Something in the Requirements _expression_ of your machines or your jobs is indicating that they do match match each other.
You can use the condor_q -analyze command to help figure out where the mismatch is.
You can look at the full ClassAd descriptions of the machines and jobs using condor_status -long and condor_q -longer, respectively.

Thanks and regards,
Jaime Frey
UW-Madison HTCondor Project