[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] All jobs idle



Hi James.

Thanks so much for your help. It turns out that I had a slight mismatch in my request_memory line when compared to slot_type ram. I now know what to look for in the future!


--
Kind regards,

Justin Fisher.

On Mon, Apr 9, 2018 at 11:26 PM, Jaime Frey <jfrey@xxxxxxxxxxx> wrote:
On Apr 9, 2018, at 9:53 AM, Justin Fisher <justin0419@xxxxxxxxx> wrote:

Can anyone help? I recently updated my Condor version and I'm now having trouble getting it to work. Caveat â there were other OS (CentOS) packages I updated at the same time.

I have 48 slots all are reported Unclaimed and Idle. This is just a rerun of something that ran ok a few months ago â so I'm a bit lost.

What version of Condor did you upgrade from?

The security errors youâre seeing suggest some changes need to be made to your authorization settings.
But the current problem with your idle jobs is shown by these lines in the NegotiatorLog:

04/09/18 16:45:07 Request 00295.00000: autocluster 1 (request count 1 of 2700)


Something in the Requirements _expression_ of your machines or your jobs is indicating that they do match match each other.
You can use the condor_q -analyze command to help figure out where the mismatch is.
You can look at the full ClassAd descriptions of the machines and jobs using condor_status -long and condor_q -longer, respectively.

Thanks and regards,
Jaime Frey
UW-Madison HTCondor Project


_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@cs.wisc.edu with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/