[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] unwelcome job classad



Hi!
That pasted text is the output of the better-analyze command.Â
Im unable to find the source of the checkpoint requirement. All users' jobs are idle because of that. Please help about how to find where that came from.

- Nagaraj
..
sent from cellphone. pls excuse brevity/typos

On Wed, 24 Mar, 2021, 13:12 Beyer, Christoph, <christoph.beyer@xxxxxxx> wrote:
Hi Nagaraj,

please try:

condor_q -better-analyze <jobid> -reverse -machine <slot-id>@<workernode.fq.dn>

This will show you in detail why this job will not run in the named slot ...

Best
christoph

--
Christoph Beyer
DESY Hamburg
IT-Department

Notkestr. 85
Building 02b, Room 009
22607 Hamburg

phone:+49-(0)40-8998-2317
mail: christoph.beyer@xxxxxxx

----- UrsprÃngliche Mail -----
Von: "Nagaraj Panyam" <pn@xxxxxxxxxxx>
An: "htcondor-users" <htcondor-users@xxxxxxxxxxx>
Gesendet: Mittwoch, 24. MÃrz 2021 07:15:52
Betreff: [HTCondor-users] unwelcome job classad

Hi,

All of a sudden all jobs of all users are idle and I find the reason as
below. Point is, I myself have put in the requirement for INO_USER, but
I have not put in the requirement for IsValidcheckpointPlatform. Could
it be in some updated packages? How to take it out?

==================================================

The Requirements _expression_ for this slot is

ÂÂÂÂ (START) &&
ÂÂÂÂ (IsValidCheckpointPlatform)

ÂÂ START is
ÂÂÂÂ INO_USER == true

ÂÂ IsValidCheckpointPlatform is
ÂÂÂÂ (TARGET.JobUniverse isnt 1 ||
ÂÂÂÂÂÂ ((MY.CheckpointPlatform isnt undefined) &&
ÂÂÂÂÂÂÂÂ ((TARGET.LastCheckpointPlatform is MY.CheckpointPlatform) ||
ÂÂÂÂÂÂÂÂÂÂ (TARGET.NumCkpts == 0))))

This slot defines the following attributes:

ÂÂÂÂ CheckpointPlatform = "LINUX X86_64 3.10.0-1160.11.1.el7.x86_64
normal N/A avx avx2 ssse3 sse4_1 sse4_2"

The Requirements _expression_ for this slot reduces to these conditions:

ÂÂÂÂÂÂÂ Clusters
Step Matched Condition
-----Â --------Â ---------
[0]ÂÂÂÂÂÂÂÂÂÂ 0Â INO_USER == true
[1]ÂÂÂÂÂÂÂÂÂÂ 0Â IsValidCheckpointPlatform

======================================================

Thanks a lot!

Nagaraj

_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/

_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/