[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] unwelcome job classad



Hi Nagaraj,

you can use
  condor_config_val -summary -config {-expanded}
to dump the configurations and their files the local Condor daemons picked up. If a class ad got set on a specific node, it should get mentioned there together with its source file.

Cheers,
  Thomas


On 24/03/2021 09.44, nagaraj.tifr@xxxxxxxxx wrote:
Hi!
That pasted text is the output of the better-analyze command.
Im unable to find the source of the checkpoint requirement. All users' jobs are idle because of that. Please help about how to find where that came from.

- Nagaraj
..
sent from cellphone. pls excuse brevity/typos

On Wed, 24 Mar, 2021, 13:12 Beyer, Christoph, <christoph.beyer@xxxxxxx <mailto:christoph.beyer@xxxxxxx>> wrote:

    Hi Nagaraj,

    please try:

    condor_q -better-analyze <jobid> -reverse -machine
    <slot-id>@<workernode.fq.dn>

    This will show you in detail why this job will not run in the named
    slot ...

    Best
    christoph

-- Christoph Beyer
    DESY Hamburg
    IT-Department

    Notkestr. 85
    Building 02b, Room 009
    22607 Hamburg

    phone:+49-(0)40-8998-2317
    mail: christoph.beyer@xxxxxxx <mailto:christoph.beyer@xxxxxxx>

    ----- UrsprÃngliche Mail -----
    Von: "Nagaraj Panyam" <pn@xxxxxxxxxxx <mailto:pn@xxxxxxxxxxx>>
    An: "htcondor-users" <htcondor-users@xxxxxxxxxxx
    <mailto:htcondor-users@xxxxxxxxxxx>>
    Gesendet: Mittwoch, 24. MÃrz 2021 07:15:52
    Betreff: [HTCondor-users] unwelcome job classad

    Hi,

    All of a sudden all jobs of all users are idle and I find the reason as
    below. Point is, I myself have put in the requirement for INO_USER, but
    I have not put in the requirement for IsValidcheckpointPlatform. Could
    it be in some updated packages? How to take it out?

    ==================================================

    The Requirements expression for this slot is

     ÂÂÂÂ (START) &&
     ÂÂÂÂ (IsValidCheckpointPlatform)

     ÂÂ START is
     ÂÂÂÂ INO_USER == true

     ÂÂ IsValidCheckpointPlatform is
     ÂÂÂÂ (TARGET.JobUniverse isnt 1 ||
     ÂÂÂÂÂÂ ((MY.CheckpointPlatform isnt undefined) &&
     ÂÂÂÂÂÂÂÂ ((TARGET.LastCheckpointPlatform is MY.CheckpointPlatform) ||
     ÂÂÂÂÂÂÂÂÂÂ (TARGET.NumCkpts == 0))))

    This slot defines the following attributes:

     ÂÂÂÂ CheckpointPlatform = "LINUX X86_64 3.10.0-1160.11.1.el7.x86_64
    normal N/A avx avx2 ssse3 sse4_1 sse4_2"

    The Requirements expression for this slot reduces to these conditions:

     ÂÂÂÂÂÂÂ Clusters
    Step Matched Condition
    -----Â --------Â ---------
    [0]ÂÂÂÂÂÂÂÂÂÂ 0Â INO_USER == true
    [1]ÂÂÂÂÂÂÂÂÂÂ 0Â IsValidCheckpointPlatform

    ======================================================

    Thanks a lot!

    Nagaraj

    _______________________________________________
    HTCondor-users mailing list
    To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx
    <mailto:htcondor-users-request@xxxxxxxxxxx> with a
    subject: Unsubscribe
    You can also unsubscribe by visiting
    https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
    <https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users>

    The archives can be found at:
    https://lists.cs.wisc.edu/archive/htcondor-users/
    <https://lists.cs.wisc.edu/archive/htcondor-users/>

    _______________________________________________
    HTCondor-users mailing list
    To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx
    <mailto:htcondor-users-request@xxxxxxxxxxx> with a
    subject: Unsubscribe
    You can also unsubscribe by visiting
    https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
    <https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users>

    The archives can be found at:
    https://lists.cs.wisc.edu/archive/htcondor-users/
    <https://lists.cs.wisc.edu/archive/htcondor-users/>


_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/


Attachment: smime.p7s
Description: S/MIME Cryptographic Signature