[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Cannot specify a compute node



Dear users,

I want to submit jobs to computeÂnode01~ 05 first, and then if the first five nodes are occupied, then I want to run the jobs on node06~10.Â

How can I achieve this in an easy way?

Thank you in advance,

Best regards,





On Tue, Jun 22, 2021 at 3:59 PM Myunggi Yi <myunggi@xxxxxxxxxxxxx> wrote:
Now I have another question.
I am testing Rank.
Again, job always goes to node03 not node01

Can you help?


Executable      Â= sleep.sh
Log          = test.log
Error         = test.err
Output        Â= test.out
# NFS
+IwdFlusNFSCache   Â= False
Should_transfer_files = no
Requirements     Â= TARGET.UidDomain == "synapse" &&             TARGET.FileSystemDomain == "synapse"
Rank = ((machine == "node01.synapse")*3) + ((machine == "node02.synapse")*2) Â Â Â+ ((machine == "node03.synapse")*2) + ((machine == "node04.synapse")*2) Â Â Â+ ((machine == "node05.synapse")*2)
GetEnv        Â= True
# Prevent re-run
periodic_remove    = JobStatus == 1 && NumJobStarts > 0
Queue


Best wishes,



On Tue, Jun 22, 2021 at 2:52 PM <ervikrant06@xxxxxxxxx> wrote:
Your last requirement must be overriding the first and second requirement mentioned in the submit file. Please add the requirements in a single line like below. It should work and make the job run on the desired node.Â

Requirements     Â= NumJobStarts == 0 && TARGET.UidDomain == "synapse" &&             TARGET.FileSystemDomain == "synapse" && machine == "node01.synapse"

Also if you are initially submitting the jobs I don't think you need to give NumJobStarts moreover only specifyingÂmachine requirements should be sufficient.Â

Thanks & Regards,
Vikrant Aggarwal


On Tue, Jun 22, 2021 at 10:46 AM Myunggi Yi <myunggi@xxxxxxxxxxxxx> wrote:
Dear users,

I just installed htcondor ver. 9.0.1 on a linux cluster with five nodes.
The following is my submit script.
The job always goes to a certain machine (node03), not to the one I choose (node01).

How can I fix thisÂproblem?
Many thanks in advance.



Executable      Â= sleep.sh
Log          = test.log
Error         = test.err
Output        Â= test.out
Requirements = machine == "node01.synapse"
# NFS
+IwdFlusNFSCache   Â= False
Should_transfer_files = no
Requirements     Â= TARGET.UidDomain == "synapse" &&             TARGET.FileSystemDomain == "synapse"
GetEnv        Â= True
#Request_GPUs     Â= 1
# Prevent re-run
requirements     Â= NumJobStarts == 0
periodic_remove    = JobStatus == 1 && NumJobStarts > 0
# Email
Notification     Â= Always
Notify_user      = myunggi@xxxxxxxxxxxxx
Queue


_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/
_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/