[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] stuck submit jobs



Dear Nate,

Thank you very much for your reply.
No jobs match the machines for "condor_q -bet" command.
The log is below.
Could you tell me how to match the jobs to machines?

Thank you,
Best regards,

Hiroshi

---
[root@bepp01 ~]# condor_q -bet 11
-- Submitter: bepp01.bepp.rcapp.kyushu-u.ac.jp : <192.168.12.1:48988> : bepp01.bepp.rcapp.kyushu-u.ac.jp User priority for hyamaguc@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx is not available, attempting to analyze without it.
---
011.000:  Run analysis summary.  Of 384 machines,
      0 are rejected by your job's requirements
      0 reject your job because of their own requirements
      0 match and are already running your jobs
      0 match but are serving other users
      0 are available to run your job
        No successful match recorded.
        Last failed match: Tue Oct  7 23:51:05 2014

        Reason for last match failure: no match found

The Requirements expression for your job is:

    ( TARGET.Arch == "X86_64" ) && ( TARGET.OpSys == "LINUX" ) &&
    ( TARGET.Disk >= RequestDisk ) && ( TARGET.Memory >= RequestMemory ) &&
    ( ( TARGET.HasFileTransfer ) ||
      ( TARGET.FileSystemDomain == MY.FileSystemDomain ) )

Your job defines the following attributes:

    FileSystemDomain = "bepp01.bepp.rcapp.kyushu-u.ac.jp"
    DiskUsage = 1000
    ImageSize = 1000
    RequestDisk = 1000
    RequestMemory = 1

The Requirements expression for your job reduces to these conditions:

         Slots
Step    Matched  Condition
-----  --------  ---------
[0]         384  TARGET.Arch == "X86_64"
[1]         384  TARGET.OpSys == "LINUX"
[3]         384  TARGET.Disk >= RequestDisk
[5]         384  TARGET.Memory >= RequestMemory
[7]         384  TARGET.HasFileTransfer

Suggestions:

    Condition                         Machines Matched Suggestion
    ---------                         ---------------- ----------
1   ( TARGET.Arch == "X86_64" )       384
2   ( TARGET.OpSys == "LINUX" )       384
3   ( TARGET.Disk >= 1000 )           384
4 ( TARGET.Memory >= ifthenelse(MemoryUsage isnt undefined,MemoryUsage,1) )
                                      384
5 ( ( TARGET.HasFileTransfer ) || ( TARGET.FileSystemDomain == "bepp01.bepp.rcapp.kyushu-u.ac.jp" ) )
                                      384

Hiroshi Yamaguchi

On 2014/10/08 0:28, Nathan Yehle wrote:

Hi you can also try condor_q -bet <jobid> to see how many machines match your requirements. Maybe they just don't match?
-Nate

On Tue, 7 Oct 2014, Hiroshi Yamaguchi wrote:

Dear Ben,

Great thanks for your comment.
Do you mean I should set the values as "IP_ADDRESS = 192.168.12.1" and "ALLOW_READ = $(CONDOR_HOST), $(IP_ADDRESS)" ?
The condor does not work on this configuration.

Thanks again,
Best regards,

On 2014/10/07 23:04, Ben Cotton wrote:
 Hiroshi,

 Do the private IP addresses map to the host names? From the logs, it
 looks like the daemons are communicating over the private addresses,
 but you only have host names configured in your ALLOW_* lines. If you
 add the appropriate IP addresses (or CIDR blocks), does it work?


 Thanks,
 BC


_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/


_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/