[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Why does machine reject job for unknown reasons



Hi,

sorry to bother you again with my question, but this problem still persists. I have recieved so far no idea how to find out why condor-jobs are rejected ...

Cheers
  Alex

On 5/14/07, Alexander Dietz <Alexander.Dietz@xxxxxxxxxxxxxx> wrote:
Hi,

thanks for this suggestion, but the output really does not help me further (see below). It looks like that 150 machine are good to run the jobs on, but still they are rejected for unknown reasons! I need them to start immediately because of a timely limited online-demonstration for the work I am doing.
Any other suggestions?

Cheers
  Alex

> condor_q -better-analyze 1082109.0

1082109.000:  Run analysis summary.  Of 152 machines,
      2 are rejected by your job's requirements
      0 reject your job because of their own requirements
      0 match but are serving users with a better priority in the pool
    150 match but reject the job for unknown reasons
      0 match but will not currently preempt their existing job
      0 are available to run your job

The Requirements _expression_ for your job is:

( target.Arch == "X86_64" ) && ( target.OpSys == "LINUX" ) &&
( ( target.CkptArch == target.Arch ) || ( target.CkptArch is undefined ) ) &&
( ( target.CkptOpSys == target.OpSys ) || ( target.CkptOpSys is undefined ) ) &&
( target.Disk >= DiskUsage ) && ( ( target.Memory * 1024 ) >= ImageSize )

    Condition                         Machines Matched    Suggestion
    ---------                         ----------------    ----------
1   ( target.Disk >= 10000 )          150
2   ( target.Arch == "X86_64" )       152
3   ( target.OpSys == "LINUX" )       152
4   ( ( target.CkptArch == target.Arch ) || ( target.CkptArch is undefined ) )
                                      152
5   ( ( target.CkptOpSys == target.OpSys ) || ( target.CkptOpSys is undefined ) )
                                      152
6   ( ( 1024 * target.Memory ) >= 10000 )152





On 5/14/07, Ian Chesal < ian.chesal@xxxxxxxxx > wrote:


On 5/14/07, Alexander Dietz <Alexander.Dietz@xxxxxxxxxxxxxx> wrote:
Hi,

I have a problems when sumbitting a DAG to condor; before any of the jobs gets executed they are rejected for unknown reasons, like the following messages suggest:

> condor_q -analyze 1076700.0

Alex,

If you're running 6.8.x on Linux you can use the -better-analyze option which is infinitely more helpful than -analyze:

condor_q -better-analyze 1076700.0

- Ian


_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at either
https://lists.cs.wisc.edu/archive/condor-users/
http://www.opencondor.org/spaces/viewmailarchive.action?key=CONDOR