[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] about classAd error: MY.Rank > MY.CurrentRank



Hi All,

I got this information from the startLog.  The problem is caused by
the unauthenticated user to excute command 442 on local DAEMON.
After the match between jobClassAd and machineClassAd, the schedd on
submit machine begin to talk with startd on execute machine.
and there is user authoritation require for schedd to execute some
method on remote daemon, which is specified by ALLOW_DEAMON
I specified the ALLOW_DEAMON = *, then the problem was gone.

Hope this will help for other guys have the same problem.

Kimaru

07/27 08:55:49 slot1: State change: match notification protocol successful
07/27 08:55:49 slot1: Changing state: Unclaimed -> Matched
07/27 08:55:49 PERMISSION DENIED to unauthenticated user from host
192.168.223.2 for command 442 (REQUEST_CLAIM), access level DAEMON:
reason: cached result for DAEMON; see first case for the full reason
07/27 08:56:09 slot2: State change: match timed out
07/27 08:56:09 slot2: Changing state: Matched -> Owner


On Mon, Jul 26, 2010 at 12:05 AM, fly zebra <flyzebra9@xxxxxxxxx> wrote:
> Hi all,
>
> I submit a simple vanilla job to Condor Windows cluster. The job has
> been submitted to the job queue, but just keep idle, and never run.
> The following is debug message with condor_q -better-ana -long
>
> -- Submitter: h01.test.com : <10.198.234.6:8080> : h01.test.com
> slot8@h01- Failed rank condition: MY.Rank > MY.CurrentRank
> ---
> 086.000:  Run analysis summary.  Of 24 machines,
>      0 are rejected by your job's requirements
>      2 reject your job because of their own requirements
>      0 match but are serving users with a better priority in the pool
>     22 match but reject the job for unknown reasons
>      0 match but will not currently preempt their existing job
>      0 match but are currently offline
>      0 are available to run your job
>        Last successful match: Sun Jul 25 23:45:36 2010
>
>
> my condor_config:
>
> START           = True
> SUSPEND                 = False
> CONTINUE        = True
> PREEMPT         = False
> KILL            = False
> WANT_SUSPEND    = False
> WANT_VACATE     = False
> RANK            = 0.000000
>
> I have checked the mailing list archives, but there is no answer to
> MY.Rank > MY.CurrentRank.
> Any hints for the problem will be highly aprreciated.
>
> --
> Thanks,
> Kimaru
>



-- 
Thanks,
Kimaru