[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] condor_q -better requires kerberos when condor_q doesn't.



I've got a strange situation where condor_q returns valid data without a
kerberos ticket but condor_q -better complains about being able to fetch
startd ads unless you have a kerberos ticket.  (The 'krb' command is a
script that does "kinit -l 10h -r 2d -f $prnc" followed by "aklog".)

Script started on Thu 16 Aug 2007 03:04:30 PM PDT
: || nomad@hawk ~ [1] ; condor_config_val SEC_DEFAULT_AUTHENTICATION
REQUIRED
: || nomad@hawk ~ [2] ; condor_config_val SEC_DEFAULT_AUTHENTICATION_METHODS
FS, KERBEROS
: || nomad@hawk ~ [3] ; kdestroy
kdestroy: No credentials cache found while destroying cache
: || nomad@hawk ~ [4] ; condor_submit condor/condortest && sleep 2 &&
condor_q
Submitting job(s)
WARNING: Log file /users/nomad/condor.log.540 is on NFS.
This could cause log file corruption and is _not_ recommended.
.
Logging submit event(s).
1 job(s) submitted to cluster 540.


-- Submitter: hawk.ee.washington.edu : <128.208.232.98:32945> :
hawk.ee.washington.edu
 ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD
 540.0   nomad           8/16 15:06   0+00:00:00 I  0   9.8
condortest.sh full

1 jobs; 1 idle, 0 running, 0 held

: || nomad@hawk ~ [5] ; condor_q -better
Error:  Could not fetch startd ads

: || nomad@hawk ~ [6] ; krb
Enter your EE department password. Ticket will be valid for 10 hours.
Password for nomad@xxxxxxxxxxxxxxxxx:
: || nomad@hawk ~ [7] ; condor_q -better


-- Submitter: hawk.ee.washington.edu : <128.208.232.98:32945> :
hawk.ee.washington.edu
---
540.000:  Run analysis summary.  Of 377 machines,
    287 are rejected by your job's requirements
      4 reject your job because of their own requirements
      0 match but are serving users with a better priority in the pool
      0 match but reject the job for unknown reasons
     64 match but will not currently preempt their existing job
     22 are available to run your job

The Requirements expression for your job is:

( ( MY.RESOURCE_GROUP == TARGET.JOB_GROUP ) ) && ( target.Arch ==
"X86_64" ) &&
( target.OpSys == "LINUX" ) && ( target.Disk >= DiskUsage ) &&
( ( target.Memory * 1024 ) >= ImageSize ) &&
( TARGET.FileSystemDomain == MY.FileSystemDomain )

    Condition                         Machines Matched    Suggestion
    ---------                         ----------------    ----------
1   ( target.Arch == "X86_64" )       90
2   ( target.OpSys == "LINUX" )       374
3   ( ( "ssli" == TARGET.JOB_GROUP ) )377
4   ( target.Disk >= 10000 )          377
5   ( ( 1024 * target.Memory ) >= 10000 )377
6   ( TARGET.FileSystemDomain == "ee.washington.edu" )
                                      377
: || nomad@hawk ~ [8] ; exit

Script done on Thu 16 Aug 2007 03:07:08 PM PDT


Any hints or advice would be appreciated.

Thanks,
nomad