[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Submit from (linux or Windows) to a Linux Manager FAILED with "Failed rank condition: MY.Rank > MY.CurrentRank"



Hi Folks, 

I setup a simple infrastructure having three stations:
 - Station1: Ubuntu 8.04.2 / Kernel 2.6.24-17-generic / i686
 - Station2: Ubuntu 8.04.2 / Kernel 2.6.24-23-generic / Intel 64 (bi-processors)
 - Station3: Windows XP Pro SP3 / Intel core Duo
 
I installed the following respective package:
 - condor-7.2.0-linux-x86-rhel5-dynamic.tar.gz
 - condor-7.2.0-linux-x86_64-rhel5-dynamic.tar.gz
 - condor-7.2.0-winnt50-x86.msi

On all installation, the security has been setup using the following:
 
SEC_DEFAULT_AUTHENTICATION = NEVER
SEC_DEFAULT_ENCRYPTION = NEVER
SEC_DEFAULT_INTEGRITY = NEVER
SEC_DEFAULT_NEGOTIATION = REQUIRED
QUEUE_ALL_USERS_TRUSTED = True

All station can run 'condor_status'

Only station1 managed to submit successfully the condor script described below. Both station1 and station3 managed to submit the job but this one stays in the queue with "Idle" state forever. 

Running the command 'condor_q -l -analyse', I obtained the following error message on both station: 'Failed rank condition: MY.Rank > MY.CurrentRank'

Condor Script 
---------------------------------------
executable=script_to_run2.sh
universe=vanilla
arguments=Example.$(Cluster).$(Process) 100
output=results.output.$(Process)
error=results.error.$(Process)
log=results.log
notification=never
Requirements = TARGET.UidDomain == "XXXXXX.XXX" && \
               TARGET.FileSystemDomain == "XXXXXX.XXX" && \
               TARGET.Arch =="X86_64" && TARGET.OpSys == "LINUX"
should_transfer_files=YES
when_to_transfer_output = ON_EXIT
queue
---------------------------------------

The shell script associated
---------------------------------------
#! /bin/sh

echo "I'm process id $$ on" `hostname`
echo "This is sent to standard error" 1>&2
date
echo "Running as binary $0" "$@"
echo "My name (argument 1) is $1"
echo "My sleep duration (argument 2) is $2"
sleep $2
echo "Sleep of $2 seconds finished.  Exiting"
exit 42
---------------------------------------

Nota: It seems there is no suspicious message in the different log file.

I would appreciate any hints regarding the possible cause of the  'Failed rank condition: MY.Rank > MY.CurrentRank' error. 

Thanks for you help
J-Chris