[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Fwd: Condor held my job every time





Hello every body



I managed to preset a workshop about HTCondor next WED,, when i prepare my presentation i shocked ,, my condor doesnt execute any job in fact
it just submit the job to cluster and no job executed ! 

I installed Condor as personal Condor,, and i want to do simple completed job to present it in my presentation 


when i run : condor_status: 

Name               OpSys      Arch   State     Activity LoadAv Mem   ActvtyTime

slot1@NovelHIC-02  WINDOWS    X86_64 Unclaimed Idle     0.000  2034  0+00:00:11
slot2@NovelHIC-02  WINDOWS    X86_64 Unclaimed Idle     0.000  2034  0+00:09:24
slot3@NovelHIC-02  WINDOWS    X86_64 Unclaimed Idle     0.000  2034  0+00:10:06
slot4@NovelHIC-02  WINDOWS    X86_64 Unclaimed Idle     0.000  2034  0+00:05:07
slot5@NovelHIC-02  WINDOWS    X86_64 Unclaimed Idle     0.000  2034  0+00:05:08
slot6@NovelHIC-02  WINDOWS    X86_64 Unclaimed Idle     0.000  2034  0+00:05:09
slot7@NovelHIC-02  WINDOWS    X86_64 Unclaimed Idle     0.000  2034  0+00:05:10
slot8@NovelHIC-02  WINDOWS    X86_64 Unclaimed Idle     0.000  2034  0+00:10:03
                     Total Owner Claimed Unclaimed Matched Preempting Backfill

      X86_64/WINDOWS     8     0       0         8       0          0        0

               Total     8     0       0         8       0          0        0

the output of : conodr_q - analyze is: 

---
035.000:  Request is held.

Hold reason: Error from slot2@NovelHIC-02: Failed to open 'C:\Users\Administrato
r\Desktop\Condor workshop KSU\Job.1\input.txt' as standard input: Permission den
ied (errno 13)

---
036.000:  Request is held.

Hold reason: Error from slot1@NovelHIC-02: Failed to open 'C:\Users\Administrato
r\Desktop\Condor workshop KSU\Job.1\input.txt' as standard input: Permission den
ied (errno 13)

---
037.000:  Request is held.

Hold reason: Error from slot1@NovelHIC-02: Failed to open 'C:\Users\Administrato
r\Desktop\Condor workshop KSU\Job.1\input.txt' as standard input: Permission den
ied (errno 13)

---
038.000:  Request is held.

Hold reason: Error from slot1@NovelHIC-02: Failed to open 'C:\Users\Administrato
r\Desktop\Condor workshop KSU\Job.1\input.txt' as standard input: Permission den
ied (errno 13)


Actually I read something about condor_cred that tell me to modify condor_config.local file with this: 
######################################################################
##
##  condor_config.credd
##
##  This is the default local configuration file for the machine
##  running the condor_credd.  You should copy this file to the
##  appropriate location and customize it for your needs.  
##
######################################################################

## Note: The following settings will need to be present in your
## global config file:
##
##   CREDD_HOST = my-credd.cs.wisc.edu
##   STARTER_ALLOW_RUNAS_OWNER = True
##   CREDD_CACHE_LOCALLY = True
##
## You'll also need to ensure that clients are configured to use
## PASSWORD authentication on any machine that can run jobs as the
## submitting user. For example,
##
##   SEC_CLIENT_AUTHENTICATION_METHODS = NTSSPI, PASSWORD
##
## And finally, you'll need to enable CONFIG-level access for all
## machines in the pool so that the pool password can be stored:
##
##   ALLOW_CONFIG = Administrator@*
##   SEC_CONFIG_NEGOTIATION = REQUIRED
##   SEC_CONFIG_AUTHENTICATION = REQUIRED
##   SEC_CONFIG_ENCRYPTION = REQUIRED
##   SEC_CONFIG_INTEGRITY = REQUIRED
##
## See the "Executing Jobs as the Submitting User" section of the
## Condor manual for further details.

## CREDD_SETTINGS

## CREDD logging settings
## Customize these if you wish.
CREDD_LOG = $(LOG)/CreddLog
CREDD_DEBUG = D_COMMAND
MAX_CREDD_LOG = 50000000

#################################################
## CREDD Expert settings
## Everyting below is for the UBER-KNOWLEDGEABLE only!
## Do not change these unless you know what you do!
#################################################


DAEMON_LIST = $(DAEMON_LIST), CREDD
#DC_DAEMON_LIST = \
#MASTER, STARTD, SCHEDD, KBDD, COLLECTOR, NEGOTIATOR, EVENTD, \
#VIEW_SERVER, CONDOR_VIEW, VIEW_COLLECTOR, HAWKEYE, CREDD, HAD, \
#QUILL

CREDD    = $(SBIN)/condor_credd.exe

# Timeout session quickly since we normally only get contacted
# once per starter
SEC_CREDD_SESSION_TIMEOUT = 10


# Set security settings so that full security to the credd is required
CREDD.SEC_DEFAULT_AUTHENTICATION =REQUIRED
CREDD.SEC_DEFAULT_ENCRYPTION = REQUIRED
CREDD.SEC_DEFAULT_INTEGRITY = REQUIRED
CREDD.SEC_DEFAULT_NEGOTIATION = REQUIRED 

# Require PASSWORD auth for password fetching
CREDD.SEC_DAEMON_AUTHENTICATION_METHODS = PASSWORD

# Only honor password fetch requests to the trusted "condor_pool" user
CREDD.ALLOW_DAEMON = condor_pool@$(UID_DOMAIN)

# Require NTSSPI for storing credentials
CREDD.SEC_DEFAULT_AUTHENTICATION_METHODS = NTSSPI

Actually i just copy this from exaple file and paste it 

and I added "run_as_owner = $(CREDD_HOST)" to my submit file 

The Condor_cred is running now ( I checked it at Task manager) but there is new problem appears !!

all jobs start running at the beginning and then held !! 





Could any one help me ,, i have workshop about condor TMW and my condor now get errors :(