[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Hold reason: Error from slot1@... Failed to open /var/lib/condor/spool...



Hello htondor list,

I'm trying to submit a simple job using the remote option but I received this error message.

$ condor_submit -remote loginicx.ib0.cm.linea.gov.br vj.sub
Submitting job(s)
ERROR: Failed to connect to queue manager loginicx.ib0.cm.linea.gov.br
AUTHENTICATE:1003:Failed to authenticate with any method
AUTHENTICATE:1004:Failed to authenticate using GSI
GSI:5003:Failed to authenticate.  Globus is reporting error (851968:38).  There is probably a problem with your credentials.  (Did you run grid-proxy-init?)
AUTHENTICATE:1004:Failed to authenticate using KERBEROS
AUTHENTICATE:1004:Failed to authenticate using FS

Looking for a solution I found this thread https://www-auth.cs.wisc.edu/lists/htcondor-users/2016-July/msg00060.shtml  then I set the parameters below and now I can submit a job.

# in the server
SEC_DEFAULT_AUTHENTICATION_METHODS = FS, CLAIMTOBE, $(SEC_DEFAULT_AUTHENTICATION_METHODS)

# in the client
SEC_CLIENT_AUTHENTICATION_METHODS = FS, CLAIMTOBE, $(SEC_CLIENT_AUTHENTICATION_METHODS)


However as you can see htcondor put the job in a hold state. If I use the same submit file from the submitter host it runs without errors.

---
47128.000:  Job is held.
Hold reason: Error from slot1@xxxxxxxxxxxxxxxxxxxxxxxxx: Failed to open '/var/lib/condor/spool/7128/0/cluster47128.proc0.subproc0/_condor_stdout' as standard output: No such file or directory (errno 2)



Any hints on how to solve this problem?


My condor version is
$CondorVersion: 8.8.1 Feb 18 2019 BuildID: 461773 PackageID: 8.8.1-1 $
$CondorPlatform: x86_64_RedHat7 $



Thanks,


--
C. Adean