[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Condor runs immediately go to "Held" state



Please advise; I still need assistance with this as my customer is under a deadline. Thank you. Please see below for a sample of the log file after trying a run:

 

“0  -  Run Bytes Received By Job

...

007 (1863.018.000) 12/27 09:30:12 Shadow exception!

Error from slot19@xxxxxxxxxxxxxxxxx: Failed to open '/some/path/filename.inp' as standard input: No such file or directory (errno 2)”

 

Some context: I have one head node (this is what we log into to submit runs) that is running RHEL6, and 9 compute nodes.

 

Thanks for any assistance you can provide.

 

 

 

Nate Mobley

Millennium Engineering & Integration Company

ISSO/Systems Administrator

Desk: (256) 489-7847

Cell (Voice Only): (256) 655-5570

MEI Help Desk:  (703) 413-7771

nmobley@xxxxxxxxxxxxxx

www.meicompany.com

 

From: Mobley, Nate (Millennium)
Sent: Wednesday, December 27, 2017 10:46 AM
To: 'htcondor-admin@xxxxxxxxxxx' <htcondor-admin@xxxxxxxxxxx>; htcondor-users@xxxxxxxxxxx
Subject: Condor runs immediately go to "Held" state

 

I’ve rebooted all 9 of my compute nodes and my head node, which usually clears up this issue. See below for how I’m submitting test runs, and the error I receive when changing permissions on the data before the run. I’m not extremely experienced with HTCondor yet, so any advice will be appreciated. Also, attached is an excerpt the log file from my test run this morning.

 

Command: cd /some/path/

 

Command: chmod -R ugo+rwx *

 

Sample of error: chmod: changing permissions of ` /some/path/filename': Operation not permitted

 

Nate Mobley

Millennium Engineering & Integration Company

ISSO/Systems Administrator

Desk: 256-489-7847

Cell (Voice Only): 256-655-5570

MEI Help Desk:  703-413-7771

nmobley@xxxxxxxxxxxxxx

www.meicompany.com