[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] couldn't create dir, permission denied



I am running the central manager on Linux and a submission host on Windows XP (condor1.mydomain.com). When I submit the job from Windows XP, the job could not start as the permission is denied to create a directory. I could see the job status turn from "running" to "idle".
 
The shadow log indicates that the request is accepted. And directory "/home/aa0586/pool/execute/" allows owner/group/others full access RWX.
 
Help is appreciated.
 
Thanks and regards,
 
Zhifeng

 
 
starter log
 
4/1 21:36:36 Communicating with shadow <192.168.0.25:1776>
4/1 21:36:36 Submitting machine is "condor1.mydomain.com"
4/1 21:36:36 setting the orig job name in starter
4/1 21:36:36 setting the orig job iwd in starter
4/1 21:36:36 couldn't create dir /home/aa0586/pool/execute/dir_3255: Permission denied
4/1 21:36:36 Failed to initialize JobInfoCommunicator, aborting
4/1 21:36:36 Unable to start job.
 
 
Shadow log
4/1 22:10:18 (4.0) (5732): Request to run on <192.168.0.20:32893> was ACCEPTED
4/1 22:10:18 (4.0) (5732): Can no longer talk to condor_starter <192.168.0.20:32893>
4/1 22:10:18 (4.0) (5732): Trying to reconnect to disconnected job
4/1 22:10:18 (4.0) (5732): LastJobLeaseRenewal: 1207102218 Tue Apr 01 22:10:18 2008
4/1 22:10:18 (4.0) (5732): JobLeaseDuration: 1200 seconds
4/1 22:10:18 (4.0) (5732): JobLeaseDuration remaining: 1200
4/1 22:10:18 (4.0) (5732): Attempting to locate disconnected starter
4/1 22:10:18 (4.0) (5732): locateStarter(): ClaimId (<192.168.0.20:32893>#1207105608#4#68048866) and GlobalJobId ( amd#1207102215#4.0 ) not found
4/1 22:10:18 (4.0) (5732): Reconnect FAILED: Job not found at execution machine
4/1 22:10:18 (4.0) (5732): **** condor_shadow (condor_SHADOW) EXITING WITH STATUS 107