[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] shadow started and die with signal 11



Hi,
i have installed condor-6-6.10 under mandrake 9.2 without shared file system. all daemons starts and every think is okay but when i test hello.c example (in standard or vanilla mode)i does not run and schedlog show exception like this:
 
1/6 19:08:24 Shadow pid 5389 died with signal 11
1/6 19:08:24 Started shadow for job 2.0 on "<192.168.0.2:1036>", (shadow pid = 5390)
1/6 19:08:24 Shadow pid 5390 died with signal 11
1/6 19:08:24 Started shadow for job 2.0 on "<192.168.0.2:1036>", (shadow pid = 5391)

for submitting job in vanilla mode the exception is:
 
1/6 19:32:53 Checking consistency running and runnable jobs
1/6 19:32:53 Tables are consistent
1/6 19:32:53 Out of jobs - 1 jobs matched, 0 jobs idle, flock level = 0
1/6 19:32:55 Started shadow for job 5.0 on "<192.168.0.1:1271>", (shadow pid = 6188)
1/6 19:32:56 Shadow pid 6188 for job 5.0 exited with status 44
1/6 19:32:56 ERROR: Shadow had fatal error writing to its log file.
1/6 19:32:57 Started shadow for job 5.0 on "<192.168.0.1:1271>", (shadow pid = 6189)
1/6 19:32:57 Shadow pid 6189 for job 5.0 exited with status 44
1/6 19:32:57 ERROR: Shadow had fatal error writing to its log file.
1/6 19:32:57 Started shadow for job 5.0 on "<192.168.0.1:1271>", (shadow pid = 6190)
1/6 19:32:57 Shadow pid 6190 for job 5.0 exited with status 44
1/6 1! 9:32:57 ERROR: Shadow had fatal error writing to its log file.
1/6 19:32:57 Started shadow for job 5.0 on "<192.168.0.1:1271>", (shadow pid = 6191)
1/6 19:32:57 Shadow pid 6191 for job 5.0 exited with status 44
1/6 19:32:57 ERROR: Shadow had fatal error writing to its log file.
1/6 19:32:57 Started shadow for job 5.0 on "<192.168.0.1:1271>", (shadow pid = 6192)
1/6 19:32:58 Sent ad to central manager for condor@xxxxxxxxxxxxxxxxx
1/6 19:32:58 Shadow pid 6192 for job 5.0 exited with status 44
1/6 19:32:58 ERROR: Shadow had fatal error writing to its log file.
1/6 19:32:58 Match for cluster 5 has had 5 shadow exceptions, relinquishing.
1/6 19:32:58 Sent RELEASE_CLAIM to startd on <192.168.0.1:1271>
1/6 19:32:58 Match record (<192.168.0.1:1271>, 5, 0) deleted
1/6 19:32:58 DaemonCore: Command received via TCP from host <192.168.0.1:1347>
1/6 19:32:58 DaemonCore: received command 443 (VACATE_SERVICE), calling handler (vacate_service)
1/6 19:32:58 Got VACATE_SERVICE from <192.168.0.1:1347>
1/6 19:37:58 Sent ad to central manager for condor@xxxxxxx.
 
Please help me because i have not found response in mailing list
(i did the test on further machines..errors are same)
thanks.
 
 


Nouveau : téléphonez moins cher avec Yahoo! Messenger ! Découvez les tarifs exceptionnels pour appeler la France et l'international. Téléchargez la version beta.