[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Failed to send REQUEST_CLAIM to startd



When the schedd said "hey, I want to run a job", the startd said, "You
cannot run here".  So the schedd says the match is bad and deletes it,
so that it can try to match the job the next time it talks to the
negotiator.

You need to log into the helike.physics.tamu.edu machine, if possible,
and check the StartLog to see what the exact problem is on the execute
side. Otherwise, you will need to talk to the admin of that machine.

Nathan Panike

On Wed, May 15, 2013 at 09:13:14AM -0500, Cody Belcher wrote:
> Can someone explain to me what this means and how to fix it? I've
> been trying to figure out a relyable way to submit Mathematica jobs
> to condor so that I can write a how to for my users, but the jobs
> stay in idle state. I believe this is the reason why.
> 
> 05/15/13 09:07:02 (pid:71980) Sent ad to central manager for
> codytrey@xxxxxxxxxxxxxxxx
> 05/15/13 09:07:02 (pid:71980) Sent ad to 1 collectors for
> codytrey@xxxxxxxxxxxxxxxx
> 05/15/13 09:07:02 (pid:71980) Response problem from startd when
> requesting claim slot1@xxxxxxxxxxxxxxxxxxxxxxx
> <128.194.151.209:49154> for codytrey 20.0.
> 05/15/13 09:07:02 (pid:71980) Failed to send REQUEST_CLAIM to startd
> slot1@xxxxxxxxxxxxxxxxxxxxxxx <128.194.151.209:49154> for codytrey:
> CEDAR:6004:failed reading from socket
> 05/15/13 09:07:02 (pid:71980) Match record
> (slot1@xxxxxxxxxxxxxxxxxxxxxxx <128.194.151.209:49154> for codytrey,
> 20.0) deleted
> 
> Much thanks,
> 
> Cody Belcher