[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Jobs fetched with a hook being killed after 20 minutes



When you turn on D_FULLDEBUG you don't see the startd at least attempting to fix up the lease time?

You can also throw D_DAEMONCORE into the mix to see what is being done with the timer.

Best,


matt

Ian Chesal wrote:
3/26 11:04:34 State change: Finished fetching work successfully
3/26 11:04:34 Changing state: Unclaimed -> Claimed
3/26 11:04:34 Warning: starting ClaimLease timer before lease duration
set.
3/26 11:04:34 Remote job ID is 40899.0
3/26 11:04:34 Got universe "VANILLA" (5) from request classad
3/26 11:04:34 Changing activity: Idle -> Busy

And 6.5 minutes later it's still running. No claim expired messages in
the
StartLog.

No change in behaviour. Exactly 20 minutes later I get:

3/26 11:24:34 State change: claim lease expired (condor_schedd gone?)

-  Ian

Confidentiality Notice.
This message may contain information that is confidential or otherwise protected from disclosure. If you are not the intended recipient, you are hereby notified that any use, disclosure, dissemination, distribution,  or copying  of this message, or any attachments, is strictly prohibited.  If you have received this message in error, please advise the sender by reply e-mail, and delete the message and any attachments.  Thank you.

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at: https://lists.cs.wisc.edu/archive/condor-users/