[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[condor-users] invalid sessions causing job preemption



Hi,

On a Condor 6.6.0 pool running on top of RedHat 9 we are seeing
the following in some of the StartLogs:

2/9 10:33:37 DC_AUTHENTICATE: attempt to open invalid session node34:6631:1076347765:1271, failing.
2/9 10:38:38 DC_AUTHENTICATE: attempt to open invalid session node34:6631:1076347765:1271, failing.
2/9 10:43:37 State change: claim timed out (condor_schedd gone?)
2/9 10:43:37 Changing state and activity: Claimed/Busy -> Preempting/Killing
2/9 10:43:38 Got ALIVE while in Preempting state, ignoring.
2/9 10:43:49 DC_AUTHENTICATE: attempt to open invalid session node34:6631:1076347835:1272, failing.
2/9 10:43:49 Starter pid 32656 exited with status 0
2/9 10:43:49 State change: starter exited
2/9 10:43:49 State change: No preempting claim, returning to owner
2/9 10:43:49 Changing state and activity: Preempting/Killing -> Owner/Idle
2/9 10:43:49 State change: IS_OWNER is false
2/9 10:43:49 Changing state: Owner -> Unclaimed

So it looks like some problem with an invalid session is causing jobs
to be preempted.

Can you tell us what can cause these problems with invalid sessions?

Thanks,

Scott

Condor Support Information:
http://www.cs.wisc.edu/condor/condor-support/
To Unsubscribe, send mail to majordomo@xxxxxxxxxxx with
unsubscribe condor-users <your_email_address>