[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] stdout/stderr for evicted jobs



Am 26.07.22 um 22:51 schrieb Todd L Miller:
1. How can one make a grid job fail on its very first eviction?
I do not want HTCondor to try and "rescue" such jobs...

 ÂÂÂÂThere's not, AFAIK, a way to specify this directly. It may work to set periodic_remove to something like NumJobStarts > 1.

2. How can one get the stdout/stderr for jobs that ended up evicted?
I am trying to debug something that would be helped by having the
the stdout/stderr as they were at the time of eviction...
when_to_transfer_output = ON_EXIT_OR_EVICT does not work for that.

 ÂÂÂÂIf these jobs are also grid universe, I'm not aware of any way to do this.

In case they are not Grid universe, I think that either the two lines:

 WHEN_TO_TRANSFER_OUTPUT = ON_EXIT_OR_EVICT
 +SpoolOnEvict = False

or the two lines:

 stream_output = True
 stream_error = True

should work (the latter being more expensive, but may be helpful for debugging).

Cheers and hope that helps,
	Oliver


- ToddM
_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/


--
Oliver Freyermuth
UniversitÃt Bonn
Physikalisches Institut, Raum 1.047
NuÃallee 12
53115 Bonn
--
Tel.: +49 228 73 2367
Fax:  +49 228 73 7869
--