[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] "aborted by the user" in successful job



Dear htcondor users,

Is there a reason why I would see abort events (see [1]) in the logs of my successful condor jobs. I have not run condor_rm on the job below, which is why it finished normally and returns the desired output. The full order of log events is below at [2].Â

I am trying to parse the log files to determine which jobs were aborted and need to be re-run. However, the abort event (i.e 009) is appearing in log files that were not aborted and so I cannot use that as a handle for identifying user aborted jobs.Â

Thanks,
Alex

[1]
009 (986318.954.000) 04/08 13:20:03 Job was aborted by the user.
    via condor_rm (by user alarmstr)

[2]
000 (986321.018.000) 04/08 13:16:38 Job submitted from host:
028 (986321.018.000) 04/08 13:16:38 Job ad information event triggered.
001 (986321.018.000) 04/08 13:17:19 Job executing on host:
028 (986321.018.000) 04/08 13:17:19 Job ad information event triggered.
006 (986321.018.000) 04/08 13:17:27 Image size of job updated: 34912
028 (986321.018.000) 04/08 13:17:27 Job ad information event triggered.
024 (986318.954.000) 04/08 13:20:03 Job reconnection failed
028 (986318.954.000) 04/08 13:20:03 Job ad information event triggered.
009 (986318.954.000) 04/08 13:20:03 Job was aborted by the user.
028 (986318.954.000) 04/08 13:20:03 Job ad information event triggered.
006 (986321.018.000) 04/08 13:22:27 Image size of job updated: 991768
028 (986321.018.000) 04/08 13:22:27 Job ad information event triggered.
005 (986321.018.000) 04/08 13:24:19 Job terminated.
(1) Normal termination (return value 0)
028 (986321.018.000) 04/08 13:24:19 Job ad information event triggered.