[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Apparent bug in job log file formatting for universe=grid



On Tue, 30 May 2006, Gabriel Mateescu wrote:

> I am using Condor 6.7.17 in two cases:
>
>   (1) with
>
>         universe = vanilla
>         log_xml = True
>
>
>    (2) with
>
>          Universe = grid
>          grid_type = gt4
>          log_xml = True
>
>
> The job log file in case (1) is indeed XML-formatted,
> but in case (2) the log_xml attribute is applied only
> to the "SubmitEvent", the other events are not
> XML-formatted, as shown below:
>
>
>
> 1. Good job log file for case (1)
>
> <c>
>     <a n="MyType"><s>SubmitEvent</s></a>
>     <a n="EventTypeNumber"><i>0</i></a>
>     <a n="EventTime"><s>2006-05-29T17:19:57</s></a>
>     <a n="Cluster"><i>68</i></a>
>     <a n="Proc"><i>0</i></a>
>     <a n="Subproc"><i>0</i></a>
>     <a n="SubmitHost"><s>&lt;IP_ADDR:32772&gt;</s></a>
> </c>
> <c>
>     <a n="MyType"><s>ExecuteEvent</s></a>
>     <a n="EventTypeNumber"><i>1</i></a>
>     <a n="EventTime"><s>2006-05-29T17:20:02</s></a>
>     <a n="Cluster"><i>68</i></a>
>     <a n="Proc"><i>0</i></a>
>     <a n="Subproc"><i>0</i></a>
>     <a n="ExecuteHost"><s>&lt;IP_ADDR:32773&gt;</s></a>
> </c>
> <c>
>     <a n="MyType"><s>JobTerminatedEvent</s></a>
>     <a n="EventTypeNumber"><i>5</i></a>
>     <a n="EventTime"><s>2006-05-29T17:20:02</s></a>
>     <a n="Cluster"><i>68</i></a>
>     <a n="Proc"><i>0</i></a>
>     <a n="Subproc"><i>0</i></a>
>     <a n="TerminatedNormally"><b v="t"/></a>
>     <a n="ReturnValue"><i>0</i></a>
>     <a n="RunLocalUsage"><s>Usr 0 00:00:00, Sys 0 00:00:00</s></a>
>     <a n="RunRemoteUsage"><s>Usr 0 00:00:00, Sys 0 00:00:00</s></a>
>     <a n="TotalLocalUsage"><s>Usr 0 00:00:00, Sys 0 00:00:00</s></a>
>     <a n="TotalRemoteUsage"><s>Usr 0 00:00:00, Sys 0 00:00:00</s></a>
>     <a n="SentBytes"><r>0.000000000000000E+00</r></a>
>     <a n="ReceivedBytes"><r>0.000000000000000E+00</r></a>
>     <a n="TotalSentBytes"><r>0.000000000000000E+00</r></a>
>     <a n="TotalReceivedBytes"><r>0.000000000000000E+00</r></a>
> </c>
>
>
>
> 2. Partially XML-formatted log file in case (2)
>
> <c>
>     <a n="MyType"><s>SubmitEvent</s></a>
>     <a n="EventTypeNumber"><i>0</i></a>
>     <a n="EventTime"><s>2006-05-29T19:42:52</s></a>
>     <a n="Cluster"><i>71</i></a>
>     <a n="Proc"><i>0</i></a>
>     <a n="Subproc"><i>0</i></a>
>     <a n="SubmitHost"><s>&lt;IP_ADDR:32772&gt;</s></a>
> </c>
> 017 (071.000.000) 05/29 19:43:28 Job submitted to Globus
>     RM-Contact: FQDN:8443
>     JM-Contact:
> https://IP_ADDR:8443/wsrf/services/ManagedExecutableJobService?\
> e6dd7330-ef6c-11da-96e8-dda4735ffe52
>     Can-Restart-JM: 0
> ...
> 027 (071.000.000) 05/29 19:43:28 Job submitted to grid resource
>     GridResource: gt4 FQDN:8443 Fork
>     GridJobId: gt4
> https://IP_ADDRE:8443/wsrf/services/ManagedExecutableJobService?\
> e6dd7330-ef6c-11da-96e8-dda4735ffe52
> ...
> 001 (071.000.000) 05/29 19:43:35 Job executing on host: gt4 FQDN:8443
> Fork
> ...
> 005 (071.000.000) 05/29 19:43:45 Job terminated.
>         (1) Normal termination (return value 0)
>                 Usr 0 00:00:00, Sys 0 00:00:00  -  Run Remote Usage
>                 Usr 0 00:00:00, Sys 0 00:00:00  -  Run Local Usage
>                 Usr 0 00:00:00, Sys 0 00:00:00  -  Total Remote Usage
>                 Usr 0 00:00:00, Sys 0 00:00:00  -  Total Local Usage
>         0  -  Run Bytes Sent By Job
>         0  -  Run Bytes Received By Job
>         0  -  Total Bytes Sent By Job
>         0  -  Total Bytes Received By Job

Okay, thanks for the report!  We will take a look at this...  I can't
think of any workaround offhand besides just not using XML userlogs.

Kent Wenger
Condor Team