[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Are UTC and local time mixed in the condor log file?



Hi Jose,
 I just finished some tests, trying to get your "error".
 The only way to get a job finished before the submit time is that the
submit node's date change When the job is already running.
 The finish time is not report from the execute nodes, and if you run
the submit and change the date, the job simply will not start.
 So, for some reason your submit node's date "was" changed after your
jobs start.

I hope this help you.
Example:
 I start this task (ls -lR /usr) and when condor_q show the job running I ran:
 date 11171113
 in the submit/master node. And I got same behavior like you.

001 (008.001.000) 11/17 12:13:02 Job executing on host: <192.168.2.12:50208>
...
006 (008.001.000) 11/17 12:13:11 Image size of job updated: 2824
005 (008.001.000) 11/17 11:13:20 Job terminated.
        (1) Normal termination (return value 0)
                Usr 0 00:00:01, Sys 0 00:00:04  -  Run Remote Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Run Local Usage
                Usr 0 00:00:01, Sys 0 00:00:04  -  Total Remote Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Total Local Usage
        10428298  -  Run Bytes Sent By Job
        0  -  Run Bytes Received By Job
        10428298  -  Total Bytes Sent By Job
        0  -  Total Bytes Received By Job
--------------------
 Jose are you in GridColombia's list?

On 11/17/10, Jose Caballero <jcaballero.hep@xxxxxxxxx> wrote:
> Hi,
>
> let's have a look to this log file
>
> ---------------------------------------------------------------------------------------------------------------------------------------------------
>
> 000 (428178.000.000) 11/17 07:15:49 Job submitted from host:<
> 130.199.205.51:60618>
> ...
> 001 (428178.000.000) 11/17 02:25:56 Job executing on host:<
> 130.199.205.53:38320>
> ...
> 006 (428178.000.000) 11/17 02:26:05 Image size of job updated: 204180
> ...
> 006 (428178.000.000) 11/17 02:31:06 Image size of job updated: 362200
> ...
> 005 (428178.000.000) 11/17 02:36:06 Job terminated.
>     (1) Normal termination (return value 0)
>         Usr 0 00:00:00, Sys 0 00:00:00  -  Run Remote Usage
>         Usr 0 00:00:00, Sys 0 00:00:00  -  Run Local Usage
>         Usr 0 00:00:00, Sys 0 00:00:00  -  Total Remote Usage
>         Usr 0 00:00:00, Sys 0 00:00:00  -  Total Local Usage
>     6942  -  Run Bytes Sent By Job
>     4997  -  Run Bytes Received By Job
>     6942  -  Total Bytes Sent By Job
>     4997  -  Total Bytes Received By Job
> ...
>
> ---------------------------------------------------------------------------------------------------------------------------------------------------
>
>
>
> Seems to me like two different time formats are mixed.
> Is that possible or there is a rational explanation for this?
> Otherwise, I cannot see how is possible the job was submitted 5 hours
> "after" it started to run....
>
>
> Any comment is more than welcome.
> Cheers,
> Jose
>


-- 
----
Edier Alberto Zapata Hernández
Est. Ingeniería de Sistemas
Universidad de Valle