[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Specific Nodes don't run job!



Hi All,

Running tasks on two specific nodes in my cluster gives me the below log. As it can be seen it actually doesn't do anything!! This happens only with these specific two nodes. Same task is OK with any other node in the cluster. It is interesting that even running the task manually on these two nodes is OK.
I can exclude these two nodes from the cluster but I am looking for a proper solution to get them work as well. Any suggestions?

Regards,

Mosy


000 (184617.000.000) 04/14 14:48:44 Job submitted from host: <xxx.xx.xxx.xxx:xxxx>
...
001 (184617.000.000) 04/14 14:49:07 Job executing on host: <yyy.yy.yyy.yyy:yyyyy>
...
006 (184617.000.000) 04/14 14:49:08 Image size of job updated: 2
0 Â- ÂMemoryUsage of job (MB)
0 Â- ÂResidentSetSize of job (KB)
...
005 (184617.000.000) 04/14 14:49:08 Job terminated.
(1) Normal termination (return value 1)
Usr 0 00:00:00, Sys 0 00:00:00 Â- ÂRun Remote Usage
Usr 0 00:00:00, Sys 0 00:00:00 Â- ÂRun Local Usage
Usr 0 00:00:00, Sys 0 00:00:00 Â- ÂTotal Remote Usage
Usr 0 00:00:00, Sys 0 00:00:00 Â- ÂTotal Local Usage
79 Â- ÂRun Bytes Sent By Job
22692 Â- ÂRun Bytes Received By Job
79 Â- ÂTotal Bytes Sent By Job
22692 Â- ÂTotal Bytes Received By Job
Partitionable Resources :  ÂUsage ÂRequest     Â
 Cpus         :         1     Â
 Disk (KB)      Â:    25    25     Â
 Memory (MB)     Â:    Â0    Â0     Â
...