[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] I: Job not executing on host.....but central manager submit it



Any suggestion?

This is StartLog file on a worker node.

4/20 07:24:11 ******************************************************
4/20 07:24:11 ** condor_startd (CONDOR_STARTD) STARTING UP
4/20 07:24:11 ** /usr/sbin/condor_startd
4/20 07:24:11 ** SubsystemInfo: name=STARTD type=STARTD(7) class=DAEMON(1)
4/20 07:24:11 ** Configuration: subsystem:STARTD local:<NONE> class:DAEMON
4/20 07:24:11 ** $CondorVersion: 7.2.4 Aug 22 2009 $
4/20 07:24:11 ** $CondorPlatform: I386-LINUX_DEBIAN_UNKNOWN $
4/20 07:24:11 ** PID = 994
4/20 07:24:11 ** Log last touched 4/16 14:17:49
4/20 07:24:11 ******************************************************
4/20 07:24:11 Using config source: /etc/condor/condor_config
4/20 07:24:11 Using local config sources:
4/20 07:24:11    /etc/condor_config.local
4/20 07:24:11 DaemonCore: Command Socket at <10.195.111.210:45619>
4/20 07:24:11 ioctl(SIOCETHTOOL/GWOL) failed: Operation not supported (95)
4/20 07:24:11 You can safely ignore the above error if you're not using hibernation
4/20 07:24:19 New machine resource allocated
4/20 07:24:19 Unable to calculate keyboard/mouse idle time due to them both being USB or not present, assuming infinite idle time for these devices.
4/20 07:24:19 About to run initial benchmarks.
4/20 07:24:25 Completed initial benchmarks.
4/20 07:24:25 State change: IS_OWNER is false
4/20 07:24:25 Changing state: Owner -> Unclaimed
4/20 07:28:02 Got SIGHUP.  Re-reading config files.

Thanks.

--- Lun 19/4/10, michele pierri <pierm4ci@xxxxxxxx> ha scritto:

Da: michele pierri <pierm4ci@xxxxxxxx>
Oggetto: Job not executing on host.....but central manager submit it
A: condor-users@xxxxxxxxxxx
Data: Lunedì 19 Aprile 2010, 17:45

Hi,
I have a central manager that can submit job and one node.
This machine are hosted on amazon ec2 and they are based on ubuntu 9.10.

I am trying to send a job from the manager/submit machine to the node,
But I have a problem:
When I launch
condor_submit file.job        I receive:

Submitting job(s).
Logging submit event(s).
1 job(s) submitted to cluster 2.
ubuntu@domU-xxx:~$ condor_q

-- Submitter: domU-12-31-39-00-4C-F2.compute-1.internal : <10.254.83.0:41197> : domU-12-31-39-00-4C-F2.compute-1.internal
 ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD              
   2.0   ubuntu          4/19 15:33   0+00:00:00 I  0   0.0  batchj.sh        

1 jobs; 1 idle, 0 running, 0 held

The problem is that the status is always idle and my job is never running.

Log file shows only:
000 (002.000.000) 04/19 15:33:33 Job submitted from host: <10.254.83.0:41197>
...

So I think that the job is submitted but the node can't receive or execute it...

What can I do?
What is wrong?

I have set HOSTALLOW_READ and HOSTALLOW_WRITE to *.

Thanks a lot.