[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Job Submission in parallel Universe



I do add these entries in /etc/condor/condor_config.local

DedicatedScheduler = “DedicatedScheduler@muak@submit.foo.com”
muak is my user in linux
STARTD_EXPRS = $(START_EXPRS), DedicatedScheduler

And confirm with this command

$ condor_status –const ‘DedicatedScheduler =!= NULL’

But it wont show anything...

Once I tried job completely executed but didn't give output.
Here is my condor_config.local and log file
Please do check and guide me where did I do anything wrong...

I take guidence from presentation in this link
http://www.docstoc.com/docs/43721750/Condor-Parallel-Universe-European-Condor-Week-2008-Barcelona
000 (018.000.000) 04/09 23:32:14 Job submitted from host: <10.0.0.1:54389>
...
014 (018.000.000) 04/09 23:32:16 Node 0 executing on host: <10.0.0.1:55426>
...
001 (018.000.000) 04/09 23:32:16 Job executing on host: MPI_job
...
015 (018.000.000) 04/09 23:32:16 Node 0 terminated.
	(1) Normal termination (return value 127)
		Usr 0 00:00:00, Sys 0 00:00:00  -  Run Remote Usage
		Usr 0 00:00:00, Sys 0 00:00:00  -  Run Local Usage
		Usr 0 00:00:00, Sys 0 00:00:00  -  Total Remote Usage
		Usr 0 00:00:00, Sys 0 00:00:00  -  Total Local Usage
	0  -  Run Bytes Sent By Node
	0  -  Run Bytes Received By Node
	0  -  Total Bytes Sent By Node
	0  -  Total Bytes Received By Node
...
005 (018.000.000) 04/09 23:32:17 Job terminated.
	(1) Normal termination (return value 127)
		Usr 0 00:00:00, Sys 0 00:00:00  -  Run Remote Usage
		Usr 0 00:00:00, Sys 0 00:00:00  -  Run Local Usage
		Usr 0 00:00:00, Sys 0 00:00:00  -  Total Remote Usage
		Usr 0 00:00:00, Sys 0 00:00:00  -  Total Local Usage
	0  -  Run Bytes Sent By Job
	0  -  Run Bytes Received By Job
	0  -  Total Bytes Sent By Job
	0  -  Total Bytes Received By Job
	Partitionable Resources :    Usage  Request          
	   Cpus                 :                 1          
	   Disk (KB)            :       10       10          
	   Memory (MB)          :        0        0          
...
000 (019.000.000) 04/10 11:45:26 Job submitted from host: <10.0.0.1:47407>
...
##  What machine is your central manager?

#CONDOR_HOST = $(FULL_HOSTNAME)

CONDOR_HOST = pucitServer.CentOSWorld.com

DedicatedScheduler = "DedicatedScheduler@muak@pucitServer.CentOSWorld.com"

STARTD_EXPRS = $(START_EXPRS), DedicatedScheduler 

## Pool's short description

COLLECTOR_NAME = Personal Condor at $(FULL_HOSTNAME)

##  When is this machine willing to start a job? 

START = TRUE


##  When to suspend a job?

SUSPEND = FALSE


##  When to nicely stop a job?
##  (as opposed to killing it instantaneously)

PREEMPT = FALSE


##  When to instantaneously kill a preempting job
##  (e.g. if a job is in the pre-empting stage for too long)

KILL = FALSE

##  This macro determines what daemons the condor_master will start and keep its watchful eyes on.
##  The list is a comma or space separated list of subsystem names

DAEMON_LIST = COLLECTOR, MASTER, NEGOTIATOR, SCHEDD, STARTD