[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Ref: Condor Idle job state



Hi,

to tell u, i have fired a job i.e.

$ cat simple_submit1
Universe   = vanilla
Executable = simple1
#Requirements = ( OpSys == 'LINUX' )||( Arch == 'Intel' )
Arguments  = 10 320
Log        = simple1.log
Output     = simple1.$(Process).out
Error      = simple1.$(Process).error
notification=never
should_transfer_files=YES
when_to_transfer_output = ON_EXIT
Queue
$condor_q
 
 
-- Submitter: pal-w2p.corp.cdac.in : <192.168.7.131:32939> : pal-w2p.corp.cdac.in
 ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD
   6.0   condor          8/10 10:23   0+00:00:00 I  0   9.8  simple1 10 320
 
1 jobs; 1 idle, 0 running, 0 held

$condor_q -analyze 6.0
-- Submitter: pal-w2p.corp.cdac.in : <192.168.7.131:32939> : pal-w2p.corp.cdac.in
 ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD
---
006.000 :  Run analysis summary.  Of 5 machines,
      0 are rejected by your job's requirements
      5 reject your job because of their own requirements
      0 match but are serving users with a better priority in the pool
      0 match but reject the job for unknown reasons
      0 match but will not currently preempt their existing job
      0 are available to run your job
        No successful match recorded.
        Last failed match: Thu Aug 10 10:23:01 2006
        Reason for last match failure: no match found
 
WARNING:  Be advised:   Request 6.0 did not match any resource's constraints

could you please suggest me wht is the probable way to run the job...
Pranav

On 8/9/06, Diego Bello <dbello@xxxxxxxxx> wrote:
On 8/9/06, Pranav Aggarwal <pranav.aggwl@xxxxxxxxx> wrote:
> Hi,
>
> I am facing some problem in running jobs on Condor while submitting job from
> Condor...
>
> Well, I have installed condor on linux machine and configured as well.
>
> I submit job using
> $ condor_submit submit
>
> content of 'submit' file is:
>
> Universe = vanilla
> Executable = simple
> Arguments = 4 10
> Log = simple.log
> Output = simple.out
> Error = simple.error
> Queue
>
>
> it is able to create simple.log, simple.out and simple.error, but no content
> in simple.out and simple.error
>
> $condor_q
>
> -- Submitter: sskadam-w2p.corp.cdac.in : < 192.168.7.130:34498> :
> sskadam-w2p.corp.cdac.in
>  ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
>  14.0 condor 8/5 13:50 0+00:00:00 I 0 9.8 simple2 4 10
>  15.0 condor 8/5 14:09 0+00:00:00 I 0 9.8 simple 4 10
>  16.0 condor 8/5 14:10 0+00:00:00 I 0 9.8 simple 4 10
>  17.0 condor 8/5 14:10 0+00:00:00 I 0 9.8 simple 4 10
>
> 4 jobs; 4 idle, 0 running, 0 held
>
> the state is idle for several hours... with no changes,
> hey can i use the machine on which the job for other purpose, or that
> machine has to remain idle i.e. no other work should be done !?!
>
> Then, I realized that machine should be left idle, so i left the machine
> idle for few hours. To my surprise the job did executed. But what came to
> surprise that, whenever I query the machine, it always says that job is idle
> and not running.
>
> Please tell me how should i solve the above problem, I just need job should
> be in run state and give the output and not remain in idle state...
>
> hope u can help me ,
>
> Thanks
> Pranav

Look the output of the condor_q -analyze JOB_NUMBER
replace JOB_NUMBER with 14, 15, 16 and 17. It should show you what is going on.


--
Diego Bello Carreño
Estudiante Memorista de Ingeniería Civil Informática
UTFSM, Valparaíso, Chile
Usuario #294897 counter.li.org

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at either
https://lists.cs.wisc.edu/archive/condor-users/
http://www.opencondor.org/spaces/viewmailarchive.action?key=CONDOR