[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: [Condor-users] jobs wait in idle mode unecessarily



It's a vanilla job and the file permissions are OK (it's
under win 2k). Also there are no nice user options
specified. Unfortunately I can't seem to reproduce it at
the moment but I'm getting a similar possibly related
problem that killed jobs hang around in the idle state.

C:\Condor\ics>condor_q -analyze
-- Submitter: 102153-71130c.liv.ac.uk : <138.253.102.153:1042> : 102153-71130c.l
iv.ac.uk
ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
---
186.000: Run analysis summary. Of 2 machines,
1 are rejected by your job's requirements
0 reject your job because of their own requirements
0 match, but are serving users with a better priority in the pool
1 match, but prefer another specific job despite its worse user-priority
0 match, but will not currently preempt their existing job
0 are available to run your job
Last successful match: Mon Jun 21 12:31:39 2004


1 jobs; 1 idle, 0 running, 0 held

This from SchedLog looks pertinent:

6/21 12:22:09 DaemonCore: Command received via TCP from host <138.253.102.153:2309>
6/21 12:22:09 DaemonCore: received command 443 (VACATE_SERVICE), calling handler (vacate_service)
6/21 12:22:09 Got VACATE_SERVICE from <138.253.102.153:2309>
6/21 12:22:09 Sent RELEASE_CLAIM to startd on <138.253.102.153:1041>
6/21 12:22:09 Match record (<138.253.102.153:1041>, 183, 0) deleted
6/21 12:22:09 DaemonCore: Command received via UDP from host <138.253.102.153:2311>
6/21 12:22:09 DaemonCore: received command 60001 (DC_PROCESSEXIT), calling handler (HandleProcessExitCommand())
6/21 12:22:09 Scheduler::Relinquish - mrec is NULL, can't relinquish
6/21 12:22:09 Null parameter --- match not deleted


any ideas,

regards,

-ian.

--On 21 June 2004 11:31 +0100 "Kewley, J (John)" <J.Kewley@xxxxxxxx> wrote:

      1 match, but prefer another specific job despite its worse
user-priority

I think there are quite a number of things that cause this.


There may be some hints in the log files about a possible problem.

Is it a vanilla job? If so do you have write permissions on your
log, error and output files?

JK
_______________________________________________
Condor-users mailing list
Condor-users@xxxxxxxxxxx
http://lists.cs.wisc.edu/mailman/listinfo/condor-users