[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] ERROR: Failed to commit job submission into the queue.



Hello,
I have set up a small(4 node) test grid using condor - 4 linux(2.6 kernel)
machines using a shared file system, running condor 6.8. On Friday, I tested a
job in the java universe, which ran a number (~20) of times quite happily. I
then ramped up the number of jobs, somewhat optimistically, to 300,000 and
left for the day. I've come back in to find the following error and zero
output:
ERROR: Failed to commit job submission into the queue.

1) Is there a limit on the job queue length in condor?
2) If so, is this by design, or determined by an installation specific factor,
such as the O/S or available memory?
3) Where is this documented? Sorry, but I cannot find it anywhere in the
manual, or forum history.

Any help would be much appreciated.
Many thanks,
Dan
------------------------------------------------------
   Dan Scarborough
   Research IT
   Deutsche Bank
   +44 (0)20 754 55914
------------------------------------------------------

---

This e-mail may contain confidential and/or privileged information. If you are
not the intended recipient (or have received this e-mail in error) please
notify the sender immediately and delete this e-mail. Any unauthorized
copying, disclosure or distribution of the material in this e-mail is strictly
forbidden.

Please refer to http://www.db.com/en/content/eu_disclosures.htm for additional
EU corporate and regulatory disclosures.