[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] heavy loads



I forgot to mention that this is using condor 6.7.18, and there are > 1300 jobs in the queue right now (all but 200 are idle).

--Mike

Michael Thomas wrote:
While doing some stress testing on our 200-node cluster using condor-g, we have noticed some extremely large loads on the cluster. The large load seems to be caused by 500+ globus-job-manager processes, with sometimes 2 or 3 globus-job-manager processes for each job.

condor_config contains the line:
GRIDMANAGER_MAX_JOBMANAGERS_PER_RESOURCE = 10
...but that seems to be ignored.

Why would we have multiple globus-job-managers for a single job, and what can we do to reduce the number of globus-job-manager processes so that our gatekeeper doesn't get quite so overloaded?

--Mike


------------------------------------------------------------------------

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at either
https://lists.cs.wisc.edu/archive/condor-users/
http://www.opencondor.org/spaces/viewmailarchive.action?key=CONDOR

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature