Mailing List Archives Public Access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] GCB Performance

Date: Wed, 18 Jan 2006 14:29:20 -0600
From: Se-Chang Son <sschang@xxxxxxxxxxx>
Subject: Re: [Condor-users] GCB Performance

I sent this only to Chris. So, I am posting this to the group.

Log files say that each job runs about 10sec. In order not to throttlesubmit machine with too many processes running and too many files intransit, Condor, by default, puts 2 second delay between jobinvocations. This is what the manual says:

"This integer-valued macro--JOB_START_DELAY--works together with theJOB_START_COUNT macro to throttle job starts. The condor_ schedd daemonstarts $(JOB_START_COUNT) jobs at a time, then delays for$(JOB_START_DELAY) seconds before starting the next set of jobs. Thisdelay prevents a sudden, large load on the submit machine as it spawnsmany condor_ shadow daemons simultaneously, and it prevents having todeal with their start up activity all at once. The resulting job startrate averages as fast as ($(JOB_START_COUNT)/$(JOB_START_DELAY))jobs/second. This configuration variable is also used during thegraceful shutdown of the condor_ schedd daemon. During gracefulshutdown, this macro determines the wait time in between requesting eachcondor_ shadow daemon to gracefully shut down. It is defined in terms ofseconds and defaults to 2. Setting this macro to a lower value is notadvised, as it can overwhelm the condor_ schedd daemon."

With this default configuration, your job finishes before Condorlaunches all matched jobs (making machines available for jobs that arewaiting for next match). Therefore, you just need 5 ~ 6 VMs to maximizeperformance in your case. Adding more machines contribute nothing andthat's why you get basically the same performance with 20 VMs and 40VMs.



Chris Miles wrote:

Ok. The condor pool is made up off exactly the same spec machines. Itsan IBM Cluster.
I firstly ran a test to see how long my 50 jobs would take on just onemachine (2 VMs)
and it took 5m 11s
I then loaded up 10 nodes -- Jobs took 2m 8s
I then loaded up 20 nodes -- Jobs took 2m 22s
Find attached are the logs for the submission machine from the 10 and 20node tests.
thanks
Chris

Follow-Ups:
- Re: [Condor-users] GCB Performance
  - From: Chris Miles

References:
- Re: [Condor-users] GCB Performance
  - From: Chris Miles

Prev by Date: Re: [Condor-users] GCB Performance
Next by Date: Re: [Condor-users] GCB Performance
Previous by thread: Re: [Condor-users] GCB Performance
Next by thread: Re: [Condor-users] GCB Performance
Index(es):
- Date
- Thread

Mailing List Archives

Public Access

Re: [Condor-users] GCB Performance