[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Can a job send a trigger to let other jobs start?



Hi Greg,

On Tuesday 15 December 2009 17:44:44 Greg Thain wrote:
> What a great question!  You could use a DAGman prescript on each node to
> poll for a certain load threshold, and as long as the load is above some
> threshold, sleep for a random period and re-poll.  The script could poll
> the data server's load directly, perhaps, if there's a way to do that.
> Or, it could run condor_q, and count the number of jobs that have been
> running for less than an hour (if the startup phase is about an hour).
> Or, perhaps the jobs themselves could use chirp or condor_qedit to set a
> job attribute in the schedd to indicate which phase they are in, and the
> prescript could poll for that.

What a great reply! :)

Many ideas to try. I like the idea of self-modification via condor_qedit. So 
far, we never used chirp.

Sounds like interesting Xmas projects ;)

cheers

Carsten