[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Whole memory request for wholeMemory job.



Hi, 

I am new to Condor but still mange to run the basic setup without any problem. 

Now I want to allow my users to submit full machine jobs. I want to use the same setup, I mean if the whole-machine job comes to a resource and other single slot jobs are running condor simply suspend the wholemachine job and start it once the single machine jobs finished. With the help of following document I have managed to to this 

https://nmi.cs.wisc.edu/node/1482

Now things are working fine, but it just turned out that when a full machine job get started on slot1 (rest of the slots are empty) still it uses the memory assigned only to slot1... 
I mean I accept that wholemachine job should have wholememory as well. 

Can someone please guide me how do I fix this? Following is my configuration. 

----------------
#require that whole-machine jobs only match to Slot1
START = ($(START)) && (TARGET.RequiresWholeMachine =!= TRUE || SlotID == 1)

# have the machine advertise when it is running a whole-machine job
STARTD_JOB_EXPRS = $(STARTD_JOB_EXPRS) RequiresWholeMachine

# Export the job expr to all other slots
STARTD_SLOT_EXPRS = RequiresWholeMachine

# Suspend the whole-machine job until the other slots are empty
SUSPEND = ($(SUSPEND)) || (SlotID == 1 && Slot1_RequiresWholeMachine =?= True && \
           (Slot2_Activity =?= "Busy" || Slot3_Activity =?= "Busy" || Slot4_Activity =?= "Busy" ) )

# suspend existing single-cpu jobs when there is a whole-machine job
SUSPEND = ($(SUSPEND)) || (SlotID != 1 && Slot1_RequiresWholeMachine =?= True)

CONTINUE = ( $(SUSPEND) =!= True )
---------------

Regards..
Salman.